Fork me on GitHub

scraperby mnmldave

A Google Chrome extension for getting data out of web pages and into spreadsheets.

Get It

Download and install the extension from Google Chrome Extensions.

Use It

Scraper is pretty easy to use if you know some basic XPath syntax (and jQuery selectors). Even if you don't, you can probably scrape by.

Until I have time to write more comprehensive documentation, you can follow these steps to get started:

  1. Find a web page containing some data you want to "scrape".
  2. Highlight some data on the page which is similar to what you want. For example, select a row of a table if you want to scrape all rows.
  3. Right-click on your selection and select the, "Scrape similar..." option.
  4. Some reasonable defaults will be chosen for you based on your selection, but go ahead and tweak them if you want. Press "Scrape" to update the table based on your current options.
  5. When you've basically got the data you want, press "Export to Google Docs..." to save your data in a new spreadsheet.

Of course, once your data is in a spreadsheet, you can do wonderful things with your data such as plugging it into Google Refine.

Credits

Some of the icons used in this extension are from the generous Yusuke Kamiyamane.

Beneath the hood, jQuery is used pretty extensively pretty much everywhere.

Contact

This extension was crafted with care by Dave Heaton (dave@bit155.com). You can follow his Tumblr blog or find him on Twitter.

Developers

Developers can download this project in either zip or tar formats.

You can also clone the project with Git by running:

$ git clone git://github.com/mnmldave/scraper