-
Recent Posts
Our Tweets
- We're hiring a Product Marketing Manager - please pass on to marketing people you know who like data! scraperwiki.com/jobs/#swjob11 1 day ago
- RT @frabcus: Secret! Hidden in this quick start guide for developers to make data tools... is a Wikipedia image scraper https://t.co/4sgpWS… 1 day ago
- Make your own data tool with HTML, Javascript, and Python: bit.ly/18bgdYM 1 day ago
- beta.scraperwiki.com is back up. Engine ticking over nicely. Have a productive Friday everyone :-) 1 day ago
- In the meantime, make sure to check out our blog – including awesome post by @d4nt on spreadsheets and data: blog.scraperwiki.com 1 day ago
Find us on Facebook
Archives
Categories
Meta
Tag Archives: pdf
Scraping the Royal Society membership list
To a data scientist any data is fair game, from my interest in the history of science I came across the membership records of the Royal Society from 1660 to 2007 which are available as a single PDF file. I’ve … Continue reading
Scraping PDFs: now 26% less unpleasant with ScraperWiki
Scraping PDFs is a bit like cleaning drains with your teeth. It’s slow, unpleasant, and you can’t help but feel you’re using the wrong tools for the job. Coders try to avoid scraping PDFs if there’s any other option. But sometimes, there … Continue reading