Tag Archives: pdf

Scraping the Royal Society membership list

To a data scientist any data is fair game, from my interest in the history of science I came across the membership records of the Royal Society from 1660 to 2007 which are available as a single PDF file. I’ve … Continue reading

Posted in Scrapers | Tagged , | 2 Comments

Scraping PDFs: now 26% less unpleasant with ScraperWiki

Scraping PDFs is a bit like cleaning drains with your teeth. It’s slow, unpleasant, and you can’t help but feel you’re using the wrong tools for the job. Coders try to avoid scraping PDFs if there’s any other option. But sometimes, there … Continue reading

Posted in developer | Tagged , , , , | 2 Comments