Tag Archives | pdf

Henry Morris (CEO and social mobility start-up whizz) on getting contacts from PDF into his iPhone

Meet @henry__morris! He’s the inspirational serial entrepreneur that set up PiC and upReach.  They’re amazing businesses that focus on social mobility. We interviewed him for PDFTables.com He’s been using it to convert delegate lists that come as PDF into Excel and then into his Apple iphone. It’s his preferred personal Customer Relationship Management (CRM) system, it’s […]

Announcing PDFTables.com

PDFs were invented at the same time as the web.  As “digital paper”, they’re trustworthy and don’t change behind your back. This has a downside – often the definitive source of published data is a PDF. It’s hard to get tens of thousands of numbers out and into a spreadsheet or database. Copying and pasting is […]

The Tyranny of the PDF

Got a PDF you want to get data from? Try our easy web interface over at PDFTables.com! Why is ScraperWiki so interested in PDF files? Because the world is full of PDF files. The treemap above shows the scale of their dominance. In the treemap the area a segment covers is proportional to the number […]

Table Scraping Is Hard

The Problem NHS trusts have been required to publish data on their expenditure over £25,000 in a bid for greater transparency; A well known B2B publisher came to us to aggregate that data and provide them with information spanning across the hundreds of different trusts, such as: who are the biggest contractors across the NHS? […]

We're hiring!