-
Recent Posts
Our Tweets
- OKFN's useful looking "messy tables" Python package github.com/okfn/messytabl… is now available on beta.scraperwiki.com 17 hours ago
- An excellent guest post by @d4nt on spreadsheets, data tools, and future of web formats: bit.ly/13FB1nG 2 days ago
- RT @CMCLRN: @ScraperWiki Give your neighbours a RT? Less than a week to go until the ICTD Event in Liverpool!! goo.gl/qTsEh #N… 6 days ago
- Nice to see Google Doodle celebrating a Liverpudlian: bit.ly/13xf3mX 1 week ago
- The basic "View in a table" tool on beta.scraperwiki.com is lots faster after lots of work today from the team 1 week ago
Find us on Facebook
Archives
Categories
Meta
Tag Archives: scraping
Digging Olympic Data at Londinium MMXII
This is a guest post by Makoto Inoue, one of the organisers of this weekend’s Londinium MMXII hackathon. The Olympics! Only a few days to go until seemingly every news camera on the planet is pointed at the East End … Continue reading
Three hundred thousand tonnes of gold
On 2 July 2012, the US Government debt to the penny was quoted at $15,888,741,858,820.66. So I wrote this scraper to read the daily US government debt for every day back to 1996. Unfortunately such a large number overflows the … Continue reading
Software Archaeology and the ScraperWiki Data Challenge at #europython
There’s a term in technical circles called “software archaeology” – it’s when you spend time studying and reverse-engineering badly documented code, to make it work, or make it better. Scraper writing involves a lot of this stuff. ScraperWiki’s data scientists … Continue reading
Local ScraperWiki Library
It quite annoyed me that you can only use the scraperwiki library on a ScraperWiki instance; most of it could work fine elsewhere. So I’ve pulled it out (well, for Python at least) so you can use it offline. How … Continue reading
Fine set of graphs at the Office of National Statistics
It’s difficult to keep up. I’ve just noticed a set of interesting interactive graphs over at the Office of National Statistics (UK). If the world is about people, then the most fundamental dataset of all must be: Where are the … Continue reading
How to get along with an ASP webpage
Fingal County Council of Ireland recently published a number of sets of Open Data, in nice clean CSV, XML and KML formats. Unfortunately, the one set of Open Data that was difficult to obtain, was the list of sets of … Continue reading
Posted in developer, Scrapers
Tagged ASP, Fingal County Council, Ireland, scraperwiki, scraping
6 Comments
Scraping guides: Excel spreadsheets
Following on from the CSV scraping guide, we’ve now added one about scraping Excel spreadsheets. You can get to them from the documentation page. The Excel scraping guide is available in Ruby, Python and PHP. Just as with all documentation, you can … Continue reading
Access government in a way that makes sense to you? Surely not!
alpha.gov.uk uses Scraperwiki, a cutting edge data-gathering tool, to deliver the results that citizens want. And radically for government, rather than tossing a finished product out onto the web with a team of defenders, this is an experiment in customer … Continue reading
Posted in news, users
Tagged Aidan McGuire, AlphaGov, data, ETL, scraperwiki, scraping, Tom Loosemore
5 Comments
ScraperWiki: A story about two boys, web scraping and a worm
“It’s like a buddy movie.” she said. Not quite the kind of story lead I’m used to. But what do you expect if you employ journalists in a tech startup? “Tell them about that computer game of his that you … Continue reading
Posted in developer, journalism, news, Scrapers
Tagged parliament, pocket money, scraping, spectrum, The Julian
Leave a comment
Scrape it – Save it – Get it
I imagine I’m talking to a load of developers. Which is odd seeing as I’m not a developer. In fact, I decided to lose my coding virginity by riding the ScraperWiki digger! I’m a journalist interested in data as a … Continue reading
Posted in developer, users
Tagged API, datastore, downloading, saving, scraperwiki, scraping
Leave a comment