-
Recent Posts
Our Tweets
- RT @CMCLRN: @ScraperWiki Give your neighbours a RT? Less than a week to go until the ICTD Event in Liverpool!! goo.gl/qTsEh #N… 3 days ago
- Nice to see Google Doodle celebrating a Liverpudlian: bit.ly/13xf3mX 4 days ago
- The basic "View in a table" tool on beta.scraperwiki.com is lots faster after lots of work today from the team 5 days ago
- Images and Domains – @frabcus' 4th piece on automatically summarising data: bit.ly/YT40VF 5 days ago
- An interesting question on @quora – "How do I become a data scientist?" b.qr.ae/17Z5a2b 5 days ago
Find us on Facebook
Archives
Categories
Meta
Author Archives: Ian Hopkinson
Book review: Interactive Data Visualization for the web by Scott Murray
Next in my book reading, I turn to Interactive Data Visualisation for the web by Scott Murray (@alignedleft on twitter). This book covers the d3 JavaScript library for data visualisation, written by Mike Bostock who was also responsible for the Protovis … Continue reading
Posted in research
Leave a comment
Book review: JavaScript: The Good Parts by Douglas Crockford
This week I’ve been programming in JavaScript, something of a novelty for me. Jealous of the Dear Leader’s automatically summarize tool I wanted to make something myself, hopefully a future post will describe my timeline visualising tool. Further motivations are … Continue reading
Book Review: Machine Learning in Action by Peter Harrington
Machine learning is about prediction, and prediction is a valuable commodity. This sounds pretty cool and definitely the sort of thing a data scientist should be into, so I picked up Machine Learning in Action by Peter Harrington to get … Continue reading
Book Review: Data Visualization: a successful design process by Andy Kirk
My next review is of Andy Kirk’s book Data Visualization: a successful design process. Those of you on Twitter might know him as @visualisingdata, where you can follow his progress around the world as he delivers training. He also blogs … Continue reading
Book Review: R in Action by Robert I. Kabacoff
This is a review of Robert I. Kabacoff’s book R in Action which is a guided tour around the statistical computing package, R. My reasons for reading this book were two-fold: firstly, I’m interested in using R for statistical analysis and … Continue reading
Tools of the trade
With the experience of a whole week of ScraperWiki, I am starting to appreciate the core tools of the professional Data Scientist. In the past I’ve written scrapers in Matlab, C# and Python. However, the house language for scraping at … Continue reading
I am Ian, Ian I am*
I have an 8 year itch: I spent the first 8 years of my career as an academic ending up a lecturer in physics at UMIST. Then I was a research scientist at a large “fast moving consumer goods” company … Continue reading
Enterprise data analysis and visualization
The topic for today is a paper[1] by members of the Stanford Visualization Group on interviews with data analysts, entitled “Enterprise Data Analysis and Visualization: An Interview Study”. This is clearly relevant to us here at ScraperWiki, and thankfully their analysis fits in with … Continue reading
Posted in thoughts
Leave a comment
Scraping the Royal Society membership list
To a data scientist any data is fair game, from my interest in the history of science I came across the membership records of the Royal Society from 1660 to 2007 which are available as a single PDF file. I’ve … Continue reading