How journalists can use Google Refine to clean ‘dirty’ data sets