The European Investigative Journalism and Dataharvest Conference organised by Journalismfund.eu, is the most relevant networking event for investigative and data journalists in Europe. Dataharvest EIJC17 will take place on Friday 19, Saturday 20 and Sunday morning 21 May 2017, with a pre-conference Hack Day on Thursday 18 May.
Back To Schedule
Sunday, May 21 • 10:45 - 11:45
SubsidyStories.eu: the European Structural Investment Funds uncovered II

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Follow the money - hands-on workshop – finding your way through the data. In the hands-on session we will use the subsidy-story dataset to show how to start finding stories from a dataset that is over 2 million lines big. Participants will learn about the dataset itself, as well as acquire skills to quickly get to know a dataset that is beyond the size of a normal spreadsheet program, running basic statistics and validation strategies that we used. We will first demonstrate the work we did on the hack day for inspiration. This will show what tools can be used to work through a 2 million-line datasets, such as jupiter notebooks, SQL, and the Open Spending toolbox. Starting with the OpenSpending toolbox, we provide a guided tour through the data, adding different filters and answering simple questions by trying different visualisations, and sorting options. In the second part, we already go into advanced mode and take the participants through the SQL-data-miner that is linked to the Open Spending repository and runs on our datasets. We will run basic statistics and some tailored queries together to show how to easily get to know a dataset of this size. Finally, we show how we validated our dataset, walking the participants through different validation strategies we used from comparing it to raw data, to making sure certain variables are always present etc.

avatar for Anna Alberts

Anna Alberts

Project Manager, OKFDE
OKFDE, Open Data, OpenBudgets.EU, Financial Transparency, subsidystories.eu, ESIF, EU Transparency
avatar for Adrian Blanco

Adrian Blanco

Visual reporter, The Washington Post
Adrian Blanco is a Spanish data journalist currently working at The Washington Post. Data stuff, visualizations, reporting and a bit of coding.
avatar for Christian Jørgensen

Christian Jørgensen

Photojournalist, EUP-Berlin GbR
Christian Jørgensen is a photo journalist from Denmark with base in Berlin, who with Follow the Subsidies www.followthesubsidies.eu has been working on collecting and publishing all EU-Subsidies since april 2016 and now together with the Hungarian programmer Balazs Kirch tries to... Read More →
avatar for Michael Peters

Michael Peters

Data Scientist, Open Knowledge Foundation Deutschland
Michael joined OKF Germany as Data Scientist for the projects: EU Subsidy Stories, OpenBudgets.eu and OffenerHaushalt.de. He studied Public Economics and advocates Transparency of Fiscal Data. His main tasks are Data Mining and Data Analysis.
avatar for Marta Rey

Marta Rey

Freelance journalist
I am a freelance journalist and Digital Marketing specialist. Currently, I am a distance student of Data Driven Journalism Postgraduate at the University of Vic. I am a data analyst volunteer for the Open Knowledge Foundation Germany for the Subsidy Stories project.

Sunday May 21, 2017 10:45 - 11:45 CEST
C2.23 Thomas More journalism school at the Campus De Ham, Raghenoplein 21, 2800 Mechelen & Campus De Vest, Zandpoortvest 60, 2800 Mechelen, BELGIUM