Last week, we got another great and widely publicised case of Graph Databases' usefulness throw our way. The ICIJ published their FinCEN Files research, and on top of allowing you to explore the data on their website they also published an anonymised subset of the data as a series of CSV/JSON files. My friends and colleagues Michael Hunger, Will Lyon and the rest of the team, helped with the process of making this subset available as a Neo4j database (see this github repo), and there's even a super easy FinCEN Files Neo4j Sandbox that you can spin up in no time for some investigation fun.
So of course I had to take this data for a spin myself - it seems really important to me that more eyeballs are looking at this, and more people exposing the sometimes very questionable behaviour of the world's largest financial institutions.
Showing posts with label icij. Show all posts
Showing posts with label icij. Show all posts
Tuesday, 29 September 2020
Wednesday, 28 November 2018
Working with the ICIJ Medical Devices dataset in Neo4j
Just last weekend our friends at the ICIJ published another really interesting case of investigative journalism - tracking down and publishing the quite absurd and disturbing practices of the medical devices industry. The entire case with all of the developing stories can be found at https://medicaldevices.icij.org/ - take a look as it really is quite fascinating. Of course that meant that I wanted to see what that data looked like in Neo4j, and if I could have a play. I didn't have time for a full detailed exploration yet - but hopefully this will also give others the opportunity to chime in. So let's see.
The Medical Devices dataset as a graph
This turned out to be surprisingly easy. Just download the Zip file from the ICIJ website: https://medicaldevices.icij.org/download/icij-imddb-2018-11-25.zip, unzip this, and then we get 3 comma-separated-values files:- one for the Devices that are being reported on
- one for the Events that are being reported (whenever something happens to a device (eg. a recall) then that is logged and reported)
- one for the Manufacturers of the medical devices.
Labels:
apoc,
bloom,
graph database,
icij,
import,
investigative journalism,
medical devices,
neo4j,
research
Subscribe to:
Posts (Atom)