For years Infogreffe has enjoyed a lucrative monopoly on the diffusion of legal information about French companies. That monopoly will cease in the near future to foster innovation. As a result, Infogreffe is starting to adopt an open data approach. It is now possible to have access to a registry of recently created or terminated companies.
The data is conveniently available as a CSV file. The Infogreffe data set for 2015 includes 13 columns and 108,175 lines. The tabular approach of a CSV file or spreadsheet makes analyzing the relationships in the data more difficult though. To circumvent that issue, we must think of the entities and relationships hidden in the tabular structure.
This process is accomplished by designing a network or graph model for our data: