Jump to content

Food Traceability Codes: Difference between revisions

no edit summary
No edit summary
No edit summary
Line 45: Line 45:
=== Bio Codes ===
=== Bio Codes ===
* https://github.com/openfoodfacts/bio-codes
* https://github.com/openfoodfacts/bio-codes
==Methodology to extract data==
===General===
* All data (url list and data files) can be found here https://github.com/openfoodfacts/eu-food-data
**There is a general csv file with the general link to the data repository for each country: https://github.com/openfoodfacts/eu-food-data/blob/master/list-eu-and-partner-countries.csv
**There are individual folders for each target country (specific url-list for each European Agreement Section and data files)
* A google sheet document is used to map all files available in the target countries. It also map the section name for every country in its own language (or translation in English) and the related European Section, which is used as a general taxonomy.
This Google Sheet can be found here: https://docs.google.com/spreadsheets/d/1egdo58Ds8PNi5G_4F2UtWOWC1V0k3tXBgPhZXs5FRqM/edit?usp=sharing
===France===
*I just build a script that takes all the french agreement info from Agriculture Ministry and concatenate them in one file. Next step is to do the same for UK. The step after that is to cleverly agregate the duplicates (some companies have several health agreements under the same agreement number)
https://github.com/openfoodfacts/eu-food-data/blob/master/scripts/FR-script.py
This script use this file to get the list of URL to retrieve https://github.com/openfoodfacts/eu-food-data/blob/master/fr/urls-fr.txt
*First work performed on
===UK===
*As UK is divided in 4 regions (Ireland, England, Wales and Scotland) and because they have different file format, we use a 3-file script
https://github.com/openfoodfacts/eu-food-data/blob/master/scripts/UK-urls.txt => all UK urls
https://github.com/openfoodfacts/eu-food-data/blob/master/scripts/UK-methods.txt => list which method to use depending on the file type
https://github.com/openfoodfacts/eu-food-data/blob/master/scripts/UK-script.py => the script itself
===DE===
*@vince has per


[[fr:Projet:Codes_propriétaires]]
[[fr:Projet:Codes_propriétaires]]