Food Traceability Codes
This project aims to investigate packaging codes and all non-standard codes, usually printed on the packaging using a machine, such as batch numbers, the quality numbers ... and get the most information possible.
Codes
EMB Codes
EU Food Establishments
- http://en.wiki.openfoodfacts.org/Project:Food_establishments
- https://github.com/openfoodfacts/eu-food-data
- French list: http://agriculture.gouv.fr/liste-des-etablissements-agrees-ce
- See list of country codes EMB: http://ec.europa.eu/food/food/biosafety/establishments/list_en.htm
EU Organic Codes
FSC Codes
MSC Codes
- https://github.com/openfoodfacts/msc-codes
- MSC (sustainable fishing) label can be easily accessed with precision, it would be nice to put a link to the website of the MSC with information from fisherman.
For example 3256224531289 Product code indicating the MSC "MML-C-1021." By putting this code in the URL of the MSC, it gets the name and origin of the fisherman http://cert.msc.org/supplierdirectory/VController.aspx?CertNr=MML-C-1021
PLU Codes
- http://www.plucodes.com/docs/PLU_List_Commodity_Varietal_Bilingual.pdf
- https://docs.google.com/spreadsheets/d/1YieZxWFHv9bAHO9zRW4I5_ghnId5B2oxVz72vMvTF-I/edit#gid=1530069957
- http://legufrulabelofolie.fr/index.php
- https://openfoodfacts.slack.com/files/teolemon/F02M8KZKV/minimalversion.xlsx
Soil Association
- www.soilassociation.org
- http://www.sacert.org/fooddrink/findalicensee
- they seems to provide a number such as DA20245
Spanish wines
I found the "Bodegas" directory in the "Consejo Regulador" site: http://es.riojawine.com/multimedia/files/directorio.xls There you can find the row: Pernod Ricard Winemakers Spain (Campo Viejo) with "Reg Embotellador" = "1.850-LO" Our number!! This is in the city of LogroƱo (the capital of La Rioja) and the street is "La Rad de Santa Cruz, s/n".
Then if you look at the spanish packaging codes site (the RGSEAA number that is used to build the ES RGSEAA EC number) in the http://rgsa-web-aesan.msssi.es/rgsa/formulario_principal_js.jsp web, with that information (RazĆ³n Social=Pernod Ricard Winemakers Spain; Localidad=LogroƱo), you get the only number: 30.00641/LO (i.e. ES 30.00641/LO CE). The adress also match the data.
The winery celler (bodega) is this, just by curiosity :simple_smile: http://www.pernodricardbodegas.com/bodegas/campo-viejo/la-bodega/?lang=en http://www.aesan.msssi.gob.es/aesa/web/AESA.jsp
Import / Export Codes of the Russian Federation
- http://www.fsvps.ru/fsvps/importExport?_language=en
- Example for a French company: http://www.fsvps.ru/fsvps/importExport/france/enterprise.html?id=18640&_language=en
Egg Codes
Bio Codes
Methodology to extract data
General
Get Data Files
- All data (url list and data files) can be found here https://github.com/openfoodfacts/eu-food-data
- There is a general csv file with the general link to the data repository for each country: https://github.com/openfoodfacts/eu-food-data/blob/master/list-eu-and-partner-countries.csv
- There are individual folders for each target country (specific url-list for each European Agreement Section and data files)
- A google sheet document is used to map all files available in the target countries. It also map the section name for every country in its own language (or translation in English) and the related European Section, which is used as a general taxonomy. This Google Sheet can be found here: https://docs.google.com/spreadsheets/d/1egdo58Ds8PNi5G_4F2UtWOWC1V0k3tXBgPhZXs5FRqM/edit?usp=sharing
- We can usually find txt or csv file but for some countries the data is only available in PDF. Those need OCR treatment before data extraction.
Build CSV files
- Several formats are used in EU countries. A specific approach is needed for each of them. Refer below for details for each country.
Geocode
- Script geocoding + google maps
France
- I just build a script that takes all the french agreement info from Agriculture Ministry and concatenate them in one file. Next step is to do the same for UK. The step after that is to cleverly agregate the duplicates (some companies have several health agreements under the same agreement number)
https://github.com/openfoodfacts/eu-food-data/blob/master/scripts/FR-script.py This script use this file to get the list of URL to retrieve https://github.com/openfoodfacts/eu-food-data/blob/master/fr/urls-fr.txt
- First work performed on
UK
- As UK is divided in 4 regions (Ireland, England, Wales and Scotland) and because they have different file format, we use a 3-file script
https://github.com/openfoodfacts/eu-food-data/blob/master/scripts/UK-urls.txt => all UK urls https://github.com/openfoodfacts/eu-food-data/blob/master/scripts/UK-methods.txt => list which method to use depending on the file type https://github.com/openfoodfacts/eu-food-data/blob/master/scripts/UK-script.py => the script itself
DE
- @vince has per