Food Traceability Codes
This project aims to investigate packaging codes and all non-standard codes, usually printed on the packaging using a machine, such as batch numbers, the quality numbers ... and get the most information possible.
Codes
EMB Codes
EU Food Establishments
- http://en.wiki.openfoodfacts.org/Project:Food_establishments
- https://github.com/openfoodfacts/eu-food-data
- French list: http://agriculture.gouv.fr/liste-des-etablissements-agrees-ce
- See list of country codes EMB: http://ec.europa.eu/food/food/biosafety/establishments/list_en.htm
EU Organic Codes
FSC Codes
MSC Codes
- https://github.com/openfoodfacts/msc-codes
- MSC (sustainable fishing) label can be easily accessed with precision, it would be nice to put a link to the website of the MSC with information from fisherman.
For example 3256224531289 Product code indicating the MSC "MML-C-1021." By putting this code in the URL of the MSC, it gets the name and origin of the fisherman http://cert.msc.org/supplierdirectory/VController.aspx?CertNr=MML-C-1021
PLU Codes
PLU or Price Look Up codes have been used by supermarkets since 1990 to make check-out and inventory control easier, faster and more accurate. PLU codes are used to identify bulk produce (and related items such as nuts and herbs). They tell the supermarket cashier whether an apple is a conventionally grown Fuji apple which may sell for $1.29 per pound versus an organically grown Fuji apple which may sell for $2.29 per pound. IFPS is the global organization that assigns PLU codes to produce items. PLU codes are 4 or 5 digit numbers and will appear on a small sticker applied to the individual piece of fresh produce. The PLU number identifies produce items based upon various attributes which can include the commodity, variety, growing methodology (e.g. organic), and the size. These numbers are assigned by the IFPS after rigorous review at both national and international levels. PLU codes ensure that the accurate price is paid by consumers by removing the need for cashiers to identify the product and whether or not it is conventionally or organically grown. The 4-digit PLU codes for produce are randomly assigned within a series of numbers within the 3000 and 4000 series. There is no intelligence built into the 4-digit code. For example, no one number within the 4-digit number represents anything in particular. The 4-digit codes are for conventionally grown produce. 5-digit codes are used to identify organic produce. The prefix of '9' would be placed in front of the 4-digit conventionally grown code for organic produce. You will not see the 5-digit codes in the PLU codes database since they are simply prefixes added on to the conventionally grown produce PLU codes. The PLU coding system is voluntary, not mandate by any governing body. There are currently over 1400 PLU codes issued for fresh produce and produce related items. Standardized Universal Product Codes (UPCs) for fresh fruits and vegetables are also available on a subscription basis through the Produce Marketing Association (PMA) website.
- PLU Search Database: http://www.ifpsglobal.com/Identification/PLU-Codes/PLU-codes-Search complete list is available if you just eat the search button without filling any field
- http://www.plucodes.com/docs/PLU_List_Commodity_Varietal_Bilingual.pdf
- https://docs.google.com/spreadsheets/d/1YieZxWFHv9bAHO9zRW4I5_ghnId5B2oxVz72vMvTF-I/edit#gid=1530069957
- http://legufrulabelofolie.fr/index.php
- https://openfoodfacts.slack.com/files/teolemon/F02M8KZKV/minimalversion.xlsx
Soil Association
- www.soilassociation.org
- http://www.sacert.org/fooddrink/findalicensee
- they seems to provide a number such as DA20245
Spanish wines
I found the "Bodegas" directory in the "Consejo Regulador" site: http://es.riojawine.com/multimedia/files/directorio.xls There you can find the row: Pernod Ricard Winemakers Spain (Campo Viejo) with "Reg Embotellador" = "1.850-LO" Our number!! This is in the city of LogroƱo (the capital of La Rioja) and the street is "La Rad de Santa Cruz, s/n".
Then if you look at the spanish packaging codes site (the RGSEAA number that is used to build the ES RGSEAA EC number) in the http://rgsa-web-aesan.msssi.es/rgsa/formulario_principal_js.jsp web, with that information (RazĆ³n Social=Pernod Ricard Winemakers Spain; Localidad=LogroƱo), you get the only number: 30.00641/LO (i.e. ES 30.00641/LO CE). The adress also match the data.
The winery celler (bodega) is this, just by curiosity :simple_smile: http://www.pernodricardbodegas.com/bodegas/campo-viejo/la-bodega/?lang=en http://www.aesan.msssi.gob.es/aesa/web/AESA.jsp
Import / Export Codes of the Russian Federation
- http://www.fsvps.ru/fsvps/importExport?_language=en
- Example for a French company: http://www.fsvps.ru/fsvps/importExport/france/enterprise.html?id=18640&_language=en
Egg Codes
Bio Codes
Methodology to extract data
General
Get Data Files
- All data (url list and data files) can be found here https://github.com/openfoodfacts/eu-food-data
- There is a general csv file with the general link to the data repository for each country: https://github.com/openfoodfacts/eu-food-data/blob/master/list-eu-and-partner-countries.csv
- There are individual folders for each target country (specific url-list for each European Agreement Section and data files)
- A google sheet document is used to map all files available in the target countries. It also map the section name for every country in its own language (or translation in English) and the related European Section, which is used as a general taxonomy. This Google Sheet can be found here: https://docs.google.com/spreadsheets/d/1egdo58Ds8PNi5G_4F2UtWOWC1V0k3tXBgPhZXs5FRqM/edit?usp=sharing
- We can usually find txt or csv file but for some countries the data is only available in PDF. Those need OCR treatment before data extraction.
Build CSV files
- Several formats are used in EU countries. A specific approach is needed for each of them. Refer below for details for each country.
Geocode
- Script geocoding + google maps
France
- I just build a script that takes all the french agreement info from Agriculture Ministry and concatenate them in one file. Next step is to do the same for UK. The step after that is to cleverly agregate the duplicates (some companies have several health agreements under the same agreement number)
https://github.com/openfoodfacts/eu-food-data/blob/master/scripts/FR-script.py This script use this file to get the list of URL to retrieve https://github.com/openfoodfacts/eu-food-data/blob/master/fr/urls-fr.txt
- First work performed on
UK
- As UK is divided in 4 regions (Ireland, England, Wales and Scotland) and because they have different file format, we use a 3-file script
https://github.com/openfoodfacts/eu-food-data/blob/master/scripts/UK-urls.txt => all UK urls https://github.com/openfoodfacts/eu-food-data/blob/master/scripts/UK-methods.txt => list which method to use depending on the file type https://github.com/openfoodfacts/eu-food-data/blob/master/scripts/UK-script.py => the script itself
DE
- @vince has per