Ingredients/Parsing: Difference between revisions
No edit summary |
|||
Line 7: | Line 7: | ||
*Van biologische oorsprong. | *Van biologische oorsprong. | ||
</pre> | </pre> | ||
== | == Comma separator == | ||
Ingredients are separated by a comma (,) generally followed by a single space | |||
== Dot == | |||
Indicates the end of the ingredients list. | |||
== Asterisk(s) at the start of an ingredient == | |||
Indicates an annotation for one or more ingredients that have the same number of asterisks at the end. For instance, some items above are indicated as of biological origin | |||
== Asterisk(s) at the end of an ingredient == | |||
Indicates that this ingredient has an annotation (see above) | |||
== Parenthesis == | == Parenthesis == | ||
Can indicate sub-components | Can indicate sub-components, that can also sometimes interpret to an E-Number. | ||
For instance: emulgator (sojalecithine) interprets to E322 | |||
== Percentage == | == Percentage == | ||
Indicates the quantity | Indicates the quantity | ||
== Order == | == Order == | ||
Items are required to be listed in order of largest to smallest quantity | Items are required to be listed in order of largest to smallest quantity | ||
== List of ingredients == | == List of ingredients == | ||
Line 24: | Line 30: | ||
* Wikidata (We could generate a list from Wikidata) | * Wikidata (We could generate a list from Wikidata) | ||
* List of ingredients for multilingual products: http://world.openfoodfacts.org/language/multilingual/ingredients | * List of ingredients for multilingual products: http://world.openfoodfacts.org/language/multilingual/ingredients | ||
* http://ec.europa.eu/food/safety/labelling_nutrition/labelling_legislation/index_en.htm | |||
* http://ec.europa.eu/dgs/health_food-safety/dgs_consultations/food/docs/consult_20150104_allergy-intolerance_guidance.pdf | |||
* http://www.fda.gov/Food/GuidanceRegulation/GuidanceDocumentsRegulatoryInformation/LabelingNutrition/ucm2006828.htm |
Revision as of 20:42, 24 September 2016
This page collects what we know about ingredient parsing.
rietsuiker*, plantaardige olie* (zonnebloem, palm), 13% _hazelnoot_*, 7.5% magere cacaopoeder*, magere _melk_poeder*, emulgator (_soja_lecithine), vanille*. *Van biologische oorsprong.
Comma separator
Ingredients are separated by a comma (,) generally followed by a single space
Dot
Indicates the end of the ingredients list.
Asterisk(s) at the start of an ingredient
Indicates an annotation for one or more ingredients that have the same number of asterisks at the end. For instance, some items above are indicated as of biological origin
Asterisk(s) at the end of an ingredient
Indicates that this ingredient has an annotation (see above)
Parenthesis
Can indicate sub-components, that can also sometimes interpret to an E-Number. For instance: emulgator (sojalecithine) interprets to E322
Percentage
Indicates the quantity
Order
Items are required to be listed in order of largest to smallest quantity
List of ingredients
- http://world.openfoodfacts.org/ingredients (will crash your browser)
- http://world.openfoodfacts.org/files/ingredients.20151117.txt (lighter text version of the above)
- https://files.slack.com/files-pri/T02KVRT1Q-F192GGZE3/download/top500ingredients.xls (normalised Excel version for top 500 of the above)
- Global ingredients taxonomy (Taxonomisation start)
- Wikidata (We could generate a list from Wikidata)
- List of ingredients for multilingual products: http://world.openfoodfacts.org/language/multilingual/ingredients
- http://ec.europa.eu/food/safety/labelling_nutrition/labelling_legislation/index_en.htm
- http://ec.europa.eu/dgs/health_food-safety/dgs_consultations/food/docs/consult_20150104_allergy-intolerance_guidance.pdf
- http://www.fda.gov/Food/GuidanceRegulation/GuidanceDocumentsRegulatoryInformation/LabelingNutrition/ucm2006828.htm