Holistic product view: Difference between revisions

From Open Food Facts wiki
 
(29 intermediate revisions by the same user not shown)
Line 1: Line 1:
One of the core features of FF is the ability to analyse, compare and select products based on a large set of possible characteristics of products.This only possible thanks to a large collection of taxonomies, which encode possible product characteristics. The size of these taxonomies is ever more growing and more difficult to maintain. This page describes some approaches to the clever usage of taxonomies, which uses the idea of combining facets, so that the user still can find and compare products. This approach might allow to reduce the number of entries in some taxonomies.  
One of the core features of OFF is the ability to analyse, compare and select products based on a large set of product characteristics.This is possible thanks to a large collection of taxonomies, which may be used to encode product characteristics.  


== The taxonomies ==
This page describes some approaches to the clever usage of taxonomies, which uses the idea of combining facets, so that the user can find and compare products.


=== Ingredients ===
== The taxonomies and facets ==
Various taxonomies and facets can be used to characterise a product:
 
*''category'' - describes the product in a generic sense. Think of aisles and shelves in the supermarket down to specific product categories;
* ''ingredients'' - for knowing what components a product is made of;
* ''labels'' - for knowing processes used in creating ingredients and/or product;
* ''origins'' (countries) - for the origins of ingredients and products (this is not yet a taxonomy);
* ''storage/preservation'' - instructions how to store unopened products and preserve opened products (not yet a taxonomy);
* ''packaging'' - to get information on how the product is packaged, which might allow filtering the best packaging  for the environment;
 
== Ingredients taxonomy ==
The ingredient taxonomy encodes the ingredients found on the product. This should be a one-on-one mapping between the ingredients list and the taxonomy. With this it is possible to filter out products containing a specific ingredient.
The ingredient taxonomy encodes the ingredients found on the product. This should be a one-on-one mapping between the ingredients list and the taxonomy. With this it is possible to filter out products containing a specific ingredient.
There are products where there is no ingredient list or a nicely parsable list with ingredients hidden in the text (even partially), then these can be gathered and entered as ingredients.


By exploiting the hierarchies defined in the ingredients taxonomy, it is possible to filter on a group of ingredients.
By exploiting the hierarchies defined in the ingredients taxonomy, it is possible to filter on a group of ingredients.


=== Labels ===
== Labels taxonomy ==
Labels are claims, logos, and other statements found on products. Sometimes these labels are statements which can be checked against the ingredients. For instance the label ''no added sugar'', should imply that the ingredient ''sugar'' is not on the ingredient list. The labels can refer to processes in the value chain, like ''fair-trade'' or ''organic''. These labels can be encoded on the ingredients list as well, but is a bit more obscure.
Labels are claims, logos, and other statements found on products. Sometimes these labels are statements which can be checked against the ingredients. For instance the label ''no added sugar'', should imply that the ingredient ''sugar'' is not on the ingredient list. The labels can refer to processes in the value chain, like ''fair-trade'' or ''organic''. These labels can be encoded on the ingredients list as well, but is a bit more obscure.


Line 17: Line 29:
The hierarchy is not (yet) much developed, but it is possible to get any ''organic'' or any ''fair-trade'' label,
The hierarchy is not (yet) much developed, but it is possible to get any ''organic'' or any ''fair-trade'' label,


=== Categories ===
== Category taxonomy ==
The categories taxonomy is more complicated as there is not a clear relationship with what is found on the product. Assigning the correct is a combination of interpreting marketing names, ingredient lists and labels found on the packaging. Often also knowledge of the current taxonomy and its hierarchy is required. And their is also a relationship with legislation, as naming a product is not always free.
The categories taxonomy is more complicated as there is not a clear relationship with what is found on the product. The name of the products is not always well matched to the ingredients. Assigning the correct is a combination of interpreting marketing names, ingredient lists and labels found on the packaging. Often also knowledge of the current taxonomy and its hierarchy is required. And their is also a relationship with legislation, as naming a product is not always free.
 
There are some basic principles behind the hierarchy found in the taxonomy;
 
* ''understandable'' - no abstract categories that a user does not understand or encounters in the supermarket aisles;
* ''ingredients driven'' - the ingredients determine to a large extent the category. This is most obvious for unprocessed single ingredient products;
* ''no doubling'' - there should be no doubling, i.e. the same concept appearing in multiple taxonomies. For instance why adding ''frozen pizzas'' as a new category, when ''frozen'' is already available as a label and ''pizzas'' as a category.
 
==== Convenience categories ====
There are some categories that can be called convenience categories, as they limit the work a user needs to do in order to classify products.These categories combine two characteristics into one, for example the category ''potatoes'' and the label ''frozen'' are converged into one category: ''frozen potatoes''.
 
==== Implied categories ====
It is possible to automatically apply a category to a product, based on two or more characteristics, without any intervention of the user. For instance if a ''herbal tea'' has the label ''organic'', it can be assigned the category ''organic herbal teas''. This in turn can be used user in any facet. It is possible to set up the rules for this in the taxonomies.
 
==== Oppositional categories ====
This a a pair of categories, where one is the opposite of the other. For instance the category ''sweetened beverages'' is opposite to the the ''unsweetened beverages''. Both can not be true at the same time.


== Examples ==
== Examples ==
Some examples of how this works in practice. Many more examples are needed.
Here are some examples of how this works in practice. This approach is used to better define categories and with that to be able to detect products that should not be part of a category. With this one can [[Monitoring added products|monitor]] the quality of a category.


The examples are a bit limited as only two facets can be specified at the same time.
The examples are a bit limited as only two facets can be specified at the same time (more in the coming months (after december 2023).


=== Apple juices/nectars ===
=== Apple juices/nectars ===
Line 47: Line 44:
* [https://world.openfoodfacts.org/category/apple-nectars/ingredient/sugar Nectars with sugar] - levering a single ingredient;
* [https://world.openfoodfacts.org/category/apple-nectars/ingredient/sugar Nectars with sugar] - levering a single ingredient;
* [https://world.openfoodfacts.org/category/apple-nectars/ingredient/sweetener Nectars with any sweetener] - levering multiple ingredients;
* [https://world.openfoodfacts.org/category/apple-nectars/ingredient/sweetener Nectars with any sweetener] - levering multiple ingredients;
* [https://world.openfoodfacts.org/category/apple-juices/ingredient/e300 Apple juices with citric acid (E300)] - which is one way to see whether there are additives
* Apple juices without additives - how can I do this?


===== Labels combination =====
==== Labels combination ====
By combining a category with a label, we filter even more.
By combining a category with a label, we filter even more.


* Organic apple juices
* [https://world.openfoodfacts.org/category/apple-juices/label/organic Organic apple juices]
* Unfiltered apple juices
* [https://world.openfoodfacts.org/category/apple-juices/label/unfiltered Unfiltered apple juices] and its opposite [https://world.openfoodfacts.org/category/apple-juices/label/filtered Filtered apple juices];
* Apple juices with additives
* [https://world.openfoodfacts.org/category/apple-juices/label/no%20additives Apple juices without additives] - this is based on the labels, but will miss out products that did not have a label.
[[Category:Global_Taxonomies]]

Latest revision as of 15:21, 16 December 2023

One of the core features of OFF is the ability to analyse, compare and select products based on a large set of product characteristics.This is possible thanks to a large collection of taxonomies, which may be used to encode product characteristics.

This page describes some approaches to the clever usage of taxonomies, which uses the idea of combining facets, so that the user can find and compare products.

The taxonomies and facets

Various taxonomies and facets can be used to characterise a product:

  • category - describes the product in a generic sense. Think of aisles and shelves in the supermarket down to specific product categories;
  • ingredients - for knowing what components a product is made of;
  • labels - for knowing processes used in creating ingredients and/or product;
  • origins (countries) - for the origins of ingredients and products (this is not yet a taxonomy);
  • storage/preservation - instructions how to store unopened products and preserve opened products (not yet a taxonomy);
  • packaging - to get information on how the product is packaged, which might allow filtering the best packaging for the environment;

Ingredients taxonomy

The ingredient taxonomy encodes the ingredients found on the product. This should be a one-on-one mapping between the ingredients list and the taxonomy. With this it is possible to filter out products containing a specific ingredient.

There are products where there is no ingredient list or a nicely parsable list with ingredients hidden in the text (even partially), then these can be gathered and entered as ingredients.

By exploiting the hierarchies defined in the ingredients taxonomy, it is possible to filter on a group of ingredients.

Labels taxonomy

Labels are claims, logos, and other statements found on products. Sometimes these labels are statements which can be checked against the ingredients. For instance the label no added sugar, should imply that the ingredient sugar is not on the ingredient list. The labels can refer to processes in the value chain, like fair-trade or organic. These labels can be encoded on the ingredients list as well, but is a bit more obscure.

The label can also encode other claimed characteristics, which refer to processes, for example filtered, artisanal, etc.

It is also possible to add storage instructions as labels (lacking better solutions), such as frozen or refrigerated.

The hierarchy is not (yet) much developed, but it is possible to get any organic or any fair-trade label,

Category taxonomy

The categories taxonomy is more complicated as there is not a clear relationship with what is found on the product. The name of the products is not always well matched to the ingredients. Assigning the correct is a combination of interpreting marketing names, ingredient lists and labels found on the packaging. Often also knowledge of the current taxonomy and its hierarchy is required. And their is also a relationship with legislation, as naming a product is not always free.

Examples

Here are some examples of how this works in practice. This approach is used to better define categories and with that to be able to detect products that should not be part of a category. With this one can monitor the quality of a category.

The examples are a bit limited as only two facets can be specified at the same time (more in the coming months (after december 2023).

Apple juices/nectars

Ingredients combination

By combining the category facet with an ingredients facet, it is possible to find:

Labels combination

By combining a category with a label, we filter even more.