Apple juices category - en: Difference between revisions

From Open Food Facts wiki
 
(20 intermediate revisions by the same user not shown)
Line 126: Line 126:
* vitamin C: < 60mg/100ml (need to check the effect of enriching)
* vitamin C: < 60mg/100ml (need to check the effect of enriching)
* potassium: < 150mg/100ml
* potassium: < 150mg/100ml
== Ingredients ==
A product is to a large extent defined by the ingredients that have been used to create. These ingredients define to a large extent the category.
=== Missing ingredients ===
The listed ingredients do not always correspond to the product:
* the listing of ''water'' implies concentrated apple juice and in turn ''apple juice from concentrate;''
* It is impossible to detect the occurrence of ''pure apple juice'' AND ''apple juice from concentrate'' at the same time;
* cider seems to imply the ingredient alcohol. In the use it is also a type of apple;
=== Inventory ===
Most common ingredients tags are:
* pure apple juice
* apple juice from concentrate
* water, need to rehydrate the apple concentrate
* ascorbic acid for vitamin C
* potassium sorbate as a source of potassium and preservative
* malic acid as a preservative
* citric acid as a preservative
* calcium lactate
* flavouring
* carbon dioxide
=== Recipes ===
Often one or more ingredients are used to create a product. This is a recipe. In order to analyse these recipes, the products with ingredients were selected (1627 products) ([https://mirabelle.openfoodfacts.org/products?sql=select+rowid%2C+code%2C+url%2C+%0D%0Aingredients_text%2C+ingredients_tags%2C+ingredients_analysis_tags%0D%0Afrom+%5Ball%5D%0D%0Awhere+categories_tags+like+%22%25apple-juices%25%22+and+ingredients_tags+%21%3D+%22%22%0D%0A Mirabelle download]). A set analysis (all juices with ingredients) using [https://upset.multinet.app/?workspace=OFF&table=AppleJuices UpSet], gives the following main recipes:
* pure apple juices (only) (37%);
* apple juices from concentrate (only) (14%)
* pure apple juices  + vitamin C (5%)
* apple juices from concentrate + vitamin C (5%)
* apple juices with flavouring (3%)
* apple juices with preservatives (2%)
=== Conclusion ===
401 products did not contain normalised ingredients, either the data is incorrect or the values are missing in the ingredients taxonomy. A lot of cleanup is still required.
The wrong main language introduces quite some errors as the ingredients can not be parsed;


== Characteristics ==
== Characteristics ==
Apple juice products often have labels or other claims on the packaging.
Apple juice products often have labels or other claims on the packaging.
=== Labels ===
=== Labels ===
Several labels can currently been found ([https://world.openfoodfacts.org/category/apple-juices/labels/ the list]). Some are useful, but other unnecessary. A quick look at some labels:
* ''organic'' (and variants) has 638 products;
* ''vegan'' (and thus vegetarian) has 109 products. The filtering process might not be vegan, as [https://en.wikipedia.org/wiki/Isinglass isenglass] can be used;
* ''no added sugar'' (326 products). The EU legislation does not allow any added sugar, so it is not really necessary. But this label might be useful for other countries.
* ''no preservatives'' (245 products) - preservatives are allowed by EU legislation (think antioxidants like malic acid, citric acid or ascorbic acid), so it might fe found in the ingredients list. This label could be added if these ingredients are not on the package.
* ''no colorings'' (151 products) - are there products that do have colorings?
* ''EU Agriculture'' (and variants) (103 products) - should also be in the origin field.
* ''pure juice'' (70 products) - this is not a label
* ''no additives'' (51 products) - This seems the same as pure juice. This can be added based on the ingredient list?
* ''Demeter'' (34 products) - this is not a label, but a brand?
* ''low or no sugar'' (27 products) - these seem be wrongly labelled and should read ''no added sugar''.
* ''no gluten'' (25 products) - seems unnecessary.
* ''no artificial flavours'' (12 products) - might be useful, there might be products that have flavouring
* ''no GMOs'' (11 products) - are the apples that are GMO? Or is this just marketing?
* ''kosher'' (10 products) - assume that a  production process can be non-kosher.
* ''fair trade'' (7 products) - guess that farmers are better paid.
* ''pasteurized product'' (7 products) - useful label (why so few?).
* ''no alcohol'' (6 products) - so it is not a cider, sounds like a default label
* ''100% natural'' (5 products) - does this mean anything?
* ''rich in vitamin C'' (5 products) - when can a juice be called rich? What is the value of this claim?
=== Missing labels ===
There are a few labels that have not been used, but might be useful
* ''artisanal fabrication'' - this is seen on many juices.
* ''filtering'' - some products are not filtered and are thus cloudy. This is not stated on the ingredients, but might be found in the product name (trouble).
* ''pasteurisation'' - add ''pasteurized'' or ''unpasteurized''
* ''extraction'' - ''cold pressed''
=== Observations ===
=== Observations ===
* when should something be added as a label, an adjective of an ingredient, or even a dedicated category?
* when should something be added as a label, an adjective of an ingredient, or even a dedicated category?
* there are implied labels, like ''no lactose'', ''no gluten''. How can this be expressed? It can be part of the category definition and noted in the taxonomy.
== Quality Monitoring ==
== Quality Monitoring ==
With the definition of this category in terms of ingredients, nutritional values, etc, it is possible to set up a quality envelope. The idea is that this envelope contains 90% (or something like that) of the products of this category.  
With the definition of this category in terms of ingredients, nutritional values, etc, it is possible to set up a quality envelope. The idea is that this envelope contains 90% (or something like that) of the products of this category.  
Line 143: Line 211:
* related categories - what categories are related?
* related categories - what categories are related?
* determination - helping the user to classify and characterise a likely apple juice product.
* determination - helping the user to classify and characterise a likely apple juice product.
A first attempt: [[Category/Apple juices]]
[[Category:Global_Taxonomies]]

Latest revision as of 14:25, 8 March 2023

This page describes an in-depth look at the apple juices category on OFF. The purpose was to increase the quality of this category in terms of data and contained products.

What is apple juice?

This category is part of more generic categories as fruit juices and is defined as a beverage. There are no formal definitions for these categories yet, so we have to come up with some guidelines:

  • drinkable, probably without any additional preparation
  • fruit based, so not the fruit itself
  • apple based, and probably lots of apple

It is necessary to loop better at the products that are now in the category to get an idea what people did put in. And then we might come up with a better definition. We also like to know what is not included in this category.

There is also EU legislation that defines what a (apple) juice may be (link, link).

Current situation

At the moment of writing (27 feb 2023) there are 3132 products in the category, of which 2800 have nutrition facts. When scrolling through the products most are called apple juice (jus de pomme).

Imposters

There are some products however that seem not to belong to this category:

Subcategories

Several subcategories were already defined:

Ingredients

Most apple juices contain only juice, i.e. apples. Some juices have added conservants (malic acid), vitamins (C) or flavours(?). In most cases the ingredients are missing and do not specify what kind of apples have been used.

Other observations

  • Should we distinguish between filtered and unfiltered?
  • Should artisanal products be a subcategory?
  • How are Apple Juices different from Smoothies, Nectars, Puree and Apple Syrups (such as Diksap).
  • The percentage of apple juice is not always stated, so difficult to see whether it is a flavoured water, nectar of juice.

Category assignment cleaning

This first step is moving the imposters to a more appropriate category:

It might be necessary to rename and create some subcategories:

  • Apple juices (enhanced) for products that have added vitamins and minerals? Enriched apple juices is better;
  • Concentrated apple juices rename to Reconstituted apple juices? Or Apple Juices from concentrate is clearer to users
  • Squeezed apple juices rename to Pure apple juices?
  • Refrigerated apple juices to be used for products that are sold in the refrigerated aisles and must be kept refrigerated at home.
  • Flavoured apple juices for juices with spices like ginger, curcuma, etc. The percentage of apple juice must be high (99%), otherwise these are nectars
  • Single variety apple juices for which the apple variety has been specified, mono-varietal.
  • Should we distinguish between apple juices without additives and with additives?
  • Some products have much less than 100% apple juice. These will be parked in the Apple nectars;

Implied categories

In addition there are some implied categories that are now lacking. These are categories that are a consequence of the definition of apple juices (but not the parent categories). These can be a part of the taxonomy. Thinking of:

  • Non-alcoholic beverages for Apple Juices and subcategories
  • Carbonated drinks for Sparkling apple juices (rename drinks to beverages? Sparkling or Carbonated?
  • Should all drinks be renamed to beverages?

Nutritional data cleaning

If the category cleanup worked well, it should be possible to define good average nutritional values for products in this category. But before we can do that it is necessary to clean up the data and remove all obvious errors.

Unfortunately many products only indicate an upper limit, like < 1 g. This is not very informative. The producer probably did not bother to do a real measurement. The same goes for the value of zero, which is also very frequent. Ideally I filter out the zero entries and the less than entries. (I do not know how to de the latter).

The results below are before category cleanup, but after nutritional values cleanup. The median values are used as this excludes any strange values.

Carbohydrates / Sugars

Carbohydrates versus sugars plot for apple juices showing a linear relationship (27-feb-2023)
Carbohydrates versus sugars plot for apple juices showing a linear relationship (27-feb-2023)

There is a linear relationship between the sugars and carbohydrates. For many producers this is even one-one relationship. The medians 10.4g/100ml and 11g/100ml seem to indicate that the sugers are 6% smaller than the carbohydrates. The spread around this values is only 1.5g/100ml. The outliers (above 20g/100ml) seem to be due to american products with values per serving and apple juice concentrates. But also dus to producers that mixup 100ml and serving size.

Energy Carbohydrates

The relation between carbohydrates and energy for the apple juices category (27-feb-2023)
The relation between carbohydrates and energy for the apple juices category (27-feb-2023)

The sugars relate directly to the energy, with 192kJ/100ml and 11g/100ml. The outliers at the top are due to the concentrated apple juices. At the bottom there seem to be diluted apple juices and at the top also errors made by the producers.

Fat level

Fat distribution for apple juices
Fat distribution for apple juices

Apple juices contain hardly any fat (and thus saturated fat). The median fat level is 100 mg/100ml. The spread is very influenced by maximum levels.

Proteins level

Distribution of proteins for apple juices
Distribution of proteins for apple juices

Apple juices contain hardly any proteins, median value is 100 mg/100 ml. The spread is very influenced by maximum levels.

Salt level

Salt distribution for apple juices
Salt distribution for apple juices

Apple juices hardly contain any salt. The median value is 8 mg per 100 ml. As can be seen in the graph, there are a lot of products with a salt level over 200 mg/100 ml.

Vitamin C

Vitamin C distribution for the apple juices category (28-feb-2023)
Vitamin C distribution for the apple juices category (28-feb-2023)

Median 25mg/100ml; 80% percentile range: 0.5-33.9mg/100ml

Potassium

Potassium histogram for apple juices (28-feb-2023)
Potassium histogram for apple juices (28-feb-2023)

Potassium has a median of 105mg/100ml, 51-125mg/100ml for 80%. Wonder where the low values come from.

Observations

  • The nutritional data from the USDA import (?) is a mess. I had to correct many. Ideally the import should be redone without translation to international units, but with a very good filled serving field (more structure)

Category Definition

Using the ingredients found for Apple Juice products, it is possible to create a definition for products belonging to this category:

  • required ingredients:
    • apple juice at least 99% ?? There seems to be a EU legal requirement
  • optional ingredients:
    • water for apple juices from concentrate, instead of apple juice it might say apple juice from concentrate
    • carbon dioxide for sparkling apple juice
    • ascorbic acid for enriched apple juices
    • vanilla, ginger, curcuma, guarana for flavoured apple juices (not a complete list)

The nutritional values given an additional definition:

  • energy: 176kJ >< 212kJ
  • fat: < 0.5g/100ml
  • saturated fat: < 0.5g/100ml
  • carbohydrate: 10g/100ml >< 12.5g/100ml
  • sugars: 9.69/100ml >< 11.7g/100ml
  • proteins: < 0.5g/100ml
  • salt: < 0.3g/100ml
  • vitamin C: < 60mg/100ml (need to check the effect of enriching)
  • potassium: < 150mg/100ml

Ingredients

A product is to a large extent defined by the ingredients that have been used to create. These ingredients define to a large extent the category.

Missing ingredients

The listed ingredients do not always correspond to the product:

  • the listing of water implies concentrated apple juice and in turn apple juice from concentrate;
  • It is impossible to detect the occurrence of pure apple juice AND apple juice from concentrate at the same time;
  • cider seems to imply the ingredient alcohol. In the use it is also a type of apple;

Inventory

Most common ingredients tags are:

  • pure apple juice
  • apple juice from concentrate
  • water, need to rehydrate the apple concentrate
  • ascorbic acid for vitamin C
  • potassium sorbate as a source of potassium and preservative
  • malic acid as a preservative
  • citric acid as a preservative
  • calcium lactate
  • flavouring
  • carbon dioxide

Recipes

Often one or more ingredients are used to create a product. This is a recipe. In order to analyse these recipes, the products with ingredients were selected (1627 products) (Mirabelle download). A set analysis (all juices with ingredients) using UpSet, gives the following main recipes:

  • pure apple juices (only) (37%);
  • apple juices from concentrate (only) (14%)
  • pure apple juices + vitamin C (5%)
  • apple juices from concentrate + vitamin C (5%)
  • apple juices with flavouring (3%)
  • apple juices with preservatives (2%)

Conclusion

401 products did not contain normalised ingredients, either the data is incorrect or the values are missing in the ingredients taxonomy. A lot of cleanup is still required.

The wrong main language introduces quite some errors as the ingredients can not be parsed;

Characteristics

Apple juice products often have labels or other claims on the packaging.

Labels

Several labels can currently been found (the list). Some are useful, but other unnecessary. A quick look at some labels:

  • organic (and variants) has 638 products;
  • vegan (and thus vegetarian) has 109 products. The filtering process might not be vegan, as isenglass can be used;
  • no added sugar (326 products). The EU legislation does not allow any added sugar, so it is not really necessary. But this label might be useful for other countries.
  • no preservatives (245 products) - preservatives are allowed by EU legislation (think antioxidants like malic acid, citric acid or ascorbic acid), so it might fe found in the ingredients list. This label could be added if these ingredients are not on the package.
  • no colorings (151 products) - are there products that do have colorings?
  • EU Agriculture (and variants) (103 products) - should also be in the origin field.
  • pure juice (70 products) - this is not a label
  • no additives (51 products) - This seems the same as pure juice. This can be added based on the ingredient list?
  • Demeter (34 products) - this is not a label, but a brand?
  • low or no sugar (27 products) - these seem be wrongly labelled and should read no added sugar.
  • no gluten (25 products) - seems unnecessary.
  • no artificial flavours (12 products) - might be useful, there might be products that have flavouring
  • no GMOs (11 products) - are the apples that are GMO? Or is this just marketing?
  • kosher (10 products) - assume that a production process can be non-kosher.
  • fair trade (7 products) - guess that farmers are better paid.
  • pasteurized product (7 products) - useful label (why so few?).
  • no alcohol (6 products) - so it is not a cider, sounds like a default label
  • 100% natural (5 products) - does this mean anything?
  • rich in vitamin C (5 products) - when can a juice be called rich? What is the value of this claim?

Missing labels

There are a few labels that have not been used, but might be useful

  • artisanal fabrication - this is seen on many juices.
  • filtering - some products are not filtered and are thus cloudy. This is not stated on the ingredients, but might be found in the product name (trouble).
  • pasteurisation - add pasteurized or unpasteurized
  • extraction - cold pressed

Observations

  • when should something be added as a label, an adjective of an ingredient, or even a dedicated category?
  • there are implied labels, like no lactose, no gluten. How can this be expressed? It can be part of the category definition and noted in the taxonomy.

Quality Monitoring

With the definition of this category in terms of ingredients, nutritional values, etc, it is possible to set up a quality envelope. The idea is that this envelope contains 90% (or something like that) of the products of this category.

This envelop can be used to monitor new additions to this category. And anything that falls outside the envelope can be flagged for inspection.

OFFwiki

Using this analysis (and other information) it might be possible to create a kind of (structured) wikipage for this category. This might comprise:

  • definition - what is an apple juice (process, ingredients, origin)?
  • characteristics - what defines an apple juice (ingredients, nutritional values, labels)
  • related categories - what categories are related?
  • determination - helping the user to classify and characterise a likely apple juice product.

A first attempt: Category/Apple juices