Brands: Difference between revisions
(parent brands) |
|||
Line 70: | Line 70: | ||
== Use cases == | == Use cases == | ||
The brands taxonomy has multiple applications within OFF. Theres are: | The brands taxonomy has multiple applications within OFF. Theres are: | ||
*1 Display the brands of a product, in the language requested by the user | *1 Display the brands of a product, in the language requested by the user; | ||
*2 Have a way to list all products of a brand | *2 Have a way to list all products of a brand; | ||
*3 Let users enter brands for a product, as they appear on the package | *3 Let users enter brands for a product, as they appear on the package; | ||
*4 Suggests a brand to the user based on the manufacturer part of the barcode and other information; | *4 Let the user select the correct brand category(?), if a different brands use the same name; | ||
* | *5 Suggests a brand to the user based on the manufacturer part of the barcode and other information; | ||
* | *6 Infer category and labels from brand - some brands are only used for specific products. This implies that the product category and possible labels can be implies (suggested); | ||
* | *7 Infer brand from manufacturer code within the barcode; | ||
*8 Barcode/Brand quality check - if the manufacturer part of the barcode does not match the specified brand, there is an error in either of them; | |||
== Design considerations == | == Design considerations == | ||
The observations and use cases lead to several design considerations: | The observations and use cases lead to several design considerations: |
Revision as of 09:29, 26 April 2022
This is the brands of the product. The main brand, generally clearly displayed on the front pack, should be entered first. A product can have other brands:
- when a product is a brand sold by a big company:
Actimel
is sold byDanone
, see https://world.openfoodfacts.org/product/4009700036810/actimel-granatapfel - when a product is sold with its brand translated in two languages:
Nature Valley
is sometimes writtenVal Nature
; see https://world.openfoodfacts.org/product/0065633280267/barre-granola-erable-et-cassonade-nature-valleyNo Name
is also writtenSans nom
; represent 100+ products for No Name and 170+ products for Sans Nom.
When a product has more than one brand, the first brand in the field is taken as the main brand.
There's no taxonomy for brands for the moment, so just do your best and don't waste too much time to enter brands.
Data issues
Some brands are difficult to read:
- is it
Coop
,coop
,COOP
,CO OP
?
Some brands can be related to different companies in different countries:
Some brands are related to a common name or an ingredient, which can be confusing:
- Racines (means "roots" in French)
- La Truffe (means "the truffle" in French)
- Pure Protein
- Best Choice is a brand, not a tagline
- Great Value also
- The Belgian, often entered
Belgian
; it produces many false positive from our AI.
Brands containing an apostrophe are often missing this one:
Brands are changing sometimes. How to deal with that?
Some particular cases for a brand
- A brand can contain only numbers, such as
1664
,1883
or365
. - A brand can contain quotes, such as
The Muffin "Mam" Inc
- A brand can contain &, such as
M&M's
- A brand can contain commas, such as
Williams, West & Witt's Products
; https://world.openfoodfacts.org/brand/williams-west-witt-s-prods - A brand can have a sub-brand containing its own name:
Sainsbury's
use a brand calledBy Sainsbury's
.
Some particular cases for a product
- A product can have more than 2 brands; eg:
- Coop, in Switzerland, can add up to three brands on a product:
Betty Bossi
,Karma
andCoop
in this product; they justify it: "We offer a wide range of own-label brands and brand worlds." (source) - In this product,
Lipton
belongs toUnilever
but this product is distributed byPepsico
- Coop, in Switzerland, can add up to three brands on a product:
Implementation in Open Food Facts
In the database, this field is called brands
.
Preventing OFF AI to detect some false brands: see brand_taxonomy_blacklist.txt.
Help to collect brands
The IA of Open Food Facts, called Robotoff, is trying to identify brands. The annotations made by Robotoff are provided to users, asking them to answer to a simple question. There are also used by Hunger Game. Everyone can use Hunger Game, but be careful to the issues mentioned in this current page.
- on 2020-10-29, there were 43200 annotations and 22748 resting
- on 2020-11-17, there were 46911 annotations and 22720 resting
Observations summary
Summarising the observations note above, we see the following brands:
- 1 universal brand, exact same name used in all countries and languages. e.g. "Nutella"
- 2 brand that is translated in different languages "The Laughing Cow", "La vache qui rit"
- 3 brands that have the same name, but used in different languages
- 4 brands that have the same name, but used in different countries
- 5 brands that have the same name, and used in the same country. (e.g. "Ferrero" in Italy: there's also a pasta brand).
- 6 brands in non-latin scripts, which can not be latinised
- 7 parent brands are sometimes shown on packaging
Use cases
The brands taxonomy has multiple applications within OFF. Theres are:
- 1 Display the brands of a product, in the language requested by the user;
- 2 Have a way to list all products of a brand;
- 3 Let users enter brands for a product, as they appear on the package;
- 4 Let the user select the correct brand category(?), if a different brands use the same name;
- 5 Suggests a brand to the user based on the manufacturer part of the barcode and other information;
- 6 Infer category and labels from brand - some brands are only used for specific products. This implies that the product category and possible labels can be implies (suggested);
- 7 Infer brand from manufacturer code within the barcode;
- 8 Barcode/Brand quality check - if the manufacturer part of the barcode does not match the specified brand, there is an error in either of them;
Design considerations
The observations and use cases lead to several design considerations:
- Unique brand key - as the same brand (string) can exist in multiple geographic areas or within the same geographic area, there must be a way to uniquely distinguish between the various brands. Otherwise a user can not enter the correct brand (UC1), nor can we list all brands (UC2);
Taxonomy encoding
Taking the observations, use cases and design considerations into account, it is possible to specify how this can be encoded in a taxonomy. The same approach as all the other taxonomies will be used for thus.
An overview of the encoding used in the brands taxonomy:
- blocks/white lines - a single brand is defined by a block of text and are separated by white lines;
- definition - a single line defines the brand in a specific language, an attribute, etc. Each line starts with a name followed by parameters and separated by a colon
- parameters:
- parent (<zz:) - a reference to another (parent) brand. Thus any attributes shared between parent and child only have to be specified in the parent;
- key (zz:) - an unique is required, so that we can distinguish between brands with the same name;
- default (xx:) - a default value to be used for any language that is not specifically listed;
- language (e.g. ru:) - the brand-name in the language ru. A brand might be defined in multiple languages as needed, with a single line for each language. For instance the ru: might be used to specify a brand in Cyrillic. Sometimes also country specific brands are required.
- attributes - each brand can have one or more attributes
- barcodeprefix:en: - the first 8(?) numbers of the barcode that belong to the brand. With this brands can be automatically assigned to a barcode. Also the barcodes of existing products can be checked;
- wikipedia:en: - the full link to a page, which explains the brand(!!!). Note that brands are not well available on wikipedia;
- wikidata:en: - the identifaction of the wikidata entry for the brand(!!). Note that brands are not well available on wikidata;
- brand_owner_opencorporates:xx: - an identification of the probable brand owner/distributor on opencorporates. It is not always clear what should be written down here.
- idea: local customer service address (one for each country where the product is distributed, and a generic one)
- idea: eu trademark - maybe only useful for name trademarks? (not logo's). But not relevant to the consumer.
- idea: stores - the stores where the brand is sold. Maybe better to create a store taxonomy and link from there to brands?
Example
zz:laespanola
xx:La Española
barcodeprefix:en:8410226
barcodeprefix:en:8410660
category:en en:Olive tree products
brand_owner_opencorporates:ES:80245129
#94 products @2022-04-23
zz:latrappe
xx:La Trappe
barcodeprefix:en:8711406
category:en en:belgian-beers
label:en: en:authentic-trappist-product
website:nl:https://nl.latrappetrappist.com/nl/nl.html
brand_owner_opencorporates:BE:0838355558
wikidata:en:Q590921
#11 products @2022-04-23
Questions / Issues
- company structure - do we want to list (and research) all the relationships between owners, marketing companies, etc. I would suggest that we do NOT and limit ourselves to the brands and brand owner (and maybe the production company);
- overkill - it is very tempting to add al kinds details for owners, etc. The actual owners of the product, brand, etc, do not seem useful to the consumer. We just should provide links to third parties for this kind information. I.e. wikipedia, wikidata,, opencorporates, ipo europe, ...
- parent brand - when should the parent brand be added? Only if the parent brand is available on the front of the packaging, or also when it is shown on the back of the packaging, or when we can find out the legal final parent owner of a brand? This choice might have an implication for how we structure the data.
- EAN manufacturer codes - is there an open database which we could use?