3,543
edits
Line 191: | Line 191: | ||
== Normalisation == | == Normalisation == | ||
As show in the previous section the Olive Oils category can be split into 3 subgroups, based on how the nutritional values are reported: | As show in the previous section the Olive Oils category can be split into 3 (or more) subgroups, based on how the nutritional values are reported on the package: | ||
* per 100g | * per 100g | ||
* per 100ml | * per 100ml | ||
Line 198: | Line 198: | ||
If we know how the nutritional data for a product is reported, we can normalise that data. And with the normalised data we have a consistent dataset, which can be used to get the real nutritional values. | If we know how the nutritional data for a product is reported, we can normalise that data. And with the normalised data we have a consistent dataset, which can be used to get the real nutritional values. | ||
=== Categorisation === | === Categorisation === | ||
The first step to categorise each product into each of the groups. | The first step to categorise each product into each of the groups. |
edits