Jump to content

Data quality: Difference between revisions

Refactoring
(Big refactoring (to be continued))
(Refactoring)
Line 1: Line 1:
Some important things to know:
Data quality at Open Food Facts is guided by 3 main principles:
* Quality does not make sense for itself: quality depends on usages.
* Quality does not make sense for itself: quality depends on usages.
* No database at all can pretend to zero-default.
* We always try to favor ease of use and ease of collection. Data quality control, data verification and data fix should never be an obstacle to data gathering.
* With more than 2 700 000 products, there are quality concerns: our goal is to lower the impacts of the issues.
* No database at all can pretend to zero-default. With more than 2 700 000 products, there are quality concerns. Data quality is done on a "best effort" basis. The effort on data quality includes measuring quality, setting objectives, publishing them and implementing them.
These principles and our high level objectives are described in this short document (~5 pages)which you should read at first if you want to go further:


== Data quality principles at Open Food Facts ==
https://link.openfoodfacts.org/data-quality  
[summary + link]


== How do we manage data quality? ==
== Measures ==
We have started an initiative to measure and publish continuously some data quality stats. We have created a specific page dedicated to [[data quality stats]].


== How to help? ==
== How to help? ==
Line 36: Line 37:


=== Data quality measurement ===
=== Data quality measurement ===
We have started an initiative to measure and publish continuously some data quality stats. We have created a specific page dedicated to [[data quality stats]].
See [[data quality stats]].


=== Quality facets ===
=== Quality facets ===