Jump to content

Reusing Open Food Facts Data: Difference between revisions

m
Uncompressed size
(Add jsonl link and documentation)
m (Uncompressed size)
Line 20: Line 20:


==== The jsonl daily export ====
==== The jsonl daily export ====
While still undocumented, there is a daily export of the whole database in jsonl format. It represents the same data as the MongoDB export. It's very big! More than 17GB uncompressed.
While still undocumented, there is a daily export of the whole database in jsonl format. It represents the same data as the MongoDB export. It's very big! More than 14GB uncompressed.


You can find it at https://static.openfoodfacts.org/data/openfoodfacts-products.jsonl.gz
You can find it at https://static.openfoodfacts.org/data/openfoodfacts-products.jsonl.gz
Line 81: Line 81:


==== jq ====
==== jq ====
* start decompress the file (be carreful => 17GB after decompression):
* start decompress the file (be carreful => 14GB after decompression):
  $ gunzip openfoodfacts-products.jsonl.gz
  $ gunzip openfoodfacts-products.jsonl.gz
* work on a small subset to test. E.g. for 100 products:
* work on a small subset to test. E.g. for 100 products: