Jump to content

Student projects/GSOC/Proposals: Difference between revisions

no edit summary
No edit summary
No edit summary
Line 36: Line 36:
=== Offline mode ===
=== Offline mode ===


* A small version of the database needs to be included in the app (at install, and then synched regularly)
* A small version of the database needs to be included in the app (at install, and then synced regularly)
** All products, but only key data
** All products, but only key data
* When scanning products, key data should be shown instantly, even if there's no network
* When scanning products, key data should be shown instantly, even if there's no network
Line 102: Line 102:
== Data science ==
== Data science ==


Why it's important: our product database is growing rapidly (10k new products every Month in early 2019), we need automated ways to extract and validate data
Why it's important: our product database is growing rapidly (730k products, with 10k new products every month in early 2019), we need automated ways to extract and validate data


Background: to date, we have done very little in this area
Background: We have started in the past year to ramp up effort, and we have processed 1,5 million images with OCR and general entity, barcode and QR-code recognition. The result is 1,5 million matching JSON files with bounding boxes.


=== Automatically classify products ===
=== Automatically classify products ===
Line 135: Line 135:
* We currently have performance issues with MongoDB requests that could be optimized
* We currently have performance issues with MongoDB requests that could be optimized
* There are many other areas for improvements for the backend, see https://github.com/openfoodfacts/openfoodfacts-server
* There are many other areas for improvements for the backend, see https://github.com/openfoodfacts/openfoodfacts-server


== 2018- 2017 Project ideas ==
== 2018- 2017 Project ideas ==