|
Β |
(39 intermediate revisions by 4 users not shown) |
Line 1: |
Line 1: |
| [https://world.openfoodfacts.org Open Food Facts] has been selected as one of the mentor organizations for the [https://summerofcode.withgoogle.com/organizations/5282542639382528/ 2018 Google Summer of Code] | | [https://world.openfoodfacts.org Open Food Facts] was a 2022 Google Summer of Code candidate (and has been selected as one of the mentor organizations for the [https://summerofcode.withgoogle.com/organizations/5282542639382528/ 2018 Google Summer of Code]) |
|
| |
|
| The next step is for students and us to work together so that students can build the strongest and most impactful proposals to submit for the Summer of Code program.
| | '''IMPORTANT:''' for GSOC 2024: see [[GSOC/2024 ideas list]] |
| Β | | [[Category:Project]] |
| This page lists the key areas where we need the most help. You are of course welcome to propose other project ideas, and we are looking forward to discussing these ideas and yours.
| |
| Β | |
| = Building strong proposals =
| |
| Β | |
| Project ideas will need to be turned into strong project proposals. Here are some guidelines on [http://write.flossmanuals.net/gsocstudentguide/writing-a-proposal/ how to write strong proposals for the Google Summer of Code.]
| |
| Β | |
| To make your proposals more relevant, please take some time to familiarize yourself with the Open Food Facts project, and how the database is crowdsourced:
| |
| * Explore our web site https://world.openfoodfacts.org , start with the [https://world.openfoodfacts.org/discover Discover] and [https://world.openfoodfacts.org Contribute] pages.
| |
| * Install our Android or iOS mobile app, scan some food products, and add photos for a few products from your country
| |
| * Create an account on the web site, look up the products that you added, and edit the product pages to fill in the data for ingredients, nutrition facts etc.
| |
| * Join us on Slack, [https://slack-ssl-openfoodfacts.herokuapp.com/ request an instant invite]
| |
| Β | |
| To discuss ideas, please join us on our Slack:
| |
| * #summerofcode channel: https://openfoodfacts.slack.com/messages/summerofcode
| |
| Β | |
| = Google Summer of Code 2018 Project ideas =
| |
| Β | |
| == New Native Android and iOS apps to drive mass adoption and mass contributionΒ ==
| |
| Β | |
| Why it's important: most of the data in the Open Food Facts database come from crowdsourcing through mobile apps: users scan barcodes of products and send us photos and data for missing products. We need Android and iOS apps that bring a lot of value to users so that we gain mass adoption, and that have powerful features to contribute photos and data as easily and quickly as possible.
| |
| Β | |
| Background: We currently have a basic app made with Cordova on the Google Play Store and the Apple App Store. Work has started on native apps, but they are missing key features.
| |
| Β | |
| Key features needed:
| |
| Β | |
| === Augmented reality and continuous scan ===
| |
| Β | |
| * Users need to be able to use the viewfinder of their camera to continuously scan for barcodes of products
| |
| * When a barcode is recognized, an overlay display key information (e.g. A to E nutrition grades), with a link to the full product page
| |
| * While the overlay is on, it needs to be possible to recognize new barcodes that come into view
| |
| * Stretch goal: recognize products without scanning barcode, using technologies like Pastec
| |
| Β | |
| === Offline mode ===
| |
| Β | |
| * A small version of the database needs to be included in the app (at install, and then synched regularly)
| |
| ** All products, but only key data
| |
| * When scanning products, key data should be shown instantly, even if there's no network
| |
| * History of scanned products, and full data for these products should be saved locally on the device
| |
| * Offline contribution
| |
| ** While offline (e.g. in a store with no network), users need to be able to scan and take photos for lots of products
| |
| ** Photos should be sent when network becomes available
| |
| Β | |
| === Drip editing ===
| |
| Β | |
| * Every little helps. Drip editing means asking Open Food Facts users little questions about the product they are looking at. They should take a split second to answer. Put together, they helps complete products quicker, update existing products and ensure quality. This project is about introducing drip editing, in collaboration with the backend team in either the Android or the iOS version.
| |
| Β | |
| === Personnalisation and recommendations ===
| |
| Β | |
| * Users should be able to provide data about them (age, sex, weight etc.) and their diet restrictions (e.g. allergens, vegan, religious) and preferences (organic, no GMOs, no palm oil..)
| |
| * This data needs to be stored locally on device, and not sent to Open Food Facts and 3rd parties
| |
| * Grade scan products based on this data
| |
| * Display product recommendations / alternatives that better match the user preferences
| |
| Β | |
| == Computer vision ==
| |
| Β | |
| Why it's important: all product data comes from photos of the product and labels. Today most of this data is entered manually. In order to be able to scale, we need to extract more data from photos automatically.
| |
| Β | |
| Background: We currently only do basic OCR for ingredients. There is a lot of room for improvement.
| |
| Β | |
| === Improve OCR for ingredients ===
| |
| Β | |
| * Create golden test sets to measure accuracy of the current OCR and improvements
| |
| * Train OCR models targeted for ingredients
| |
| * Automatic cropping of ingredients lists
| |
| Β | |
| === OCR for Nutrition Facts tables ===
| |
| Β | |
| * Automatic recognition and cropping of nutrition facts table
| |
| * OCR for the nutrition facts table
| |
| Β | |
| === Brands and labels detection ===
| |
| Β | |
| * Automatically recognize brands and labels
| |
| Β | |
| Β | |
| == Data science ==
| |
| Β | |
| Why it's important: our product database is growing rapidly (10k new products every Month in early 2018), we need automated ways to extract and validate data
| |
| Β | |
| Background: to date, we have done very little in this area
| |
| Β | |
| === Automatically classify products ===
| |
| Β | |
| * Detect field values from other field values or bag of words from the OCR
| |
| ** Categories
| |
| ** Brands (in some cases, a strong feature can be the barcode prefix)
| |
| ** Labels
| |
| * When certain, detected values can be applied immediately
| |
| * When less certain, we can ask users to confirm suggestions
| |
| Β | |
| === Automatically detect errors ===
| |
| Β | |
| * Bad nutrition facts
| |
| ** e.g. by looking at outliers for products of the same category
| |
| Β | |
| Β | |
| == Other projects ==
| |
| Β | |
| === Taxonomy Editor ===
| |
| Β | |
| * We define and use multilingual taxonomies for categories, labels, ingredients and other fields.
| |
| * Those taxonomies are directed acyclic graphs (hierarchies where a child can have multiple parents).
| |
| * They are currently defined in text files hosted on our wiki: https://en.wiki.openfoodfacts.org/Global_taxonomies but it is becoming unmanageable (the biggest taxonomy for categories is 37k lines long).
| |
| * We need a tool that makes it easy to edit the taxonomy and translate it.
| |
| Β | |
| == Your ideas ==
| |
| Β | |
| Please feel free to submit proposals for other ideas that you have. If you have other ideas, please talk to us about them as early as possible, so that we can give you early feedback.
| |
| Β | |
| Thank you!
| |