Artificial Intelligence/Robotoff/Roadmap

From Open Food Facts wiki
Revision as of 19:31, 1 February 2021 by Teolemon (talk | contribs)

2021 Roadmap draft

“More themes for detections, more efficient detections, with more techniques, more broadly automatically applied, and more broadly distributed and validated by contributors”

Pre-requisites

Developper experience

  • Document installation better including simulating a Product Opener sending and receiving to and from Robotoff
  • Make it easy to assess impact of Regex on existing OCRs (or the need for it could be replaced by the self service regex system)

Solve current performance issues

More detections

There are a lot of things to detect on a packaging. While we can add many individual detections (and that's very fine, there are some good and impactful examples in this project), we should aim to build general purpose systems that will scale across languages and kinds of detection (eg a system able to detect any kind of recurring text pattern)

Put an initial version of nutrition table extraction in production

Integrate spellcheck into Product Opener

  • Integrate the spellcheck into Product Opener (already available in the Robotoff API)

Categories and ingredients

Other

Support quality

(Probably doable in Q2, after the Wild School prototype completes)

  • Leverage the historization databases to:

Support the Eco-Score

Extract the various variable of the Eco-Score:

Distribution

Increase insight validation

Increase insight validation, either by

  • increasing distribution (though apps, 3rd party apps, games…)
  • increasing ability to validate them automatically
    • More aggressive application of many insights
      • EMLyon project
        • apply top level categories for the Nutri-Score
        • Secondary goal: OCR-based category prediction

Hunger Games

  • Bring the logos and labels games to the next level
  • Bind the nutrition game to the nutrition prediction
  • Add a nudge to login to Open Food Facts
  • Add a mission page (could be a link to a wiki page)

Speculative projects

  • Being able to auto-extract recurring patterns of text, and create rules to populate fields from them (or create insights) (Le Wagon project)

Next-gen Robotoff and high impact tooling

  • Secondary insights
  • Composite insights - "IF & IF THEN" insights
  • Add a self-service Elastic Search interface to let power users create new Regex rules (search within all the OCRs for a keyword or pattern, to help find the potential volumetry and refine the REGEX, and then add a Rule-building interface that saves the detection in a new BDD table that will then be used by Robotoff)