OCR/Roadmap: Difference between revisions

From Open Food Facts wiki
(Created page with "Currently, all products are edited manually. This project is about automatic or semi-automatic detection of a number of things using OCR and Computer vision. Tools: * Google ...")
 
No edit summary
Line 6: Line 6:
* OpenCV
* OpenCV
* Moodstocks
* Moodstocks
Targets:
* Logos (standardized)
* Text
* Standardized layouts (US Nutrition labels)
* Standardized text (quantities, EU Packaging codes)
* Barcodes (extraction in uploaded images)
* Image orientation: check that the text is properly oriented to guess if the image is properly oriented.
Extracting areas is already great work: if we can extract logos or patterns, it will be faster for humans to double check and turn that into text.

Revision as of 17:16, 1 November 2013

Currently, all products are edited manually. This project is about automatic or semi-automatic detection of a number of things using OCR and Computer vision.

Tools:

  • Google Drive OCR or Google Goggles
  • Ocropus
  • OpenCV
  • Moodstocks

Targets:

  • Logos (standardized)
  • Text
  • Standardized layouts (US Nutrition labels)
  • Standardized text (quantities, EU Packaging codes)
  • Barcodes (extraction in uploaded images)
  • Image orientation: check that the text is properly oriented to guess if the image is properly oriented.

Extracting areas is already great work: if we can extract logos or patterns, it will be faster for humans to double check and turn that into text.