OCR/Roadmap

Currently, all products are edited manually. This project is about automatic or semi-automatic detection of a number of things using OCR and Computer vision.

Tools:

Google Drive OCR or Google Goggles
Ocropus
OpenCV
Moodstocks

Targets:

Logos (standardized)
Text
Standardized layouts (US Nutrition labels)
Standardized text (quantities, EU Packaging codes)
Barcodes (extraction in uploaded images)
Image orientation: check that the text is properly oriented to guess if the image is properly oriented.

Extracting areas is already great work: if we can extract logos or patterns, it will be faster for humans to double check and turn that into text.