OCR/Roadmap: Difference between revisions

Revision as of 17:16, 1 November 2013

Currently, all products are edited manually. This project is about automatic or semi-automatic detection of a number of things using OCR and Computer vision.

Tools:

Google Drive OCR or Google Goggles
Ocropus
OpenCV
Moodstocks

Targets:

Logos (standardized)
Text
Standardized layouts (US Nutrition labels)
Standardized text (quantities, EU Packaging codes)
Barcodes (extraction in uploaded images)
Image orientation: check that the text is properly oriented to guess if the image is properly oriented.

Extracting areas is already great work: if we can extract logos or patterns, it will be faster for humans to double check and turn that into text.

@@ Line 6: / Line 6: @@
 * OpenCV
 * Moodstocks
+Targets:
+* Logos (standardized)
+* Text
+* Standardized layouts (US Nutrition labels)
+* Standardized text (quantities, EU Packaging codes)
+* Barcodes (extraction in uploaded images)
+* Image orientation: check that the text is properly oriented to guess if the image is properly oriented.
+Extracting areas is already great work: if we can extract logos or patterns, it will be faster for humans to double check and turn that into text.