From Open Food Facts wiki
Current state
- On-demand OCR extraction of ingredients using Tesseract 2 (production) and Tesseract 3 (.net) and Google Cloud Vision
- Uses the French dictionary for all languages
-- /home/off-fr/cgi# grep get_ocr *
Ingredients.pm:use Image::OCR::Tesseract 'get_ocr';
Ingredients.pm: $text = decode utf8=>get_ocr($image,undef,'fra');
- Has a small custom dictionary for French ( /usr/share/tesseract-ocr/tessdata/fra.user-words)
Roadmap
OCR/Roadmap
Exploiting OCR results
OCR/Results