OCR

From Open Food Facts wiki
Revision as of 16:51, 30 January 2016 by Teolemon (talk | contribs)
Slack channel

Current state

  • On-demand OCR extraction of ingredients using Tesseract 2 (production) and Tesseract 3 (.net)
  • Uses the French dictionary for all languages
-- /home/off-fr/cgi# grep get_ocr *
Ingredients.pm:use Image::OCR::Tesseract 'get_ocr';
Ingredients.pm: $text =  decode utf8=>get_ocr($image,undef,'fra');

Roadmap

OCR/Roadmap

Exploiting OCR results

OCR/Results