OCR: Difference between revisions
No edit summary |
|||
Line 1: | Line 1: | ||
{{Box | |||
| 1 = Slack channel | |||
| 2 = [https://openfoodfacts.slack.com/messages/ocr/ #ocr] | |||
}} | |||
== Current state == | == Current state == | ||
* On-demand OCR extraction of ingredients using Tesseract 2 (production) and Tesseract 3 (.net) | * On-demand OCR extraction of ingredients using Tesseract 2 (production) and Tesseract 3 (.net) |
Revision as of 12:43, 24 January 2016
|
---|
Current state
- On-demand OCR extraction of ingredients using Tesseract 2 (production) and Tesseract 3 (.net)
- Uses the French dictionary for all languages
-- /home/off-fr/cgi# grep get_ocr * Ingredients.pm:use Image::OCR::Tesseract 'get_ocr'; Ingredients.pm: $text = decode utf8=>get_ocr($image,undef,'fra');
- Has a small custom dictionary for French ( /usr/share/tesseract-ocr/tessdata/fra.user-words)