General Feature Overview
How to use the CLI application? - Samples
abbyyocr -if sample.jpg -f HTML -hkl -of sample.html -f RTF -rmp -of sample.rtf
The sample.jpg file will be recognised
The results will be exported to
The original lines in the recognised text will be retained during export to
HTML format (-hkl).
The source page layout will not be retained when exporting recognised text to RTF format (-rmp).
abbyyocr -ii -fm -if sample.jpg -tet UTF8 -of sample.txt
The sample.jpg file will be recognised in fast mode (-fm).
The colours of the prepared image will be inverted during conversion to the internal format (-ii).
The results will be exported to an Unicode UTF8 type text file (-tet UTF8).
Features + Functionality
ABBYY FineReader Engine CLI for Linux offers easy and instant access to ABBYY’s high quality OCR technology on the Linux platform. Processing can be easily controlled and automated via terminal/command line calls.
The following image and document formats can be opened and processed:
PDF
BMP
PCX
DCX
JPEG
JPEG2000
TIFF
PNG
b) Processing and Recognition Features:
The image processing and recognition are controlled through a set of parameters:
Image processing
Skew correction, image format, compression settings, image resolution, cleaning images, colour inversion, splitting of dual pages
Recognition Keys
Fast/balanced mode, format recognition (e.g. Italic), recognition languages that should be used, Recognition of mixed font types, such as normal text, typewriter, dot-matrix, OCR-A, OCR-B and MICR (E13b)
Barcode Keys
17 most popular
1D barcodes,
2D: PDF417, Aztec, DataMatrix, QRCode
positioned at any angle on a document
c) Export Options:
FineReader Engine CLI for Linux offers sophisticated output options and formats:
Synthesis Keys
Settings how the recognition result export should be exported, e.g. fonts, paragraphs, text colour, hyperlinks…
The recognition results can be exported to these formats:
-
text only
text on image
image on text
image only
protected PDFs
Further details can be found in the documentation.
OCR Languages
ABBYY FineReader Engine for Linux recognises over 190 OCR languages.
Read more...
Barcode Types
1D: Check Code 39, Check Interleaved 25, Code 128, Code 39, EAN 13, EAN 8, Interleaved 25, CODABAR (without checksum), UCC Code 128, Code 2 of 5 (Industrial, IATA, Matrix), Code 93, UPC-A, UPC-E and Postnet.
2D: PDF 417, Aztec, DataMatrix, QRCode
Licence Add ons