OCR & Processing Speed

KB-Type:
Tips & How to Information
Version:
9.0, 11
Category:
Recognition, Processing

General

Processing speed is important when high volumes of scanned pages of paper documents have to be processed/converted using Optical Character Recognition. Below some on this topic,

  • Optical Character Recognition is
    • a multi-step process and each of the processing step can be very CPU intense (e.g. image pre-processing or layout analysis)
    • a high number of images and PDFs require fast hard disk throughput
    • processing speed also depends on the complexity of the input document, for example the document type, structure, image quality, languages, etc.
  • OCR quality and processing speed are linked, the general rule is
    • Recognition quality is direct proportional to the required processing time
    • Low quality documents need more CPU time and are processed slower than document images in high quality

How to increase the processing speed?

  • CLI Tool offers a fast processing mode that is especially designed to optimise processing speed/accuracy balance for images of good quality.
  • If the image quality is not known before, it is recommend to use the balanced mode, here the technology makes the “internal” decision.
  • Use of multiple cores (CPUs) should be considered to speed up the processing time and efectively increase the throughput.
  • The good news - OCR Scalability is built in CLI Tool. Key for multiprocessing mode >>
  • The type of OCR processing also influences the processing time/throughput
  • Pure text extraction without document layout retention is faster than exporting to a format where the layout has to be reconstructed.
  • Image-resolution changes and MRC compression of the generated PDFs will influence the processing speed.
    • Different PDF export parameters are available: Balanced, MaxSpeed, MaxQuality, MinSize. PDF Export Keys >>

Back to:

This website uses cookies which enable you to see pages or use other functions of our websites. You can turn off such cookies in your browser’s settings. If you continue to use these pages, you consent to the use of cookies.