OptimiDoc OCR processing options



  • Image processing
    • Remove punch holes - OCR engine will detect the hole leaving black dot over places where the punch hole is and will remove it from the final document.
    • Remove black borders - if yes, OCR engine automatically recognizes black borders of the document and removes them.
    • Auto-detection of page orientation - the system automatically checks the orientation of each page and corrects it if required.
    • Automated image de-skewing - automated image de-skewing is an essential document imaging function which is applied to scanned documents requiring compensation for image skew. It does not require leading edge borders or lines.
    • Image despeckling - when scanning poor to medium quality documents, you may get very noisy images with lots of dot speckles on them. These speckles, when they appear close to letters or numbers, may affect the quality of OCR. This feature removes such noise.
    • Separation - separation by the zone is missing, the rest of options stays the same as in case of ABBYY OCR.
    • Remove blank pages
      • Include all pages - all pages will be included in document
      • Use device - blank pages will be removed by device (supported only by selected Xerox devices)
      • Use OCR - OCR engine will be used for blank page removal
    • OCR recognition mode - type of recognition processing.
      • Accuracy - accurate mode for achieving the highest quality of recognition.
      • Speed - designed for high-volume document processing.
    • OCR language - language of document. It is recommended to select just language of the scanned document for better result of recognition.



Processing options available with ABBYY extension, define the document processing and distribution.



  • Image processing
    • Auto-detection of page orientation - the system automatically checks the orientation of each page and corrects it if required.
    • Splitting facing and dual pages - recognition and layout analysis are then performed separately for each page.
    • Automated image de-skewing - automated image de-skewing is an essential document imaging function which is applied to scanned documents requiring compensation for image skew. It does not require leading edge borders or lines.
    • Image despeckling - when scanning poor to medium quality documents, you may get very noisy images with lots of dot speckles on them. These speckles, when they appear close to letters or numbers, may affect the quality of OCR. This feature removes such noise.
    • Texture filtering - Texture filtering technology helps to filter out background “noise” such as color and texture, accuracy for difficult to read documents such as newsprint, color documents, faxes, etc.
  • Remove blank pages
    • Include all pages - all pages will be included in document
    • Use device - blank pages will be removed by device (supported only by selected Xerox devices)
    • Use OCR - OCR engine will be used for blank page removal
  • Separation
    • One Document - scanned documents will be processed as one document.
    • Barcode - scanned documents will be separated to multiple documents by barcodes. Barcode represents first page of the new document.
    • Barcode - defines the barcode type
    • Remove page with barcode - OptimiDoc removes the page with barcode
    • Regular expression - definition of regular expression
    • BlankPage - scanned documents will be separated by a blank page. Blank page is automatically removed.
    • One Document with Barcode - scanned documents will processed as one document with barcode recognition.
      • - Remove page with barcode - OptimiDoc removes the page with the barcode
    • Number of pages - document will be separated based on a predefined number of pages.
    • Zone - selected zone will be used for document separation.
  • OCR recognition mode - type of recognition processing.
    • Accuracy - accurate mode for achieving the highest quality of recognition.
    • Speed - designed for high-volume document processing.
  • OCR language - language of document. It is recommended to select just language of the scanned document for better result of recognition.