Pdf2Data 5.0.1 API
Loading...
Searching...
No Matches
iText.Pdf2Data.Ocr.Engine.Tesseract4BasedEngine Class Reference

Engine which uses iText.Pdfocr.Tesseract4.Tesseract4LibOcrEngine as based ocr engine. More...

Inheritance diagram for iText.Pdf2Data.Ocr.Engine.Tesseract4BasedEngine:
iText.Pdf2Data.OcrWithPostProcessingEngine

Classes

class   Builder
  Builder for Tesseract4BasedEngine. More...
 

Static Public Member Functions

static Tesseract4BasedEngine.Builder  CreateBuilder (IList< String > languages, FileInfo tessDataPath)
  Creates new Builder.
 

Additional Inherited Members

- Public Member Functions inherited from iText.Pdf2Data.OcrWithPostProcessingEngine
  OcrWithPostProcessingEngine (IOcrEngine baseOcrEngine, IList< IOcrEnginePostProcessor > postProcessors, bool isTaggingSupported)
  Creates new OcrWithPostProcessingEngine instance.
 
virtual bool  IsTaggingSupported ()
  Gets whether results will be tagged or not.
 
virtual IDictionary< int, IList< TextInfo > >  DoImageOcr (FileInfo input)
  Performs ocr with post-processing to your input file.
 
virtual IDictionary< int, IList< TextInfo > >  DoImageOcr (FileInfo input, OcrProcessContext ocrProcessContext)
  Performs ocr with post-processing to your input file.
 
virtual void  CreateTxtFile (IList< FileInfo > inputImages, FileInfo txtFile)
  Performs OCR using provided iText.Pdfocr.IOcrEngine for the given list of input images and saves output to a text file using provided path.
 
virtual void  CreateTxtFile (IList< FileInfo > inputImages, FileInfo txtFile, OcrProcessContext ocrProcessContext)
  Performs OCR using provided iText.Pdfocr.IOcrEngine for the given list of input images and saves output to a text file using provided path.
 

Detailed Description

Engine which uses iText.Pdfocr.Tesseract4.Tesseract4LibOcrEngine as based ocr engine.

Engine which uses iText.Pdfocr.Tesseract4.Tesseract4LibOcrEngine as based ocr engine. And apply Pdf2DataTATRPostProcessor if needed afterwards.

Member Function Documentation

◆ CreateBuilder()

static Tesseract4BasedEngine.Builder iText.Pdf2Data.Ocr.Engine.Tesseract4BasedEngine.CreateBuilder ( IList< String > languages,
FileInfo tessDataPath )
inlinestatic

Creates new Builder.

Creates new Builder . Note that it's required to provide path to your train tesseract data directory.

Parameters
languages

System.Collections.IList of languages which you'd like to extract from image. If missing or empty english will be default language.

Parameters
tessDataPath path to your train tesseract data directory as System.IO.FileInfo
Returns
new instance of Builder