Package com.itextpdf.pdf2data.ocr.engine
Class Tesseract4BasedEngine.Builder
java.lang.Object
com.itextpdf.pdf2data.ocr.engine.Tesseract4BasedEngine.Builder
- Enclosing class:
- Tesseract4BasedEngine
Builder for
Tesseract4BasedEngine
.
-
Method Summary
Modifier and TypeMethodDescriptionbuild()
Creates newOcrWithPostProcessingEngine
engine.Enables TATR post-processing if called.textPositioning
(com.itextpdf.pdfocr.tesseract4.TextPositioning textPositioning) Defines the way text is retrieved from tesseract output usingTextPositioning
.
-
Method Details
-
textPositioning
public Tesseract4BasedEngine.Builder textPositioning(com.itextpdf.pdfocr.tesseract4.TextPositioning textPositioning) Defines the way text is retrieved from tesseract output usingTextPositioning
.- Parameters:
-
textPositioning
- the way text is retrieved - Returns:
-
instance of
Tesseract4BasedEngine.Builder
-
enableTATRPostProcessing
Enables TATR post-processing if called. Note that you should initialize table models, seePdf2DataTATRPostProcessorStaticInitializer
first.- Returns:
-
instance of
Tesseract4BasedEngine.Builder
-
build
Creates newOcrWithPostProcessingEngine
engine.Tesseract4LibOcrEngine
will be used as based engine.- Returns:
-
instance of
OcrWithPostProcessingEngine
-