Package com.itextpdf.pdf2data.ocr.engine
Class Tesseract4BasedEngine.Builder
java.lang.Object
com.itextpdf.pdf2data.ocr.engine.Tesseract4BasedEngine.Builder
- Enclosing class:
- Tesseract4BasedEngine
Builder for
Tesseract4BasedEngine.
-
Method Summary
Modifier and TypeMethodDescriptionbuild()Creates newOcrWithPostProcessingEngineengine.Enables TATR post-processing if called.textPositioning(com.itextpdf.pdfocr.tesseract4.TextPositioning textPositioning) Defines the way text is retrieved from tesseract output usingTextPositioning.
-
Method Details
-
textPositioning
public Tesseract4BasedEngine.Builder textPositioning(com.itextpdf.pdfocr.tesseract4.TextPositioning textPositioning) Defines the way text is retrieved from tesseract output usingTextPositioning.- Parameters:
-
textPositioning- the way text is retrieved - Returns:
-
instance of
Tesseract4BasedEngine.Builder
-
enableTATRPostProcessing
Enables TATR post-processing if called. Note that you should initialize table models, seePdf2DataTATRPostProcessorStaticInitializerfirst.- Returns:
-
instance of
Tesseract4BasedEngine.Builder
-
build
Creates newOcrWithPostProcessingEngineengine.Tesseract4LibOcrEnginewill be used as based engine.- Returns:
-
instance of
OcrWithPostProcessingEngine
-