IOcrEngine
.
IOcrEngine
and creates PDF using provided PdfWriter
.
IOcrEngine
and creates PDF using provided PdfWriter
and PdfOutputIntent
.
IOcrEngine
for the given list of input images and saves output to a text file using provided path.
IOcrEngine
for the given list of input images and saves output to a text file using provided path.
FontProvider
.
null
a new instance of PdfOcrFontProvider
is returned.
- getImageLayerName() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets name of image layer.
- getLanguages() - Method in class com.itextpdf.pdfocr.OcrEngineProperties
-
Gets list of languages required for provided images.
- getLanguagesAsString() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Gets list of languages concatenated with "+" symbol to a string in format required by tesseract.
- getMessage() - Method in exception com.itextpdf.pdfocr.OcrException
- getMessageParams() - Method in exception com.itextpdf.pdfocr.OcrException
-
Gets additional params for Exception message.
- getOcrEngine() - Method in class com.itextpdf.pdfocr.OcrPdfCreator
-
Gets used
IOcrEngine
.
- getOcrPdfCreatorProperties() - Method in class com.itextpdf.pdfocr.OcrPdfCreator
-
Gets properties for
OcrPdfCreator
.
- getOriginId() - Method in class com.itextpdf.pdfocr.tesseract4.events.PdfOcrTesseract4Event
-
- getPageSegMode() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Gets Page Segmentation Mode.
- getPageSize() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets required size for output PDF document.
- getPathToExecutable() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4ExecutableOcrEngine
-
Gets path to tesseract executable.
- getPathToTessData() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Gets path to directory with tess data.
- getPdfDocumentType() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorMetaInfo
-
Gets the type of the document which is created during the corresponding ocr event
- getPdfLang() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets PDF language.
- getScaleMode() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets scale mode for input images using available options from
ScaleMode
enumeration.
- getTesseract4OcrEngineProperties() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Gets properties for
AbstractTesseract4OcrEngine
.
- getTesseractInstance() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4LibOcrEngine
-
Gets tesseract instance.
- getText() - Method in class com.itextpdf.pdfocr.TextInfo
-
Gets text element.
- getTextColor() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets text color in output PDF document.
- getTextLayerName() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets name of text layer.
- getTextPositioning() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Defines the way text is retrieved from tesseract output using
TextPositioning
.
- getThreadLocalMetaInfo() - Method in interface com.itextpdf.pdfocr.events.IThreadLocalMetaInfoAware
-
Gets the meta info which is held by the interface.
- getThreadLocalMetaInfo() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Gets the meta info which is held by the interface.
- getTitle() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets PDF document title.
- getWrappedMetaInfo() - Method in interface com.itextpdf.pdfocr.IMetaInfoWrapper
-
Gets the wrapped meta info
- getWrappedMetaInfo() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorMetaInfo
-
I
- identifyOsType() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Identifies type of current OS and return it (win, linux).
- IMetaInfoWrapper - Interface in com.itextpdf.pdfocr
-
The meta info wrapper that holds some meta info
- INCORRECT_INPUT_IMAGE_FORMAT - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
-
- INCORRECT_LANGUAGE - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
-
- initializeTesseract(OutputFormat) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4LibOcrEngine
-
Initializes instance of tesseract if it haven't been already initialized or it have been disposed and sets all the required properties.
- IOcrEngine - Interface in com.itextpdf.pdfocr
-
IOcrEngine
interface is used for instantiating new OcrReader objects.
- isPreprocessingImages() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Checks whether image preprocessing is needed.
- isWindows() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Checks current os type.
- IThreadLocalMetaInfoAware - Interface in com.itextpdf.pdfocr.events
-
The interface which holds a thread local meta info, meaning different threads operate with independent and different meta infos.
L
- LANGUAGE_IS_NOT_IN_THE_LIST - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
-
M
- MAJOR_VERSION - Static variable in class com.itextpdf.pdfocr.tesseract4.PdfOcrTesseract4ProductInfo
-
The major version number.
- MINOR_VERSION - Static variable in class com.itextpdf.pdfocr.tesseract4.PdfOcrTesseract4ProductInfo
-
The minor version number.
N
- NUMBER_OF_PAGES_IN_IMAGE - Static variable in class com.itextpdf.pdfocr.PdfOcrLogMessageConstant
-
O
- OcrEngineProperties - Class in com.itextpdf.pdfocr
-
- OcrEngineProperties() - Constructor for class com.itextpdf.pdfocr.OcrEngineProperties
-
Creates a new
OcrEngineProperties
instance.
- OcrEngineProperties(OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.OcrEngineProperties
-
Creates a new
OcrEngineProperties
instance based on another OcrEngineProperties
instance (copy constructor).
- OcrException - Exception in com.itextpdf.pdfocr
-
Exception class for custom exceptions.
- OcrException(String, Throwable) - Constructor for exception com.itextpdf.pdfocr.OcrException
-
Creates a new OcrException.
- OcrException(String) - Constructor for exception com.itextpdf.pdfocr.OcrException
-
Creates a new OcrException.
- OcrPdfCreator - Class in com.itextpdf.pdfocr
-
OcrPdfCreator
is the class that creates PDF documents containing input images and text that was recognized using provided IOcrEngine
.
- OcrPdfCreator(IOcrEngine) - Constructor for class com.itextpdf.pdfocr.OcrPdfCreator
-
Creates a new
OcrPdfCreator
instance.
- OcrPdfCreator(IOcrEngine, OcrPdfCreatorProperties) - Constructor for class com.itextpdf.pdfocr.OcrPdfCreator
-
Creates a new
OcrPdfCreator
instance.
- OcrPdfCreatorMetaInfo - Class in com.itextpdf.pdfocr
-
The meta info that is used internally by pdfOcr to pass a wrapped custom meta data
- OcrPdfCreatorMetaInfo(IMetaInfo, UUID, OcrPdfCreatorMetaInfo.PdfDocumentType) - Constructor for class com.itextpdf.pdfocr.OcrPdfCreatorMetaInfo
-
Creates an inner meta info wrapper
- OcrPdfCreatorMetaInfo.PdfDocumentType - Enum in com.itextpdf.pdfocr
-
The enum which represents types of documents, for which pdfOcr sends different events
- OcrPdfCreatorProperties - Class in com.itextpdf.pdfocr
-
Properties that will be used by the
OcrPdfCreator
.
- OcrPdfCreatorProperties() - Constructor for class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Creates a new
OcrPdfCreatorProperties
instance.
- OcrPdfCreatorProperties(OcrPdfCreatorProperties) - Constructor for class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Creates a new
OcrPdfCreatorProperties
instance based on another OcrPdfCreatorProperties
instance (copy constructor).
- OutputFormat - Enum in com.itextpdf.pdfocr.tesseract4
-
Enumeration of the available output formats.
P
- PAGE_NUMBER_IS_INCORRECT - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
-
- parseHocrFile(List
, TextPositioning) - Static method in class com.itextpdf.pdfocr.tesseract4.TesseractHelper
-
Parses each hocr file from the provided list, retrieves text, and returns data in the format described below.
- PATH_TO_TESS_DATA_DIRECTORY_IS_INVALID - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
-
- PATH_TO_TESS_DATA_IS_NOT_SET - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
-
- PDF_LANGUAGE_PROPERTY_IS_NOT_SET - Static variable in class com.itextpdf.pdfocr.PdfOcrLogMessageConstant
-
- PdfOcrFontProvider - Class in com.itextpdf.pdfocr
-
- PdfOcrFontProvider() - Constructor for class com.itextpdf.pdfocr.PdfOcrFontProvider
-
Creates a new
PdfOcrFontProvider
instance with the default font and the default font family.
- PdfOcrFontProvider(FontSet, String) - Constructor for class com.itextpdf.pdfocr.PdfOcrFontProvider
-
Creates a new
PdfOcrFontProvider
instance.
- PdfOcrLogMessageConstant - Class in com.itextpdf.pdfocr
-
- PdfOcrMetaInfo - Class in com.itextpdf.pdfocr
-
- PdfOcrMetaInfo() - Constructor for class com.itextpdf.pdfocr.PdfOcrMetaInfo
-
- PdfOcrTesseract4Event - Class in com.itextpdf.pdfocr.tesseract4.events
-
Class for ocr events
- PdfOcrTesseract4ProductInfo - Class in com.itextpdf.pdfocr.tesseract4
-
Product info about this iText add-on.
- PdfOcrTesseract4ProductInfo() - Constructor for class com.itextpdf.pdfocr.tesseract4.PdfOcrTesseract4ProductInfo
-
- PRODUCT_NAME - Static variable in class com.itextpdf.pdfocr.tesseract4.PdfOcrTesseract4ProductInfo
-
The product name.
- PROVIDED_FONT_PROVIDER_IS_INVALID - Static variable in class com.itextpdf.pdfocr.PdfOcrLogMessageConstant
-
S
- ScaleMode - Enum in com.itextpdf.pdfocr
-
Enumeration of the possible scale modes for input images.
- setBbox(List
) - Method in class com.itextpdf.pdfocr.TextInfo
-
Sets bbox coordinates.
- setFontProvider(FontProvider) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets font provider.
- setFontProvider(FontProvider, String) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets font provider and default font family.
- setImageLayerName(String) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets name for the image layer.
- setLanguages(List
) - Method in class com.itextpdf.pdfocr.OcrEngineProperties
-
Sets list of languages to be recognized in provided images.
- setMessageParams(String...) - Method in exception com.itextpdf.pdfocr.OcrException
-
Sets additional params for Exception message.
- setOcrEngine(IOcrEngine) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
-
Sets
IOcrEngine
reader object to perform OCR.
- setOcrPdfCreatorProperties(OcrPdfCreatorProperties) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
-
Sets properties for
OcrPdfCreator
.
- setPageSegMode(Integer) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Sets Page Segmentation Mode.
- setPageSize(Rectangle) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets required size for output PDF document.
- setPathToExecutable(String) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4ExecutableOcrEngine
-
Sets path to tesseract executable.
- setPathToTessData(File) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Sets path to directory with tess data.
- setPdfLang(String) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Specify PDF natural language, and optionally locale.
- setPreprocessingImages(boolean) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Sets true if image preprocessing is needed.
- setScaleMode(ScaleMode) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets scale mode for input images using available options from
ScaleMode
enumeration.
- setTesseract4OcrEngineProperties(Tesseract4OcrEngineProperties) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Sets properties for
AbstractTesseract4OcrEngine
.
- setText(String) - Method in class com.itextpdf.pdfocr.TextInfo
-
Sets text element.
- setTextColor(Color) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets text color in output PDF document.
- setTextLayerName(String) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets name for the text layer.
- setTextPositioning(TextPositioning) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Defines the way text is retrieved from tesseract output using
TextPositioning
.
- setThreadLocalMetaInfo(IMetaInfo) - Method in interface com.itextpdf.pdfocr.events.IThreadLocalMetaInfoAware
-
Sets a thread local meta info.
- setThreadLocalMetaInfo(IMetaInfo) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Sets a thread local meta info.
- setTitle(String) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets PDF document title.
- START_OCR_FOR_IMAGES - Static variable in class com.itextpdf.pdfocr.PdfOcrLogMessageConstant
-
- START_OCR_FOR_IMAGES - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
-
T
- TESSERACT4_IMAGE_OCR - Static variable in class com.itextpdf.pdfocr.tesseract4.events.PdfOcrTesseract4Event
-
- TESSERACT4_IMAGE_TO_PDF - Static variable in class com.itextpdf.pdfocr.tesseract4.events.PdfOcrTesseract4Event
-
- TESSERACT4_IMAGE_TO_PDFA - Static variable in class com.itextpdf.pdfocr.tesseract4.events.PdfOcrTesseract4Event
-
- Tesseract4ExecutableOcrEngine - Class in com.itextpdf.pdfocr.tesseract4
-
The implementation of
AbstractTesseract4OcrEngine
for tesseract OCR.
- Tesseract4ExecutableOcrEngine(Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4ExecutableOcrEngine
-
Creates a new
Tesseract4ExecutableOcrEngine
instance.
- Tesseract4ExecutableOcrEngine(String, Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4ExecutableOcrEngine
-
Creates a new
Tesseract4ExecutableOcrEngine
instance.
- Tesseract4LibOcrEngine - Class in com.itextpdf.pdfocr.tesseract4
-
The implementation of
AbstractTesseract4OcrEngine
for tesseract OCR.
- Tesseract4LibOcrEngine(Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4LibOcrEngine
-
Creates a new
Tesseract4LibOcrEngine
instance.
- Tesseract4LogMessageConstant - Class in com.itextpdf.pdfocr.tesseract4
-
- Tesseract4OcrEngineProperties - Class in com.itextpdf.pdfocr.tesseract4
-
Properties that will be used by the
IOcrEngine
.
- Tesseract4OcrEngineProperties() - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Creates a new
Tesseract4OcrEngineProperties
instance.
- Tesseract4OcrEngineProperties(Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Creates a new
Tesseract4OcrEngineProperties
instance based on another Tesseract4OcrEngineProperties
instance (copy constructor).
- Tesseract4OcrException - Exception in com.itextpdf.pdfocr.tesseract4
-
- Tesseract4OcrException(String, Throwable) - Constructor for exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
-
Creates a new TesseractException.
- Tesseract4OcrException(String) - Constructor for exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
-
Creates a new TesseractException.
- TESSERACT_FAILED - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
-
- TESSERACT_FAILED - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
-
- TESSERACT_LIB_NOT_INSTALLED - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
-
- TESSERACT_LIB_NOT_INSTALLED_WIN - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
-
- TESSERACT_NOT_FOUND - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
-
- TesseractHelper - Class in com.itextpdf.pdfocr.tesseract4
-
Helper class.
- TextInfo - Class in com.itextpdf.pdfocr
-
This class describes how recognized text is positioned on the image providing bbox for each text item (could be a line or a word).
- TextInfo() - Constructor for class com.itextpdf.pdfocr.TextInfo
-
Creates a new
TextInfo
instance.
- TextInfo(String, List
) - Constructor for class com.itextpdf.pdfocr.TextInfo
-
Creates a new
TextInfo
instance.
- TextPositioning - Enum in com.itextpdf.pdfocr.tesseract4
-
Enumeration of the possible types of text positioning.
V
- validateLanguages(List
) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Validates list of provided languages and checks if they all exist in given tess data directory.
- valueOf(String) - Static method in enum com.itextpdf.pdfocr.OcrPdfCreatorMetaInfo.PdfDocumentType
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.itextpdf.pdfocr.ScaleMode
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.itextpdf.pdfocr.tesseract4.OutputFormat
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.itextpdf.pdfocr.tesseract4.TextPositioning
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum com.itextpdf.pdfocr.OcrPdfCreatorMetaInfo.PdfDocumentType
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.itextpdf.pdfocr.ScaleMode
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.itextpdf.pdfocr.tesseract4.OutputFormat
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.itextpdf.pdfocr.tesseract4.TextPositioning
-
Returns an array containing the constants of this enum type, in the order they are declared.
A C D G I L M N O P S T V
Copyright © 1998–2020 iText Group NV. All rights reserved.