All Classes and Interfaces (pdfOCR 5.0.0 API)

Class

Description

AbstractOnnxPredictor<T, R>

Abstract predictor, based on models running over ONNX runtime.

AbstractOnnxPredictorProperties

Properties for configuring ONNX models.

AbstractPdfOcrEventHelper

Helper class for working with events.

AbstractTesseract4OcrEngine

The implementation of IOcrEngine.

ArtifactItem

This class represents artifact structure tree item.

BasicDetectionPostProcessor

Implementation of a text detection predictor post-processor, which is used as a basis for creating post-processors for handling OnnxTR, EasyOCR and PaddleOCR model outputs.

BasicLabelPostProcessor

Abstract Implementation of a basic text recognition predictor post-processor.

Batching

Static utility class to help with batching.

BatchProcessingGenerator<T, R>

Generator with batch processing.

BoxType

Enum for values under the box_type key within a DBPostProcess PostProcessor object in a config file.

BufferedImageUtil

Additional algorithms for working with BufferedImage.

ConfigParserException

Exception class for exceptions during configuration file parsing.

CrnnPostProcessor

Implementation of a text recognition predictor post-processor, used for OnnxTR CRNN model outputs.

CtcLabelDecode

POJO for the CTCLabelDecode post-processor object under a PostProcess key in a config file.

CtcLabelPostProcessor

Implementation of a text recognition predictor post-processor, used for EasyOCR and PaddleOCR model outputs.

DbPostProcess

POJO for the DBPostProcess post-processor object under a PostProcess key in a config file.

DecodeImage

POJO for the DecodeImage transform operation within a PreProcess object in a config file.

DefaultOrientationMapper

Default implementation for mapping output of a crop orientation model to TextOrientation values.

DefaultOrtSessionOptionsCreator

Default implementation of IOrtSessionOptionsCreator.

DetResizeForTest

POJO for the DetResizeForTest transform operation within a PreProcess object in a config file.

Dimensions2D

A basic 2-element tuple with a width and a height.

EasyOcrDetectionPostProcessor

Implementation of a text detection predictor post-processor, used for EasyOCR model outputs.

EasyOcrMapper

Label mapper for EasyOCR text recognition models.

EasyOcrTextBoxMerger

Text box merger, based on the algorithm used in EasyOCR.

EndOfStringPostProcessor

Implementation of a text recognition predictor post-processor, used for OnnxTR non-CRNN model outputs.

FloatBufferMdArray

Multidimensional array with a FloatBufferWrapper backing storage.

FloatBufferWrapper

Wrapper class around FloatBuffer.

IBatchProcessor<T, R>

Batch processor mapper interface.

IDetectionPostProcessor

Interface for post-processors, which convert raw output of an ML model and returns rotated boxes with the detected objects.

IDetectionPredictor

Interface for predictors, which take a full image and find text boxes on them.

IImageRotationHandler

Rotation information may be stored in image metadata.

ImageChannelConfiguration

Enumeration of supported image channel configuration for buffers.

ImagePreprocessingOptions

Additional options applied on image preprocessing step.

ImageResizeOptions

Options, that describe the way an image will be resized before being converted to a tensor for an ML model input.

ImgMode

Enum for values under the img_mode key within a DecodeImage transform operation object in a config file.

InferenceConfig

POJO for the root object in a config file.

InferenceConfigParser

Static class with functions for parsing PaddleOCR YAML config files into a InferenceConfig POJO.

IOcrEngine

IOcrEngine interface is used for instantiating new OcrReader objects.

IOcrProcessProperties

OCR properties passed to the OCR engine as part of OcrProcessContext.

IOrientationPredictor

Interface for predictors, which take a cropped image of text and determine its orientation.

IOrtSessionOptionsCreator

Interface for ONNX runtime session options creators.

IOutputLabelMapper<T>

Interface for mapping an integer index (continuous from 0) to output values.

IPredictor<T, R>

Interface of a generic predictor.

IProductAware

The interface that holds information about product data and meta info.

IRecognitionPostProcessor

Interface for post-processors, which convert raw output of an ML model and returns recognized characters as a string.

IRecognitionPredictor

Interface for predictors, which take a cropped image of text and recognize text characters on it.

IScoreCalculator

Interface for abstracting away score calculation over a text contour in the text detection post-processor.

ITextBoxMerger

Interface for a processing class, which handles merging text boxes, received from a text detection routine.

LeptonicaImageRotationHandler

Leptonica based implementation of IImageRotationHandler.

LogicalStructureTreeItem

This class represents structure tree item of the text item put into the pdf document.

MathUtil

Additional math functions.

MaxScoreCalculator

Score calculator, which returns the biggest observed sample.

MeanScoreCalculator

Score calculator, which calculates the mean values over the observed samples.

NormalizeImage

POJO for the NormalizeImage transform operation within a PreProcess object in a config file.

OcrEngineProperties

This class contains additional properties for ocr engine.

OcrPdfCreator

OcrPdfCreator is the class that creates PDF documents containing input images and text that was recognized using provided IOcrEngine.

OcrPdfCreatorProperties

Properties that will be used by the OcrPdfCreator.

OcrProcessContext

Class for storing ocr processing context.

OnnxDetectionPostProcessor

Implementation of a text detection predictor post-processor, used for OnnxTR model outputs.

OnnxDetectionPredictor

A text detection predictor implementation, which is using ONNX Runtime and its ML models to find, where text is located on an image.

OnnxDetectionPredictorProperties

Properties for configuring text detection ONNX models.

OnnxEngineProperties

Properties that are used by the OnnxOcrEngine.

OnnxInputProperties

Properties of the input of an ONNX model, which expects an image.

OnnxOcrEngine

IOcrEngine implementation, based on OnnxTR/DocTR machine learning OCR projects.

OnnxOrientationPredictor

A crop orientation predictor implementation, which is using ONNX Runtime and its ML models to figure out, how text is oriented in a cropped image of text.

OnnxOrientationPredictorProperties

Properties for configuring crop orientation ONNX models.

OnnxRecognitionPredictor

A text recognition predictor implementation, which is using ONNX Runtime and its ML models to recognize text characters on an image.

OnnxRecognitionPredictorProperties

Properties for configuring text recognition ONNX models.

OpenCvUtil

Static class with OpenCV utility functions.

OutputFormat

Enumeration of the available output formats.

PaddingStrategy

Enumeration of implemented padding strategies for padding images.

PaddleOcrDetectionPostProcessor

Implementation of a text detection predictor post-processor, used for PaddleOCR model outputs.

PaddleOcrInitException

Exception class for exceptions during PaddleOCR initialization.

ParagraphTreeItem

A convenience class to associate certain text items with the paragraph structure item.

PdfOcrException

Exception class for custom exceptions.

PdfOcrExceptionMessageConstant

Class that bundles all the exception message templates as constants.

PdfOcrFileUtil

Utility class for working with files.

PdfOcrFontProvider

FontProvider extension for ocr engine.

PdfOcrInputException

Exception class for input related exceptions.

PdfOcrInputTesseract4Exception

Exception class for Tesseract4 input related exceptions.

PdfOcrLogMessageConstant

Class that bundles all the log message templates as constants.

PdfOcrMetaInfoContainer

Container to keep meta info.

PdfOcrOnnxExceptionMessageConstant

Class that bundles all the error message templates as constants.

PdfOcrOnnxProductData

Stores an instance of ProductData related to iText pdfOcr Onnx module.

PdfOcrOnnxProductEvent

Class represents events registered in iText pdfOcr Onnx module.

PdfOcrOutputType

pdfOcr output types for statistics.

PdfOcrOutputTypeStatisticsEvent

Class which represents an event for specifying type of an ocr processing.

PdfOcrTesseract4Exception

Exception class for Tesseract4 exceptions.

PdfOcrTesseract4ExceptionMessageConstant

Class that bundles all the error message templates as constants.

PdfOcrTesseract4ProductData

Stores an instance of ProductData related to iText pdfOcr Tesseract4 module.

PdfOcrTesseract4ProductEvent

Class represents events registered in iText pdfOcr Tesseract4 module.

PdfOcrTextBuilder

Class to build text output from the provided image OCR result and write it to the TXT file.

PostProcess

Interface for objects under the PostProcess key in a config file.

PreProcess

POJO for the object under the PreProcess key in a config file.

RecResizeImg

POJO for the RecResizeImg transform operation within a PreProcess object in a config file.

ScaleMode

Enumeration of the possible scale modes for input images.

ScoreMode

Enum for values under the score_mode key within a DBPostProcess PostProcessor object in a config file.

SpanTreeItem

A convenience class to associate certain text items with the span structure item.

StringMapper

Look-up table for mapping text recognition model results to strings.

TableCellTreeItem

A convenience class to associate certain text items with the table cell structure item.

TableRowTreeItem

A convenience class to associate certain text items with the table row structure item.

TableTreeItem

A convenience class to associate certain text items with the table structure item.

Tesseract4ExecutableOcrEngine

The implementation of AbstractTesseract4OcrEngine for tesseract OCR.

Tesseract4LibOcrEngine

The implementation of AbstractTesseract4OcrEngine for tesseract OCR.

Tesseract4LogMessageConstant

Class that bundles all the log message templates as constants.

Tesseract4OcrEngineProperties

Properties that will be used by the IOcrEngine.

TesseractHelper

Helper class.

TextInfo

This class describes how recognized text is positioned on the image providing bbox for each text item (could be a line or a word).

TextOrientation

Enumeration of supported text orientations.

TextPositioning

Enumeration of the possible types of text positioning.

TextPositioning

Enumeration of the possible types of text positioning.

TiffImageUtil

Utility class to handle tiff images.

TransformOp

Interface for objects inside the transform_ops array within a PreProcess object in a config file.

Vocabulary

A string-based LUT for mapping text recognition model results to characters.

YamlUtil

Functions for working with YAML documents.