All Classes and Interfaces
Class
Description
Abstract predictor, based on models running over ONNX runtime.
Properties for configuring ONNX models.
Helper class for working with events.
The implementation of
IOcrEngine.
This class represents artifact structure tree item.
Implementation of a text detection predictor post-processor, which is used as a basis for creating post-processors for handling OnnxTR, EasyOCR and PaddleOCR model outputs.
Abstract Implementation of a basic text recognition predictor post-processor.
Static utility class to help with batching.
Generator with batch processing.
Enum for values under the
box_type key within a DBPostProcess PostProcessor object in a config file.
Additional algorithms for working with
BufferedImage.
Exception class for exceptions during configuration file parsing.
Implementation of a text recognition predictor post-processor, used for OnnxTR CRNN model outputs.
POJO for the CTCLabelDecode post-processor object under a
PostProcess key in a config file.
Implementation of a text recognition predictor post-processor, used for EasyOCR and PaddleOCR model outputs.
POJO for the DBPostProcess post-processor object under a
PostProcess key in a config file.
POJO for the DecodeImage transform operation within a
PreProcess object in a config file.
Default implementation for mapping output of a crop orientation model to
TextOrientation values.
Default implementation of
IOrtSessionOptionsCreator.
POJO for the DetResizeForTest transform operation within a
PreProcess object in a config file.
A basic 2-element tuple with a width and a height.
Implementation of a text detection predictor post-processor, used for EasyOCR model outputs.
Label mapper for EasyOCR text recognition models.
Text box merger, based on the algorithm used in EasyOCR.
Implementation of a text recognition predictor post-processor, used for OnnxTR non-CRNN model outputs.
Multidimensional array with a
FloatBufferWrapper backing storage.
Wrapper class around
FloatBuffer.
Batch processor mapper interface.
Interface for post-processors, which convert raw output of an ML model and returns rotated boxes with the detected objects.
Interface for predictors, which take a full image and find text boxes on them.
Rotation information may be stored in image metadata.
Enumeration of supported image channel configuration for buffers.
Additional options applied on image preprocessing step.
Options, that describe the way an image will be resized before being converted to a tensor for an ML model input.
Enum for values under the
img_mode key within a DecodeImage transform operation object in a config file.
POJO for the root object in a config file.
Static class with functions for parsing PaddleOCR YAML config files into a
InferenceConfig POJO.
IOcrEngine interface is used for instantiating new OcrReader objects.
OCR properties passed to the OCR engine as part of
OcrProcessContext.
Interface for predictors, which take a cropped image of text and determine its orientation.
Interface for ONNX runtime session options creators.
Interface for mapping an integer index (continuous from 0) to output values.
Interface of a generic predictor.
The interface that holds information about product data and meta info.
Interface for post-processors, which convert raw output of an ML model and returns recognized characters as a string.
Interface for predictors, which take a cropped image of text and recognize text characters on it.
Interface for abstracting away score calculation over a text contour in the text detection post-processor.
Interface for a processing class, which handles merging text boxes, received from a text detection routine.
Leptonica based implementation of
IImageRotationHandler.
This class represents structure tree item of the text item put into the pdf document.
Additional math functions.
Score calculator, which returns the biggest observed sample.
Score calculator, which calculates the mean values over the observed samples.
POJO for the NormalizeImage transform operation within a
PreProcess object in a config file.
This class contains additional properties for ocr engine.
OcrPdfCreator is the class that creates PDF documents containing input images and text that was recognized using provided IOcrEngine.
Properties that will be used by the
OcrPdfCreator.
Class for storing ocr processing context.
Implementation of a text detection predictor post-processor, used for OnnxTR model outputs.
A text detection predictor implementation, which is using ONNX Runtime and its ML models to find, where text is located on an image.
Properties for configuring text detection ONNX models.
Properties that are used by the
OnnxOcrEngine.
Properties of the input of an ONNX model, which expects an image.
IOcrEngine implementation, based on OnnxTR/DocTR machine learning OCR projects.
A crop orientation predictor implementation, which is using ONNX Runtime and its ML models to figure out, how text is oriented in a cropped image of text.
Properties for configuring crop orientation ONNX models.
A text recognition predictor implementation, which is using ONNX Runtime and its ML models to recognize text characters on an image.
Properties for configuring text recognition ONNX models.
Static class with OpenCV utility functions.
Enumeration of the available output formats.
Enumeration of implemented padding strategies for padding images.
Implementation of a text detection predictor post-processor, used for PaddleOCR model outputs.
Exception class for exceptions during PaddleOCR initialization.
A convenience class to associate certain text items with the paragraph structure item.
Exception class for custom exceptions.
Class that bundles all the exception message templates as constants.
Utility class for working with files.
FontProvider extension for ocr engine.
Exception class for input related exceptions.
Exception class for Tesseract4 input related exceptions.
Class that bundles all the log message templates as constants.
Container to keep meta info.
Class that bundles all the error message templates as constants.
Stores an instance of
ProductData related to iText pdfOcr Onnx module.
Class represents events registered in iText pdfOcr Onnx module.
pdfOcr output types for statistics.
Class which represents an event for specifying type of an ocr processing.
Exception class for Tesseract4 exceptions.
Class that bundles all the error message templates as constants.
Stores an instance of
ProductData related to iText pdfOcr Tesseract4 module.
Class represents events registered in iText pdfOcr Tesseract4 module.
Class to build text output from the provided image OCR result and write it to the TXT file.
Interface for objects under the
PostProcess key in a config file.
POJO for the object under the
PreProcess key in a config file.
POJO for the RecResizeImg transform operation within a
PreProcess object in a config file.
Enumeration of the possible scale modes for input images.
Enum for values under the
score_mode key within a DBPostProcess PostProcessor object in a config file.
A convenience class to associate certain text items with the span structure item.
Look-up table for mapping text recognition model results to strings.
A convenience class to associate certain text items with the table cell structure item.
A convenience class to associate certain text items with the table row structure item.
A convenience class to associate certain text items with the table structure item.
The implementation of
AbstractTesseract4OcrEngine for tesseract OCR.
The implementation of
AbstractTesseract4OcrEngine for tesseract OCR.
Class that bundles all the log message templates as constants.
Properties that will be used by the
IOcrEngine.
Helper class.
This class describes how recognized text is positioned on the image providing bbox for each text item (could be a line or a word).
Enumeration of supported text orientations.
Enumeration of the possible types of text positioning.
Enumeration of the possible types of text positioning.
Utility class to handle tiff images.
Interface for objects inside the
transform_ops array within a PreProcess object in a config file.
A string-based LUT for mapping text recognition model results to characters.
Functions for working with YAML documents.