Index

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form

A

AbstractOnnxPredictor<T, R> - Class in com.itextpdf.pdfocr.onnx
Abstract predictor, based on models running over ONNX runtime.
AbstractOnnxPredictor(AbstractOnnxPredictorProperties, long[]) - Constructor for class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictor
Creates a new abstract predictor.
AbstractOnnxPredictorProperties - Class in com.itextpdf.pdfocr.onnx
Properties for configuring ONNX models.
AbstractOnnxPredictorProperties(String, OnnxInputProperties, IOrtSessionOptionsCreator) - Constructor for class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictorProperties
Creates new predictor properties.
AbstractPdfOcrEventHelper - Class in com.itextpdf.pdfocr
Helper class for working with events.
AbstractPdfOcrEventHelper() - Constructor for class com.itextpdf.pdfocr.AbstractPdfOcrEventHelper
 
AbstractTesseract4OcrEngine - Class in com.itextpdf.pdfocr.tesseract4
The implementation of IOcrEngine.
AbstractTesseract4OcrEngine(Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Creates a new Tesseract4OcrEngineProperties instance based on another Tesseract4OcrEngineProperties instance (copy constructor).
addCell(TableCellTreeItem) - Method in class com.itextpdf.pdfocr.structuretree.TableRowTreeItem
Add a new table cell structure tree item to the table row.
addChild(LogicalStructureTreeItem) - Method in class com.itextpdf.pdfocr.structuretree.LogicalStructureTreeItem
Add child structure tree item.
addRow(TableRowTreeItem) - Method in class com.itextpdf.pdfocr.structuretree.TableTreeItem
Add a new row structure tree item to the table.
AFRIKAANS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ALBANIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
allocate(int) - Static method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Allocates a new float buffer.
ANCIENT_GREEK - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
appendLabel(StringBuilder, int) - Method in class com.itextpdf.pdfocr.onnx.recognition.BasicLabelPostProcessor
Adds label to the string output, based on the label's index.
appendLabel(StringBuilder, int) - Method in class com.itextpdf.pdfocr.onnx.recognition.CrnnPostProcessor
Adds label to the string output, based on the label's index.
appendLabel(StringBuilder, int) - Method in class com.itextpdf.pdfocr.onnx.recognition.CtcLabelPostProcessor
Adds label to the string output, based on the label's index.
applyRotation(ImageData) - Method in interface com.itextpdf.pdfocr.IImageRotationHandler
Apply rotation to image data.
applyRotation(ImageData) - Method in class com.itextpdf.pdfocr.tesseract4.LeptonicaImageRotationHandler
 
applyTextBoxMerger(List) - Method in class com.itextpdf.pdfocr.onnx.detection.EasyOcrDetectionPostProcessor
The text detection model from EasyOCR, for the most part, returns words or small groups of words.
ARABIC - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ARABIC_DIACRITICS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ARABIC_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ARABIC_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
ARABIC_LETTERS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ARABIC_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
argmax(float[]) - Static method in class com.itextpdf.pdfocr.onnx.util.MathUtil
Returns the index of the maximum value in the given array.
ARMENIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
array() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Returns the float array that backs this buffer.
arrayOffset() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Returns the offset within this buffer's backing array of the first element of the buffer.
ArtifactItem - Class in com.itextpdf.pdfocr.structuretree
This class represents artifact structure tree item.
ASCII_LETTERS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ASCII_LOWERCASE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ASCII_UPPERCASE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
AZERBAIJANI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 

B

BasicDetectionPostProcessor - Class in com.itextpdf.pdfocr.onnx.detection
Implementation of a text detection predictor post-processor, which is used as a basis for creating post-processors for handling OnnxTR, EasyOCR and PaddleOCR model outputs.
BasicDetectionPostProcessor(float, float, int) - Constructor for class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
Creates a new post-processor.
BasicLabelPostProcessor - Class in com.itextpdf.pdfocr.onnx.recognition
Abstract Implementation of a basic text recognition predictor post-processor.
BasicLabelPostProcessor() - Constructor for class com.itextpdf.pdfocr.onnx.recognition.BasicLabelPostProcessor
 
BASQUE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BATCH_SIZE_SHOULD_BE_POSITIVE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
Batching - Class in com.itextpdf.pdfocr.onnx.util
Static utility class to help with batching.
BatchProcessingGenerator<T, R> - Class in com.itextpdf.pdfocr.onnx.util
Generator with batch processing.
BatchProcessingGenerator(Iterator>, IBatchProcessor) - Constructor for class com.itextpdf.pdfocr.onnx.util.BatchProcessingGenerator
Creates a new generator with the provided batch iterator and processor.
BELARUSIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BENGALI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BENGALI_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BENGALI_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BENGALI_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
BENGALI_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BENGALI_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BENGALI_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BENGALI_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BENGALI_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BGR - Enum constant in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ImgMode
Value for BGR config value.
BGR - Enum constant in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
Image is represented with three channels: blue, green, red.
binarizeMdArray(FloatBufferMdArray, float) - Static method in class com.itextpdf.pdfocr.onnx.util.OpenCvUtil
Takes a two-dimensional MD-array and returns a binarized version of it as an OpenCV Mat.
BOSNIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BOTTOM_RIGHT_BLACK - Enum constant in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
Image will be put into the top-left corner.
BOTTOM_RIGHT_EDGE - Enum constant in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
Image will be put into the top-left corner.
BOTTOM_RIGHT_GRAY - Enum constant in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
Image will be put into the top-left corner.
BOTTOM_RIGHT_WHITE - Enum constant in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
Image will be put into the top-left corner.
BOX_TYPE_IS_NOT_SUPPORTED - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
BoxType - Enum in com.itextpdf.pdfocr.onnx.conf.paddle.model
Enum for values under the box_type key within a DBPostProcess PostProcessor object in a config file.
boxTypeIsNotSupported() - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
Creates an exception for cases, when the detection model does not return quads.
BufferedImageUtil - Class in com.itextpdf.pdfocr.onnx.util
Additional algorithms for working with BufferedImage.
buildText(Map>) - Static method in class com.itextpdf.pdfocr.util.PdfOcrTextBuilder
Constructs string output from the provided IOcrEngine.doImageOcr(java.io.File) result.
buildTextContourPredictionMask(Mat, Rect) - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
Builds and return a mask for calculating prediction score for the provided contour.
buildTextContourPredictionMask(Mat, Rect) - Method in class com.itextpdf.pdfocr.onnx.detection.PaddleOcrDetectionPostProcessor
Builds and return a mask for calculating prediction score for the provided contour.
BULGARIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BURMESE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BURMESE_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BURMESE_DIACRITICS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BURMESE_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BURMESE_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BURMESE_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BURMESE_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
BY_LINES - Enum constant in enum com.itextpdf.pdfocr.onnx.text.TextPositioning
Text will be grouped by lines.
BY_LINES - Enum constant in enum com.itextpdf.pdfocr.tesseract4.TextPositioning
Text will be located by lines retrieved from hocr file.
BY_WORDS - Enum constant in enum com.itextpdf.pdfocr.onnx.text.TextPositioning
Text will be grouped by words.
BY_WORDS - Enum constant in enum com.itextpdf.pdfocr.tesseract4.TextPositioning
Text will be located by words retrieved from hocr file.
BY_WORDS_AND_LINES - Enum constant in enum com.itextpdf.pdfocr.onnx.text.TextPositioning
Similar to BY_WORDS mode, but top and bottom of word BBox are inherited from line (default value).
BY_WORDS_AND_LINES - Enum constant in enum com.itextpdf.pdfocr.tesseract4.TextPositioning
Similar to BY_WORDS mode, but top and bottom of word BBox are inherited from line.

C

calcOutputDimensions(BufferedImage, ImageResizeOptions) - Static method in class com.itextpdf.pdfocr.onnx.util.BufferedImageUtil
Based on the provided ImageResizeOptions, calculates the dimensions of the output image, to where there original image should be scaled and placed with padding.
calcOutputDimensions(Collection, ImageResizeOptions) - Static method in class com.itextpdf.pdfocr.onnx.util.BufferedImageUtil
Based on the provided ImageResizeOptions, calculates the dimensions to which a batch of images should be scaled and padded.
calcTextBoxEnlargement(double, double) - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
Calculates by how much the dimensions of a text box should be enlarged compared to the ones gotten from the model output.
calcTextBoxEnlargement(double, double) - Method in class com.itextpdf.pdfocr.onnx.detection.EasyOcrDetectionPostProcessor
Calculates by how much the dimensions of a text box should be enlarged compared to the ones gotten from the model output.
calcTextBoxEnlargement(double, double) - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPostProcessor
Calculates by how much the dimensions of a text box should be enlarged compared to the ones gotten from the model output.
calcTextBoxEnlargement(double, double) - Method in class com.itextpdf.pdfocr.onnx.detection.PaddleOcrDetectionPostProcessor
Calculates by how much the dimensions of a text box should be enlarged compared to the ones gotten from the model output.
calculate() - Method in interface com.itextpdf.pdfocr.onnx.detection.score.IScoreCalculator
Calculate the score based on the observed samples.
calculate() - Method in class com.itextpdf.pdfocr.onnx.detection.score.MaxScoreCalculator
Calculate the score based on the observed samples.
calculate() - Method in class com.itextpdf.pdfocr.onnx.detection.score.MeanScoreCalculator
Calculate the score based on the observed samples.
calculateLevenshteinDistance(String, String) - Static method in class com.itextpdf.pdfocr.onnx.util.MathUtil
Calculates the Levenshtein distance between two input strings.
CANNOT_ADD_DATA_TO_PDF_DOCUMENT - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
The constant CANNOT_ADD_DATA_TO_PDF_DOCUMENT.
CANNOT_BINARIZE_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
CANNOT_CONVERT_IMAGE_TO_GRAYSCALE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
CANNOT_CREATE_BUFFERED_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
CANNOT_CREATE_PDF_DOCUMENT - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
 
CANNOT_DELETE_FILE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
CANNOT_FIND_PATH_TO_TESSERACT_EXECUTABLE - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
 
CANNOT_GET_TEMPORARY_DIRECTORY - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
CANNOT_OCR_INPUT_FILE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
CANNOT_PARSE_NODE_BBOX - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
CANNOT_PROCESS_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
CANNOT_READ_DEFAULT_FONT - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
The constant CANNOT_READ_DEFAULT_FONT.
CANNOT_READ_FILE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
CANNOT_READ_IMAGE_METADATA - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
CANNOT_READ_INPUT_IMAGE - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
 
CANNOT_READ_INPUT_IMAGE - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
The constant CANNOT_READ_INPUT_IMAGE.
CANNOT_READ_INPUT_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
CANNOT_READ_INPUT_IMAGE_PARAMS - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
 
CANNOT_READ_PROVIDED_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
 
CANNOT_RESOLVE_PROVIDED_FONTS - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
 
CANNOT_RETRIEVE_PAGES_FROM_IMAGE - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
 
CANNOT_START_ANOTHER_MAPPING_OPERATION - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
CANNOT_USE_USER_WORDS - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
CANNOT_WRITE_TO_FILE - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
 
CATALAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
CHANNEL_FIRST_IS_NOT_SUPPORTED - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
channelFirstIsNotSupported() - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
Creates an exception to assert channel_first is set to false.
clamp(double, double, double) - Static method in class com.itextpdf.pdfocr.onnx.util.MathUtil
Clamps a value between a specified minimum and maximum range.
close() - Method in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictor
close() - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
collectWordsIntoLines(Map>) - Static method in class com.itextpdf.pdfocr.util.PdfOcrTextBuilder
Merges the provided IOcrEngine.doImageOcr(java.io.File) result into lines and updates line bounding boxes to match the largest words.
com.itextpdf.pdfocr - package com.itextpdf.pdfocr
 
com.itextpdf.pdfocr.exceptions - package com.itextpdf.pdfocr.exceptions
 
com.itextpdf.pdfocr.logs - package com.itextpdf.pdfocr.logs
 
com.itextpdf.pdfocr.onnx - package com.itextpdf.pdfocr.onnx
 
com.itextpdf.pdfocr.onnx.actions.data - package com.itextpdf.pdfocr.onnx.actions.data
 
com.itextpdf.pdfocr.onnx.actions.events - package com.itextpdf.pdfocr.onnx.actions.events
 
com.itextpdf.pdfocr.onnx.conf.paddle.model - package com.itextpdf.pdfocr.onnx.conf.paddle.model
 
com.itextpdf.pdfocr.onnx.conf.paddle.parser - package com.itextpdf.pdfocr.onnx.conf.paddle.parser
 
com.itextpdf.pdfocr.onnx.detection - package com.itextpdf.pdfocr.onnx.detection
 
com.itextpdf.pdfocr.onnx.detection.score - package com.itextpdf.pdfocr.onnx.detection.score
 
com.itextpdf.pdfocr.onnx.exceptions - package com.itextpdf.pdfocr.onnx.exceptions
 
com.itextpdf.pdfocr.onnx.merging - package com.itextpdf.pdfocr.onnx.merging
 
com.itextpdf.pdfocr.onnx.orientation - package com.itextpdf.pdfocr.onnx.orientation
 
com.itextpdf.pdfocr.onnx.recognition - package com.itextpdf.pdfocr.onnx.recognition
 
com.itextpdf.pdfocr.onnx.text - package com.itextpdf.pdfocr.onnx.text
 
com.itextpdf.pdfocr.onnx.util - package com.itextpdf.pdfocr.onnx.util
 
com.itextpdf.pdfocr.statistics - package com.itextpdf.pdfocr.statistics
 
com.itextpdf.pdfocr.structuretree - package com.itextpdf.pdfocr.structuretree
 
com.itextpdf.pdfocr.tesseract4 - package com.itextpdf.pdfocr.tesseract4
 
com.itextpdf.pdfocr.tesseract4.actions.data - package com.itextpdf.pdfocr.tesseract4.actions.data
 
com.itextpdf.pdfocr.tesseract4.actions.events - package com.itextpdf.pdfocr.tesseract4.actions.events
 
com.itextpdf.pdfocr.tesseract4.exceptions - package com.itextpdf.pdfocr.tesseract4.exceptions
 
com.itextpdf.pdfocr.tesseract4.logs - package com.itextpdf.pdfocr.tesseract4.logs
 
com.itextpdf.pdfocr.util - package com.itextpdf.pdfocr.util
 
COMMAND_FAILED - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
concat(Vocabulary...) - Static method in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
Creates a new vocabulary by concatenating multiple ones.
ConfigParserException - Exception in com.itextpdf.pdfocr.onnx.exceptions
Exception class for exceptions during configuration file parsing.
ConfigParserException(String) - Constructor for exception com.itextpdf.pdfocr.onnx.exceptions.ConfigParserException
Creates new ConfigParserException instance.
correctRotationAngle(Map>) - Static method in class com.itextpdf.pdfocr.util.PdfOcrTextBuilder
Processes all text infos to round the rotation angle to either 0, 90, 180 or 270 degrees.
COULD_NOT_FIND_CORRESPONDING_GLYPH_TO_UNICODE_CHARACTER - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
The constant COULD_NOT_FIND_CORRESPONDING_GLYPH_TO_UNICODE_CHARACTER.
create() - Method in class com.itextpdf.pdfocr.onnx.DefaultOrtSessionOptionsCreator
 
create() - Method in interface com.itextpdf.pdfocr.onnx.IOrtSessionOptionsCreator
Creates an instance of ONNX runtime session options.
CREATED_TEMPORARY_FILE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
createPdf(List, PdfWriter) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Performs OCR with set parameters using provided IOcrEngine and creates PDF using provided PdfWriter.
createPdf(List, PdfWriter, DocumentProperties) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Performs OCR with set parameters using provided IOcrEngine and creates PDF using provided PdfWriter.
createPdf(List, PdfWriter, DocumentProperties, IOcrProcessProperties) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Performs OCR with set parameters using provided IOcrEngine and creates PDF using provided PdfWriter.
createPdfA(List, PdfWriter, DocumentProperties, PdfOutputIntent) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Performs OCR with set parameters using provided IOcrEngine and creates PDF using provided PdfWriter, DocumentProperties and PdfOutputIntent.
createPdfA(List, PdfWriter, DocumentProperties, PdfOutputIntent, IOcrProcessProperties) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Performs OCR with set parameters using provided IOcrEngine and creates PDF using provided PdfWriter, DocumentProperties and PdfOutputIntent.
createPdfA(List, PdfWriter, PdfOutputIntent) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Performs OCR with set parameters using provided IOcrEngine and creates PDF using provided PdfWriter and PdfOutputIntent.
createPdfAFile(List, File, PdfOutputIntent) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Performs OCR with set parameters using provided IOcrEngine and creates PDF using provided File and PdfOutputIntent.
createPdfFile(List, File) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Performs OCR with set parameters using provided IOcrEngine and creates PDF using provided File.
createProcessImageEvent(SequenceId, IMetaInfo, EventConfirmationType) - Static method in class com.itextpdf.pdfocr.tesseract4.actions.events.PdfOcrTesseract4ProductEvent
Creates process-image event.
createProcessImageOnnxEvent(SequenceId, IMetaInfo, EventConfirmationType) - Static method in class com.itextpdf.pdfocr.onnx.actions.events.PdfOcrOnnxProductEvent
Creates process-image-onnx event.
createScoreCalculator() - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
Creates a new score calculator for calculating score over a text contour.
createScoreCalculator() - Method in class com.itextpdf.pdfocr.onnx.detection.EasyOcrDetectionPostProcessor
Creates a new score calculator for calculating score over a text contour.
createStatisticsAggregatorFromName(String) - Method in class com.itextpdf.pdfocr.statistics.PdfOcrOutputTypeStatisticsEvent
createTxtFile(List, File) - Method in interface com.itextpdf.pdfocr.IOcrEngine
Performs OCR using provided IOcrEngine for the given list of input images and saves output to a text file using provided path.
createTxtFile(List, File) - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
Performs OCR using provided IOcrEngine for the given list of input images and saves output to a text file using provided path.
createTxtFile(List, File) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Performs OCR using provided IOcrEngine for the given list of input images and saves output to a text file using provided path.
createTxtFile(List, File, OcrProcessContext) - Method in interface com.itextpdf.pdfocr.IOcrEngine
Performs OCR using provided IOcrEngine for the given list of input images and saves output to a text file using provided path.
createTxtFile(List, File, OcrProcessContext) - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
Performs OCR using provided IOcrEngine for the given list of input images and saves output to a text file using provided path.
createTxtFile(List, File, OcrProcessContext) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Performs OCR using provided IOcrEngine for the given list of input images and saves output to a text file using provided path.
crnnMobileNetV3(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained CRNN model with a MobileNet V3 backbone, stored on disk.
crnnMobileNetV3(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained CRNN models with a MobileNet V3 backbone, stored on disk.
crnnMobileNetV3(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained CRNN model with a MobileNet V3 backbone, stored on disk.
crnnMobileNetV3(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained CRNN models with a MobileNet V3 backbone, stored on disk.
CrnnPostProcessor - Class in com.itextpdf.pdfocr.onnx.recognition
Implementation of a text recognition predictor post-processor, used for OnnxTR CRNN model outputs.
CrnnPostProcessor() - Constructor for class com.itextpdf.pdfocr.onnx.recognition.CrnnPostProcessor
Creates a new post-processor with the default vocabulary.
CrnnPostProcessor(Vocabulary) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.CrnnPostProcessor
Creates a new post-processor.
crnnVgg16(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained CRNN model with a VGG-16 backbone, stored on disk.
crnnVgg16(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained CRNN models with a VGG-16 backbone, stored on disk.
crnnVgg16(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained CRNN model with a VGG-16 backbone, stored on disk.
crnnVgg16(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained CRNN models with a VGG-16 backbone, stored on disk.
CROATIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
CtcLabelDecode - Class in com.itextpdf.pdfocr.onnx.conf.paddle.model
POJO for the CTCLabelDecode post-processor object under a PostProcess key in a config file.
CtcLabelDecode(String[]) - Constructor for class com.itextpdf.pdfocr.onnx.conf.paddle.model.CtcLabelDecode
Creates a new POJO for the config file object.
CtcLabelPostProcessor - Class in com.itextpdf.pdfocr.onnx.recognition
Implementation of a text recognition predictor post-processor, used for EasyOCR and PaddleOCR model outputs.
CtcLabelPostProcessor(IOutputLabelMapper) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.CtcLabelPostProcessor
Creates a new post-processor.
CURRENCY - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
CYRILLIC_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
CYRILLIC_G2 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
CZECH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 

D

DANISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
DATA - Enum constant in enum com.itextpdf.pdfocr.statistics.PdfOcrOutputType
Processing of an image in the engine with data output
dbNet(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
Creates a new text detection predictor using an existing pre-trained DBNet model, stored on disk.
dbNet(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
Creates a new text detection properties object for existing pre-trained DBNet models, stored on disk.
dbNet(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
Creates a new text detection predictor using an existing pre-trained DBNet model, stored on disk.
dbNet(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
Creates a new text detection properties object for existing pre-trained DBNet models, stored on disk.
DbPostProcess - Class in com.itextpdf.pdfocr.onnx.conf.paddle.model
POJO for the DBPostProcess post-processor object under a PostProcess key in a config file.
DbPostProcess(float, float, float, int, boolean, ScoreMode, BoxType) - Constructor for class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
Creates a new POJO for the config file object.
DecodeImage - Class in com.itextpdf.pdfocr.onnx.conf.paddle.model
POJO for the DecodeImage transform operation within a PreProcess object in a config file.
DecodeImage(boolean, ImgMode) - Constructor for class com.itextpdf.pdfocr.onnx.conf.paddle.model.DecodeImage
Creates a new POJO for the config file object.
DEFAULT_ORT_SESSION_CREATOR - Static variable in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictorProperties
Default ONNX runtime session options creator.
DefaultOrientationMapper - Class in com.itextpdf.pdfocr.onnx.orientation
Default implementation for mapping output of a crop orientation model to TextOrientation values.
DefaultOrientationMapper() - Constructor for class com.itextpdf.pdfocr.onnx.orientation.DefaultOrientationMapper
Constructs a new DefaultOrientationMapper with default behavior.
DefaultOrtSessionOptionsCreator - Class in com.itextpdf.pdfocr.onnx
Default implementation of IOrtSessionOptionsCreator.
DefaultOrtSessionOptionsCreator() - Constructor for class com.itextpdf.pdfocr.onnx.DefaultOrtSessionOptionsCreator
 
deserializeFromStream(InputStream) - Static method in class com.itextpdf.pdfocr.onnx.util.YamlUtil
Deserializes a content stream, which contains a single YAML document.
DetResizeForTest - Class in com.itextpdf.pdfocr.onnx.conf.paddle.model
POJO for the DetResizeForTest transform operation within a PreProcess object in a config file.
DetResizeForTest(int[], boolean) - Constructor for class com.itextpdf.pdfocr.onnx.conf.paddle.model.DetResizeForTest
Creates a new POJO for the config file object.
DEVANAGARI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
DEVANAGARI_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
DEVANAGARI_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
DEVANAGARI_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
DEVANAGARI_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
DEVANAGARI_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
DEVANAGARI_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
DEVANAGARI_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
DEVANAGARI_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
Dimensions2D - Class in com.itextpdf.pdfocr.onnx.util
A basic 2-element tuple with a width and a height.
Dimensions2D(int, int) - Constructor for class com.itextpdf.pdfocr.onnx.util.Dimensions2D
Creates new Dimensions2D instance.
doImageOcr(File) - Method in interface com.itextpdf.pdfocr.IOcrEngine
Reads data from the provided input image file and returns retrieved data in the format described below.
doImageOcr(File) - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
Reads data from the provided input image file and returns retrieved data in the format described below.
doImageOcr(File) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Reads data from the provided input image file and returns retrieved data in the format described below.
doImageOcr(File, OcrProcessContext) - Method in interface com.itextpdf.pdfocr.IOcrEngine
Reads data from the provided input image file and returns retrieved data in the format described below.
doImageOcr(File, OcrProcessContext) - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
Reads data from the provided input image file and returns retrieved data in the format described below.
doImageOcr(File, OcrProcessContext) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Reads data from the provided input image file and returns retrieved data in the format described below.
doImageOcr(File, OutputFormat) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Reads data from the provided input image file and returns retrieved data as string.
doImageOcr(File, OutputFormat, OcrProcessContext) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Reads data from the provided input image file and returns retrieved data as string.
doImageOcr(List) - Method in interface com.itextpdf.pdfocr.IOcrEngine
Reads data from the provided list of input image files and returns retrieved data in the format described below.
doImageOcr(List) - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
Reads data from the provided list of input image files and returns retrieved data in the format described below.
doImageOcr(List) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Reads data from the provided list of input image files and returns retrieved data in the format described below.
doImageOcr(List, OcrProcessContext) - Method in interface com.itextpdf.pdfocr.IOcrEngine
Reads data from the provided list of input image files and returns retrieved data in the format described below.
doImageOcr(List, OcrProcessContext) - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
Reads data from the provided list of input image files and returns retrieved data in the format described below.
doImageOcr(List, OcrProcessContext) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Reads data from the provided list of input image files and returns retrieved data in the format described below.
doTesseractOcr(File, File, OutputFormat) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Performs tesseract OCR for the first (or for the only) image page.
doTesseractOcr(File, File, OutputFormat, OcrProcessContext) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Performs tesseract OCR for the first (or for the only) image page.
duplicate() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Creates a new float buffer that shares this buffer's content.
DUTCH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 

E

easyOcr(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
Creates a new text detection predictor using an existing pre-trained EasyOCR CRAFT model, stored on disk.
easyOcr(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
Creates a new text detection properties object for an existing pre-trained EasyOCR CRAFT model, stored on disk.
easyOcr(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
Creates a new text detection predictor using an existing pre-trained EasyOCR CRAFT model, stored on disk.
easyOcr(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
Creates a new text detection properties object for an existing pre-trained EasyOCR CRAFT model, stored on disk.
easyOcr(String, EasyOcrMapper) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained EasyOCR model, stored on disk.
easyOcr(String, EasyOcrMapper) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained EasyOCR models, stored on disk.
easyOcr(String, EasyOcrMapper, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained EasyOCR model, stored on disk.
easyOcr(String, EasyOcrMapper, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained EasyOCR models, stored on disk.
EasyOcrDetectionPostProcessor - Class in com.itextpdf.pdfocr.onnx.detection
Implementation of a text detection predictor post-processor, used for EasyOCR model outputs.
EasyOcrDetectionPostProcessor() - Constructor for class com.itextpdf.pdfocr.onnx.detection.EasyOcrDetectionPostProcessor
Creates a new post-processor with the default parameters.
EasyOcrDetectionPostProcessor(float) - Constructor for class com.itextpdf.pdfocr.onnx.detection.EasyOcrDetectionPostProcessor
Creates a new post-processor.
EasyOcrMapper - Class in com.itextpdf.pdfocr.onnx.recognition
Label mapper for EasyOCR text recognition models.
EasyOcrTextBoxMerger - Class in com.itextpdf.pdfocr.onnx.merging
Text box merger, based on the algorithm used in EasyOCR.
EasyOcrTextBoxMerger() - Constructor for class com.itextpdf.pdfocr.onnx.merging.EasyOcrTextBoxMerger
Creates new EasyOcrTextBoxMerger instance.
ELEM_COUNT_DOES_NOT_MATCH_SHAPE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
EndOfStringPostProcessor - Class in com.itextpdf.pdfocr.onnx.recognition
Implementation of a text recognition predictor post-processor, used for OnnxTR non-CRNN model outputs.
EndOfStringPostProcessor() - Constructor for class com.itextpdf.pdfocr.onnx.recognition.EndOfStringPostProcessor
Creates a new post-processor with the default vocabulary.
EndOfStringPostProcessor(Vocabulary) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.EndOfStringPostProcessor
Creates a new post-processor without any additional tokens.
EndOfStringPostProcessor(Vocabulary, int) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.EndOfStringPostProcessor
Creates a new post-processor.
ENGLISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ENGLISH_G2 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
equals(Object) - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.CtcLabelDecode
equals(Object) - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
equals(Object) - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DecodeImage
equals(Object) - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DetResizeForTest
equals(Object) - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.InferenceConfig
equals(Object) - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.NormalizeImage
equals(Object) - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.PreProcess
equals(Object) - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.RecResizeImg
equals(Object) - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
equals(Object) - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
 
equals(Object) - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
equals(Object) - Method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictorProperties
equals(Object) - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
equals(Object) - Method in class com.itextpdf.pdfocr.onnx.recognition.StringMapper
 
equals(Object) - Method in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
equals(Object) - Method in class com.itextpdf.pdfocr.onnx.util.Dimensions2D
 
ESPERANTO - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ESTONIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ETHIOPIC - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
euclideanModulo(float, float) - Static method in class com.itextpdf.pdfocr.onnx.util.MathUtil
Computes the Euclidean modulo (non-negative remainder) of x modulo y.
EXPECTED_SHAPE_SIZE - Static variable in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Expected shape size.
expit(float) - Static method in class com.itextpdf.pdfocr.onnx.util.MathUtil
Computes the sigmoid function, also known as the logistic function, for the given input.
extractBoxes(BufferedImage, Collection) - Static method in class com.itextpdf.pdfocr.onnx.util.BufferedImageUtil
Extracts sub-images from an image, based on provided rotated 4-point boxes.

F

FAILED_TO_CLOSE_ONNX_RUNTIME_SESSION - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
FAILED_TO_INIT_ONNX_RUNTIME_SESSION - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
FAILED_TO_INIT_SESSION_OPTIONS - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
FAILED_TO_LOAD_ONNXRUNTIME - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
FAILED_TO_READ_IMAGE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
fast(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
Creates a new text detection predictor using an existing pre-trained FAST model, stored on disk.
fast(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
Creates a new text detection properties object for existing pre-trained FAST models, stored on disk.
fast(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
Creates a new text detection predictor using an existing pre-trained FAST model, stored on disk.
fast(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
Creates a new text detection properties object for existing pre-trained FAST models, stored on disk.
FAST - Enum constant in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ScoreMode
Value for fast config value.
fillPolyAtOffset(Mat, Mat, Scalar, int, int) - Static method in class com.itextpdf.pdfocr.onnx.util.OpenCvUtil
Fill the polygon on the bitmap at a specific offset.
findTextContours(Mat) - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
Extracts text contours from the provided 0 - 255 mask.
findTextContours(Mat) - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPostProcessor
Extracts text contours from the provided 0 - 255 mask.
FINNISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
FloatBufferMdArray - Class in com.itextpdf.pdfocr.onnx
Multidimensional array with a FloatBufferWrapper backing storage.
FloatBufferMdArray(FloatBufferWrapper, long[]) - Constructor for class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
Constructs a new FloatBufferMdArray with the specified data buffer and shape.
FloatBufferWrapper - Class in com.itextpdf.pdfocr.onnx
Wrapper class around FloatBuffer.
FloatBufferWrapper(FloatBuffer) - Constructor for class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Constructs FloatBufferWrapper on top of FloatBuffer.
FRENCH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
FRISIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
fromOutputBuffer(List, FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
Converts ONNX runtime model batched output MD-array buffer to a list of predictor outputs.
fromOutputBuffer(List, FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictor
Converts ONNX runtime model batched output MD-array buffer to a list of predictor outputs.
fromOutputBuffer(List, FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Converts ONNX runtime model batched output MD-array buffer to a list of predictor outputs.
fromOutputBuffer(List, FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictor
Converts ONNX runtime model batched output MD-array buffer to a list of predictor outputs.

G

GALICIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
GENERIC_CYRILLIC_LETTERS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
generifyWordBBoxesByLine(Map>) - Static method in class com.itextpdf.pdfocr.util.PdfOcrTextBuilder
Sorts the provided IOcrEngine.doImageOcr(java.io.File) result by lines and updates line bboxes to match the largest words.
GEORGIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
GERMAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
get() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Relative get method.
get(float[]) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Relative bulk get method.
get(int) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Absolute get method.
getAccessibilityProperties() - Method in class com.itextpdf.pdfocr.structuretree.LogicalStructureTreeItem
Retrieve structure tree element's properties.
getAllImages(File) - Static method in class com.itextpdf.pdfocr.util.TiffImageUtil
Retrieves all images from a TIFF file.
getArrayOffset() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
Gets internal offset of the provided float buffer array.
getArraySize() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
Gets number of available bytes for read from provided float buffer array.
getBatchSize() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns input batch size.
getBBoxRect() - Method in class com.itextpdf.pdfocr.TextInfo
Converts a text polygon to a bounding box.
getBlueChannelIndex() - Method in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
Returns the index of the blue channel in the resulting ML input buffer.
getBlueMean() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns blue channel mean, used for normalization.
getBlueStd() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns blue channel standard deviation, used for normalization.
getBoxThresh() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
Returns the value under the box_thresh key.
getBoxType() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
Returns the value under the box_type key.
getChannelConfiguration() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
Returns the image channel configuration, that will be used when passing the image to the ML model.
getChannelCount() - Method in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
Returns the amount of channels used to store the image.
getChannelCount() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns input channel count.
getChannelFirst() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DecodeImage
Returns the value under the channel_first key.
getCharacterDict() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.CtcLabelDecode
Returns the value under the character_dict key.
getChildren() - Method in class com.itextpdf.pdfocr.structuretree.LogicalStructureTreeItem
Retrieve all child structure tree items.
getConfirmationType() - Method in class com.itextpdf.pdfocr.AbstractPdfOcrEventHelper
Returns the confirmation type of event.
getData() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
Returns a duplicate of the backing FloatBufferWrapper.
getDefaultFontFamily() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Gets preferred font family to be used when selecting font from FontProvider.
getDefaultFontFamily() - Method in class com.itextpdf.pdfocr.PdfOcrFontProvider
Gets default font family.
getDefaultLanguage() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Gets default language for ocr.
getDefaultUserWordsSuffix() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Gets default user words suffix.
getDimension(int) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
Returns the size of the specified dimension.
getDimensionCount() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
Returns the number of dimensions of this multidimensional array.
getEventType() - Method in class com.itextpdf.pdfocr.onnx.actions.events.PdfOcrOnnxProductEvent
getEventType() - Method in class com.itextpdf.pdfocr.tesseract4.actions.events.PdfOcrTesseract4ProductEvent
 
getFloatBuffer() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Returns FloatBuffer that backs this buffer.
getFontProvider() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Returns FontProvider that was set previously or if it is null a new instance of PdfOcrFontProvider is returned.
getGrayMean() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns gray channel mean, used for normalization.
getGrayStd() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns gray channel standard deviation, used for normalization.
getGreenChannelIndex() - Method in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
Returns the index of the green channel in the resulting ML input buffer.
getGreenMean() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns green channel mean, used for normalization.
getGreenStd() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns green channel standard deviation, used for normalization.
getHeight() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns input minimum height.
getHeight() - Method in class com.itextpdf.pdfocr.onnx.util.Dimensions2D
Gets height of the Dimensions2D instance.
getHeightMultiple() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
Returns the height multiple.
getImageLayerName() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Gets name of image layer.
getImagePreprocessingOptions() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Gets Tesseract4OcrEngineProperties.imagePreprocessingOptions.
getImageResizeOptions() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns image resize options for the input.
getImageRotationHandler() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Gets image rotation handler instance.
getImageShape() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DetResizeForTest
Returns the values under the image_shape key.
getImageShape() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.RecResizeImg
Returns the values under the image_shape key.
getImageType(File) - Static method in class com.itextpdf.pdfocr.util.TiffImageUtil
Gets the image type.
getImgMode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DecodeImage
Returns the value under the img_mode key.
getInputProperties() - Method in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictorProperties
Returns the ONNX model input properties.
getInstance() - Static method in class com.itextpdf.pdfocr.onnx.actions.data.PdfOcrOnnxProductData
Getter for an instance of ProductData related to iText pdfOcr Onnx module.
getInstance() - Static method in class com.itextpdf.pdfocr.structuretree.ArtifactItem
Retrieve an instance of ArtifactItem.
getInstance() - Static method in class com.itextpdf.pdfocr.tesseract4.actions.data.PdfOcrTesseract4ProductData
Getter for an instance of ProductData related to iText pdfOcr Tesseract4 module.
getKeepRatio() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DetResizeForTest
Returns the value under the keep_ratio key.
getLanguages() - Method in class com.itextpdf.pdfocr.OcrEngineProperties
Gets list of languages required for provided images.
getLanguagesAsString() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Gets list of languages concatenated with "+" symbol to a string in format required by tesseract.
getLogicalStructureTreeItem() - Method in class com.itextpdf.pdfocr.TextInfo
Retrieves structure tree item for the text item.
getLookUpString() - Method in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
Returns the look-up string.
getMaskSourceArray(FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
Returns the array to be used, when building a mask for contour detection.
getMaskSourceArray(FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.detection.EasyOcrDetectionPostProcessor
Returns the array to be used, when building a mask for contour detection.
getMaxCandidates() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
Returns the value under the max_candidates key.
getMaxHeight() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
Returns the maximum height the image should be after resizing.
getMaxWidth() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
Returns the maximum width the image should be after resizing.
getMean() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.NormalizeImage
Returns the values under the mean key.
getMean() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns per-channel mean, used for normalization.
getMean(int) - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns channel-specific mean, used for normalization.
getMessage() - Method in exception com.itextpdf.pdfocr.exceptions.PdfOcrException
getMessageParams() - Method in exception com.itextpdf.pdfocr.exceptions.PdfOcrException
Gets additional params for Exception message.
getMetaInfoContainer() - Method in interface com.itextpdf.pdfocr.IProductAware
Gets the container with meta info.
getMetaInfoContainer() - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
Gets the container with meta info.
getMetaInfoContainer() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Gets the container with meta info.
getMinHeight() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
Returns the minimum height the image should be after resizing.
getMinimalConfidenceLevel() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Gets minimal confidence level for HOCR line to be considered as properly recognized.
getMinWidth() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
Returns the minimum width the image should be after resizing.
getModelPath() - Method in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictorProperties
Returns the path to the ONNX model.
getName() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.CtcLabelDecode
Returns the value under the name key.
getName() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
Returns the value under the name key.
getName() - Method in interface com.itextpdf.pdfocr.onnx.conf.paddle.model.PostProcess
Returns the value under the name key.
getOcrEngine() - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Gets used IOcrEngine reader object to perform OCR.
getOcrEventHelper() - Method in class com.itextpdf.pdfocr.OcrProcessContext
Returns helper for working with events.
getOcrPdfCreatorProperties() - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Gets properties for OcrPdfCreator.
getOcrProcessProperties() - Method in class com.itextpdf.pdfocr.OcrProcessContext
Get extra OCR process properties.
getOrtSessionOptionsCreator() - Method in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictorProperties
Returns the ONNX runtime session options creator.
getOutputMapper() - Method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictorProperties
Returns the ONNX model output mapper.
getPaddingStrategy() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
Returns the padding strategy.
getPaddingStrategy() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns the padding strategy for image inputs.
getPageSegMode() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Gets Page Segmentation Mode.
getPageSize() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Gets required size for output PDF document.
getParent() - Method in class com.itextpdf.pdfocr.structuretree.LogicalStructureTreeItem
Retrieve parent structure tree item.
getPathToExecutable() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4ExecutableOcrEngine
Gets path to tesseract executable.
getPathToTessData() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Gets path to directory with tess data.
getPdfLang() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Gets PDF language.
getPdfOcrStatisticsEventType() - Method in class com.itextpdf.pdfocr.statistics.PdfOcrOutputTypeStatisticsEvent
Gets the type of statistic event.
getPixelTextPoints(int) - Method in class com.itextpdf.pdfocr.TextInfo
Gets array of 4 Points describing text bbox (lower-left based relative to text) expressed in pixels.
getPostProcess() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.InferenceConfig
Returns the value under the PostProcess key.
getPostProcessor() - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
Returns the ONNX model output post-processor.
getPostProcessor() - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Returns the ONNX model output post-processor.
getPredsArray(FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
Returns the preds array from the output buffer.
getPreProcess() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.InferenceConfig
Returns the value under the PreProcess key.
getProductData() - Method in interface com.itextpdf.pdfocr.IProductAware
Gets object containing information about the product.
getProductData() - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
Gets object containing information about the product.
getProductData() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
 
getProperties() - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
Returns the text detection predictor properties.
getProperties() - Method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictor
Returns the crop orientation predictor properties.
getProperties() - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Returns the text recognition predictor properties.
getRedChannelIndex() - Method in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
Returns the index of the red channel in the resulting ML input buffer.
getRedMean() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns red channel mean, used for normalization.
getRedStd() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns red channel standard deviation, used for normalization.
getRotationAngle() - Method in class com.itextpdf.pdfocr.TextInfo
Returns the text rotation angle in radian for this TextInfo in the range of -pi to pi.
getScalar(int) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
Returns the scalar value at the specified index.
getScaleMode() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Gets scale mode for input images using available options from ScaleMode enumeration.
getScoreMode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
Returns the value under the score_mode key.
getSequenceId() - Method in class com.itextpdf.pdfocr.AbstractPdfOcrEventHelper
Returns the sequence id
getShape() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
Returns a copy of the shape array that defines the dimensions of this multidimensional array.
getShape() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns target input shape.
getShape(int) - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns target input dimension value.
getSolidColor() - Method in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
Returns the solid color used for padding.
getStatisticsNames() - Method in class com.itextpdf.pdfocr.statistics.PdfOcrOutputTypeStatisticsEvent
getStd() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.NormalizeImage
Returns the values under the std key.
getStd() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns per-channel standard deviation, used for normalization.
getStd(int) - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns channel-specific standard deviation, used for normalization.
getSubArray(int) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
Returns a sub-array representing the slice at the specified index of the first dimension.
getTempFilePath(String, String) - Static method in class com.itextpdf.pdfocr.util.PdfOcrFileUtil
Gets path to temp file in current system temporary directory.
getTesseract4OcrEngineProperties() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Gets properties for AbstractTesseract4OcrEngine.
getTesseractInstance() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4LibOcrEngine
Gets tesseract instance.
getText() - Method in class com.itextpdf.pdfocr.TextInfo
Gets text element.
getTextBBoxColor() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Gets text bounding box color in the output PDF document.
getTextColor() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Gets text color in output PDF document.
getTextLayerName() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Gets name of text layer.
getTextPoints() - Method in class com.itextpdf.pdfocr.TextInfo
Gets array of 4 Points describing text bbox (lower-left based relative to text) expressed in points.
getTextPositioning() - Method in class com.itextpdf.pdfocr.onnx.OnnxEngineProperties
Gets the way text is retrieved from ocr engine output using TextPositioning.
getTextPositioning() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Defines the way text is retrieved from tesseract output using TextPositioning.
getThresh() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
Returns the value under the thresh key.
getTileHeight() - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
Gets ImagePreprocessingOptions.tileHeight.
getTileWidth() - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
Gets ImagePreprocessingOptions.tileWidth.
getTitle() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Gets PDF document title.
getTransformOps() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.PreProcess
Returns the values under the transform_ops key.
getUnclipRatio() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
Returns the value under the unclip_ratio key.
getUseDilation() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
Returns the value under the use_dilation key.
getWidth() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns input minimum width.
getWidth() - Method in class com.itextpdf.pdfocr.onnx.util.Dimensions2D
Gets width of the Dimensions2D instance.
getWidthMultiple() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
Returns the width multiple.
getWrappingKey() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DecodeImage
Returns the wrapping key for the transform operation object.
getWrappingKey() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DetResizeForTest
Returns the wrapping key for the transform operation object.
getWrappingKey() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.NormalizeImage
Returns the wrapping key for the transform operation object.
getWrappingKey() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.RecResizeImg
Returns the wrapping key for the transform operation object.
getWrappingKey() - Method in interface com.itextpdf.pdfocr.onnx.conf.paddle.model.TransformOp
Returns the wrapping key for the transform operation object.
GRAY - Enum constant in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ImgMode
Value for GRAY config value.
GRAYSCALE - Enum constant in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
Image is represented with a single channel, which contains the grayscale version of the image.
GREEK - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
GREEK_EXTENDED - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
GUJARATI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
GUJARATI_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
GUJARATI_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
GUJARATI_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
GUJARATI_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
GUJARATI_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
GUJARATI_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
GUJARATI_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 

H

hashCode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.CtcLabelDecode
hashCode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
hashCode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DecodeImage
hashCode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DetResizeForTest
hashCode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.InferenceConfig
hashCode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.NormalizeImage
hashCode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.PreProcess
hashCode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.RecResizeImg
hashCode() - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
hashCode() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
 
hashCode() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
hashCode() - Method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictorProperties
hashCode() - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
hashCode() - Method in class com.itextpdf.pdfocr.onnx.recognition.StringMapper
 
hashCode() - Method in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
hashCode() - Method in class com.itextpdf.pdfocr.onnx.util.Dimensions2D
 
hasNext() - Method in class com.itextpdf.pdfocr.onnx.util.BatchProcessingGenerator
HAUSA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
HEBREW - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
HEBREW_CANTILLATIONS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
HEBREW_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
HEBREW_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
HEBREW_SPECIALS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
HEBREW_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
HEIGHT_MULTIPLE_SHOULD_BE_POSITIVE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
HINDI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
HINDI_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
HOCR - Enum constant in enum com.itextpdf.pdfocr.tesseract4.OutputFormat
Reader will produce XHTML output compliant with the hOCR specification.
HORIZONTAL - Enum constant in enum com.itextpdf.pdfocr.TextOrientation
Horizontal text, non-rotated.
HORIZONTAL_ROTATED_180 - Enum constant in enum com.itextpdf.pdfocr.TextOrientation
Horizontal text, rotated 180 degrees counter-clockwise.
HORIZONTAL_ROTATED_270 - Enum constant in enum com.itextpdf.pdfocr.TextOrientation
Horizontal text, rotated 270 degrees counter-clockwise.
HORIZONTAL_ROTATED_90 - Enum constant in enum com.itextpdf.pdfocr.TextOrientation
Horizontal text, rotated 90 degrees counter-clockwise.
HUNGARIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
HW_ARRAY_SHOULD_BE_TWO_DIMENSIONAL - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 

I

IBatchProcessor<T, R> - Interface in com.itextpdf.pdfocr.onnx.util
Batch processor mapper interface.
ICELANDIC - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
identifyOsType() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Identifies type of current OS and return it (win, linux).
IDetectionPostProcessor - Interface in com.itextpdf.pdfocr.onnx.detection
Interface for post-processors, which convert raw output of an ML model and returns rotated boxes with the detected objects.
IDetectionPredictor - Interface in com.itextpdf.pdfocr.onnx.detection
Interface for predictors, which take a full image and find text boxes on them.
IImageRotationHandler - Interface in com.itextpdf.pdfocr
Rotation information may be stored in image metadata.
IMAGE_LAYER_NAME_IS_NOT_APPLIED - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
 
IMAGE_SHAPE_IS_NOT_SUPPORTED - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
ImageChannelConfiguration - Enum in com.itextpdf.pdfocr.onnx
Enumeration of supported image channel configuration for buffers.
ImagePreprocessingOptions - Class in com.itextpdf.pdfocr.tesseract4
Additional options applied on image preprocessing step.
ImagePreprocessingOptions() - Constructor for class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
Creates ImagePreprocessingOptions instance.
ImagePreprocessingOptions(ImagePreprocessingOptions) - Constructor for class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
Creates a new ImagePreprocessingOptions instance based on another ImagePreprocessingOptions instance (copy constructor).
ImageResizeOptions - Class in com.itextpdf.pdfocr.onnx
Options, that describe the way an image will be resized before being converted to a tensor for an ML model input.
ImageResizeOptions(ImageChannelConfiguration, int, int) - Constructor for class com.itextpdf.pdfocr.onnx.ImageResizeOptions
Creates image resize options.
ImageResizeOptions(ImageChannelConfiguration, int, int, int, int) - Constructor for class com.itextpdf.pdfocr.onnx.ImageResizeOptions
Creates image resize options.
ImageResizeOptions(ImageChannelConfiguration, int, int, int, int, int, int, PaddingStrategy) - Constructor for class com.itextpdf.pdfocr.onnx.ImageResizeOptions
Creates image resize options.
ImageResizeOptions(ImageChannelConfiguration, int, int, int, int, PaddingStrategy) - Constructor for class com.itextpdf.pdfocr.onnx.ImageResizeOptions
Creates image resize options.
ImageResizeOptions(ImageChannelConfiguration, int, int, PaddingStrategy) - Constructor for class com.itextpdf.pdfocr.onnx.ImageResizeOptions
Creates image resize options.
imageShapeIsNotSupported() - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
Creates an exception for cases, when the detection model uses an unsupported method for resizing input images.
ImgMode - Enum in com.itextpdf.pdfocr.onnx.conf.paddle.model
Enum for values under the img_mode key within a DecodeImage transform operation object in a config file.
INCORRECT_INPUT_IMAGE_FORMAT - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
 
INCORRECT_LANGUAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
 
INDEX_OUT_OF_BOUNDS - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
INDONESIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
InferenceConfig - Class in com.itextpdf.pdfocr.onnx.conf.paddle.model
POJO for the root object in a config file.
InferenceConfig(PreProcess, PostProcess) - Constructor for class com.itextpdf.pdfocr.onnx.conf.paddle.model.InferenceConfig
Creates a new POJO for the config file object.
InferenceConfigParser - Class in com.itextpdf.pdfocr.onnx.conf.paddle.parser
Static class with functions for parsing PaddleOCR YAML config files into a InferenceConfig POJO.
initializeTesseract(OutputFormat) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4LibOcrEngine
Initializes instance of tesseract if it haven't been already initialized or it have been disposed and sets all the required properties.
inputProperties - Variable in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictorProperties
Properties of the inputs of the ONNX model.
INVALID_NUMBER_OF_OUTPUTS - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
IO_EXCEPTION_OCCURRED - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
 
IOcrEngine - Interface in com.itextpdf.pdfocr
IOcrEngine interface is used for instantiating new OcrReader objects.
IOcrProcessProperties - Interface in com.itextpdf.pdfocr
OCR properties passed to the OCR engine as part of OcrProcessContext.
IOrientationPredictor - Interface in com.itextpdf.pdfocr.onnx.orientation
Interface for predictors, which take a cropped image of text and determine its orientation.
IOrtSessionOptionsCreator - Interface in com.itextpdf.pdfocr.onnx
Interface for ONNX runtime session options creators.
IOutputLabelMapper<T> - Interface in com.itextpdf.pdfocr.onnx
Interface for mapping an integer index (continuous from 0) to output values.
IPredictor<T, R> - Interface in com.itextpdf.pdfocr.onnx
Interface of a generic predictor.
IProductAware - Interface in com.itextpdf.pdfocr
The interface that holds information about product data and meta info.
IRecognitionPostProcessor - Interface in com.itextpdf.pdfocr.onnx.recognition
Interface for post-processors, which convert raw output of an ML model and returns recognized characters as a string.
IRecognitionPredictor - Interface in com.itextpdf.pdfocr.onnx.recognition
Interface for predictors, which take a cropped image of text and recognize text characters on it.
IRISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
IScoreCalculator - Interface in com.itextpdf.pdfocr.onnx.detection.score
Interface for abstracting away score calculation over a text contour in the text detection post-processor.
isFixedSize() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
Returns whether the target size is fixed.
isPreprocessingImages() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Checks whether image preprocessing is needed.
isSmoothTiling() - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
Gets ImagePreprocessingOptions.smoothTiling.
isTagged() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Retrieve information on whether pdf document should be tagged or not.
isTaggingSupported() - Method in interface com.itextpdf.pdfocr.IOcrEngine
Checks whether tagging is supported by the OCR engine.
isTaggingSupported() - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
Checks whether tagging is supported by the OCR engine.
isTaggingSupported() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
 
isTiffImage(File) - Static method in class com.itextpdf.pdfocr.util.TiffImageUtil
Checks whether image type is TIFF.
isUseTxtToImproveHocrParsing() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Gets Tesseract4OcrEngineProperties.useTxtToImproveHocrParsing.
isValidContour(Mat, Rect) - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
Returns whether the contour is good enough to be a text box.
isWindows() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Checks current os type.
ITALIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ITextBoxMerger - Interface in com.itextpdf.pdfocr.onnx.merging
Interface for a processing class, which handles merging text boxes, received from a text detection routine.

J

JAPANESE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
JAPANESE_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
JAPANESE_G2 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
JAVANESE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
JAVANESE_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
JAVANESE_DIACRITICS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
JAVANESE_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
JAVANESE_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
JAVANESE_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
JAVANESE_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 

K

KANNADA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KANNADA_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KANNADA_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KANNADA_G2 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
KANNADA_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KANNADA_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KANNADA_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KANNADA_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KANNADA_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KAZAKH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KHMER - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KHMER_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KHMER_DIACRITICS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KHMER_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KHMER_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KHMER_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KHMER_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KHMER_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KOREAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KOREAN_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
KOREAN_G2 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
KURDISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
KYRGYZ - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 

L

labelDimension() - Method in class com.itextpdf.pdfocr.onnx.recognition.CrnnPostProcessor
Returns the size of the output character label vector.
labelDimension() - Method in class com.itextpdf.pdfocr.onnx.recognition.CtcLabelPostProcessor
Returns the size of the output character label vector.
labelDimension() - Method in class com.itextpdf.pdfocr.onnx.recognition.EndOfStringPostProcessor
Returns the size of the output character label vector.
labelDimension() - Method in interface com.itextpdf.pdfocr.onnx.recognition.IRecognitionPostProcessor
Returns the size of the output character label vector.
LANGUAGE_IS_NOT_IN_THE_LIST - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
 
LAO - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
LATIN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
LATIN_EXTENDED - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
LATIN_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
LATIN_G2 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
LATVIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
LEGACY_FRENCH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
LeptonicaImageRotationHandler - Class in com.itextpdf.pdfocr.tesseract4
Leptonica based implementation of IImageRotationHandler.
LeptonicaImageRotationHandler() - Constructor for class com.itextpdf.pdfocr.tesseract4.LeptonicaImageRotationHandler
 
limit() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Returns this buffer's limit.
limit(int) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Sets this buffer's limit.
linkNet(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
Creates a new text detection predictor using an existing pre-trained LinkNet model, stored on disk.
linkNet(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
Creates a new text detection properties object for existing pre-trained LinkNet models, stored on disk.
linkNet(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
Creates a new text detection predictor using an existing pre-trained LinkNet model, stored on disk.
linkNet(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
Creates a new text detection properties object for existing pre-trained LinkNet models, stored on disk.
LITHUANIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
LogicalStructureTreeItem - Class in com.itextpdf.pdfocr.structuretree
This class represents structure tree item of the text item put into the pdf document.
LogicalStructureTreeItem() - Constructor for class com.itextpdf.pdfocr.structuretree.LogicalStructureTreeItem
Instantiate a new LogicalStructureTreeItem instance.
LogicalStructureTreeItem(AccessibilityProperties) - Constructor for class com.itextpdf.pdfocr.structuretree.LogicalStructureTreeItem
Instantiate a new LogicalStructureTreeItem instance.
logit(double) - Static method in class com.itextpdf.pdfocr.onnx.util.MathUtil
Computes the logit function, which is the inverse of expit, for the given input.
logit(float) - Static method in class com.itextpdf.pdfocr.onnx.util.MathUtil
Computes the logit function, which is the inverse of expit, for the given input.
LOOK_UP_STRING_CONTAINS_2_CODE_UNITS_POINTS - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
LUXEMBOURGISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 

M

MACEDONIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
makePdfSearchable(PdfDocument) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Performs OCR of all images in an input PDF document and adds recognized text on top of the images.
makePdfSearchable(PdfDocument, IOcrProcessProperties) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Performs OCR of all images in an input PDF document and adds recognized text on top of the images.
makePdfSearchable(File, File) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Performs OCR of all images in an input PDF file and generates searchable PDF.
makePdfSearchable(File, File, IOcrProcessProperties) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Performs OCR of all images in an input PDF file and generates searchable PDF.
MALAGASY - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
MALAY - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
MALAYALAM - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
MALAYALAM_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
MALAYALAM_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
MALAYALAM_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
MALAYALAM_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
MALAYALAM_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
MALAYALAM_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
MALTESE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
MAORI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
map(int) - Method in interface com.itextpdf.pdfocr.onnx.IOutputLabelMapper
Returns value, which is mapped to the specified index.
map(int) - Method in class com.itextpdf.pdfocr.onnx.orientation.DefaultOrientationMapper
Returns value, which is mapped to the specified index.
map(int) - Method in class com.itextpdf.pdfocr.onnx.recognition.StringMapper
Returns character, which is mapped to the specified index in the lookup string.
map(int) - Method in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
Returns character, which is mapped to the specified index in the lookup string.
mapPredToSample(float) - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
Calculates the score sample value, based on a prediction value from the buffer.
mapPredToSample(float) - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPostProcessor
Calculates the score sample value, based on a prediction value from the buffer.
MARATHI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
master(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained MASTER model, stored on disk.
master(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained MASTER models, stored on disk.
master(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained MASTER model, stored on disk.
master(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained MASTER models, stored on disk.
MathUtil - Class in com.itextpdf.pdfocr.onnx.util
Additional math functions.
MAX_HEIGHT_SHOULD_BE_A_MULTIPLE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
MAX_HEIGHT_SHOULD_NOT_BE_LESS_THAN_MIN - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
MAX_SHOULD_NOT_BE_LESS_THAN_MIN - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
MAX_WIDTH_SHOULD_BE_A_MULTIPLE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
MAX_WIDTH_SHOULD_NOT_BE_LESS_THAN_MIN - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
MaxScoreCalculator - Class in com.itextpdf.pdfocr.onnx.detection.score
Score calculator, which returns the biggest observed sample.
MaxScoreCalculator() - Constructor for class com.itextpdf.pdfocr.onnx.detection.score.MaxScoreCalculator
Creates a new score calculator.
MeanScoreCalculator - Class in com.itextpdf.pdfocr.onnx.detection.score
Score calculator, which calculates the mean values over the observed samples.
MeanScoreCalculator() - Constructor for class com.itextpdf.pdfocr.onnx.detection.score.MeanScoreCalculator
Creates a new score calculator.
MIN_HEIGHT_SHOULD_BE_POSITIVE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
MIN_WIDTH_SHOULD_BE_POSITIVE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
minAreaRectBoxPoints(Mat) - Static method in class com.itextpdf.pdfocr.onnx.util.OpenCvUtil
Equivalent to calling OpenCV minAreaRect, followed by boxPoints, but with resource handling taken care of.
minAreaRectBoxPoly(Mat) - Static method in class com.itextpdf.pdfocr.onnx.util.OpenCvUtil
Creates an OpenCV polygon, based on the results of a minAreaRectBoxPoints(points) call, but with resource handling taken care of.
mobileNetV3(String) - Static method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictor
Creates a new crop orientation predictor using an existing pre-trained MobileNetV3 model, stored on disk.
mobileNetV3(String) - Static method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictorProperties
Creates a new crop orientation properties object for existing pre-trained MobileNetV3 models, stored on disk.
mobileNetV3(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictor
Creates a new crop orientation predictor using an existing pre-trained MobileNetV3 model, stored on disk.
mobileNetV3(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictorProperties
Creates a new crop orientation properties object for existing pre-trained MobileNetV3 models, stored on disk.
MODEL_DID_NOT_PASS_VALIDATION - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
MODEL_ONLY_SUPPORTS_RGB - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
modelPath - Variable in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictorProperties
Path to the ONNX model to load.
MONGOLIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
MONTENEGRIN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
MULTI_LANG - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
MULTI_LANG_FULL - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 

N

NAME - Static variable in class com.itextpdf.pdfocr.onnx.conf.paddle.model.CtcLabelDecode
Expected name for the CTCLabelDecode post-processor.
NAME - Static variable in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
Expected name for the CTCLabelDecode post-processor.
NEGATIVE_VALUE_IN_SHAPE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
NEPALI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
next() - Method in class com.itextpdf.pdfocr.onnx.util.BatchProcessingGenerator
normalizedMinAreaRect(Mat) - Static method in class com.itextpdf.pdfocr.onnx.util.OpenCvUtil
OpenCV minAreaRect, but returns the normalized rectangle immediately.
NormalizeImage - Class in com.itextpdf.pdfocr.onnx.conf.paddle.model
POJO for the NormalizeImage transform operation within a PreProcess object in a config file.
NormalizeImage(float[], float[]) - Constructor for class com.itextpdf.pdfocr.onnx.conf.paddle.model.NormalizeImage
Creates a new POJO for the config file object.
normalizeRotatedRect(RotatedRect) - Static method in class com.itextpdf.pdfocr.onnx.util.OpenCvUtil
Normalizes RotatedRect, so that its angle is in the [-45; 45) range.
NORWEGIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
NUMBER_OF_PAGES_IN_IMAGE - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
The constant NUMBER_OF_PAGES_IN_IMAGE.

O

objToBool(Object) - Static method in class com.itextpdf.pdfocr.onnx.util.YamlUtil
Casts a parsed YAML object to a boolean, if it is a bool.
objToFloat(Object) - Static method in class com.itextpdf.pdfocr.onnx.util.YamlUtil
Casts a parsed YAML object to a double, if it is a float.
objToInt(Object) - Static method in class com.itextpdf.pdfocr.onnx.util.YamlUtil
Casts a parsed YAML object to an int32, if it is an int.
objToMapping(Object) - Static method in class com.itextpdf.pdfocr.onnx.util.YamlUtil
Casts a parsed YAML object to a map/dictionary, if it is a mapping.
objToSequence(Object) - Static method in class com.itextpdf.pdfocr.onnx.util.YamlUtil
Casts a parsed YAML object to a collection, if it is a sequence.
objToString(Object) - Static method in class com.itextpdf.pdfocr.onnx.util.YamlUtil
Casts a parsed YAML object to a string, if it is a string.
observe(float) - Method in interface com.itextpdf.pdfocr.onnx.detection.score.IScoreCalculator
Observe a sample value from the text contour.
observe(float) - Method in class com.itextpdf.pdfocr.onnx.detection.score.MaxScoreCalculator
Observe a sample value from the text contour.
observe(float) - Method in class com.itextpdf.pdfocr.onnx.detection.score.MeanScoreCalculator
Observe a sample value from the text contour.
OcrEngineProperties - Class in com.itextpdf.pdfocr
This class contains additional properties for ocr engine.
OcrEngineProperties() - Constructor for class com.itextpdf.pdfocr.OcrEngineProperties
Creates a new OcrEngineProperties instance.
OcrEngineProperties(OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.OcrEngineProperties
Creates a new OcrEngineProperties instance based on another OcrEngineProperties instance (copy constructor).
OcrPdfCreator - Class in com.itextpdf.pdfocr
OcrPdfCreator is the class that creates PDF documents containing input images and text that was recognized using provided IOcrEngine.
OcrPdfCreator(IOcrEngine) - Constructor for class com.itextpdf.pdfocr.OcrPdfCreator
Creates a new OcrPdfCreator instance.
OcrPdfCreator(IOcrEngine, OcrPdfCreatorProperties) - Constructor for class com.itextpdf.pdfocr.OcrPdfCreator
Creates a new OcrPdfCreator instance.
OcrPdfCreatorProperties - Class in com.itextpdf.pdfocr
Properties that will be used by the OcrPdfCreator.
OcrPdfCreatorProperties() - Constructor for class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Creates a new OcrPdfCreatorProperties instance.
OcrPdfCreatorProperties(OcrPdfCreatorProperties) - Constructor for class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Creates a new OcrPdfCreatorProperties instance based on another OcrPdfCreatorProperties instance (copy constructor).
OcrProcessContext - Class in com.itextpdf.pdfocr
Class for storing ocr processing context.
OcrProcessContext(AbstractPdfOcrEventHelper) - Constructor for class com.itextpdf.pdfocr.OcrProcessContext
Creates an instance of ocr process context
ODIA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ODIA_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ODIA_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ODIA_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ODIA_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ODIA_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ODIA_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
ODIA_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
onEvent(AbstractProductITextEvent) - Method in class com.itextpdf.pdfocr.AbstractPdfOcrEventHelper
Handles the event.
ONNX_RUNTIME_OPERATION_FAILED - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
OnnxDetectionPostProcessor - Class in com.itextpdf.pdfocr.onnx.detection
Implementation of a text detection predictor post-processor, used for OnnxTR model outputs.
OnnxDetectionPostProcessor() - Constructor for class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPostProcessor
Creates a new post-processor with the default threshold values.
OnnxDetectionPostProcessor(float, float) - Constructor for class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPostProcessor
Creates a new post-processor.
OnnxDetectionPredictor - Class in com.itextpdf.pdfocr.onnx.detection
A text detection predictor implementation, which is using ONNX Runtime and its ML models to find, where text is located on an image.
OnnxDetectionPredictor(OnnxDetectionPredictorProperties) - Constructor for class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
Creates a text detection predictor with the specified properties.
OnnxDetectionPredictorProperties - Class in com.itextpdf.pdfocr.onnx.detection
Properties for configuring text detection ONNX models.
OnnxDetectionPredictorProperties(String, OnnxInputProperties, IDetectionPostProcessor) - Constructor for class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
Creates new text detection predictor properties.
OnnxDetectionPredictorProperties(String, OnnxInputProperties, IDetectionPostProcessor, IOrtSessionOptionsCreator) - Constructor for class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
Creates new text detection predictor properties.
OnnxEngineProperties - Class in com.itextpdf.pdfocr.onnx
Properties that are used by the OnnxOcrEngine.
OnnxEngineProperties() - Constructor for class com.itextpdf.pdfocr.onnx.OnnxEngineProperties
Creates a new OnnxEngineProperties instance.
OnnxInputProperties - Class in com.itextpdf.pdfocr.onnx
Properties of the input of an ONNX model, which expects an image.
OnnxInputProperties(ImageResizeOptions) - Constructor for class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Creates model input properties.
OnnxInputProperties(ImageResizeOptions, float[], float[]) - Constructor for class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Creates model input properties.
OnnxInputProperties(ImageResizeOptions, float[], float[], int) - Constructor for class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Creates model input properties.
OnnxInputProperties(ImageResizeOptions, int) - Constructor for class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Creates model input properties.
OnnxOcrEngine - Class in com.itextpdf.pdfocr.onnx
IOcrEngine implementation, based on OnnxTR/DocTR machine learning OCR projects.
OnnxOcrEngine(IDetectionPredictor, IOrientationPredictor, IRecognitionPredictor) - Constructor for class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
Create a new OCR engine with the provided predictors.
OnnxOcrEngine(IDetectionPredictor, IOrientationPredictor, IRecognitionPredictor, OnnxEngineProperties) - Constructor for class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
Create a new OCR engine with the provided predictors.
OnnxOcrEngine(IDetectionPredictor, IRecognitionPredictor) - Constructor for class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
Create a new OCR engine with the provided predictors, without text orientation prediction.
OnnxOrientationPredictor - Class in com.itextpdf.pdfocr.onnx.orientation
A crop orientation predictor implementation, which is using ONNX Runtime and its ML models to figure out, how text is oriented in a cropped image of text.
OnnxOrientationPredictor(OnnxOrientationPredictorProperties) - Constructor for class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictor
Creates a crop orientation predictor with the specified properties.
OnnxOrientationPredictorProperties - Class in com.itextpdf.pdfocr.onnx.orientation
Properties for configuring crop orientation ONNX models.
OnnxOrientationPredictorProperties(String, OnnxInputProperties, IOutputLabelMapper) - Constructor for class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictorProperties
Creates new crop orientation predictor properties.
OnnxOrientationPredictorProperties(String, OnnxInputProperties, IOutputLabelMapper, IOrtSessionOptionsCreator) - Constructor for class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictorProperties
Creates new crop orientation predictor properties.
OnnxRecognitionPredictor - Class in com.itextpdf.pdfocr.onnx.recognition
A text recognition predictor implementation, which is using ONNX Runtime and its ML models to recognize text characters on an image.
OnnxRecognitionPredictor(OnnxRecognitionPredictorProperties) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a text recognition predictor with the specified properties.
OnnxRecognitionPredictorProperties - Class in com.itextpdf.pdfocr.onnx.recognition
Properties for configuring text recognition ONNX models.
OnnxRecognitionPredictorProperties(String, OnnxInputProperties, IRecognitionPostProcessor) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates new text recognition predictor properties.
OnnxRecognitionPredictorProperties(String, OnnxInputProperties, IRecognitionPostProcessor, boolean) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates new text recognition predictor properties.
OnnxRecognitionPredictorProperties(String, OnnxInputProperties, IRecognitionPostProcessor, boolean, IOrtSessionOptionsCreator) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates new text recognition predictor properties.
OnnxRecognitionPredictorProperties(String, OnnxInputProperties, IRecognitionPostProcessor, IOrtSessionOptionsCreator) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates new text recognition predictor properties.
OpenCvUtil - Class in com.itextpdf.pdfocr.onnx.util
Static class with OpenCV utility functions.
ortSessionOptionsCreator - Variable in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictorProperties
ONNX runtime session options creator.
OutputFormat - Enum in com.itextpdf.pdfocr.tesseract4
Enumeration of the available output formats.

P

PaddingStrategy - Enum in com.itextpdf.pdfocr.onnx
Enumeration of implemented padding strategies for padding images.
paddleOcr(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
Creates a new text detection predictor using an existing pre-trained PaddleOCR model, stored on disk.
paddleOcr(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
Creates a new text detection properties object for existing pre-trained PaddleOCR models, stored on disk.
paddleOcr(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained PaddleOCR model, stored on disk.
paddleOcr(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained PaddleOCR models, stored on disk.
paddleOcr(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
Creates a new text detection predictor using an existing pre-trained PaddleOCR model, stored on disk.
paddleOcr(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
Creates a new text detection properties object for existing pre-trained PaddleOCR models, stored on disk.
paddleOcr(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained PaddleOCR model, stored on disk.
paddleOcr(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained PaddleOCR models, stored on disk.
paddleOcr(String, String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
Creates a new text detection predictor using an existing pre-trained PaddleOCR model, stored on disk.
paddleOcr(String, String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
Creates a new text detection properties object for existing pre-trained PaddleOCR models, stored on disk.
paddleOcr(String, String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained PaddleOCR model, stored on disk.
paddleOcr(String, String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained PaddleOCR models, stored on disk.
paddleOcr(String, String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
Creates a new text detection predictor using an existing pre-trained PaddleOCR model, stored on disk.
paddleOcr(String, String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
Creates a new text detection properties object for existing pre-trained PaddleOCR models, stored on disk.
paddleOcr(String, String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained PaddleOCR model, stored on disk.
paddleOcr(String, String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained PaddleOCR models, stored on disk.
PaddleOcrDetectionPostProcessor - Class in com.itextpdf.pdfocr.onnx.detection
Implementation of a text detection predictor post-processor, used for PaddleOCR model outputs.
PaddleOcrDetectionPostProcessor() - Constructor for class com.itextpdf.pdfocr.onnx.detection.PaddleOcrDetectionPostProcessor
Creates a new post-processor with the default parameters.
PaddleOcrDetectionPostProcessor(float, float, float, int) - Constructor for class com.itextpdf.pdfocr.onnx.detection.PaddleOcrDetectionPostProcessor
Creates a new post-processor.
PaddleOcrInitException - Exception in com.itextpdf.pdfocr.onnx.exceptions
Exception class for exceptions during PaddleOCR initialization.
PaddleOcrInitException(String) - Constructor for exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
Creates new PaddleOcrInitException instance.
PAGE_NUMBER_IS_INCORRECT - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
PAGE_SIZE_IS_NOT_APPLIED - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
 
ParagraphTreeItem - Class in com.itextpdf.pdfocr.structuretree
A convenience class to associate certain text items with the paragraph structure item.
ParagraphTreeItem() - Constructor for class com.itextpdf.pdfocr.structuretree.ParagraphTreeItem
Instantiate a new ParagraphTreeItem instance.
parse(InputStream) - Static method in class com.itextpdf.pdfocr.onnx.conf.paddle.parser.InferenceConfigParser
Parses a PaddleOCR config file into a InferenceConfig POJO.
parSeq(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained PARSeq model, stored on disk.
parSeq(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained PARSeq models, stored on disk.
parSeq(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained PARSeq model, stored on disk.
parSeq(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained PARSeq models, stored on disk.
parSeq(String, Vocabulary, int) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained PARSeq model, stored on disk.
parSeq(String, Vocabulary, int) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained PARSeq models, stored on disk.
parSeq(String, Vocabulary, int, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained PARSeq model, stored on disk.
parSeq(String, Vocabulary, int, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained PARSeq models, stored on disk.
PASHTO - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
PATH_TO_TESS_DATA_DIRECTORY_CONTAINS_NON_ASCII_CHARACTERS - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
 
PATH_TO_TESS_DATA_DIRECTORY_IS_INVALID - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
 
PATH_TO_TESS_DATA_IS_NOT_SET - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
 
PDF - Enum constant in enum com.itextpdf.pdfocr.statistics.PdfOcrOutputType
Creating a PDF file
PDF_DOCUMENT_MUST_BE_OPENED_IN_STAMPING_MODE - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
 
PDF_LANGUAGE_PROPERTY_IS_NOT_SET - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
The constant PDF_LANGUAGE_PROPERTY_IS_NOT_SET.
PDFA - Enum constant in enum com.itextpdf.pdfocr.statistics.PdfOcrOutputType
Creating a PDF-A file
PDFA_IS_NOT_SUPPORTED - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
 
PdfOcrException - Exception in com.itextpdf.pdfocr.exceptions
Exception class for custom exceptions.
PdfOcrException(String) - Constructor for exception com.itextpdf.pdfocr.exceptions.PdfOcrException
Creates a new PdfOcrException.
PdfOcrException(String, Throwable) - Constructor for exception com.itextpdf.pdfocr.exceptions.PdfOcrException
Creates a new PdfOcrException.
PdfOcrException(Throwable) - Constructor for exception com.itextpdf.pdfocr.exceptions.PdfOcrException
Creates a new PdfOcrException.
PdfOcrExceptionMessageConstant - Class in com.itextpdf.pdfocr.exceptions
Class that bundles all the exception message templates as constants.
PdfOcrFileUtil - Class in com.itextpdf.pdfocr.util
Utility class for working with files.
PdfOcrFontProvider - Class in com.itextpdf.pdfocr
FontProvider extension for ocr engine.
PdfOcrFontProvider() - Constructor for class com.itextpdf.pdfocr.PdfOcrFontProvider
Creates a new PdfOcrFontProvider instance with the default font and the default font family.
PdfOcrFontProvider(FontSet, String) - Constructor for class com.itextpdf.pdfocr.PdfOcrFontProvider
Creates a new PdfOcrFontProvider instance based on provided FontSet instance and font family.
PdfOcrInputException - Exception in com.itextpdf.pdfocr.exceptions
Exception class for input related exceptions.
PdfOcrInputException(String) - Constructor for exception com.itextpdf.pdfocr.exceptions.PdfOcrInputException
Creates a new PdfOcrInputException.
PdfOcrInputException(String, Throwable) - Constructor for exception com.itextpdf.pdfocr.exceptions.PdfOcrInputException
Creates a new PdfOcrInputException.
PdfOcrInputException(Throwable) - Constructor for exception com.itextpdf.pdfocr.exceptions.PdfOcrInputException
Creates a new PdfOcrInputException.
PdfOcrInputTesseract4Exception - Exception in com.itextpdf.pdfocr.tesseract4.exceptions
Exception class for Tesseract4 input related exceptions.
PdfOcrInputTesseract4Exception(String) - Constructor for exception com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrInputTesseract4Exception
PdfOcrInputTesseract4Exception(String, Throwable) - Constructor for exception com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrInputTesseract4Exception
PdfOcrInputTesseract4Exception(Throwable) - Constructor for exception com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrInputTesseract4Exception
PdfOcrLogMessageConstant - Class in com.itextpdf.pdfocr.logs
Class that bundles all the log message templates as constants.
PdfOcrMetaInfoContainer - Class in com.itextpdf.pdfocr
Container to keep meta info.
PdfOcrMetaInfoContainer(IMetaInfo) - Constructor for class com.itextpdf.pdfocr.PdfOcrMetaInfoContainer
Creates instance of container to keep passed meta info.
PdfOcrOnnxExceptionMessageConstant - Class in com.itextpdf.pdfocr.onnx.exceptions
Class that bundles all the error message templates as constants.
PdfOcrOnnxProductData - Class in com.itextpdf.pdfocr.onnx.actions.data
Stores an instance of ProductData related to iText pdfOcr Onnx module.
PdfOcrOnnxProductEvent - Class in com.itextpdf.pdfocr.onnx.actions.events
Class represents events registered in iText pdfOcr Onnx module.
PdfOcrOutputType - Enum in com.itextpdf.pdfocr.statistics
pdfOcr output types for statistics.
PdfOcrOutputTypeStatisticsEvent - Class in com.itextpdf.pdfocr.statistics
Class which represents an event for specifying type of an ocr processing.
PdfOcrOutputTypeStatisticsEvent(PdfOcrOutputType, ProductData) - Constructor for class com.itextpdf.pdfocr.statistics.PdfOcrOutputTypeStatisticsEvent
Creates instance of pdfOcr statistics event.
PdfOcrTesseract4Exception - Exception in com.itextpdf.pdfocr.tesseract4.exceptions
Exception class for Tesseract4 exceptions.
PdfOcrTesseract4Exception(String) - Constructor for exception com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4Exception
Creates a new PdfOcrTesseract4Exception.
PdfOcrTesseract4Exception(String, Throwable) - Constructor for exception com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4Exception
Creates a new PdfOcrTesseract4Exception.
PdfOcrTesseract4Exception(Throwable) - Constructor for exception com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4Exception
Creates a new PdfOcrTesseract4Exception.
PdfOcrTesseract4ExceptionMessageConstant - Class in com.itextpdf.pdfocr.tesseract4.exceptions
Class that bundles all the error message templates as constants.
PdfOcrTesseract4ProductData - Class in com.itextpdf.pdfocr.tesseract4.actions.data
Stores an instance of ProductData related to iText pdfOcr Tesseract4 module.
PdfOcrTesseract4ProductData() - Constructor for class com.itextpdf.pdfocr.tesseract4.actions.data.PdfOcrTesseract4ProductData
 
PdfOcrTesseract4ProductEvent - Class in com.itextpdf.pdfocr.tesseract4.actions.events
Class represents events registered in iText pdfOcr Tesseract4 module.
PdfOcrTextBuilder - Class in com.itextpdf.pdfocr.util
Class to build text output from the provided image OCR result and write it to the TXT file.
PERSIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
PERSIAN_LETTERS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
POLISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
POLY - Enum constant in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.BoxType
Value for poly config value.
PORTUGUESE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
position(int) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Sets this buffer's position.
PostProcess - Interface in com.itextpdf.pdfocr.onnx.conf.paddle.model
Interface for objects under the PostProcess key in a config file.
PRE_PROCESSOR_OPERATION_MISSING - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
predict(Iterable) - Method in interface com.itextpdf.pdfocr.onnx.IPredictor
Performs prediction on a sequence of input items provided as an Iterable.
predict(Iterator) - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Performs prediction on a sequence of input items.
predict(Iterator) - Method in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictor
Performs prediction on a sequence of input items.
predict(Iterator) - Method in interface com.itextpdf.pdfocr.onnx.IPredictor
Performs prediction on a sequence of input items.
PreProcess - Class in com.itextpdf.pdfocr.onnx.conf.paddle.model
POJO for the object under the PreProcess key in a config file.
PreProcess(TransformOp[]) - Constructor for class com.itextpdf.pdfocr.onnx.conf.paddle.model.PreProcess
Creates a new POJO for the config file object.
preProcessorOperationMissing(String) - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
Creates an exception for cases, when an expected pre-processing operation is missing in the configuration file.
process(FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.recognition.BasicLabelPostProcessor
Process ML model output and return recognized characters as string.
process(FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.recognition.EndOfStringPostProcessor
Process ML model output and return recognized characters as string.
process(FloatBufferMdArray) - Method in interface com.itextpdf.pdfocr.onnx.recognition.IRecognitionPostProcessor
Process ML model output and return recognized characters as string.
process(BufferedImage, FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
Process ML model output for a specified image and return a list of detected objects.
process(BufferedImage, FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.detection.EasyOcrDetectionPostProcessor
Process ML model output for a specified image and return a list of detected objects.
process(BufferedImage, FloatBufferMdArray) - Method in interface com.itextpdf.pdfocr.onnx.detection.IDetectionPostProcessor
Process ML model output for a specified image and return a list of detected objects.
process(List) - Method in class com.itextpdf.pdfocr.onnx.merging.EasyOcrTextBoxMerger
Merges text boxes based on some set of rules.
process(List) - Method in interface com.itextpdf.pdfocr.onnx.merging.ITextBoxMerger
Merges text boxes based on some set of rules.
PROCESS_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.actions.events.PdfOcrTesseract4ProductEvent
Process image event type.
PROCESS_IMAGE_ONNX - Static variable in class com.itextpdf.pdfocr.onnx.actions.events.PdfOcrOnnxProductEvent
Process image event type.
processBatch(List) - Method in interface com.itextpdf.pdfocr.onnx.util.IBatchProcessor
Processes a batch of input items and produces a corresponding batch of output items.
PROVIDED_FONT_PROVIDER_IS_INVALID - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
The constant PROVIDED_FONT_PROVIDER_IS_INVALID.
PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
PUNJABI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
PUNJABI_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
PUNJABI_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
PUNJABI_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
PUNJABI_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
PUNJABI_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
PUNJABI_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
PUNJABI_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
put(float) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Relative put method.
put(float[], int, int) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Relative bulk put method.

Q

QUAD - Enum constant in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.BoxType
Value for quad config value.
QUECHUA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 

R

RecResizeImg - Class in com.itextpdf.pdfocr.onnx.conf.paddle.model
POJO for the RecResizeImg transform operation within a PreProcess object in a config file.
RecResizeImg(int[]) - Constructor for class com.itextpdf.pdfocr.onnx.conf.paddle.model.RecResizeImg
Creates a new POJO for the config file object.
remaining() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Returns the number of elements between the current position and the limit.
removeChild(LogicalStructureTreeItem) - Method in class com.itextpdf.pdfocr.structuretree.LogicalStructureTreeItem
Remove child structure tree item.
rewind() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Rewinds this buffer.
RGB - Enum constant in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ImgMode
Value for RGB config value.
RGB - Enum constant in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
Image is represented with three channels: red, green, blue.
ROMANIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
rotate(BufferedImage, TextOrientation) - Static method in class com.itextpdf.pdfocr.onnx.util.BufferedImageUtil
Rotates image based on text orientation.
RUSSIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
RUSSIAN_CYRILLIC_LETTERS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
RUSSIAN_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 

S

SANSKRIT - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
sar(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained SAR model, stored on disk.
sar(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained SAR models, stored on disk.
sar(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained SAR model, stored on disk.
sar(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained SAR models, stored on disk.
SCALE_HEIGHT - Enum constant in enum com.itextpdf.pdfocr.ScaleMode
Only height of the image will be proportionally scaled to fit required size that is set using OcrPdfCreatorProperties.setPageSize(Rectangle) method.
SCALE_TO_FIT - Enum constant in enum com.itextpdf.pdfocr.ScaleMode
The image will be scaled to fit within the page width and height dimensions that are set using OcrPdfCreatorProperties.setPageSize(Rectangle) method.
SCALE_WIDTH - Enum constant in enum com.itextpdf.pdfocr.ScaleMode
Only width of the image will be proportionally scaled to fit required size that is set using OcrPdfCreatorProperties.setPageSize(Rectangle) method.
ScaleMode - Enum in com.itextpdf.pdfocr
Enumeration of the possible scale modes for input images.
SCORE_MODE_IS_NOT_SUPPORTED - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
ScoreMode - Enum in com.itextpdf.pdfocr.onnx.conf.paddle.model
Enum for values under the score_mode key within a DBPostProcess PostProcessor object in a config file.
scoreModeIsNotSupported() - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
Creates an exception for cases, when the detection model uses an unsupported score calculation method.
SCOTTISH_GAELIC - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SERBIAN_CYRILLIC - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SERBIAN_LATIN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
setAccessibilityProperties(AccessibilityProperties) - Method in class com.itextpdf.pdfocr.structuretree.LogicalStructureTreeItem
Set structure tree element's properties.
setFontProvider(FontProvider) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Sets font provider.
setFontProvider(FontProvider, String) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Sets font provider and default font family.
setImageLayerName(String) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Sets name for the image layer, null by default.
setImagePreprocessingOptions(ImagePreprocessingOptions) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Sets Tesseract4OcrEngineProperties.imagePreprocessingOptions.
setImageRotationHandler(IImageRotationHandler) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Sets image rotation handler instance.
setLanguages(List) - Method in class com.itextpdf.pdfocr.OcrEngineProperties
Sets list of languages to be recognized in provided images.
setLogicalStructureTreeItem(LogicalStructureTreeItem) - Method in class com.itextpdf.pdfocr.TextInfo
Sets logical structure tree parent item for the text info.
setMessageParams(String...) - Method in exception com.itextpdf.pdfocr.exceptions.PdfOcrException
Sets additional params for Exception message.
setMetaInfo(IMetaInfo) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Set meta info for this OcrPdfCreatorProperties.
setMinimalConfidenceLevel(int) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Sets minimal confidence level for HOCR line to be considered as properly recognized.
setOcrEngine(IOcrEngine) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Sets IOcrEngine reader object to perform OCR.
setOcrEventHelper(AbstractPdfOcrEventHelper) - Method in class com.itextpdf.pdfocr.OcrProcessContext
Sets ocr event helper.
setOcrPdfCreatorProperties(OcrPdfCreatorProperties) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Sets properties for OcrPdfCreator.
setPageSegMode(Integer) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Sets Page Segmentation Mode.
setPageSize(Rectangle) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Sets required size for output PDF document.
setPathToExecutable(String) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4ExecutableOcrEngine
Sets path to tesseract executable.
setPathToTessData(File) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Sets path to directory with tess data.
setPdfLang(String) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Specify PDF natural language, and optionally locale.
setPixelTextPoints(Point[], int) - Method in class com.itextpdf.pdfocr.TextInfo
Sets an array of 4 Points describing text bbox (lower-left based relative to text) expressed in pixels.
setPreprocessingImages(boolean) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Sets true if image preprocessing is needed.
setScaleMode(ScaleMode) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Sets scale mode for input images using available options from ScaleMode enumeration.
setSmoothTiling(boolean) - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
Sets ImagePreprocessingOptions.smoothTiling.
setTagged(boolean) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Defines whether pdf document should be tagged or not.
setTesseract4OcrEngineProperties(Tesseract4OcrEngineProperties) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Sets properties for AbstractTesseract4OcrEngine.
setText(String) - Method in class com.itextpdf.pdfocr.TextInfo
Sets text element.
setTextBBoxColor(Color) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Sets text bounding box color in the output PDF document.
setTextColor(Color) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Sets text color in output PDF document.
setTextLayerName(String) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Sets name for the text layer.
setTextPoints(Point[]) - Method in class com.itextpdf.pdfocr.TextInfo
Sets array of 4 Points describing text bbox (lower-left based relative to text) expressed in points.
setTextPositioning(TextPositioning) - Method in class com.itextpdf.pdfocr.onnx.OnnxEngineProperties
Defines the way text is retrieved from ocr engine output using TextPositioning.
setTextPositioning(TextPositioning) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Defines the way text is retrieved from tesseract output using TextPositioning.
setTileHeight(int) - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
Sets ImagePreprocessingOptions.tileHeight.
setTileWidth(int) - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
Sets ImagePreprocessingOptions.tileWidth.
setTitle(String) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
Sets PDF document title.
setUseTxtToImproveHocrParsing(boolean) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Sets Tesseract4OcrEngineProperties.useTxtToImproveHocrParsing.
SHAPE_IS_NOT_VALID - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
SHOULD_BE_AT_LEAST_ONE_IMAGE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
shouldSplitImages() - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Returns whether input images should be split.
SIMPLIFIED_CHINESE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SINDHI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SINHALA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SINHALA_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SINHALA_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SINHALA_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SINHALA_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SINHALA_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SINHALA_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SINHALA_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
size() - Method in interface com.itextpdf.pdfocr.onnx.IOutputLabelMapper
Returns a number of mappable values.
size() - Method in class com.itextpdf.pdfocr.onnx.orientation.DefaultOrientationMapper
Returns a number of mappable values.
size() - Method in class com.itextpdf.pdfocr.onnx.recognition.StringMapper
Returns the size of the string mapper.
size() - Method in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
Returns the size of the vocabulary.
slice() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Creates a new float buffer whose content is a shared subsequence of this buffer's content.
SLOVAK - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SLOVENE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SLOW - Enum constant in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ScoreMode
Value for slow config value.
SOMALI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
sortTextInfosByLines(Map>) - Static method in class com.itextpdf.pdfocr.util.PdfOcrTextBuilder
Sorts the provided IOcrEngine.doImageOcr(java.io.File) result by lines.
SPANISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SpanTreeItem - Class in com.itextpdf.pdfocr.structuretree
A convenience class to associate certain text items with the span structure item.
SpanTreeItem() - Constructor for class com.itextpdf.pdfocr.structuretree.SpanTreeItem
Instantiate a new SpanTreeItem instance.
START_OCR_FOR_IMAGES - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
The constant START_OCR_FOR_IMAGES.
STATISTICS_EVENT_TYPE_CANT_BE_NULL - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
 
STATISTICS_EVENT_TYPE_IS_NOT_DETECTED - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
 
StringMapper - Class in com.itextpdf.pdfocr.onnx.recognition
Look-up table for mapping text recognition model results to strings.
StringMapper(String) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.StringMapper
Creates a new string mapper based on a look-up string.
StringMapper(String[]) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.StringMapper
Creates a new string mapper based on a look-up table.
SUDANESE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SUDANESE_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SUDANESE_DIACRITICS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SUDANESE_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SUDANESE_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SWAHILI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SWEDISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
SYMMETRIC_BLACK - Enum constant in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
Image will be put into the middle.
SYMMETRIC_EDGE - Enum constant in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
Image will be put into the middle.
SYMMETRIC_GRAY - Enum constant in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
Image will be put into the middle.
SYMMETRIC_WHITE - Enum constant in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
Image will be put into the middle.

T

TableCellTreeItem - Class in com.itextpdf.pdfocr.structuretree
A convenience class to associate certain text items with the table cell structure item.
TableCellTreeItem() - Constructor for class com.itextpdf.pdfocr.structuretree.TableCellTreeItem
Instantiate a new TableCellTreeItem instance.
TableRowTreeItem - Class in com.itextpdf.pdfocr.structuretree
A convenience class to associate certain text items with the table row structure item.
TableRowTreeItem() - Constructor for class com.itextpdf.pdfocr.structuretree.TableRowTreeItem
Instantiate a new TableRowTreeItem instance.
TableTreeItem - Class in com.itextpdf.pdfocr.structuretree
A convenience class to associate certain text items with the table structure item.
TableTreeItem() - Constructor for class com.itextpdf.pdfocr.structuretree.TableTreeItem
Instantiate a new TableTreeItem instance.
TAGALOG - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TAGGED_PDF_IS_NOT_SUPPORTED - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
 
TAGGING_IS_NOT_SUPPORTED - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
 
TAJIK - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TAMIL - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TAMIL_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TAMIL_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TAMIL_FRACTIONS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TAMIL_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TAMIL_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TAMIL_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TAMIL_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TAMIL_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TATAR - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TELUGU - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TELUGU_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TELUGU_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TELUGU_G2 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
TELUGU_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TELUGU_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TELUGU_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TELUGU_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TELUGU_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TESSERACT_FAILED - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
 
TESSERACT_FAILED - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
TESSERACT_LIB_NOT_INSTALLED - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
 
TESSERACT_LIB_NOT_INSTALLED_WIN - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
 
TESSERACT_NOT_FOUND - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
 
Tesseract4ExecutableOcrEngine - Class in com.itextpdf.pdfocr.tesseract4
The implementation of AbstractTesseract4OcrEngine for tesseract OCR.
Tesseract4ExecutableOcrEngine(Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4ExecutableOcrEngine
Creates a new Tesseract4ExecutableOcrEngine instance.
Tesseract4ExecutableOcrEngine(String, Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4ExecutableOcrEngine
Creates a new Tesseract4ExecutableOcrEngine instance.
Tesseract4LibOcrEngine - Class in com.itextpdf.pdfocr.tesseract4
The implementation of AbstractTesseract4OcrEngine for tesseract OCR.
Tesseract4LibOcrEngine(Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4LibOcrEngine
Creates a new Tesseract4LibOcrEngine instance.
Tesseract4LogMessageConstant - Class in com.itextpdf.pdfocr.tesseract4.logs
Class that bundles all the log message templates as constants.
Tesseract4OcrEngineProperties - Class in com.itextpdf.pdfocr.tesseract4
Properties that will be used by the IOcrEngine.
Tesseract4OcrEngineProperties() - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Creates a new Tesseract4OcrEngineProperties instance.
Tesseract4OcrEngineProperties(Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Creates a new Tesseract4OcrEngineProperties instance based on another Tesseract4OcrEngineProperties instance (copy constructor).
TesseractHelper - Class in com.itextpdf.pdfocr.tesseract4
Helper class.
TextInfo - Class in com.itextpdf.pdfocr
This class describes how recognized text is positioned on the image providing bbox for each text item (could be a line or a word).
TextInfo() - Constructor for class com.itextpdf.pdfocr.TextInfo
Creates a new TextInfo instance.
TextInfo(TextInfo) - Constructor for class com.itextpdf.pdfocr.TextInfo
Creates a new TextInfo instance from existing one.
TextInfo(String, Point[]) - Constructor for class com.itextpdf.pdfocr.TextInfo
Creates new TextInfo instance.
TextInfo(String, Rectangle) - Constructor for class com.itextpdf.pdfocr.TextInfo
Creates new TextInfo instance.
TextOrientation - Enum in com.itextpdf.pdfocr
Enumeration of supported text orientations.
TextPositioning - Enum in com.itextpdf.pdfocr.onnx.text
Enumeration of the possible types of text positioning.
TextPositioning - Enum in com.itextpdf.pdfocr.tesseract4
Enumeration of the possible types of text positioning.
THAI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
THAI_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
TiffImageUtil - Class in com.itextpdf.pdfocr.util
Utility class to handle tiff images.
toBchwInput(Collection, OnnxInputProperties) - Static method in class com.itextpdf.pdfocr.onnx.util.BufferedImageUtil
Converts a collection of images to a batched ML model input in a BCHW format with 1 or 3 channels.
toInputBuffer(List) - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
Converts predictor inputs to an ONNX runtime model batched input MD-array buffer.
toInputBuffer(List) - Method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictor
Converts predictor inputs to an ONNX runtime model batched input MD-array buffer.
toInputBuffer(List) - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Converts predictor inputs to an ONNX runtime model batched input MD-array buffer.
toInputBuffer(List) - Method in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictor
Converts predictor inputs to an ONNX runtime model batched input MD-array buffer.
TOO_MANY_IMAGES - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
toString() - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
toString() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
 
toString() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
toString() - Method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictorProperties
toString() - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
toString() - Method in class com.itextpdf.pdfocr.onnx.recognition.StringMapper
 
toString() - Method in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
toString() - Method in class com.itextpdf.pdfocr.onnx.util.Dimensions2D
 
TransformOp - Interface in com.itextpdf.pdfocr.onnx.conf.paddle.model
Interface for objects inside the transform_ops array within a PreProcess object in a config file.
truncateToRatio(BufferedImage, double) - Static method in class com.itextpdf.pdfocr.onnx.util.BufferedImageUtil
Truncates the input image, so that neither width/height, nor height/width ratios exceed the limit.
TURKISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
TXT - Enum constant in enum com.itextpdf.pdfocr.tesseract4.OutputFormat
Reader will produce plain txt file.

U

UKRAINIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
UNEXPECTED_CHANNEL_CONFIGURATION - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
UNEXPECTED_CONFIG_KEY - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
UNEXPECTED_DIMENSION_VALUE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
UNEXPECTED_INPUT_SHAPE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
UNEXPECTED_INPUT_SIZE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
UNEXPECTED_INPUT_TYPE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
UNEXPECTED_MAT_TYPE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
UNEXPECTED_MEAN_CHANNEL_COUNT - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
UNEXPECTED_OUTPUT_SHAPE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
UNEXPECTED_OUTPUT_SIZE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
UNEXPECTED_OUTPUT_TYPE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
UNEXPECTED_PADDING_STRATEGY - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
UNEXPECTED_POST_PROCESSOR_TYPE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
UNEXPECTED_SHAPE_SIZE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
UNEXPECTED_STD_CHANNEL_COUNT - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
UNEXPECTED_VALUE_FOR_CONFIG_KEY - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
unexpectedKey(String) - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.ConfigParserException
Creates an exception for cases, when an unexpected key was found in a configuration file mapping.
unexpectedMeanChannelCount(int) - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
Creates an exception for cases, when the size of the array of means for normalization has an unexpected size.
unexpectedPostProcessorType(String) - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
Creates an exception for cases, when an unexpected post-processor is specified in the configuration file.
unexpectedStdChannelCount(int) - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
Creates an exception for cases, when the size of the array of standard deviations for normalization has an unexpected size.
unexpectedValueForKey(String) - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.ConfigParserException
Creates an exception for cases, when an unexpected value was found in a configuration file mapping.
UNSUPPORTED_EXIF_ORIENTATION_VALUE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
 
URDU - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
USE_DILATION_IS_NOT_SUPPORTED - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
useDilationIsNotSupported() - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
Creates an exception for cases, when the detection model uses an unsupported pre-processing step for input images.
usesBottomRightPadding() - Method in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
Returns whether the strategy uses bottom-right padding.
usesImageEdge() - Method in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
Returns whether the strategy uses the edge of the image for padding.
usesSolidColor() - Method in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
Returns whether the strategy uses a solid color for padding.
usesSymmetricPadding() - Method in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
Returns whether the strategy uses symmetric padding.
useSymmetricPad() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
Returns whether padding should be symmetrical during input resizing.
UYGHUR - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
UZBEK_CYRILLIC - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
UZBEK_LATIN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 

V

validateInputPdfDocument(PdfDocument) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
Validates input PDF document.
validateLanguages(List) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Validates list of provided languages and checks if they all exist in given tess data directory.
valueOf(String) - Static method in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.BoxType
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ImgMode
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ScoreMode
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.itextpdf.pdfocr.onnx.text.TextPositioning
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.itextpdf.pdfocr.ScaleMode
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.itextpdf.pdfocr.statistics.PdfOcrOutputType
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.itextpdf.pdfocr.tesseract4.OutputFormat
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.itextpdf.pdfocr.tesseract4.TextPositioning
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.itextpdf.pdfocr.TextOrientation
Returns the enum constant of this type with the specified name.
values() - Static method in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.BoxType
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ImgMode
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ScoreMode
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.itextpdf.pdfocr.onnx.text.TextPositioning
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.itextpdf.pdfocr.ScaleMode
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.itextpdf.pdfocr.statistics.PdfOcrOutputType
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.itextpdf.pdfocr.tesseract4.OutputFormat
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.itextpdf.pdfocr.tesseract4.TextPositioning
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.itextpdf.pdfocr.TextOrientation
Returns an array containing the constants of this enum type, in the order they are declared.
VALUES_SHOULD_BE_A_NON_EMPTY_ARRAY - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
VIETNAMESE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
viTstr(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained ViTSTR model, stored on disk.
viTstr(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained ViTSTR models, stored on disk.
viTstr(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
Creates a new text recognition predictor using an existing pre-trained ViTSTR model, stored on disk.
viTstr(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
Creates a new text recognition properties object for existing pre-trained ViTSTR models, stored on disk.
Vocabulary - Class in com.itextpdf.pdfocr.onnx.recognition
A string-based LUT for mapping text recognition model results to characters.
Vocabulary(String) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
Creates a new vocabulary based on a look-up string.

W

WELSH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
WIDTH_MULTIPLE_SHOULD_BE_POSITIVE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 
wrap(float[]) - Static method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
Wraps a float array into a buffer.
wrap(Iterator, int) - Static method in class com.itextpdf.pdfocr.onnx.util.Batching
Wraps an existing iterator into a new one, which output List-based batches,
WRAPPING_KEY - Static variable in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DecodeImage
Expected wrapping key for the DecodeImage operation.
WRAPPING_KEY - Static variable in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DetResizeForTest
Expected wrapping key for the DetResizeForTest operation.
WRAPPING_KEY - Static variable in class com.itextpdf.pdfocr.onnx.conf.paddle.model.NormalizeImage
Expected wrapping key for the NormalizeImage operation.
WRAPPING_KEY - Static variable in class com.itextpdf.pdfocr.onnx.conf.paddle.model.RecResizeImg
Expected wrapping key for the RecResizeImg operation.
writeToTextFile(String, String) - Static method in class com.itextpdf.pdfocr.util.PdfOcrFileUtil
Writes provided String to text file using provided path.

X

X_SHOULD_BE_IN_0_1_RANGE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
 

Y

YAKUT - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
YamlUtil - Class in com.itextpdf.pdfocr.onnx.util
Functions for working with YAML documents.
YORUBA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 

Z

ZH_SIM_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
ZH_SIM_G2 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
ZH_TRA_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
 
ZULU - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
 
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form