Index
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form
A
-
AbstractOnnxPredictor<T,
R> - Class in com.itextpdf.pdfocr.onnx -
Abstract predictor, based on models running over ONNX runtime.
- AbstractOnnxPredictor(AbstractOnnxPredictorProperties, long[]) - Constructor for class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictor
-
Creates a new abstract predictor.
- AbstractOnnxPredictorProperties - Class in com.itextpdf.pdfocr.onnx
-
Properties for configuring ONNX models.
- AbstractOnnxPredictorProperties(String, OnnxInputProperties, IOrtSessionOptionsCreator) - Constructor for class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictorProperties
-
Creates new predictor properties.
- AbstractPdfOcrEventHelper - Class in com.itextpdf.pdfocr
-
Helper class for working with events.
- AbstractPdfOcrEventHelper() - Constructor for class com.itextpdf.pdfocr.AbstractPdfOcrEventHelper
- AbstractTesseract4OcrEngine - Class in com.itextpdf.pdfocr.tesseract4
-
The implementation of
IOcrEngine. - AbstractTesseract4OcrEngine(Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Creates a new
Tesseract4OcrEnginePropertiesinstance based on anotherTesseract4OcrEnginePropertiesinstance (copy constructor). - addCell(TableCellTreeItem) - Method in class com.itextpdf.pdfocr.structuretree.TableRowTreeItem
-
Add a new table cell structure tree item to the table row.
- addChild(LogicalStructureTreeItem) - Method in class com.itextpdf.pdfocr.structuretree.LogicalStructureTreeItem
-
Add child structure tree item.
- addRow(TableRowTreeItem) - Method in class com.itextpdf.pdfocr.structuretree.TableTreeItem
-
Add a new row structure tree item to the table.
- AFRIKAANS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ALBANIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- allocate(int) - Static method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Allocates a new float buffer.
- ANCIENT_GREEK - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- appendLabel(StringBuilder, int) - Method in class com.itextpdf.pdfocr.onnx.recognition.BasicLabelPostProcessor
-
Adds label to the string output, based on the label's index.
- appendLabel(StringBuilder, int) - Method in class com.itextpdf.pdfocr.onnx.recognition.CrnnPostProcessor
-
Adds label to the string output, based on the label's index.
- appendLabel(StringBuilder, int) - Method in class com.itextpdf.pdfocr.onnx.recognition.CtcLabelPostProcessor
-
Adds label to the string output, based on the label's index.
- applyRotation(ImageData) - Method in interface com.itextpdf.pdfocr.IImageRotationHandler
-
Apply rotation to image data.
- applyRotation(ImageData) - Method in class com.itextpdf.pdfocr.tesseract4.LeptonicaImageRotationHandler
-
applyTextBoxMerger(List
) - Method in class com.itextpdf.pdfocr.onnx.detection.EasyOcrDetectionPostProcessor -
The text detection model from EasyOCR, for the most part, returns words or small groups of words.
- ARABIC - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ARABIC_DIACRITICS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ARABIC_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ARABIC_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- ARABIC_LETTERS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ARABIC_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- argmax(float[]) - Static method in class com.itextpdf.pdfocr.onnx.util.MathUtil
-
Returns the index of the maximum value in the given array.
- ARMENIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- array() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Returns the float array that backs this buffer.
- arrayOffset() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Returns the offset within this buffer's backing array of the first element of the buffer.
- ArtifactItem - Class in com.itextpdf.pdfocr.structuretree
-
This class represents artifact structure tree item.
- ASCII_LETTERS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ASCII_LOWERCASE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ASCII_UPPERCASE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- AZERBAIJANI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
B
- BasicDetectionPostProcessor - Class in com.itextpdf.pdfocr.onnx.detection
-
Implementation of a text detection predictor post-processor, which is used as a basis for creating post-processors for handling OnnxTR, EasyOCR and PaddleOCR model outputs.
- BasicDetectionPostProcessor(float, float, int) - Constructor for class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
-
Creates a new post-processor.
- BasicLabelPostProcessor - Class in com.itextpdf.pdfocr.onnx.recognition
-
Abstract Implementation of a basic text recognition predictor post-processor.
- BasicLabelPostProcessor() - Constructor for class com.itextpdf.pdfocr.onnx.recognition.BasicLabelPostProcessor
- BASQUE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BATCH_SIZE_SHOULD_BE_POSITIVE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- Batching - Class in com.itextpdf.pdfocr.onnx.util
-
Static utility class to help with batching.
-
BatchProcessingGenerator<T,
R> - Class in com.itextpdf.pdfocr.onnx.util -
Generator with batch processing.
-
BatchProcessingGenerator(Iterator
- >, IBatchProcessor
) -
Creates a new generator with the provided batch iterator and processor.
- BELARUSIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BENGALI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BENGALI_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BENGALI_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BENGALI_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- BENGALI_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BENGALI_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BENGALI_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BENGALI_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BENGALI_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BGR - Enum constant in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ImgMode
-
Value for
BGRconfig value. - BGR - Enum constant in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
-
Image is represented with three channels: blue, green, red.
- binarizeMdArray(FloatBufferMdArray, float) - Static method in class com.itextpdf.pdfocr.onnx.util.OpenCvUtil
-
Takes a two-dimensional MD-array and returns a binarized version of it as an OpenCV Mat.
- BOSNIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BOTTOM_RIGHT_BLACK - Enum constant in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
-
Image will be put into the top-left corner.
- BOTTOM_RIGHT_EDGE - Enum constant in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
-
Image will be put into the top-left corner.
- BOTTOM_RIGHT_GRAY - Enum constant in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
-
Image will be put into the top-left corner.
- BOTTOM_RIGHT_WHITE - Enum constant in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
-
Image will be put into the top-left corner.
- BOX_TYPE_IS_NOT_SUPPORTED - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- BoxType - Enum in com.itextpdf.pdfocr.onnx.conf.paddle.model
-
Enum for values under the
box_typekey within a DBPostProcessPostProcessorobject in a config file. - boxTypeIsNotSupported() - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
-
Creates an exception for cases, when the detection model does not return quads.
- BufferedImageUtil - Class in com.itextpdf.pdfocr.onnx.util
-
Additional algorithms for working with
BufferedImage. -
buildText(Map
>) - Static method in class com.itextpdf.pdfocr.util.PdfOcrTextBuilder -
Constructs string output from the provided
IOcrEngine.doImageOcr(java.io.File)result. - buildTextContourPredictionMask(Mat, Rect) - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
-
Builds and return a mask for calculating prediction score for the provided contour.
- buildTextContourPredictionMask(Mat, Rect) - Method in class com.itextpdf.pdfocr.onnx.detection.PaddleOcrDetectionPostProcessor
-
Builds and return a mask for calculating prediction score for the provided contour.
- BULGARIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BURMESE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BURMESE_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BURMESE_DIACRITICS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BURMESE_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BURMESE_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BURMESE_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BURMESE_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- BY_LINES - Enum constant in enum com.itextpdf.pdfocr.onnx.text.TextPositioning
-
Text will be grouped by lines.
- BY_LINES - Enum constant in enum com.itextpdf.pdfocr.tesseract4.TextPositioning
-
Text will be located by lines retrieved from hocr file.
- BY_WORDS - Enum constant in enum com.itextpdf.pdfocr.onnx.text.TextPositioning
-
Text will be grouped by words.
- BY_WORDS - Enum constant in enum com.itextpdf.pdfocr.tesseract4.TextPositioning
-
Text will be located by words retrieved from hocr file.
- BY_WORDS_AND_LINES - Enum constant in enum com.itextpdf.pdfocr.onnx.text.TextPositioning
-
Similar to BY_WORDS mode, but top and bottom of word BBox are inherited from line (default value).
- BY_WORDS_AND_LINES - Enum constant in enum com.itextpdf.pdfocr.tesseract4.TextPositioning
-
Similar to BY_WORDS mode, but top and bottom of word BBox are inherited from line.
C
- calcOutputDimensions(BufferedImage, ImageResizeOptions) - Static method in class com.itextpdf.pdfocr.onnx.util.BufferedImageUtil
-
Based on the provided ImageResizeOptions, calculates the dimensions of the output image, to where there original image should be scaled and placed with padding.
-
calcOutputDimensions(Collection
, ImageResizeOptions) - Static method in class com.itextpdf.pdfocr.onnx.util.BufferedImageUtil -
Based on the provided ImageResizeOptions, calculates the dimensions to which a batch of images should be scaled and padded.
- calcTextBoxEnlargement(double, double) - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
-
Calculates by how much the dimensions of a text box should be enlarged compared to the ones gotten from the model output.
- calcTextBoxEnlargement(double, double) - Method in class com.itextpdf.pdfocr.onnx.detection.EasyOcrDetectionPostProcessor
-
Calculates by how much the dimensions of a text box should be enlarged compared to the ones gotten from the model output.
- calcTextBoxEnlargement(double, double) - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPostProcessor
-
Calculates by how much the dimensions of a text box should be enlarged compared to the ones gotten from the model output.
- calcTextBoxEnlargement(double, double) - Method in class com.itextpdf.pdfocr.onnx.detection.PaddleOcrDetectionPostProcessor
-
Calculates by how much the dimensions of a text box should be enlarged compared to the ones gotten from the model output.
- calculate() - Method in interface com.itextpdf.pdfocr.onnx.detection.score.IScoreCalculator
-
Calculate the score based on the observed samples.
- calculate() - Method in class com.itextpdf.pdfocr.onnx.detection.score.MaxScoreCalculator
-
Calculate the score based on the observed samples.
- calculate() - Method in class com.itextpdf.pdfocr.onnx.detection.score.MeanScoreCalculator
-
Calculate the score based on the observed samples.
- calculateLevenshteinDistance(String, String) - Static method in class com.itextpdf.pdfocr.onnx.util.MathUtil
-
Calculates the Levenshtein distance between two input strings.
- CANNOT_ADD_DATA_TO_PDF_DOCUMENT - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
-
The constant CANNOT_ADD_DATA_TO_PDF_DOCUMENT.
- CANNOT_BINARIZE_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
- CANNOT_CONVERT_IMAGE_TO_GRAYSCALE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
- CANNOT_CREATE_BUFFERED_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
- CANNOT_CREATE_PDF_DOCUMENT - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
- CANNOT_DELETE_FILE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
- CANNOT_FIND_PATH_TO_TESSERACT_EXECUTABLE - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
- CANNOT_GET_TEMPORARY_DIRECTORY - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
- CANNOT_OCR_INPUT_FILE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
- CANNOT_PARSE_NODE_BBOX - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
- CANNOT_PROCESS_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
- CANNOT_READ_DEFAULT_FONT - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
-
The constant CANNOT_READ_DEFAULT_FONT.
- CANNOT_READ_FILE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
- CANNOT_READ_IMAGE_METADATA - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
- CANNOT_READ_INPUT_IMAGE - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
- CANNOT_READ_INPUT_IMAGE - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
-
The constant CANNOT_READ_INPUT_IMAGE.
- CANNOT_READ_INPUT_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
- CANNOT_READ_INPUT_IMAGE_PARAMS - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
- CANNOT_READ_PROVIDED_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
- CANNOT_RESOLVE_PROVIDED_FONTS - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
- CANNOT_RETRIEVE_PAGES_FROM_IMAGE - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
- CANNOT_START_ANOTHER_MAPPING_OPERATION - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- CANNOT_USE_USER_WORDS - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
- CANNOT_WRITE_TO_FILE - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
- CATALAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- CHANNEL_FIRST_IS_NOT_SUPPORTED - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- channelFirstIsNotSupported() - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
-
Creates an exception to assert
channel_firstis set tofalse. - clamp(double, double, double) - Static method in class com.itextpdf.pdfocr.onnx.util.MathUtil
-
Clamps a value between a specified minimum and maximum range.
- close() - Method in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictor
- close() - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
-
collectWordsIntoLines(Map
>) - Static method in class com.itextpdf.pdfocr.util.PdfOcrTextBuilder -
Merges the provided
IOcrEngine.doImageOcr(java.io.File)result into lines and updates line bounding boxes to match the largest words. - com.itextpdf.pdfocr - package com.itextpdf.pdfocr
- com.itextpdf.pdfocr.exceptions - package com.itextpdf.pdfocr.exceptions
- com.itextpdf.pdfocr.logs - package com.itextpdf.pdfocr.logs
- com.itextpdf.pdfocr.onnx - package com.itextpdf.pdfocr.onnx
- com.itextpdf.pdfocr.onnx.actions.data - package com.itextpdf.pdfocr.onnx.actions.data
- com.itextpdf.pdfocr.onnx.actions.events - package com.itextpdf.pdfocr.onnx.actions.events
- com.itextpdf.pdfocr.onnx.conf.paddle.model - package com.itextpdf.pdfocr.onnx.conf.paddle.model
- com.itextpdf.pdfocr.onnx.conf.paddle.parser - package com.itextpdf.pdfocr.onnx.conf.paddle.parser
- com.itextpdf.pdfocr.onnx.detection - package com.itextpdf.pdfocr.onnx.detection
- com.itextpdf.pdfocr.onnx.detection.score - package com.itextpdf.pdfocr.onnx.detection.score
- com.itextpdf.pdfocr.onnx.exceptions - package com.itextpdf.pdfocr.onnx.exceptions
- com.itextpdf.pdfocr.onnx.merging - package com.itextpdf.pdfocr.onnx.merging
- com.itextpdf.pdfocr.onnx.orientation - package com.itextpdf.pdfocr.onnx.orientation
- com.itextpdf.pdfocr.onnx.recognition - package com.itextpdf.pdfocr.onnx.recognition
- com.itextpdf.pdfocr.onnx.text - package com.itextpdf.pdfocr.onnx.text
- com.itextpdf.pdfocr.onnx.util - package com.itextpdf.pdfocr.onnx.util
- com.itextpdf.pdfocr.statistics - package com.itextpdf.pdfocr.statistics
- com.itextpdf.pdfocr.structuretree - package com.itextpdf.pdfocr.structuretree
- com.itextpdf.pdfocr.tesseract4 - package com.itextpdf.pdfocr.tesseract4
- com.itextpdf.pdfocr.tesseract4.actions.data - package com.itextpdf.pdfocr.tesseract4.actions.data
- com.itextpdf.pdfocr.tesseract4.actions.events - package com.itextpdf.pdfocr.tesseract4.actions.events
- com.itextpdf.pdfocr.tesseract4.exceptions - package com.itextpdf.pdfocr.tesseract4.exceptions
- com.itextpdf.pdfocr.tesseract4.logs - package com.itextpdf.pdfocr.tesseract4.logs
- com.itextpdf.pdfocr.util - package com.itextpdf.pdfocr.util
- COMMAND_FAILED - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
- concat(Vocabulary...) - Static method in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
-
Creates a new vocabulary by concatenating multiple ones.
- ConfigParserException - Exception in com.itextpdf.pdfocr.onnx.exceptions
-
Exception class for exceptions during configuration file parsing.
- ConfigParserException(String) - Constructor for exception com.itextpdf.pdfocr.onnx.exceptions.ConfigParserException
-
Creates new
ConfigParserExceptioninstance. -
correctRotationAngle(Map
>) - Static method in class com.itextpdf.pdfocr.util.PdfOcrTextBuilder -
Processes all text infos to round the rotation angle to either 0, 90, 180 or 270 degrees.
- COULD_NOT_FIND_CORRESPONDING_GLYPH_TO_UNICODE_CHARACTER - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
-
The constant COULD_NOT_FIND_CORRESPONDING_GLYPH_TO_UNICODE_CHARACTER.
- create() - Method in class com.itextpdf.pdfocr.onnx.DefaultOrtSessionOptionsCreator
- create() - Method in interface com.itextpdf.pdfocr.onnx.IOrtSessionOptionsCreator
-
Creates an instance of ONNX runtime session options.
- CREATED_TEMPORARY_FILE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
-
createPdf(List
, PdfWriter) - Method in class com.itextpdf.pdfocr.OcrPdfCreator -
Performs OCR with set parameters using provided
IOcrEngineand creates PDF using providedPdfWriter. -
createPdf(List
, PdfWriter, DocumentProperties) - Method in class com.itextpdf.pdfocr.OcrPdfCreator -
Performs OCR with set parameters using provided
IOcrEngineand creates PDF using providedPdfWriter. -
createPdf(List
, PdfWriter, DocumentProperties, IOcrProcessProperties) - Method in class com.itextpdf.pdfocr.OcrPdfCreator -
Performs OCR with set parameters using provided
IOcrEngineand creates PDF using providedPdfWriter. -
createPdfA(List
, PdfWriter, DocumentProperties, PdfOutputIntent) - Method in class com.itextpdf.pdfocr.OcrPdfCreator -
Performs OCR with set parameters using provided
IOcrEngineand creates PDF using providedPdfWriter,DocumentPropertiesandPdfOutputIntent. -
createPdfA(List
, PdfWriter, DocumentProperties, PdfOutputIntent, IOcrProcessProperties) - Method in class com.itextpdf.pdfocr.OcrPdfCreator -
Performs OCR with set parameters using provided
IOcrEngineand creates PDF using providedPdfWriter,DocumentPropertiesandPdfOutputIntent. -
createPdfA(List
, PdfWriter, PdfOutputIntent) - Method in class com.itextpdf.pdfocr.OcrPdfCreator -
Performs OCR with set parameters using provided
IOcrEngineand creates PDF using providedPdfWriterandPdfOutputIntent. -
createPdfAFile(List
, File, PdfOutputIntent) - Method in class com.itextpdf.pdfocr.OcrPdfCreator -
Performs OCR with set parameters using provided
IOcrEngineand creates PDF using providedFileandPdfOutputIntent. -
createPdfFile(List
, File) - Method in class com.itextpdf.pdfocr.OcrPdfCreator -
Performs OCR with set parameters using provided
IOcrEngineand creates PDF using providedFile. - createProcessImageEvent(SequenceId, IMetaInfo, EventConfirmationType) - Static method in class com.itextpdf.pdfocr.tesseract4.actions.events.PdfOcrTesseract4ProductEvent
-
Creates process-image event.
- createProcessImageOnnxEvent(SequenceId, IMetaInfo, EventConfirmationType) - Static method in class com.itextpdf.pdfocr.onnx.actions.events.PdfOcrOnnxProductEvent
-
Creates process-image-onnx event.
- createScoreCalculator() - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
-
Creates a new score calculator for calculating score over a text contour.
- createScoreCalculator() - Method in class com.itextpdf.pdfocr.onnx.detection.EasyOcrDetectionPostProcessor
-
Creates a new score calculator for calculating score over a text contour.
- createStatisticsAggregatorFromName(String) - Method in class com.itextpdf.pdfocr.statistics.PdfOcrOutputTypeStatisticsEvent
-
createTxtFile(List
, File) - Method in interface com.itextpdf.pdfocr.IOcrEngine -
Performs OCR using provided
IOcrEnginefor the given list of input images and saves output to a text file using provided path. -
createTxtFile(List
, File) - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine -
Performs OCR using provided
IOcrEnginefor the given list of input images and saves output to a text file using provided path. -
createTxtFile(List
, File) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine -
Performs OCR using provided
IOcrEnginefor the given list of input images and saves output to a text file using provided path. -
createTxtFile(List
, File, OcrProcessContext) - Method in interface com.itextpdf.pdfocr.IOcrEngine -
Performs OCR using provided
IOcrEnginefor the given list of input images and saves output to a text file using provided path. -
createTxtFile(List
, File, OcrProcessContext) - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine -
Performs OCR using provided
IOcrEnginefor the given list of input images and saves output to a text file using provided path. -
createTxtFile(List
, File, OcrProcessContext) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine -
Performs OCR using provided
IOcrEnginefor the given list of input images and saves output to a text file using provided path. - crnnMobileNetV3(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained CRNN model with a MobileNet V3 backbone, stored on disk.
- crnnMobileNetV3(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained CRNN models with a MobileNet V3 backbone, stored on disk.
- crnnMobileNetV3(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained CRNN model with a MobileNet V3 backbone, stored on disk.
- crnnMobileNetV3(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained CRNN models with a MobileNet V3 backbone, stored on disk.
- CrnnPostProcessor - Class in com.itextpdf.pdfocr.onnx.recognition
-
Implementation of a text recognition predictor post-processor, used for OnnxTR CRNN model outputs.
- CrnnPostProcessor() - Constructor for class com.itextpdf.pdfocr.onnx.recognition.CrnnPostProcessor
-
Creates a new post-processor with the default vocabulary.
- CrnnPostProcessor(Vocabulary) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.CrnnPostProcessor
-
Creates a new post-processor.
- crnnVgg16(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained CRNN model with a VGG-16 backbone, stored on disk.
- crnnVgg16(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained CRNN models with a VGG-16 backbone, stored on disk.
- crnnVgg16(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained CRNN model with a VGG-16 backbone, stored on disk.
- crnnVgg16(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained CRNN models with a VGG-16 backbone, stored on disk.
- CROATIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- CtcLabelDecode - Class in com.itextpdf.pdfocr.onnx.conf.paddle.model
-
POJO for the CTCLabelDecode post-processor object under a
PostProcesskey in a config file. - CtcLabelDecode(String[]) - Constructor for class com.itextpdf.pdfocr.onnx.conf.paddle.model.CtcLabelDecode
-
Creates a new POJO for the config file object.
- CtcLabelPostProcessor - Class in com.itextpdf.pdfocr.onnx.recognition
-
Implementation of a text recognition predictor post-processor, used for EasyOCR and PaddleOCR model outputs.
-
CtcLabelPostProcessor(IOutputLabelMapper
) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.CtcLabelPostProcessor -
Creates a new post-processor.
- CURRENCY - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- CYRILLIC_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- CYRILLIC_G2 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- CZECH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
D
- DANISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- DATA - Enum constant in enum com.itextpdf.pdfocr.statistics.PdfOcrOutputType
-
Processing of an image in the engine with data output
- dbNet(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
-
Creates a new text detection predictor using an existing pre-trained DBNet model, stored on disk.
- dbNet(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
-
Creates a new text detection properties object for existing pre-trained DBNet models, stored on disk.
- dbNet(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
-
Creates a new text detection predictor using an existing pre-trained DBNet model, stored on disk.
- dbNet(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
-
Creates a new text detection properties object for existing pre-trained DBNet models, stored on disk.
- DbPostProcess - Class in com.itextpdf.pdfocr.onnx.conf.paddle.model
-
POJO for the DBPostProcess post-processor object under a
PostProcesskey in a config file. - DbPostProcess(float, float, float, int, boolean, ScoreMode, BoxType) - Constructor for class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
-
Creates a new POJO for the config file object.
- DecodeImage - Class in com.itextpdf.pdfocr.onnx.conf.paddle.model
-
POJO for the DecodeImage transform operation within a
PreProcessobject in a config file. - DecodeImage(boolean, ImgMode) - Constructor for class com.itextpdf.pdfocr.onnx.conf.paddle.model.DecodeImage
-
Creates a new POJO for the config file object.
- DEFAULT_ORT_SESSION_CREATOR - Static variable in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictorProperties
-
Default ONNX runtime session options creator.
- DefaultOrientationMapper - Class in com.itextpdf.pdfocr.onnx.orientation
-
Default implementation for mapping output of a crop orientation model to
TextOrientationvalues. - DefaultOrientationMapper() - Constructor for class com.itextpdf.pdfocr.onnx.orientation.DefaultOrientationMapper
-
Constructs a new
DefaultOrientationMapperwith default behavior. - DefaultOrtSessionOptionsCreator - Class in com.itextpdf.pdfocr.onnx
-
Default implementation of
IOrtSessionOptionsCreator. - DefaultOrtSessionOptionsCreator() - Constructor for class com.itextpdf.pdfocr.onnx.DefaultOrtSessionOptionsCreator
- deserializeFromStream(InputStream) - Static method in class com.itextpdf.pdfocr.onnx.util.YamlUtil
-
Deserializes a content stream, which contains a single YAML document.
- DetResizeForTest - Class in com.itextpdf.pdfocr.onnx.conf.paddle.model
-
POJO for the DetResizeForTest transform operation within a
PreProcessobject in a config file. - DetResizeForTest(int[], boolean) - Constructor for class com.itextpdf.pdfocr.onnx.conf.paddle.model.DetResizeForTest
-
Creates a new POJO for the config file object.
- DEVANAGARI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- DEVANAGARI_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- DEVANAGARI_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- DEVANAGARI_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- DEVANAGARI_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- DEVANAGARI_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- DEVANAGARI_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- DEVANAGARI_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- DEVANAGARI_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- Dimensions2D - Class in com.itextpdf.pdfocr.onnx.util
-
A basic 2-element tuple with a width and a height.
- Dimensions2D(int, int) - Constructor for class com.itextpdf.pdfocr.onnx.util.Dimensions2D
-
Creates new
Dimensions2Dinstance. - doImageOcr(File) - Method in interface com.itextpdf.pdfocr.IOcrEngine
-
Reads data from the provided input image file and returns retrieved data in the format described below.
- doImageOcr(File) - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
-
Reads data from the provided input image file and returns retrieved data in the format described below.
- doImageOcr(File) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Reads data from the provided input image file and returns retrieved data in the format described below.
- doImageOcr(File, OcrProcessContext) - Method in interface com.itextpdf.pdfocr.IOcrEngine
-
Reads data from the provided input image file and returns retrieved data in the format described below.
- doImageOcr(File, OcrProcessContext) - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
-
Reads data from the provided input image file and returns retrieved data in the format described below.
- doImageOcr(File, OcrProcessContext) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Reads data from the provided input image file and returns retrieved data in the format described below.
- doImageOcr(File, OutputFormat) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Reads data from the provided input image file and returns retrieved data as string.
- doImageOcr(File, OutputFormat, OcrProcessContext) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Reads data from the provided input image file and returns retrieved data as string.
-
doImageOcr(List
) - Method in interface com.itextpdf.pdfocr.IOcrEngine -
Reads data from the provided list of input image files and returns retrieved data in the format described below.
-
doImageOcr(List
) - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine -
Reads data from the provided list of input image files and returns retrieved data in the format described below.
-
doImageOcr(List
) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine -
Reads data from the provided list of input image files and returns retrieved data in the format described below.
-
doImageOcr(List
, OcrProcessContext) - Method in interface com.itextpdf.pdfocr.IOcrEngine -
Reads data from the provided list of input image files and returns retrieved data in the format described below.
-
doImageOcr(List
, OcrProcessContext) - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine -
Reads data from the provided list of input image files and returns retrieved data in the format described below.
-
doImageOcr(List
, OcrProcessContext) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine -
Reads data from the provided list of input image files and returns retrieved data in the format described below.
- doTesseractOcr(File, File, OutputFormat) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Performs tesseract OCR for the first (or for the only) image page.
- doTesseractOcr(File, File, OutputFormat, OcrProcessContext) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Performs tesseract OCR for the first (or for the only) image page.
- duplicate() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Creates a new float buffer that shares this buffer's content.
- DUTCH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
E
- easyOcr(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
-
Creates a new text detection predictor using an existing pre-trained EasyOCR CRAFT model, stored on disk.
- easyOcr(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
-
Creates a new text detection properties object for an existing pre-trained EasyOCR CRAFT model, stored on disk.
- easyOcr(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
-
Creates a new text detection predictor using an existing pre-trained EasyOCR CRAFT model, stored on disk.
- easyOcr(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
-
Creates a new text detection properties object for an existing pre-trained EasyOCR CRAFT model, stored on disk.
- easyOcr(String, EasyOcrMapper) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained EasyOCR model, stored on disk.
- easyOcr(String, EasyOcrMapper) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained EasyOCR models, stored on disk.
- easyOcr(String, EasyOcrMapper, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained EasyOCR model, stored on disk.
- easyOcr(String, EasyOcrMapper, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained EasyOCR models, stored on disk.
- EasyOcrDetectionPostProcessor - Class in com.itextpdf.pdfocr.onnx.detection
-
Implementation of a text detection predictor post-processor, used for EasyOCR model outputs.
- EasyOcrDetectionPostProcessor() - Constructor for class com.itextpdf.pdfocr.onnx.detection.EasyOcrDetectionPostProcessor
-
Creates a new post-processor with the default parameters.
- EasyOcrDetectionPostProcessor(float) - Constructor for class com.itextpdf.pdfocr.onnx.detection.EasyOcrDetectionPostProcessor
-
Creates a new post-processor.
- EasyOcrMapper - Class in com.itextpdf.pdfocr.onnx.recognition
-
Label mapper for EasyOCR text recognition models.
- EasyOcrTextBoxMerger - Class in com.itextpdf.pdfocr.onnx.merging
-
Text box merger, based on the algorithm used in EasyOCR.
- EasyOcrTextBoxMerger() - Constructor for class com.itextpdf.pdfocr.onnx.merging.EasyOcrTextBoxMerger
-
Creates new
EasyOcrTextBoxMergerinstance. - ELEM_COUNT_DOES_NOT_MATCH_SHAPE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- EndOfStringPostProcessor - Class in com.itextpdf.pdfocr.onnx.recognition
-
Implementation of a text recognition predictor post-processor, used for OnnxTR non-CRNN model outputs.
- EndOfStringPostProcessor() - Constructor for class com.itextpdf.pdfocr.onnx.recognition.EndOfStringPostProcessor
-
Creates a new post-processor with the default vocabulary.
- EndOfStringPostProcessor(Vocabulary) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.EndOfStringPostProcessor
-
Creates a new post-processor without any additional tokens.
- EndOfStringPostProcessor(Vocabulary, int) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.EndOfStringPostProcessor
-
Creates a new post-processor.
- ENGLISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ENGLISH_G2 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- equals(Object) - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.CtcLabelDecode
- equals(Object) - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
- equals(Object) - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DecodeImage
- equals(Object) - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DetResizeForTest
- equals(Object) - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.InferenceConfig
- equals(Object) - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.NormalizeImage
- equals(Object) - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.PreProcess
- equals(Object) - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.RecResizeImg
- equals(Object) - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
- equals(Object) - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
- equals(Object) - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
- equals(Object) - Method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictorProperties
- equals(Object) - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
- equals(Object) - Method in class com.itextpdf.pdfocr.onnx.recognition.StringMapper
- equals(Object) - Method in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- equals(Object) - Method in class com.itextpdf.pdfocr.onnx.util.Dimensions2D
- ESPERANTO - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ESTONIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ETHIOPIC - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- euclideanModulo(float, float) - Static method in class com.itextpdf.pdfocr.onnx.util.MathUtil
-
Computes the Euclidean modulo (non-negative remainder) of
xmoduloy. - EXPECTED_SHAPE_SIZE - Static variable in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Expected shape size.
- expit(float) - Static method in class com.itextpdf.pdfocr.onnx.util.MathUtil
-
Computes the sigmoid function, also known as the logistic function, for the given input.
-
extractBoxes(BufferedImage, Collection
) - Static method in class com.itextpdf.pdfocr.onnx.util.BufferedImageUtil -
Extracts sub-images from an image, based on provided rotated 4-point boxes.
F
- FAILED_TO_CLOSE_ONNX_RUNTIME_SESSION - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- FAILED_TO_INIT_ONNX_RUNTIME_SESSION - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- FAILED_TO_INIT_SESSION_OPTIONS - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- FAILED_TO_LOAD_ONNXRUNTIME - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- FAILED_TO_READ_IMAGE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- fast(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
-
Creates a new text detection predictor using an existing pre-trained FAST model, stored on disk.
- fast(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
-
Creates a new text detection properties object for existing pre-trained FAST models, stored on disk.
- fast(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
-
Creates a new text detection predictor using an existing pre-trained FAST model, stored on disk.
- fast(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
-
Creates a new text detection properties object for existing pre-trained FAST models, stored on disk.
- FAST - Enum constant in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ScoreMode
-
Value for
fastconfig value. - fillPolyAtOffset(Mat, Mat, Scalar, int, int) - Static method in class com.itextpdf.pdfocr.onnx.util.OpenCvUtil
-
Fill the polygon on the bitmap at a specific offset.
- findTextContours(Mat) - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
-
Extracts text contours from the provided 0 - 255 mask.
- findTextContours(Mat) - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPostProcessor
-
Extracts text contours from the provided 0 - 255 mask.
- FINNISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- FloatBufferMdArray - Class in com.itextpdf.pdfocr.onnx
-
Multidimensional array with a
FloatBufferWrapperbacking storage. - FloatBufferMdArray(FloatBufferWrapper, long[]) - Constructor for class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
-
Constructs a new
FloatBufferMdArraywith the specified data buffer and shape. - FloatBufferWrapper - Class in com.itextpdf.pdfocr.onnx
-
Wrapper class around
FloatBuffer. - FloatBufferWrapper(FloatBuffer) - Constructor for class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Constructs
FloatBufferWrapperon top ofFloatBuffer. - FRENCH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- FRISIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
-
fromOutputBuffer(List
, FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor -
Converts ONNX runtime model batched output MD-array buffer to a list of predictor outputs.
-
fromOutputBuffer(List
, FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictor -
Converts ONNX runtime model batched output MD-array buffer to a list of predictor outputs.
-
fromOutputBuffer(List
, FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor -
Converts ONNX runtime model batched output MD-array buffer to a list of predictor outputs.
-
fromOutputBuffer(List
, FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictor -
Converts ONNX runtime model batched output MD-array buffer to a list of predictor outputs.
G
- GALICIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- GENERIC_CYRILLIC_LETTERS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
-
generifyWordBBoxesByLine(Map
>) - Static method in class com.itextpdf.pdfocr.util.PdfOcrTextBuilder -
Sorts the provided
IOcrEngine.doImageOcr(java.io.File)result by lines and updates line bboxes to match the largest words. - GEORGIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- GERMAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- get() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Relative get method.
- get(float[]) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Relative bulk get method.
- get(int) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Absolute get method.
- getAccessibilityProperties() - Method in class com.itextpdf.pdfocr.structuretree.LogicalStructureTreeItem
-
Retrieve structure tree element's properties.
- getAllImages(File) - Static method in class com.itextpdf.pdfocr.util.TiffImageUtil
-
Retrieves all images from a TIFF file.
- getArrayOffset() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
-
Gets internal offset of the provided float buffer array.
- getArraySize() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
-
Gets number of available bytes for read from provided float buffer array.
- getBatchSize() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns input batch size.
- getBBoxRect() - Method in class com.itextpdf.pdfocr.TextInfo
-
Converts a text polygon to a bounding box.
- getBlueChannelIndex() - Method in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
-
Returns the index of the blue channel in the resulting ML input buffer.
- getBlueMean() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns blue channel mean, used for normalization.
- getBlueStd() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns blue channel standard deviation, used for normalization.
- getBoxThresh() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
-
Returns the value under the
box_threshkey. - getBoxType() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
-
Returns the value under the
box_typekey. - getChannelConfiguration() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
-
Returns the image channel configuration, that will be used when passing the image to the ML model.
- getChannelCount() - Method in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
-
Returns the amount of channels used to store the image.
- getChannelCount() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns input channel count.
- getChannelFirst() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DecodeImage
-
Returns the value under the
channel_firstkey. - getCharacterDict() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.CtcLabelDecode
-
Returns the value under the
character_dictkey. - getChildren() - Method in class com.itextpdf.pdfocr.structuretree.LogicalStructureTreeItem
-
Retrieve all child structure tree items.
- getConfirmationType() - Method in class com.itextpdf.pdfocr.AbstractPdfOcrEventHelper
-
Returns the confirmation type of event.
- getData() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
-
Returns a duplicate of the backing
FloatBufferWrapper. - getDefaultFontFamily() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets preferred font family to be used when selecting font from
FontProvider. - getDefaultFontFamily() - Method in class com.itextpdf.pdfocr.PdfOcrFontProvider
-
Gets default font family.
- getDefaultLanguage() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Gets default language for ocr.
- getDefaultUserWordsSuffix() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Gets default user words suffix.
- getDimension(int) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
-
Returns the size of the specified dimension.
- getDimensionCount() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
-
Returns the number of dimensions of this multidimensional array.
- getEventType() - Method in class com.itextpdf.pdfocr.onnx.actions.events.PdfOcrOnnxProductEvent
- getEventType() - Method in class com.itextpdf.pdfocr.tesseract4.actions.events.PdfOcrTesseract4ProductEvent
- getFloatBuffer() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Returns
FloatBufferthat backs this buffer. - getFontProvider() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Returns FontProvider that was set previously or if it is
nulla new instance ofPdfOcrFontProvideris returned. - getGrayMean() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns gray channel mean, used for normalization.
- getGrayStd() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns gray channel standard deviation, used for normalization.
- getGreenChannelIndex() - Method in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
-
Returns the index of the green channel in the resulting ML input buffer.
- getGreenMean() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns green channel mean, used for normalization.
- getGreenStd() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns green channel standard deviation, used for normalization.
- getHeight() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns input minimum height.
- getHeight() - Method in class com.itextpdf.pdfocr.onnx.util.Dimensions2D
-
Gets height of the
Dimensions2Dinstance. - getHeightMultiple() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
-
Returns the height multiple.
- getImageLayerName() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets name of image layer.
- getImagePreprocessingOptions() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Gets
Tesseract4OcrEngineProperties.imagePreprocessingOptions. - getImageResizeOptions() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns image resize options for the input.
- getImageRotationHandler() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets image rotation handler instance.
- getImageShape() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DetResizeForTest
-
Returns the values under the
image_shapekey. - getImageShape() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.RecResizeImg
-
Returns the values under the
image_shapekey. - getImageType(File) - Static method in class com.itextpdf.pdfocr.util.TiffImageUtil
-
Gets the image type.
- getImgMode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DecodeImage
-
Returns the value under the
img_modekey. - getInputProperties() - Method in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictorProperties
-
Returns the ONNX model input properties.
- getInstance() - Static method in class com.itextpdf.pdfocr.onnx.actions.data.PdfOcrOnnxProductData
-
Getter for an instance of
ProductDatarelated to iText pdfOcr Onnx module. - getInstance() - Static method in class com.itextpdf.pdfocr.structuretree.ArtifactItem
-
Retrieve an instance of
ArtifactItem. - getInstance() - Static method in class com.itextpdf.pdfocr.tesseract4.actions.data.PdfOcrTesseract4ProductData
-
Getter for an instance of
ProductDatarelated to iText pdfOcr Tesseract4 module. - getKeepRatio() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DetResizeForTest
-
Returns the value under the
keep_ratiokey. - getLanguages() - Method in class com.itextpdf.pdfocr.OcrEngineProperties
-
Gets list of languages required for provided images.
- getLanguagesAsString() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Gets list of languages concatenated with "+" symbol to a string in format required by tesseract.
- getLogicalStructureTreeItem() - Method in class com.itextpdf.pdfocr.TextInfo
-
Retrieves structure tree item for the text item.
- getLookUpString() - Method in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
-
Returns the look-up string.
- getMaskSourceArray(FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
-
Returns the array to be used, when building a mask for contour detection.
- getMaskSourceArray(FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.detection.EasyOcrDetectionPostProcessor
-
Returns the array to be used, when building a mask for contour detection.
- getMaxCandidates() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
-
Returns the value under the
max_candidateskey. - getMaxHeight() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
-
Returns the maximum height the image should be after resizing.
- getMaxWidth() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
-
Returns the maximum width the image should be after resizing.
- getMean() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.NormalizeImage
-
Returns the values under the
meankey. - getMean() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns per-channel mean, used for normalization.
- getMean(int) - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns channel-specific mean, used for normalization.
- getMessage() - Method in exception com.itextpdf.pdfocr.exceptions.PdfOcrException
- getMessageParams() - Method in exception com.itextpdf.pdfocr.exceptions.PdfOcrException
-
Gets additional params for Exception message.
- getMetaInfoContainer() - Method in interface com.itextpdf.pdfocr.IProductAware
-
Gets the container with meta info.
- getMetaInfoContainer() - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
-
Gets the container with meta info.
- getMetaInfoContainer() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Gets the container with meta info.
- getMinHeight() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
-
Returns the minimum height the image should be after resizing.
- getMinimalConfidenceLevel() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Gets minimal confidence level for HOCR line to be considered as properly recognized.
- getMinWidth() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
-
Returns the minimum width the image should be after resizing.
- getModelPath() - Method in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictorProperties
-
Returns the path to the ONNX model.
- getName() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.CtcLabelDecode
-
Returns the value under the
namekey. - getName() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
-
Returns the value under the
namekey. - getName() - Method in interface com.itextpdf.pdfocr.onnx.conf.paddle.model.PostProcess
-
Returns the value under the
namekey. - getOcrEngine() - Method in class com.itextpdf.pdfocr.OcrPdfCreator
-
Gets used
IOcrEnginereader object to perform OCR. - getOcrEventHelper() - Method in class com.itextpdf.pdfocr.OcrProcessContext
-
Returns helper for working with events.
- getOcrPdfCreatorProperties() - Method in class com.itextpdf.pdfocr.OcrPdfCreator
-
Gets properties for
OcrPdfCreator. - getOcrProcessProperties() - Method in class com.itextpdf.pdfocr.OcrProcessContext
-
Get extra OCR process properties.
- getOrtSessionOptionsCreator() - Method in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictorProperties
-
Returns the ONNX runtime session options creator.
- getOutputMapper() - Method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictorProperties
-
Returns the ONNX model output mapper.
- getPaddingStrategy() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
-
Returns the padding strategy.
- getPaddingStrategy() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns the padding strategy for image inputs.
- getPageSegMode() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Gets Page Segmentation Mode.
- getPageSize() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets required size for output PDF document.
- getParent() - Method in class com.itextpdf.pdfocr.structuretree.LogicalStructureTreeItem
-
Retrieve parent structure tree item.
- getPathToExecutable() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4ExecutableOcrEngine
-
Gets path to tesseract executable.
- getPathToTessData() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Gets path to directory with tess data.
- getPdfLang() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets PDF language.
- getPdfOcrStatisticsEventType() - Method in class com.itextpdf.pdfocr.statistics.PdfOcrOutputTypeStatisticsEvent
-
Gets the type of statistic event.
- getPixelTextPoints(int) - Method in class com.itextpdf.pdfocr.TextInfo
-
Gets array of 4
Points describing text bbox (lower-left based relative to text) expressed in pixels. - getPostProcess() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.InferenceConfig
-
Returns the value under the
PostProcesskey. - getPostProcessor() - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
-
Returns the ONNX model output post-processor.
- getPostProcessor() - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Returns the ONNX model output post-processor.
- getPredsArray(FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
-
Returns the preds array from the output buffer.
- getPreProcess() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.InferenceConfig
-
Returns the value under the
PreProcesskey. - getProductData() - Method in interface com.itextpdf.pdfocr.IProductAware
-
Gets object containing information about the product.
- getProductData() - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
-
Gets object containing information about the product.
- getProductData() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
- getProperties() - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
-
Returns the text detection predictor properties.
- getProperties() - Method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictor
-
Returns the crop orientation predictor properties.
- getProperties() - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Returns the text recognition predictor properties.
- getRedChannelIndex() - Method in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
-
Returns the index of the red channel in the resulting ML input buffer.
- getRedMean() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns red channel mean, used for normalization.
- getRedStd() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns red channel standard deviation, used for normalization.
- getRotationAngle() - Method in class com.itextpdf.pdfocr.TextInfo
-
Returns the text rotation angle in radian for this
TextInfoin the range of -pi to pi. - getScalar(int) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
-
Returns the scalar value at the specified index.
- getScaleMode() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets scale mode for input images using available options from
ScaleModeenumeration. - getScoreMode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
-
Returns the value under the
score_modekey. - getSequenceId() - Method in class com.itextpdf.pdfocr.AbstractPdfOcrEventHelper
-
Returns the sequence id
- getShape() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
-
Returns a copy of the shape array that defines the dimensions of this multidimensional array.
- getShape() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns target input shape.
- getShape(int) - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns target input dimension value.
- getSolidColor() - Method in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
-
Returns the solid color used for padding.
- getStatisticsNames() - Method in class com.itextpdf.pdfocr.statistics.PdfOcrOutputTypeStatisticsEvent
- getStd() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.NormalizeImage
-
Returns the values under the
stdkey. - getStd() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns per-channel standard deviation, used for normalization.
- getStd(int) - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns channel-specific standard deviation, used for normalization.
- getSubArray(int) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferMdArray
-
Returns a sub-array representing the slice at the specified index of the first dimension.
- getTempFilePath(String, String) - Static method in class com.itextpdf.pdfocr.util.PdfOcrFileUtil
-
Gets path to temp file in current system temporary directory.
- getTesseract4OcrEngineProperties() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Gets properties for
AbstractTesseract4OcrEngine. - getTesseractInstance() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4LibOcrEngine
-
Gets tesseract instance.
- getText() - Method in class com.itextpdf.pdfocr.TextInfo
-
Gets text element.
- getTextBBoxColor() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets text bounding box color in the output PDF document.
- getTextColor() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets text color in output PDF document.
- getTextLayerName() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets name of text layer.
- getTextPoints() - Method in class com.itextpdf.pdfocr.TextInfo
-
Gets array of 4
Points describing text bbox (lower-left based relative to text) expressed in points. - getTextPositioning() - Method in class com.itextpdf.pdfocr.onnx.OnnxEngineProperties
-
Gets the way text is retrieved from ocr engine output using
TextPositioning. - getTextPositioning() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Defines the way text is retrieved from tesseract output using
TextPositioning. - getThresh() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
-
Returns the value under the
threshkey. - getTileHeight() - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
-
Gets
ImagePreprocessingOptions.tileHeight. - getTileWidth() - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
-
Gets
ImagePreprocessingOptions.tileWidth. - getTitle() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Gets PDF document title.
- getTransformOps() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.PreProcess
-
Returns the values under the
transform_opskey. - getUnclipRatio() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
-
Returns the value under the
unclip_ratiokey. - getUseDilation() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
-
Returns the value under the
use_dilationkey. - getWidth() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns input minimum width.
- getWidth() - Method in class com.itextpdf.pdfocr.onnx.util.Dimensions2D
-
Gets width of the
Dimensions2Dinstance. - getWidthMultiple() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
-
Returns the width multiple.
- getWrappingKey() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DecodeImage
-
Returns the wrapping key for the transform operation object.
- getWrappingKey() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DetResizeForTest
-
Returns the wrapping key for the transform operation object.
- getWrappingKey() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.NormalizeImage
-
Returns the wrapping key for the transform operation object.
- getWrappingKey() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.RecResizeImg
-
Returns the wrapping key for the transform operation object.
- getWrappingKey() - Method in interface com.itextpdf.pdfocr.onnx.conf.paddle.model.TransformOp
-
Returns the wrapping key for the transform operation object.
- GRAY - Enum constant in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ImgMode
-
Value for
GRAYconfig value. - GRAYSCALE - Enum constant in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
-
Image is represented with a single channel, which contains the grayscale version of the image.
- GREEK - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- GREEK_EXTENDED - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- GUJARATI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- GUJARATI_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- GUJARATI_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- GUJARATI_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- GUJARATI_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- GUJARATI_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- GUJARATI_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- GUJARATI_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
H
- hashCode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.CtcLabelDecode
- hashCode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
- hashCode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DecodeImage
- hashCode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DetResizeForTest
- hashCode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.InferenceConfig
- hashCode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.NormalizeImage
- hashCode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.PreProcess
- hashCode() - Method in class com.itextpdf.pdfocr.onnx.conf.paddle.model.RecResizeImg
- hashCode() - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
- hashCode() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
- hashCode() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
- hashCode() - Method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictorProperties
- hashCode() - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
- hashCode() - Method in class com.itextpdf.pdfocr.onnx.recognition.StringMapper
- hashCode() - Method in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- hashCode() - Method in class com.itextpdf.pdfocr.onnx.util.Dimensions2D
- hasNext() - Method in class com.itextpdf.pdfocr.onnx.util.BatchProcessingGenerator
- HAUSA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- HEBREW - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- HEBREW_CANTILLATIONS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- HEBREW_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- HEBREW_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- HEBREW_SPECIALS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- HEBREW_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- HEIGHT_MULTIPLE_SHOULD_BE_POSITIVE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- HINDI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- HINDI_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- HOCR - Enum constant in enum com.itextpdf.pdfocr.tesseract4.OutputFormat
-
Reader will produce XHTML output compliant with the hOCR specification.
- HORIZONTAL - Enum constant in enum com.itextpdf.pdfocr.TextOrientation
-
Horizontal text, non-rotated.
- HORIZONTAL_ROTATED_180 - Enum constant in enum com.itextpdf.pdfocr.TextOrientation
-
Horizontal text, rotated 180 degrees counter-clockwise.
- HORIZONTAL_ROTATED_270 - Enum constant in enum com.itextpdf.pdfocr.TextOrientation
-
Horizontal text, rotated 270 degrees counter-clockwise.
- HORIZONTAL_ROTATED_90 - Enum constant in enum com.itextpdf.pdfocr.TextOrientation
-
Horizontal text, rotated 90 degrees counter-clockwise.
- HUNGARIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- HW_ARRAY_SHOULD_BE_TWO_DIMENSIONAL - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
I
-
IBatchProcessor<T,
R> - Interface in com.itextpdf.pdfocr.onnx.util -
Batch processor mapper interface.
- ICELANDIC - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- identifyOsType() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Identifies type of current OS and return it (win, linux).
- IDetectionPostProcessor - Interface in com.itextpdf.pdfocr.onnx.detection
-
Interface for post-processors, which convert raw output of an ML model and returns rotated boxes with the detected objects.
- IDetectionPredictor - Interface in com.itextpdf.pdfocr.onnx.detection
-
Interface for predictors, which take a full image and find text boxes on them.
- IImageRotationHandler - Interface in com.itextpdf.pdfocr
-
Rotation information may be stored in image metadata.
- IMAGE_LAYER_NAME_IS_NOT_APPLIED - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
- IMAGE_SHAPE_IS_NOT_SUPPORTED - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- ImageChannelConfiguration - Enum in com.itextpdf.pdfocr.onnx
-
Enumeration of supported image channel configuration for buffers.
- ImagePreprocessingOptions - Class in com.itextpdf.pdfocr.tesseract4
-
Additional options applied on image preprocessing step.
- ImagePreprocessingOptions() - Constructor for class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
-
Creates
ImagePreprocessingOptionsinstance. - ImagePreprocessingOptions(ImagePreprocessingOptions) - Constructor for class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
-
Creates a new
ImagePreprocessingOptionsinstance based on anotherImagePreprocessingOptionsinstance (copy constructor). - ImageResizeOptions - Class in com.itextpdf.pdfocr.onnx
-
Options, that describe the way an image will be resized before being converted to a tensor for an ML model input.
- ImageResizeOptions(ImageChannelConfiguration, int, int) - Constructor for class com.itextpdf.pdfocr.onnx.ImageResizeOptions
-
Creates image resize options.
- ImageResizeOptions(ImageChannelConfiguration, int, int, int, int) - Constructor for class com.itextpdf.pdfocr.onnx.ImageResizeOptions
-
Creates image resize options.
- ImageResizeOptions(ImageChannelConfiguration, int, int, int, int, int, int, PaddingStrategy) - Constructor for class com.itextpdf.pdfocr.onnx.ImageResizeOptions
-
Creates image resize options.
- ImageResizeOptions(ImageChannelConfiguration, int, int, int, int, PaddingStrategy) - Constructor for class com.itextpdf.pdfocr.onnx.ImageResizeOptions
-
Creates image resize options.
- ImageResizeOptions(ImageChannelConfiguration, int, int, PaddingStrategy) - Constructor for class com.itextpdf.pdfocr.onnx.ImageResizeOptions
-
Creates image resize options.
- imageShapeIsNotSupported() - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
-
Creates an exception for cases, when the detection model uses an unsupported method for resizing input images.
- ImgMode - Enum in com.itextpdf.pdfocr.onnx.conf.paddle.model
-
Enum for values under the
img_modekey within a DecodeImage transform operation object in a config file. - INCORRECT_INPUT_IMAGE_FORMAT - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
- INCORRECT_LANGUAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
- INDEX_OUT_OF_BOUNDS - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- INDONESIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- InferenceConfig - Class in com.itextpdf.pdfocr.onnx.conf.paddle.model
-
POJO for the root object in a config file.
- InferenceConfig(PreProcess, PostProcess) - Constructor for class com.itextpdf.pdfocr.onnx.conf.paddle.model.InferenceConfig
-
Creates a new POJO for the config file object.
- InferenceConfigParser - Class in com.itextpdf.pdfocr.onnx.conf.paddle.parser
-
Static class with functions for parsing PaddleOCR YAML config files into a
InferenceConfigPOJO. - initializeTesseract(OutputFormat) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4LibOcrEngine
-
Initializes instance of tesseract if it haven't been already initialized or it have been disposed and sets all the required properties.
- inputProperties - Variable in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictorProperties
-
Properties of the inputs of the ONNX model.
- INVALID_NUMBER_OF_OUTPUTS - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- IO_EXCEPTION_OCCURRED - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
- IOcrEngine - Interface in com.itextpdf.pdfocr
-
IOcrEngineinterface is used for instantiating new OcrReader objects. - IOcrProcessProperties - Interface in com.itextpdf.pdfocr
-
OCR properties passed to the OCR engine as part of
OcrProcessContext. - IOrientationPredictor - Interface in com.itextpdf.pdfocr.onnx.orientation
-
Interface for predictors, which take a cropped image of text and determine its orientation.
- IOrtSessionOptionsCreator - Interface in com.itextpdf.pdfocr.onnx
-
Interface for ONNX runtime session options creators.
- IOutputLabelMapper<T> - Interface in com.itextpdf.pdfocr.onnx
-
Interface for mapping an integer index (continuous from 0) to output values.
-
IPredictor<T,
R> - Interface in com.itextpdf.pdfocr.onnx -
Interface of a generic predictor.
- IProductAware - Interface in com.itextpdf.pdfocr
-
The interface that holds information about product data and meta info.
- IRecognitionPostProcessor - Interface in com.itextpdf.pdfocr.onnx.recognition
-
Interface for post-processors, which convert raw output of an ML model and returns recognized characters as a string.
- IRecognitionPredictor - Interface in com.itextpdf.pdfocr.onnx.recognition
-
Interface for predictors, which take a cropped image of text and recognize text characters on it.
- IRISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- IScoreCalculator - Interface in com.itextpdf.pdfocr.onnx.detection.score
-
Interface for abstracting away score calculation over a text contour in the text detection post-processor.
- isFixedSize() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
-
Returns whether the target size is fixed.
- isPreprocessingImages() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Checks whether image preprocessing is needed.
- isSmoothTiling() - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
-
Gets
ImagePreprocessingOptions.smoothTiling. - isTagged() - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Retrieve information on whether pdf document should be tagged or not.
- isTaggingSupported() - Method in interface com.itextpdf.pdfocr.IOcrEngine
-
Checks whether tagging is supported by the OCR engine.
- isTaggingSupported() - Method in class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
-
Checks whether tagging is supported by the OCR engine.
- isTaggingSupported() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
- isTiffImage(File) - Static method in class com.itextpdf.pdfocr.util.TiffImageUtil
-
Checks whether image type is TIFF.
- isUseTxtToImproveHocrParsing() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Gets
Tesseract4OcrEngineProperties.useTxtToImproveHocrParsing. - isValidContour(Mat, Rect) - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
-
Returns whether the contour is good enough to be a text box.
- isWindows() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Checks current os type.
- ITALIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ITextBoxMerger - Interface in com.itextpdf.pdfocr.onnx.merging
-
Interface for a processing class, which handles merging text boxes, received from a text detection routine.
J
- JAPANESE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- JAPANESE_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- JAPANESE_G2 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- JAVANESE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- JAVANESE_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- JAVANESE_DIACRITICS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- JAVANESE_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- JAVANESE_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- JAVANESE_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- JAVANESE_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
K
- KANNADA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KANNADA_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KANNADA_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KANNADA_G2 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- KANNADA_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KANNADA_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KANNADA_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KANNADA_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KANNADA_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KAZAKH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KHMER - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KHMER_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KHMER_DIACRITICS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KHMER_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KHMER_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KHMER_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KHMER_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KHMER_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KOREAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KOREAN_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- KOREAN_G2 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- KURDISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- KYRGYZ - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
L
- labelDimension() - Method in class com.itextpdf.pdfocr.onnx.recognition.CrnnPostProcessor
-
Returns the size of the output character label vector.
- labelDimension() - Method in class com.itextpdf.pdfocr.onnx.recognition.CtcLabelPostProcessor
-
Returns the size of the output character label vector.
- labelDimension() - Method in class com.itextpdf.pdfocr.onnx.recognition.EndOfStringPostProcessor
-
Returns the size of the output character label vector.
- labelDimension() - Method in interface com.itextpdf.pdfocr.onnx.recognition.IRecognitionPostProcessor
-
Returns the size of the output character label vector.
- LANGUAGE_IS_NOT_IN_THE_LIST - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
- LAO - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- LATIN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- LATIN_EXTENDED - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- LATIN_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- LATIN_G2 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- LATVIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- LEGACY_FRENCH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- LeptonicaImageRotationHandler - Class in com.itextpdf.pdfocr.tesseract4
-
Leptonica based implementation of
IImageRotationHandler. - LeptonicaImageRotationHandler() - Constructor for class com.itextpdf.pdfocr.tesseract4.LeptonicaImageRotationHandler
- limit() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Returns this buffer's limit.
- limit(int) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Sets this buffer's limit.
- linkNet(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
-
Creates a new text detection predictor using an existing pre-trained LinkNet model, stored on disk.
- linkNet(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
-
Creates a new text detection properties object for existing pre-trained LinkNet models, stored on disk.
- linkNet(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
-
Creates a new text detection predictor using an existing pre-trained LinkNet model, stored on disk.
- linkNet(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
-
Creates a new text detection properties object for existing pre-trained LinkNet models, stored on disk.
- LITHUANIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- LogicalStructureTreeItem - Class in com.itextpdf.pdfocr.structuretree
-
This class represents structure tree item of the text item put into the pdf document.
- LogicalStructureTreeItem() - Constructor for class com.itextpdf.pdfocr.structuretree.LogicalStructureTreeItem
-
Instantiate a new
LogicalStructureTreeIteminstance. - LogicalStructureTreeItem(AccessibilityProperties) - Constructor for class com.itextpdf.pdfocr.structuretree.LogicalStructureTreeItem
-
Instantiate a new
LogicalStructureTreeIteminstance. - logit(double) - Static method in class com.itextpdf.pdfocr.onnx.util.MathUtil
-
Computes the logit function, which is the inverse of expit, for the given input.
- logit(float) - Static method in class com.itextpdf.pdfocr.onnx.util.MathUtil
-
Computes the logit function, which is the inverse of expit, for the given input.
- LOOK_UP_STRING_CONTAINS_2_CODE_UNITS_POINTS - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- LUXEMBOURGISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
M
- MACEDONIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- makePdfSearchable(PdfDocument) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
-
Performs OCR of all images in an input PDF document and adds recognized text on top of the images.
- makePdfSearchable(PdfDocument, IOcrProcessProperties) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
-
Performs OCR of all images in an input PDF document and adds recognized text on top of the images.
- makePdfSearchable(File, File) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
-
Performs OCR of all images in an input PDF file and generates searchable PDF.
- makePdfSearchable(File, File, IOcrProcessProperties) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
-
Performs OCR of all images in an input PDF file and generates searchable PDF.
- MALAGASY - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- MALAY - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- MALAYALAM - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- MALAYALAM_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- MALAYALAM_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- MALAYALAM_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- MALAYALAM_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- MALAYALAM_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- MALAYALAM_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- MALTESE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- MAORI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- map(int) - Method in interface com.itextpdf.pdfocr.onnx.IOutputLabelMapper
-
Returns value, which is mapped to the specified index.
- map(int) - Method in class com.itextpdf.pdfocr.onnx.orientation.DefaultOrientationMapper
-
Returns value, which is mapped to the specified index.
- map(int) - Method in class com.itextpdf.pdfocr.onnx.recognition.StringMapper
-
Returns character, which is mapped to the specified index in the lookup string.
- map(int) - Method in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
-
Returns character, which is mapped to the specified index in the lookup string.
- mapPredToSample(float) - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
-
Calculates the score sample value, based on a prediction value from the buffer.
- mapPredToSample(float) - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPostProcessor
-
Calculates the score sample value, based on a prediction value from the buffer.
- MARATHI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- master(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained MASTER model, stored on disk.
- master(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained MASTER models, stored on disk.
- master(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained MASTER model, stored on disk.
- master(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained MASTER models, stored on disk.
- MathUtil - Class in com.itextpdf.pdfocr.onnx.util
-
Additional math functions.
- MAX_HEIGHT_SHOULD_BE_A_MULTIPLE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- MAX_HEIGHT_SHOULD_NOT_BE_LESS_THAN_MIN - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- MAX_SHOULD_NOT_BE_LESS_THAN_MIN - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- MAX_WIDTH_SHOULD_BE_A_MULTIPLE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- MAX_WIDTH_SHOULD_NOT_BE_LESS_THAN_MIN - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- MaxScoreCalculator - Class in com.itextpdf.pdfocr.onnx.detection.score
-
Score calculator, which returns the biggest observed sample.
- MaxScoreCalculator() - Constructor for class com.itextpdf.pdfocr.onnx.detection.score.MaxScoreCalculator
-
Creates a new score calculator.
- MeanScoreCalculator - Class in com.itextpdf.pdfocr.onnx.detection.score
-
Score calculator, which calculates the mean values over the observed samples.
- MeanScoreCalculator() - Constructor for class com.itextpdf.pdfocr.onnx.detection.score.MeanScoreCalculator
-
Creates a new score calculator.
- MIN_HEIGHT_SHOULD_BE_POSITIVE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- MIN_WIDTH_SHOULD_BE_POSITIVE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- minAreaRectBoxPoints(Mat) - Static method in class com.itextpdf.pdfocr.onnx.util.OpenCvUtil
-
Equivalent to calling OpenCV
minAreaRect, followed byboxPoints, but with resource handling taken care of. - minAreaRectBoxPoly(Mat) - Static method in class com.itextpdf.pdfocr.onnx.util.OpenCvUtil
-
Creates an OpenCV polygon, based on the results of a
minAreaRectBoxPoints(points)call, but with resource handling taken care of. - mobileNetV3(String) - Static method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictor
-
Creates a new crop orientation predictor using an existing pre-trained MobileNetV3 model, stored on disk.
- mobileNetV3(String) - Static method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictorProperties
-
Creates a new crop orientation properties object for existing pre-trained MobileNetV3 models, stored on disk.
- mobileNetV3(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictor
-
Creates a new crop orientation predictor using an existing pre-trained MobileNetV3 model, stored on disk.
- mobileNetV3(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictorProperties
-
Creates a new crop orientation properties object for existing pre-trained MobileNetV3 models, stored on disk.
- MODEL_DID_NOT_PASS_VALIDATION - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- MODEL_ONLY_SUPPORTS_RGB - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- modelPath - Variable in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictorProperties
-
Path to the ONNX model to load.
- MONGOLIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- MONTENEGRIN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- MULTI_LANG - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- MULTI_LANG_FULL - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
N
- NAME - Static variable in class com.itextpdf.pdfocr.onnx.conf.paddle.model.CtcLabelDecode
-
Expected name for the CTCLabelDecode post-processor.
- NAME - Static variable in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DbPostProcess
-
Expected name for the CTCLabelDecode post-processor.
- NEGATIVE_VALUE_IN_SHAPE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- NEPALI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- next() - Method in class com.itextpdf.pdfocr.onnx.util.BatchProcessingGenerator
- normalizedMinAreaRect(Mat) - Static method in class com.itextpdf.pdfocr.onnx.util.OpenCvUtil
-
OpenCV minAreaRect, but returns the normalized rectangle immediately.
- NormalizeImage - Class in com.itextpdf.pdfocr.onnx.conf.paddle.model
-
POJO for the NormalizeImage transform operation within a
PreProcessobject in a config file. - NormalizeImage(float[], float[]) - Constructor for class com.itextpdf.pdfocr.onnx.conf.paddle.model.NormalizeImage
-
Creates a new POJO for the config file object.
- normalizeRotatedRect(RotatedRect) - Static method in class com.itextpdf.pdfocr.onnx.util.OpenCvUtil
-
Normalizes RotatedRect, so that its angle is in the [-45; 45) range.
- NORWEGIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- NUMBER_OF_PAGES_IN_IMAGE - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
-
The constant NUMBER_OF_PAGES_IN_IMAGE.
O
- objToBool(Object) - Static method in class com.itextpdf.pdfocr.onnx.util.YamlUtil
-
Casts a parsed YAML object to a boolean, if it is a bool.
- objToFloat(Object) - Static method in class com.itextpdf.pdfocr.onnx.util.YamlUtil
-
Casts a parsed YAML object to a double, if it is a float.
- objToInt(Object) - Static method in class com.itextpdf.pdfocr.onnx.util.YamlUtil
-
Casts a parsed YAML object to an int32, if it is an int.
- objToMapping(Object) - Static method in class com.itextpdf.pdfocr.onnx.util.YamlUtil
-
Casts a parsed YAML object to a map/dictionary, if it is a mapping.
- objToSequence(Object) - Static method in class com.itextpdf.pdfocr.onnx.util.YamlUtil
-
Casts a parsed YAML object to a collection, if it is a sequence.
- objToString(Object) - Static method in class com.itextpdf.pdfocr.onnx.util.YamlUtil
-
Casts a parsed YAML object to a string, if it is a string.
- observe(float) - Method in interface com.itextpdf.pdfocr.onnx.detection.score.IScoreCalculator
-
Observe a sample value from the text contour.
- observe(float) - Method in class com.itextpdf.pdfocr.onnx.detection.score.MaxScoreCalculator
-
Observe a sample value from the text contour.
- observe(float) - Method in class com.itextpdf.pdfocr.onnx.detection.score.MeanScoreCalculator
-
Observe a sample value from the text contour.
- OcrEngineProperties - Class in com.itextpdf.pdfocr
-
This class contains additional properties for ocr engine.
- OcrEngineProperties() - Constructor for class com.itextpdf.pdfocr.OcrEngineProperties
-
Creates a new
OcrEnginePropertiesinstance. - OcrEngineProperties(OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.OcrEngineProperties
-
Creates a new
OcrEnginePropertiesinstance based on anotherOcrEnginePropertiesinstance (copy constructor). - OcrPdfCreator - Class in com.itextpdf.pdfocr
-
OcrPdfCreatoris the class that creates PDF documents containing input images and text that was recognized using providedIOcrEngine. - OcrPdfCreator(IOcrEngine) - Constructor for class com.itextpdf.pdfocr.OcrPdfCreator
-
Creates a new
OcrPdfCreatorinstance. - OcrPdfCreator(IOcrEngine, OcrPdfCreatorProperties) - Constructor for class com.itextpdf.pdfocr.OcrPdfCreator
-
Creates a new
OcrPdfCreatorinstance. - OcrPdfCreatorProperties - Class in com.itextpdf.pdfocr
-
Properties that will be used by the
OcrPdfCreator. - OcrPdfCreatorProperties() - Constructor for class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Creates a new
OcrPdfCreatorPropertiesinstance. - OcrPdfCreatorProperties(OcrPdfCreatorProperties) - Constructor for class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Creates a new
OcrPdfCreatorPropertiesinstance based on anotherOcrPdfCreatorPropertiesinstance (copy constructor). - OcrProcessContext - Class in com.itextpdf.pdfocr
-
Class for storing ocr processing context.
- OcrProcessContext(AbstractPdfOcrEventHelper) - Constructor for class com.itextpdf.pdfocr.OcrProcessContext
-
Creates an instance of ocr process context
- ODIA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ODIA_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ODIA_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ODIA_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ODIA_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ODIA_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ODIA_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- ODIA_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- onEvent(AbstractProductITextEvent) - Method in class com.itextpdf.pdfocr.AbstractPdfOcrEventHelper
-
Handles the event.
- ONNX_RUNTIME_OPERATION_FAILED - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- OnnxDetectionPostProcessor - Class in com.itextpdf.pdfocr.onnx.detection
-
Implementation of a text detection predictor post-processor, used for OnnxTR model outputs.
- OnnxDetectionPostProcessor() - Constructor for class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPostProcessor
-
Creates a new post-processor with the default threshold values.
- OnnxDetectionPostProcessor(float, float) - Constructor for class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPostProcessor
-
Creates a new post-processor.
- OnnxDetectionPredictor - Class in com.itextpdf.pdfocr.onnx.detection
-
A text detection predictor implementation, which is using ONNX Runtime and its ML models to find, where text is located on an image.
- OnnxDetectionPredictor(OnnxDetectionPredictorProperties) - Constructor for class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
-
Creates a text detection predictor with the specified properties.
- OnnxDetectionPredictorProperties - Class in com.itextpdf.pdfocr.onnx.detection
-
Properties for configuring text detection ONNX models.
- OnnxDetectionPredictorProperties(String, OnnxInputProperties, IDetectionPostProcessor) - Constructor for class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
-
Creates new text detection predictor properties.
- OnnxDetectionPredictorProperties(String, OnnxInputProperties, IDetectionPostProcessor, IOrtSessionOptionsCreator) - Constructor for class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
-
Creates new text detection predictor properties.
- OnnxEngineProperties - Class in com.itextpdf.pdfocr.onnx
-
Properties that are used by the
OnnxOcrEngine. - OnnxEngineProperties() - Constructor for class com.itextpdf.pdfocr.onnx.OnnxEngineProperties
-
Creates a new
OnnxEnginePropertiesinstance. - OnnxInputProperties - Class in com.itextpdf.pdfocr.onnx
-
Properties of the input of an ONNX model, which expects an image.
- OnnxInputProperties(ImageResizeOptions) - Constructor for class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Creates model input properties.
- OnnxInputProperties(ImageResizeOptions, float[], float[]) - Constructor for class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Creates model input properties.
- OnnxInputProperties(ImageResizeOptions, float[], float[], int) - Constructor for class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Creates model input properties.
- OnnxInputProperties(ImageResizeOptions, int) - Constructor for class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Creates model input properties.
- OnnxOcrEngine - Class in com.itextpdf.pdfocr.onnx
-
IOcrEngineimplementation, based on OnnxTR/DocTR machine learning OCR projects. - OnnxOcrEngine(IDetectionPredictor, IOrientationPredictor, IRecognitionPredictor) - Constructor for class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
-
Create a new OCR engine with the provided predictors.
- OnnxOcrEngine(IDetectionPredictor, IOrientationPredictor, IRecognitionPredictor, OnnxEngineProperties) - Constructor for class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
-
Create a new OCR engine with the provided predictors.
- OnnxOcrEngine(IDetectionPredictor, IRecognitionPredictor) - Constructor for class com.itextpdf.pdfocr.onnx.OnnxOcrEngine
-
Create a new OCR engine with the provided predictors, without text orientation prediction.
- OnnxOrientationPredictor - Class in com.itextpdf.pdfocr.onnx.orientation
-
A crop orientation predictor implementation, which is using ONNX Runtime and its ML models to figure out, how text is oriented in a cropped image of text.
- OnnxOrientationPredictor(OnnxOrientationPredictorProperties) - Constructor for class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictor
-
Creates a crop orientation predictor with the specified properties.
- OnnxOrientationPredictorProperties - Class in com.itextpdf.pdfocr.onnx.orientation
-
Properties for configuring crop orientation ONNX models.
-
OnnxOrientationPredictorProperties(String, OnnxInputProperties, IOutputLabelMapper
) - Constructor for class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictorProperties -
Creates new crop orientation predictor properties.
-
OnnxOrientationPredictorProperties(String, OnnxInputProperties, IOutputLabelMapper
, IOrtSessionOptionsCreator) - Constructor for class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictorProperties -
Creates new crop orientation predictor properties.
- OnnxRecognitionPredictor - Class in com.itextpdf.pdfocr.onnx.recognition
-
A text recognition predictor implementation, which is using ONNX Runtime and its ML models to recognize text characters on an image.
- OnnxRecognitionPredictor(OnnxRecognitionPredictorProperties) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a text recognition predictor with the specified properties.
- OnnxRecognitionPredictorProperties - Class in com.itextpdf.pdfocr.onnx.recognition
-
Properties for configuring text recognition ONNX models.
- OnnxRecognitionPredictorProperties(String, OnnxInputProperties, IRecognitionPostProcessor) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates new text recognition predictor properties.
- OnnxRecognitionPredictorProperties(String, OnnxInputProperties, IRecognitionPostProcessor, boolean) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates new text recognition predictor properties.
- OnnxRecognitionPredictorProperties(String, OnnxInputProperties, IRecognitionPostProcessor, boolean, IOrtSessionOptionsCreator) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates new text recognition predictor properties.
- OnnxRecognitionPredictorProperties(String, OnnxInputProperties, IRecognitionPostProcessor, IOrtSessionOptionsCreator) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates new text recognition predictor properties.
- OpenCvUtil - Class in com.itextpdf.pdfocr.onnx.util
-
Static class with OpenCV utility functions.
- ortSessionOptionsCreator - Variable in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictorProperties
-
ONNX runtime session options creator.
- OutputFormat - Enum in com.itextpdf.pdfocr.tesseract4
-
Enumeration of the available output formats.
P
- PaddingStrategy - Enum in com.itextpdf.pdfocr.onnx
-
Enumeration of implemented padding strategies for padding images.
- paddleOcr(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
-
Creates a new text detection predictor using an existing pre-trained PaddleOCR model, stored on disk.
- paddleOcr(String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
-
Creates a new text detection properties object for existing pre-trained PaddleOCR models, stored on disk.
- paddleOcr(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained PaddleOCR model, stored on disk.
- paddleOcr(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained PaddleOCR models, stored on disk.
- paddleOcr(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
-
Creates a new text detection predictor using an existing pre-trained PaddleOCR model, stored on disk.
- paddleOcr(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
-
Creates a new text detection properties object for existing pre-trained PaddleOCR models, stored on disk.
- paddleOcr(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained PaddleOCR model, stored on disk.
- paddleOcr(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained PaddleOCR models, stored on disk.
- paddleOcr(String, String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
-
Creates a new text detection predictor using an existing pre-trained PaddleOCR model, stored on disk.
- paddleOcr(String, String) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
-
Creates a new text detection properties object for existing pre-trained PaddleOCR models, stored on disk.
- paddleOcr(String, String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained PaddleOCR model, stored on disk.
- paddleOcr(String, String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained PaddleOCR models, stored on disk.
- paddleOcr(String, String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor
-
Creates a new text detection predictor using an existing pre-trained PaddleOCR model, stored on disk.
- paddleOcr(String, String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
-
Creates a new text detection properties object for existing pre-trained PaddleOCR models, stored on disk.
- paddleOcr(String, String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained PaddleOCR model, stored on disk.
- paddleOcr(String, String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained PaddleOCR models, stored on disk.
- PaddleOcrDetectionPostProcessor - Class in com.itextpdf.pdfocr.onnx.detection
-
Implementation of a text detection predictor post-processor, used for PaddleOCR model outputs.
- PaddleOcrDetectionPostProcessor() - Constructor for class com.itextpdf.pdfocr.onnx.detection.PaddleOcrDetectionPostProcessor
-
Creates a new post-processor with the default parameters.
- PaddleOcrDetectionPostProcessor(float, float, float, int) - Constructor for class com.itextpdf.pdfocr.onnx.detection.PaddleOcrDetectionPostProcessor
-
Creates a new post-processor.
- PaddleOcrInitException - Exception in com.itextpdf.pdfocr.onnx.exceptions
-
Exception class for exceptions during PaddleOCR initialization.
- PaddleOcrInitException(String) - Constructor for exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
-
Creates new
PaddleOcrInitExceptioninstance. - PAGE_NUMBER_IS_INCORRECT - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
- PAGE_SIZE_IS_NOT_APPLIED - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
- ParagraphTreeItem - Class in com.itextpdf.pdfocr.structuretree
-
A convenience class to associate certain text items with the paragraph structure item.
- ParagraphTreeItem() - Constructor for class com.itextpdf.pdfocr.structuretree.ParagraphTreeItem
-
Instantiate a new
ParagraphTreeIteminstance. - parse(InputStream) - Static method in class com.itextpdf.pdfocr.onnx.conf.paddle.parser.InferenceConfigParser
-
Parses a PaddleOCR config file into a
InferenceConfigPOJO. - parSeq(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained PARSeq model, stored on disk.
- parSeq(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained PARSeq models, stored on disk.
- parSeq(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained PARSeq model, stored on disk.
- parSeq(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained PARSeq models, stored on disk.
- parSeq(String, Vocabulary, int) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained PARSeq model, stored on disk.
- parSeq(String, Vocabulary, int) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained PARSeq models, stored on disk.
- parSeq(String, Vocabulary, int, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained PARSeq model, stored on disk.
- parSeq(String, Vocabulary, int, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained PARSeq models, stored on disk.
- PASHTO - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- PATH_TO_TESS_DATA_DIRECTORY_CONTAINS_NON_ASCII_CHARACTERS - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
- PATH_TO_TESS_DATA_DIRECTORY_IS_INVALID - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
- PATH_TO_TESS_DATA_IS_NOT_SET - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
- PDF - Enum constant in enum com.itextpdf.pdfocr.statistics.PdfOcrOutputType
-
Creating a PDF file
- PDF_DOCUMENT_MUST_BE_OPENED_IN_STAMPING_MODE - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
- PDF_LANGUAGE_PROPERTY_IS_NOT_SET - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
-
The constant PDF_LANGUAGE_PROPERTY_IS_NOT_SET.
- PDFA - Enum constant in enum com.itextpdf.pdfocr.statistics.PdfOcrOutputType
-
Creating a PDF-A file
- PDFA_IS_NOT_SUPPORTED - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
- PdfOcrException - Exception in com.itextpdf.pdfocr.exceptions
-
Exception class for custom exceptions.
- PdfOcrException(String) - Constructor for exception com.itextpdf.pdfocr.exceptions.PdfOcrException
-
Creates a new
PdfOcrException. - PdfOcrException(String, Throwable) - Constructor for exception com.itextpdf.pdfocr.exceptions.PdfOcrException
-
Creates a new
PdfOcrException. - PdfOcrException(Throwable) - Constructor for exception com.itextpdf.pdfocr.exceptions.PdfOcrException
-
Creates a new
PdfOcrException. - PdfOcrExceptionMessageConstant - Class in com.itextpdf.pdfocr.exceptions
-
Class that bundles all the exception message templates as constants.
- PdfOcrFileUtil - Class in com.itextpdf.pdfocr.util
-
Utility class for working with files.
- PdfOcrFontProvider - Class in com.itextpdf.pdfocr
-
FontProviderextension for ocr engine. - PdfOcrFontProvider() - Constructor for class com.itextpdf.pdfocr.PdfOcrFontProvider
-
Creates a new
PdfOcrFontProviderinstance with the default font and the default font family. - PdfOcrFontProvider(FontSet, String) - Constructor for class com.itextpdf.pdfocr.PdfOcrFontProvider
-
Creates a new
PdfOcrFontProviderinstance based on providedFontSetinstance and font family. - PdfOcrInputException - Exception in com.itextpdf.pdfocr.exceptions
-
Exception class for input related exceptions.
- PdfOcrInputException(String) - Constructor for exception com.itextpdf.pdfocr.exceptions.PdfOcrInputException
-
Creates a new
PdfOcrInputException. - PdfOcrInputException(String, Throwable) - Constructor for exception com.itextpdf.pdfocr.exceptions.PdfOcrInputException
-
Creates a new
PdfOcrInputException. - PdfOcrInputException(Throwable) - Constructor for exception com.itextpdf.pdfocr.exceptions.PdfOcrInputException
-
Creates a new
PdfOcrInputException. - PdfOcrInputTesseract4Exception - Exception in com.itextpdf.pdfocr.tesseract4.exceptions
-
Exception class for Tesseract4 input related exceptions.
- PdfOcrInputTesseract4Exception(String) - Constructor for exception com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrInputTesseract4Exception
-
Creates a new
PdfOcrInputTesseract4Exception. - PdfOcrInputTesseract4Exception(String, Throwable) - Constructor for exception com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrInputTesseract4Exception
-
Creates a new
PdfOcrInputTesseract4Exception. - PdfOcrInputTesseract4Exception(Throwable) - Constructor for exception com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrInputTesseract4Exception
-
Creates a new
PdfOcrInputTesseract4Exception. - PdfOcrLogMessageConstant - Class in com.itextpdf.pdfocr.logs
-
Class that bundles all the log message templates as constants.
- PdfOcrMetaInfoContainer - Class in com.itextpdf.pdfocr
-
Container to keep meta info.
- PdfOcrMetaInfoContainer(IMetaInfo) - Constructor for class com.itextpdf.pdfocr.PdfOcrMetaInfoContainer
-
Creates instance of container to keep passed meta info.
- PdfOcrOnnxExceptionMessageConstant - Class in com.itextpdf.pdfocr.onnx.exceptions
-
Class that bundles all the error message templates as constants.
- PdfOcrOnnxProductData - Class in com.itextpdf.pdfocr.onnx.actions.data
-
Stores an instance of
ProductDatarelated to iText pdfOcr Onnx module. - PdfOcrOnnxProductEvent - Class in com.itextpdf.pdfocr.onnx.actions.events
-
Class represents events registered in iText pdfOcr Onnx module.
- PdfOcrOutputType - Enum in com.itextpdf.pdfocr.statistics
-
pdfOcr output types for statistics.
- PdfOcrOutputTypeStatisticsEvent - Class in com.itextpdf.pdfocr.statistics
-
Class which represents an event for specifying type of an ocr processing.
- PdfOcrOutputTypeStatisticsEvent(PdfOcrOutputType, ProductData) - Constructor for class com.itextpdf.pdfocr.statistics.PdfOcrOutputTypeStatisticsEvent
-
Creates instance of pdfOcr statistics event.
- PdfOcrTesseract4Exception - Exception in com.itextpdf.pdfocr.tesseract4.exceptions
-
Exception class for Tesseract4 exceptions.
- PdfOcrTesseract4Exception(String) - Constructor for exception com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4Exception
-
Creates a new
PdfOcrTesseract4Exception. - PdfOcrTesseract4Exception(String, Throwable) - Constructor for exception com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4Exception
-
Creates a new
PdfOcrTesseract4Exception. - PdfOcrTesseract4Exception(Throwable) - Constructor for exception com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4Exception
-
Creates a new
PdfOcrTesseract4Exception. - PdfOcrTesseract4ExceptionMessageConstant - Class in com.itextpdf.pdfocr.tesseract4.exceptions
-
Class that bundles all the error message templates as constants.
- PdfOcrTesseract4ProductData - Class in com.itextpdf.pdfocr.tesseract4.actions.data
-
Stores an instance of
ProductDatarelated to iText pdfOcr Tesseract4 module. - PdfOcrTesseract4ProductData() - Constructor for class com.itextpdf.pdfocr.tesseract4.actions.data.PdfOcrTesseract4ProductData
- PdfOcrTesseract4ProductEvent - Class in com.itextpdf.pdfocr.tesseract4.actions.events
-
Class represents events registered in iText pdfOcr Tesseract4 module.
- PdfOcrTextBuilder - Class in com.itextpdf.pdfocr.util
-
Class to build text output from the provided image OCR result and write it to the TXT file.
- PERSIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- PERSIAN_LETTERS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- POLISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- POLY - Enum constant in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.BoxType
-
Value for
polyconfig value. - PORTUGUESE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- position(int) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Sets this buffer's position.
- PostProcess - Interface in com.itextpdf.pdfocr.onnx.conf.paddle.model
-
Interface for objects under the
PostProcesskey in a config file. - PRE_PROCESSOR_OPERATION_MISSING - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
-
predict(Iterable
) - Method in interface com.itextpdf.pdfocr.onnx.IPredictor -
Performs prediction on a sequence of input items provided as an
Iterable. -
predict(Iterator
) - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor -
Performs prediction on a sequence of input items.
-
predict(Iterator
) - Method in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictor -
Performs prediction on a sequence of input items.
-
predict(Iterator
) - Method in interface com.itextpdf.pdfocr.onnx.IPredictor -
Performs prediction on a sequence of input items.
- PreProcess - Class in com.itextpdf.pdfocr.onnx.conf.paddle.model
-
POJO for the object under the
PreProcesskey in a config file. - PreProcess(TransformOp[]) - Constructor for class com.itextpdf.pdfocr.onnx.conf.paddle.model.PreProcess
-
Creates a new POJO for the config file object.
- preProcessorOperationMissing(String) - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
-
Creates an exception for cases, when an expected pre-processing operation is missing in the configuration file.
- process(FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.recognition.BasicLabelPostProcessor
-
Process ML model output and return recognized characters as string.
- process(FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.recognition.EndOfStringPostProcessor
-
Process ML model output and return recognized characters as string.
- process(FloatBufferMdArray) - Method in interface com.itextpdf.pdfocr.onnx.recognition.IRecognitionPostProcessor
-
Process ML model output and return recognized characters as string.
- process(BufferedImage, FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.detection.BasicDetectionPostProcessor
-
Process ML model output for a specified image and return a list of detected objects.
- process(BufferedImage, FloatBufferMdArray) - Method in class com.itextpdf.pdfocr.onnx.detection.EasyOcrDetectionPostProcessor
-
Process ML model output for a specified image and return a list of detected objects.
- process(BufferedImage, FloatBufferMdArray) - Method in interface com.itextpdf.pdfocr.onnx.detection.IDetectionPostProcessor
-
Process ML model output for a specified image and return a list of detected objects.
-
process(List
) - Method in class com.itextpdf.pdfocr.onnx.merging.EasyOcrTextBoxMerger -
Merges text boxes based on some set of rules.
-
process(List
) - Method in interface com.itextpdf.pdfocr.onnx.merging.ITextBoxMerger -
Merges text boxes based on some set of rules.
- PROCESS_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.actions.events.PdfOcrTesseract4ProductEvent
-
Process image event type.
- PROCESS_IMAGE_ONNX - Static variable in class com.itextpdf.pdfocr.onnx.actions.events.PdfOcrOnnxProductEvent
-
Process image event type.
-
processBatch(List
) - Method in interface com.itextpdf.pdfocr.onnx.util.IBatchProcessor -
Processes a batch of input items and produces a corresponding batch of output items.
- PROVIDED_FONT_PROVIDER_IS_INVALID - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
-
The constant PROVIDED_FONT_PROVIDER_IS_INVALID.
- PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- PUNJABI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- PUNJABI_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- PUNJABI_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- PUNJABI_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- PUNJABI_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- PUNJABI_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- PUNJABI_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- PUNJABI_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- put(float) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Relative put method.
- put(float[], int, int) - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Relative bulk put method.
Q
- QUAD - Enum constant in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.BoxType
-
Value for
quadconfig value. - QUECHUA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
R
- RecResizeImg - Class in com.itextpdf.pdfocr.onnx.conf.paddle.model
-
POJO for the RecResizeImg transform operation within a
PreProcessobject in a config file. - RecResizeImg(int[]) - Constructor for class com.itextpdf.pdfocr.onnx.conf.paddle.model.RecResizeImg
-
Creates a new POJO for the config file object.
- remaining() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Returns the number of elements between the current position and the limit.
- removeChild(LogicalStructureTreeItem) - Method in class com.itextpdf.pdfocr.structuretree.LogicalStructureTreeItem
-
Remove child structure tree item.
- rewind() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Rewinds this buffer.
- RGB - Enum constant in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ImgMode
-
Value for
RGBconfig value. - RGB - Enum constant in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
-
Image is represented with three channels: red, green, blue.
- ROMANIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- rotate(BufferedImage, TextOrientation) - Static method in class com.itextpdf.pdfocr.onnx.util.BufferedImageUtil
-
Rotates image based on text orientation.
- RUSSIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- RUSSIAN_CYRILLIC_LETTERS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- RUSSIAN_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
S
- SANSKRIT - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- sar(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained SAR model, stored on disk.
- sar(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained SAR models, stored on disk.
- sar(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained SAR model, stored on disk.
- sar(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained SAR models, stored on disk.
- SCALE_HEIGHT - Enum constant in enum com.itextpdf.pdfocr.ScaleMode
-
Only height of the image will be proportionally scaled to fit required size that is set using
OcrPdfCreatorProperties.setPageSize(Rectangle)method. - SCALE_TO_FIT - Enum constant in enum com.itextpdf.pdfocr.ScaleMode
-
The image will be scaled to fit within the page width and height dimensions that are set using
OcrPdfCreatorProperties.setPageSize(Rectangle)method. - SCALE_WIDTH - Enum constant in enum com.itextpdf.pdfocr.ScaleMode
-
Only width of the image will be proportionally scaled to fit required size that is set using
OcrPdfCreatorProperties.setPageSize(Rectangle)method. - ScaleMode - Enum in com.itextpdf.pdfocr
-
Enumeration of the possible scale modes for input images.
- SCORE_MODE_IS_NOT_SUPPORTED - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- ScoreMode - Enum in com.itextpdf.pdfocr.onnx.conf.paddle.model
-
Enum for values under the
score_modekey within a DBPostProcessPostProcessorobject in a config file. - scoreModeIsNotSupported() - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
-
Creates an exception for cases, when the detection model uses an unsupported score calculation method.
- SCOTTISH_GAELIC - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SERBIAN_CYRILLIC - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SERBIAN_LATIN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- setAccessibilityProperties(AccessibilityProperties) - Method in class com.itextpdf.pdfocr.structuretree.LogicalStructureTreeItem
-
Set structure tree element's properties.
- setFontProvider(FontProvider) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets font provider.
- setFontProvider(FontProvider, String) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets font provider and default font family.
- setImageLayerName(String) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets name for the image layer,
nullby default. - setImagePreprocessingOptions(ImagePreprocessingOptions) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Sets
Tesseract4OcrEngineProperties.imagePreprocessingOptions. - setImageRotationHandler(IImageRotationHandler) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets image rotation handler instance.
-
setLanguages(List
) - Method in class com.itextpdf.pdfocr.OcrEngineProperties -
Sets list of languages to be recognized in provided images.
- setLogicalStructureTreeItem(LogicalStructureTreeItem) - Method in class com.itextpdf.pdfocr.TextInfo
-
Sets logical structure tree parent item for the text info.
- setMessageParams(String...) - Method in exception com.itextpdf.pdfocr.exceptions.PdfOcrException
-
Sets additional params for Exception message.
- setMetaInfo(IMetaInfo) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Set meta info for this
OcrPdfCreatorProperties. - setMinimalConfidenceLevel(int) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Sets minimal confidence level for HOCR line to be considered as properly recognized.
- setOcrEngine(IOcrEngine) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
-
Sets
IOcrEnginereader object to perform OCR. - setOcrEventHelper(AbstractPdfOcrEventHelper) - Method in class com.itextpdf.pdfocr.OcrProcessContext
-
Sets ocr event helper.
- setOcrPdfCreatorProperties(OcrPdfCreatorProperties) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
-
Sets properties for
OcrPdfCreator. - setPageSegMode(Integer) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Sets Page Segmentation Mode.
- setPageSize(Rectangle) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets required size for output PDF document.
- setPathToExecutable(String) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4ExecutableOcrEngine
-
Sets path to tesseract executable.
- setPathToTessData(File) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Sets path to directory with tess data.
- setPdfLang(String) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Specify PDF natural language, and optionally locale.
- setPixelTextPoints(Point[], int) - Method in class com.itextpdf.pdfocr.TextInfo
-
Sets an array of 4
Points describing text bbox (lower-left based relative to text) expressed in pixels. - setPreprocessingImages(boolean) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Sets true if image preprocessing is needed.
- setScaleMode(ScaleMode) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets scale mode for input images using available options from
ScaleModeenumeration. - setSmoothTiling(boolean) - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
-
Sets
ImagePreprocessingOptions.smoothTiling. - setTagged(boolean) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Defines whether pdf document should be tagged or not.
- setTesseract4OcrEngineProperties(Tesseract4OcrEngineProperties) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
-
Sets properties for
AbstractTesseract4OcrEngine. - setText(String) - Method in class com.itextpdf.pdfocr.TextInfo
-
Sets text element.
- setTextBBoxColor(Color) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets text bounding box color in the output PDF document.
- setTextColor(Color) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets text color in output PDF document.
- setTextLayerName(String) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets name for the text layer.
- setTextPoints(Point[]) - Method in class com.itextpdf.pdfocr.TextInfo
-
Sets array of 4
Points describing text bbox (lower-left based relative to text) expressed in points. - setTextPositioning(TextPositioning) - Method in class com.itextpdf.pdfocr.onnx.OnnxEngineProperties
-
Defines the way text is retrieved from ocr engine output using
TextPositioning. - setTextPositioning(TextPositioning) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Defines the way text is retrieved from tesseract output using
TextPositioning. - setTileHeight(int) - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
-
Sets
ImagePreprocessingOptions.tileHeight. - setTileWidth(int) - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
-
Sets
ImagePreprocessingOptions.tileWidth. - setTitle(String) - Method in class com.itextpdf.pdfocr.OcrPdfCreatorProperties
-
Sets PDF document title.
- setUseTxtToImproveHocrParsing(boolean) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Sets
Tesseract4OcrEngineProperties.useTxtToImproveHocrParsing. - SHAPE_IS_NOT_VALID - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- SHOULD_BE_AT_LEAST_ONE_IMAGE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- shouldSplitImages() - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Returns whether input images should be split.
- SIMPLIFIED_CHINESE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SINDHI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SINHALA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SINHALA_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SINHALA_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SINHALA_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SINHALA_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SINHALA_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SINHALA_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SINHALA_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- size() - Method in interface com.itextpdf.pdfocr.onnx.IOutputLabelMapper
-
Returns a number of mappable values.
- size() - Method in class com.itextpdf.pdfocr.onnx.orientation.DefaultOrientationMapper
-
Returns a number of mappable values.
- size() - Method in class com.itextpdf.pdfocr.onnx.recognition.StringMapper
-
Returns the size of the string mapper.
- size() - Method in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
-
Returns the size of the vocabulary.
- slice() - Method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Creates a new float buffer whose content is a shared subsequence of this buffer's content.
- SLOVAK - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SLOVENE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SLOW - Enum constant in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ScoreMode
-
Value for
slowconfig value. - SOMALI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
-
sortTextInfosByLines(Map
>) - Static method in class com.itextpdf.pdfocr.util.PdfOcrTextBuilder -
Sorts the provided
IOcrEngine.doImageOcr(java.io.File)result by lines. - SPANISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SpanTreeItem - Class in com.itextpdf.pdfocr.structuretree
-
A convenience class to associate certain text items with the span structure item.
- SpanTreeItem() - Constructor for class com.itextpdf.pdfocr.structuretree.SpanTreeItem
-
Instantiate a new
SpanTreeIteminstance. - START_OCR_FOR_IMAGES - Static variable in class com.itextpdf.pdfocr.logs.PdfOcrLogMessageConstant
-
The constant START_OCR_FOR_IMAGES.
- STATISTICS_EVENT_TYPE_CANT_BE_NULL - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
- STATISTICS_EVENT_TYPE_IS_NOT_DETECTED - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
- StringMapper - Class in com.itextpdf.pdfocr.onnx.recognition
-
Look-up table for mapping text recognition model results to strings.
- StringMapper(String) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.StringMapper
-
Creates a new string mapper based on a look-up string.
- StringMapper(String[]) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.StringMapper
-
Creates a new string mapper based on a look-up table.
- SUDANESE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SUDANESE_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SUDANESE_DIACRITICS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SUDANESE_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SUDANESE_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SWAHILI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SWEDISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- SYMMETRIC_BLACK - Enum constant in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
-
Image will be put into the middle.
- SYMMETRIC_EDGE - Enum constant in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
-
Image will be put into the middle.
- SYMMETRIC_GRAY - Enum constant in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
-
Image will be put into the middle.
- SYMMETRIC_WHITE - Enum constant in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
-
Image will be put into the middle.
T
- TableCellTreeItem - Class in com.itextpdf.pdfocr.structuretree
-
A convenience class to associate certain text items with the table cell structure item.
- TableCellTreeItem() - Constructor for class com.itextpdf.pdfocr.structuretree.TableCellTreeItem
-
Instantiate a new
TableCellTreeIteminstance. - TableRowTreeItem - Class in com.itextpdf.pdfocr.structuretree
-
A convenience class to associate certain text items with the table row structure item.
- TableRowTreeItem() - Constructor for class com.itextpdf.pdfocr.structuretree.TableRowTreeItem
-
Instantiate a new
TableRowTreeIteminstance. - TableTreeItem - Class in com.itextpdf.pdfocr.structuretree
-
A convenience class to associate certain text items with the table structure item.
- TableTreeItem() - Constructor for class com.itextpdf.pdfocr.structuretree.TableTreeItem
-
Instantiate a new
TableTreeIteminstance. - TAGALOG - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TAGGED_PDF_IS_NOT_SUPPORTED - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
- TAGGING_IS_NOT_SUPPORTED - Static variable in class com.itextpdf.pdfocr.exceptions.PdfOcrExceptionMessageConstant
- TAJIK - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TAMIL - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TAMIL_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TAMIL_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TAMIL_FRACTIONS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TAMIL_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TAMIL_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TAMIL_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TAMIL_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TAMIL_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TATAR - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TELUGU - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TELUGU_CONSONANTS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TELUGU_DIGITS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TELUGU_G2 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- TELUGU_MATRAS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TELUGU_PUNCTUATION - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TELUGU_SIGNS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TELUGU_VIRAMA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TELUGU_VOWELS - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TESSERACT_FAILED - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
- TESSERACT_FAILED - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
- TESSERACT_LIB_NOT_INSTALLED - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
- TESSERACT_LIB_NOT_INSTALLED_WIN - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
- TESSERACT_NOT_FOUND - Static variable in class com.itextpdf.pdfocr.tesseract4.exceptions.PdfOcrTesseract4ExceptionMessageConstant
- Tesseract4ExecutableOcrEngine - Class in com.itextpdf.pdfocr.tesseract4
-
The implementation of
AbstractTesseract4OcrEnginefor tesseract OCR. - Tesseract4ExecutableOcrEngine(Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4ExecutableOcrEngine
-
Creates a new
Tesseract4ExecutableOcrEngineinstance. - Tesseract4ExecutableOcrEngine(String, Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4ExecutableOcrEngine
-
Creates a new
Tesseract4ExecutableOcrEngineinstance. - Tesseract4LibOcrEngine - Class in com.itextpdf.pdfocr.tesseract4
-
The implementation of
AbstractTesseract4OcrEnginefor tesseract OCR. - Tesseract4LibOcrEngine(Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4LibOcrEngine
-
Creates a new
Tesseract4LibOcrEngineinstance. - Tesseract4LogMessageConstant - Class in com.itextpdf.pdfocr.tesseract4.logs
-
Class that bundles all the log message templates as constants.
- Tesseract4OcrEngineProperties - Class in com.itextpdf.pdfocr.tesseract4
-
Properties that will be used by the
IOcrEngine. - Tesseract4OcrEngineProperties() - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Creates a new
Tesseract4OcrEnginePropertiesinstance. - Tesseract4OcrEngineProperties(Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
-
Creates a new
Tesseract4OcrEnginePropertiesinstance based on anotherTesseract4OcrEnginePropertiesinstance (copy constructor). - TesseractHelper - Class in com.itextpdf.pdfocr.tesseract4
-
Helper class.
- TextInfo - Class in com.itextpdf.pdfocr
-
This class describes how recognized text is positioned on the image providing bbox for each text item (could be a line or a word).
- TextInfo() - Constructor for class com.itextpdf.pdfocr.TextInfo
-
Creates a new
TextInfoinstance. - TextInfo(TextInfo) - Constructor for class com.itextpdf.pdfocr.TextInfo
-
Creates a new
TextInfoinstance from existing one. - TextInfo(String, Point[]) - Constructor for class com.itextpdf.pdfocr.TextInfo
-
Creates new
TextInfoinstance. - TextInfo(String, Rectangle) - Constructor for class com.itextpdf.pdfocr.TextInfo
-
Creates new
TextInfoinstance. - TextOrientation - Enum in com.itextpdf.pdfocr
-
Enumeration of supported text orientations.
- TextPositioning - Enum in com.itextpdf.pdfocr.onnx.text
-
Enumeration of the possible types of text positioning.
- TextPositioning - Enum in com.itextpdf.pdfocr.tesseract4
-
Enumeration of the possible types of text positioning.
- THAI - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- THAI_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- TiffImageUtil - Class in com.itextpdf.pdfocr.util
-
Utility class to handle tiff images.
-
toBchwInput(Collection
, OnnxInputProperties) - Static method in class com.itextpdf.pdfocr.onnx.util.BufferedImageUtil -
Converts a collection of images to a batched ML model input in a BCHW format with 1 or 3 channels.
-
toInputBuffer(List
) - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictor -
Converts predictor inputs to an ONNX runtime model batched input MD-array buffer.
-
toInputBuffer(List
) - Method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictor -
Converts predictor inputs to an ONNX runtime model batched input MD-array buffer.
-
toInputBuffer(List
) - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor -
Converts predictor inputs to an ONNX runtime model batched input MD-array buffer.
-
toInputBuffer(List
) - Method in class com.itextpdf.pdfocr.onnx.AbstractOnnxPredictor -
Converts predictor inputs to an ONNX runtime model batched input MD-array buffer.
- TOO_MANY_IMAGES - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- toString() - Method in class com.itextpdf.pdfocr.onnx.detection.OnnxDetectionPredictorProperties
- toString() - Method in class com.itextpdf.pdfocr.onnx.ImageResizeOptions
- toString() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
- toString() - Method in class com.itextpdf.pdfocr.onnx.orientation.OnnxOrientationPredictorProperties
- toString() - Method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
- toString() - Method in class com.itextpdf.pdfocr.onnx.recognition.StringMapper
- toString() - Method in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- toString() - Method in class com.itextpdf.pdfocr.onnx.util.Dimensions2D
- TransformOp - Interface in com.itextpdf.pdfocr.onnx.conf.paddle.model
-
Interface for objects inside the
transform_opsarray within aPreProcessobject in a config file. - truncateToRatio(BufferedImage, double) - Static method in class com.itextpdf.pdfocr.onnx.util.BufferedImageUtil
-
Truncates the input image, so that neither width/height, nor height/width ratios exceed the limit.
- TURKISH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- TXT - Enum constant in enum com.itextpdf.pdfocr.tesseract4.OutputFormat
-
Reader will produce plain txt file.
U
- UKRAINIAN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- UNEXPECTED_CHANNEL_CONFIGURATION - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- UNEXPECTED_CONFIG_KEY - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- UNEXPECTED_DIMENSION_VALUE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- UNEXPECTED_INPUT_SHAPE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- UNEXPECTED_INPUT_SIZE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- UNEXPECTED_INPUT_TYPE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- UNEXPECTED_MAT_TYPE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- UNEXPECTED_MEAN_CHANNEL_COUNT - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- UNEXPECTED_OUTPUT_SHAPE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- UNEXPECTED_OUTPUT_SIZE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- UNEXPECTED_OUTPUT_TYPE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- UNEXPECTED_PADDING_STRATEGY - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- UNEXPECTED_POST_PROCESSOR_TYPE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- UNEXPECTED_SHAPE_SIZE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- UNEXPECTED_STD_CHANNEL_COUNT - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- UNEXPECTED_VALUE_FOR_CONFIG_KEY - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- unexpectedKey(String) - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.ConfigParserException
-
Creates an exception for cases, when an unexpected key was found in a configuration file mapping.
- unexpectedMeanChannelCount(int) - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
-
Creates an exception for cases, when the size of the array of means for normalization has an unexpected size.
- unexpectedPostProcessorType(String) - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
-
Creates an exception for cases, when an unexpected post-processor is specified in the configuration file.
- unexpectedStdChannelCount(int) - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
-
Creates an exception for cases, when the size of the array of standard deviations for normalization has an unexpected size.
- unexpectedValueForKey(String) - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.ConfigParserException
-
Creates an exception for cases, when an unexpected value was found in a configuration file mapping.
- UNSUPPORTED_EXIF_ORIENTATION_VALUE - Static variable in class com.itextpdf.pdfocr.tesseract4.logs.Tesseract4LogMessageConstant
- URDU - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- USE_DILATION_IS_NOT_SUPPORTED - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- useDilationIsNotSupported() - Static method in exception com.itextpdf.pdfocr.onnx.exceptions.PaddleOcrInitException
-
Creates an exception for cases, when the detection model uses an unsupported pre-processing step for input images.
- usesBottomRightPadding() - Method in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
-
Returns whether the strategy uses bottom-right padding.
- usesImageEdge() - Method in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
-
Returns whether the strategy uses the edge of the image for padding.
- usesSolidColor() - Method in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
-
Returns whether the strategy uses a solid color for padding.
- usesSymmetricPadding() - Method in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
-
Returns whether the strategy uses symmetric padding.
- useSymmetricPad() - Method in class com.itextpdf.pdfocr.onnx.OnnxInputProperties
-
Returns whether padding should be symmetrical during input resizing.
- UYGHUR - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- UZBEK_CYRILLIC - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- UZBEK_LATIN - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
V
- validateInputPdfDocument(PdfDocument) - Method in class com.itextpdf.pdfocr.OcrPdfCreator
-
Validates input PDF document.
-
validateLanguages(List
) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine -
Validates list of provided languages and checks if they all exist in given tess data directory.
- valueOf(String) - Static method in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.BoxType
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ImgMode
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ScoreMode
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.itextpdf.pdfocr.onnx.text.TextPositioning
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.itextpdf.pdfocr.ScaleMode
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.itextpdf.pdfocr.statistics.PdfOcrOutputType
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.itextpdf.pdfocr.tesseract4.OutputFormat
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.itextpdf.pdfocr.tesseract4.TextPositioning
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.itextpdf.pdfocr.TextOrientation
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.BoxType
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ImgMode
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.itextpdf.pdfocr.onnx.conf.paddle.model.ScoreMode
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.itextpdf.pdfocr.onnx.ImageChannelConfiguration
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.itextpdf.pdfocr.onnx.PaddingStrategy
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.itextpdf.pdfocr.onnx.text.TextPositioning
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.itextpdf.pdfocr.ScaleMode
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.itextpdf.pdfocr.statistics.PdfOcrOutputType
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.itextpdf.pdfocr.tesseract4.OutputFormat
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.itextpdf.pdfocr.tesseract4.TextPositioning
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.itextpdf.pdfocr.TextOrientation
-
Returns an array containing the constants of this enum type, in the order they are declared.
- VALUES_SHOULD_BE_A_NON_EMPTY_ARRAY - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- VIETNAMESE - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- viTstr(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained ViTSTR model, stored on disk.
- viTstr(String) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained ViTSTR models, stored on disk.
- viTstr(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictor
-
Creates a new text recognition predictor using an existing pre-trained ViTSTR model, stored on disk.
- viTstr(String, IOrtSessionOptionsCreator) - Static method in class com.itextpdf.pdfocr.onnx.recognition.OnnxRecognitionPredictorProperties
-
Creates a new text recognition properties object for existing pre-trained ViTSTR models, stored on disk.
- Vocabulary - Class in com.itextpdf.pdfocr.onnx.recognition
-
A string-based LUT for mapping text recognition model results to characters.
- Vocabulary(String) - Constructor for class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
-
Creates a new vocabulary based on a look-up string.
W
- WELSH - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- WIDTH_MULTIPLE_SHOULD_BE_POSITIVE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
- wrap(float[]) - Static method in class com.itextpdf.pdfocr.onnx.FloatBufferWrapper
-
Wraps a float array into a buffer.
-
wrap(Iterator
, int) - Static method in class com.itextpdf.pdfocr.onnx.util.Batching -
Wraps an existing iterator into a new one, which output List-based batches,
- WRAPPING_KEY - Static variable in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DecodeImage
-
Expected wrapping key for the DecodeImage operation.
- WRAPPING_KEY - Static variable in class com.itextpdf.pdfocr.onnx.conf.paddle.model.DetResizeForTest
-
Expected wrapping key for the DetResizeForTest operation.
- WRAPPING_KEY - Static variable in class com.itextpdf.pdfocr.onnx.conf.paddle.model.NormalizeImage
-
Expected wrapping key for the NormalizeImage operation.
- WRAPPING_KEY - Static variable in class com.itextpdf.pdfocr.onnx.conf.paddle.model.RecResizeImg
-
Expected wrapping key for the RecResizeImg operation.
- writeToTextFile(String, String) - Static method in class com.itextpdf.pdfocr.util.PdfOcrFileUtil
-
Writes provided
Stringto text file using provided path.
X
- X_SHOULD_BE_IN_0_1_RANGE - Static variable in class com.itextpdf.pdfocr.onnx.exceptions.PdfOcrOnnxExceptionMessageConstant
Y
- YAKUT - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
- YamlUtil - Class in com.itextpdf.pdfocr.onnx.util
-
Functions for working with YAML documents.
- YORUBA - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
Z
- ZH_SIM_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- ZH_SIM_G2 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- ZH_TRA_G1 - Static variable in class com.itextpdf.pdfocr.onnx.recognition.EasyOcrMapper
- ZULU - Static variable in class com.itextpdf.pdfocr.onnx.recognition.Vocabulary
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form