pdfOCR 4.0.0 API
|
IOcrEngine interface is used for instantiating new OcrReader objects. More...
Public Member Functions |
|
IDictionary< int, IList< TextInfo > > | DoImageOcr (FileInfo input) |
Reads data from the provided input image file and returns retrieved data in the format described below. More... |
|
IDictionary< int, IList< TextInfo > > | DoImageOcr (FileInfo input, OcrProcessContext ocrProcessContext) |
Reads data from the provided input image file and returns retrieved data in the format described below. More... |
|
void | CreateTxtFile (IList< FileInfo > inputImages, FileInfo txtFile) |
Performs OCR using provided IOcrEngine for the given list of input images and saves output to a text file using provided path. More... |
|
void | CreateTxtFile (IList< FileInfo > inputImages, FileInfo txtFile, OcrProcessContext ocrProcessContext) |
Performs OCR using provided IOcrEngine for the given list of input images and saves output to a text file using provided path. More... |
|
bool | IsTaggingSupported () |
Checks whether tagging is supported by the OCR engine. More... |
|
IOcrEngine interface is used for instantiating new OcrReader objects.
IOcrEngine interface is used for instantiating new OcrReader objects. IOcrEngine interface provides possibility to perform OCR, to read data from input files and to return the contained text in the required format.
void iText.Pdfocr.IOcrEngine.CreateTxtFile | ( | IList< FileInfo > | inputImages, |
FileInfo | txtFile | ||
) |
Performs OCR using provided IOcrEngine for the given list of input images and saves output to a text file using provided path.
Performs OCR using provided IOcrEngine for the given list of input images and saves output to a text file using provided path. Note that a human reading order is not guaranteed due to possible specifics of input images (multi column layout, tables etc)
inputImages |
System.Collections.IList
txtFile | file to be created |
Implemented in iText.Pdfocr.Tesseract4.AbstractTesseract4OcrEngine.
void iText.Pdfocr.IOcrEngine.CreateTxtFile | ( | IList< FileInfo > | inputImages, |
FileInfo | txtFile, | ||
OcrProcessContext | ocrProcessContext | ||
) |
Performs OCR using provided IOcrEngine for the given list of input images and saves output to a text file using provided path.
Performs OCR using provided IOcrEngine for the given list of input images and saves output to a text file using provided path. Note that a human reading order is not guaranteed due to possible specifics of input images (multi column layout, tables etc)
inputImages |
System.Collections.IList
txtFile | file to be created |
ocrProcessContext | ocr processing context |
Implemented in iText.Pdfocr.Tesseract4.AbstractTesseract4OcrEngine.
IDictionary |
( | FileInfo | input | ) |
Reads data from the provided input image file and returns retrieved data in the format described below.
input | input image System.IO.FileInfo |
System.Collections.IDictionary
Implemented in iText.Pdfocr.Tesseract4.AbstractTesseract4OcrEngine.
IDictionary |
( | FileInfo | input, |
OcrProcessContext | ocrProcessContext | ||
) |
Reads data from the provided input image file and returns retrieved data in the format described below.
input | input image System.IO.FileInfo |
ocrProcessContext | ocr processing context |
System.Collections.IDictionary
Implemented in iText.Pdfocr.Tesseract4.AbstractTesseract4OcrEngine.
bool iText.Pdfocr.IOcrEngine.IsTaggingSupported | ( | ) |
Checks whether tagging is supported by the OCR engine.
true
if tagging is supported by the engine, false
otherwise
Implemented in iText.Pdfocr.Tesseract4.AbstractTesseract4OcrEngine.