|
pdfOCR 1.0.3 API
|
The implementation of AbstractTesseract4OcrEngine for tesseract OCR. More...
Public Member Functions |
|
| Tesseract4LibOcrEngine (Tesseract4OcrEngineProperties tesseract4OcrEngineProperties) | |
| Creates a new Tesseract4LibOcrEngine instance. More... |
|
| virtual TesseractEngine | GetTesseractInstance () |
| Gets tesseract instance. More... |
|
| virtual void | InitializeTesseract (OutputFormat outputFormat) |
| Initializes instance of tesseract if it haven't been already initialized or it have been disposed and sets all the required properties. More... |
|
Public Member Functions inherited from iText.Pdfocr.Tesseract4.AbstractTesseract4OcrEngine |
|
| AbstractTesseract4OcrEngine (Tesseract4OcrEngineProperties tesseract4OcrEngineProperties) | |
| virtual void | DoTesseractOcr (FileInfo inputImage, FileInfo outputFile, OutputFormat outputFormat) |
| Performs tesseract OCR for the first (or for the only) image page. More... |
|
| virtual void | CreateTxtFile (IList< FileInfo > inputImages, FileInfo txtFile) |
| Performs OCR using provided iText.Pdfocr.IOcrEngine for the given list of input images and saves output to a text file using provided path. More... |
|
| Tesseract4OcrEngineProperties | GetTesseract4OcrEngineProperties () |
| Gets properties for AbstractTesseract4OcrEngine. More... |
|
| void | SetTesseract4OcrEngineProperties (Tesseract4OcrEngineProperties tesseract4OcrEngineProperties) |
| Sets properties for AbstractTesseract4OcrEngine. More... |
|
| String | GetLanguagesAsString () |
| Gets list of languages concatenated with "+" symbol to a string in format required by tesseract. More... |
|
| IDictionary< int, IList< TextInfo > > | DoImageOcr (FileInfo input) |
| Reads data from the provided input image file and returns retrieved data in the format described below. More... |
|
| String | DoImageOcr (FileInfo input, OutputFormat outputFormat) |
| Reads data from the provided input image file and returns retrieved data as string. More... |
|
| virtual bool | IsWindows () |
| Checks current os type. More... |
|
| virtual String | IdentifyOsType () |
| Identifies type of current OS and return it (win, linux). More... |
|
| virtual void | ValidateLanguages (IList< String > languagesList) |
| Validates list of provided languages and checks if they all exist in given tess data directory. More... |
|
| virtual IMetaInfo | GetThreadLocalMetaInfo () |
| Gets the meta info which is held by the interface. More... |
|
| virtual IThreadLocalMetaInfoAware | SetThreadLocalMetaInfo (IMetaInfo metaInfo) |
| Sets a thread local meta info. More... |
|
The implementation of AbstractTesseract4OcrEngine for tesseract OCR.
The implementation of AbstractTesseract4OcrEngine for tesseract OCR. This class provides possibilities to use features of "tesseract" using tess4j. Please note that this class is not thread-safe, in other words this Tesseract engine cannot be used for multithreaded processing. You should create one instance per thread
|
inline |
Creates a new Tesseract4LibOcrEngine instance.
| tesseract4OcrEngineProperties | set of properteis |
|
inlinevirtual |
Gets tesseract instance.
|
inlinevirtual |
Initializes instance of tesseract if it haven't been already initialized or it have been disposed and sets all the required properties.
| outputFormat | selected OutputFormat for tesseract |