public class Pdf2DataExtractor extends Object
Pdf2DataExtractor is a class for extracting templates and data from PDF files.
To create instance of Pdf2DataExtractor, use create(File p2dFile).
To extract data from PDF file use recognize(File) method.
| Constructor and Description |
|---|
Pdf2DataExtractor(com.itextpdf.pdf2data.template.Template template)
Deprecated.
To create an instance use
create(File).
|
| Modifier and Type | Method and Description |
|---|---|
Map<String,Integer> |
check(File targetPDF)
Recognize a pdf file and returns recognition results amount.
|
Map<String,Integer> |
check(File targetPDF, RecognitionProperties properties)
Recognize a pdf file and returns recognition results amount.
|
Map<String,Integer> |
check(InputStream targetInputStream)
Recognize a pdf file and returns recognition results amount.
|
Map<String,Integer> |
check(InputStream targetInputStream, RecognitionProperties properties)
Recognize a pdf file and returns recognition results amount.
|
static Pdf2DataExtractor |
create(File p2dFile)
Creates instance of Pdf2DataExtractor from pdf2data template file.
|
com.itextpdf.pdf2data.template.Template |
getTemplate()
Gets current template instance.
|
static com.itextpdf.pdf2data.template.Template |
parseTemplateFromPDF(InputStream templateInputStream)
Deprecated.
To work with
Pdf2DataExtractor class use create(File). If you have PDF template, XML template or P2DTA which is not ready for sdk, then first use an appropriate method(s) from Pdf2DataTemplateConverter.
|
static com.itextpdf.pdf2data.template.Template |
parseTemplateFromPDF(String templatePDF)
Deprecated.
To work with
Pdf2DataExtractor class use create(File). If you have PDF template, XML template or P2DTA which is not ready for sdk, then first use an appropriate method(s) from Pdf2DataTemplateConverter.
|
static com.itextpdf.pdf2data.template.Template |
parseTemplateFromXML(InputStream xmlInputStream)
Deprecated.
To parse ready for SDK template use
create(File). If you have PDF template, XML template or P2DTA which is not ready for sdk, then first use an appropriate method(s) from Pdf2DataTemplateConverter.
|
static com.itextpdf.pdf2data.template.Template |
parseTemplateFromXML(String templateXML)
Deprecated.
To work with
Pdf2DataExtractor class use create(File). If you have PDF template, XML template or P2DTA which is not ready for sdk, then first use an appropriate method(s) from Pdf2DataTemplateConverter.
|
ParsingResult |
recognize(File targetPDF)
Recognize a pdf file.
|
ParsingResult |
recognize(File targetPDF, RecognitionProperties properties)
Recognize a pdf file.
|
ParsingResult |
recognize(InputStream targetInputStream)
Recognize a pdf file.
|
ParsingResult |
recognize(InputStream targetInputStream, OutputStream pdfOutputStream)
Deprecated.
use
recognize(File) instead.
|
ParsingResult |
recognize(InputStream targetInputStream, RecognitionProperties properties)
Recognize a pdf file.
|
ParsingResult |
recognize(String targetPDF)
Deprecated.
use
recognize(File) instead.
|
ParsingResult |
recognize(String targetPDF, String outputPDF)
Deprecated.
use
recognize(File) instead.
|
static ParsingResult |
recognize(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream)
Deprecated.
use
recognize(InputStream) instead.
|
static ParsingResult |
recognize(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream pdfOutputStream)
Deprecated.
use
recognize(InputStream) instead.
|
static ParsingResult |
recognize(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream pdfOutputStream, RecognitionProperties properties)
Deprecated.
use
recognize(InputStream, RecognitionProperties) instead.
|
static ParsingResult |
recognize(com.itextpdf.pdf2data.template.Template template, String targetPDF)
Deprecated.
use
recognize(File) instead.
|
static ParsingResult |
recognize(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputPDF)
Deprecated.
use
recognize(File) instead.
|
static void |
recognizeToJson(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream jsonOutputStream, RecognitionProperties properties)
Deprecated.
|
static void |
recognizeToJson(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputJson, RecognitionProperties properties)
Deprecated.
|
void |
recognizeToXML(InputStream targetInputStream, OutputStream pdfOutputStream, OutputStream xmlOutputStream)
Deprecated.
use
recognize(InputStream) with ParsingResult.saveToXml(OutputStream) instead.
|
void |
recognizeToXML(InputStream targetInputStream, OutputStream pdfOutputStream, OutputStream xmlOutputStream, RecognitionProperties properties)
Deprecated.
|
void |
recognizeToXML(String targetPDF, String outputPDF, String outputXML)
Deprecated.
use
recognize(File) with ParsingResult.saveToXml(File) instead.
|
void |
recognizeToXML(String targetPDF, String outputPDF, String outputXML, RecognitionProperties properties)
Deprecated.
|
static void |
recognizeToXML(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream xmlOutputStream)
Deprecated.
use
recognize(InputStream) with ParsingResult.saveToXml(OutputStream) instead.
|
static void |
recognizeToXML(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream pdfOutputStream, OutputStream xmlOutputStream)
Deprecated.
use
recognize(InputStream) with ParsingResult.saveToXml(OutputStream) instead.
|
static void |
recognizeToXML(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream pdfOutputStream, OutputStream xmlOutputStream, RecognitionProperties properties)
Deprecated.
|
static void |
recognizeToXML(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream xmlOutputStream, RecognitionProperties properties)
Deprecated.
|
static void |
recognizeToXML(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputXML)
Deprecated.
use
recognize(File) with ParsingResult.saveToXml(File) instead.
|
static void |
recognizeToXML(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputXML, RecognitionProperties properties)
Deprecated.
|
static void |
recognizeToXML(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputPDF, String outputXML)
Deprecated.
use
recognize(File) with ParsingResult.saveToXml(File) instead.
|
static void |
recognizeToXML(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputPDF, String outputXML, RecognitionProperties properties)
Deprecated.
|
@Deprecated public Pdf2DataExtractor(com.itextpdf.pdf2data.template.Template template)
create(File).
Pdf2DataExtractor with passed template entity.
template - template, which will be used in recognition.
public static Pdf2DataExtractor create(File p2dFile) throws IOException
Pdf2DataExtractor from pdf2data template file. Note that template should be processed.
p2dFile - pdf2data template archive
IOException - if any I/O exception occurs
com.itextpdf.pdf2data.exceptions.TemplateConversionException - if it's impossible to extract template from passed archive
@Deprecated public static com.itextpdf.pdf2data.template.Template parseTemplateFromPDF(String templatePDF) throws IOException
Pdf2DataExtractor class use create(File). If you have PDF template, XML template or P2DTA which is not ready for sdk, then first use an appropriate method(s) from Pdf2DataTemplateConverter.
templatePDF - a path to pdf file
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
@Deprecated public static com.itextpdf.pdf2data.template.Template parseTemplateFromPDF(InputStream templateInputStream) throws IOException
Pdf2DataExtractor class use create(File). If you have PDF template, XML template or P2DTA which is not ready for sdk, then first use an appropriate method(s) from Pdf2DataTemplateConverter.
templateInputStream - pdf input stream with template
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
@Deprecated public static com.itextpdf.pdf2data.template.Template parseTemplateFromXML(String templateXML) throws IOException
Pdf2DataExtractor class use create(File). If you have PDF template, XML template or P2DTA which is not ready for sdk, then first use an appropriate method(s) from Pdf2DataTemplateConverter.
templateXML - a path to xml file
IOException - if any I/O issue occurs
@Deprecated public static com.itextpdf.pdf2data.template.Template parseTemplateFromXML(InputStream xmlInputStream) throws IOException
create(File). If you have PDF template, XML template or P2DTA which is not ready for sdk, then first use an appropriate method(s) from Pdf2DataTemplateConverter.
xmlInputStream - input stream that contains template in xml form
IOException - if any I/O issue occurs
@Deprecated public static ParsingResult recognize(com.itextpdf.pdf2data.template.Template template, String targetPDF) throws IOException
recognize(File) instead.
template - template instance
targetPDF - path to pdf file for recognition
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
@Deprecated public static ParsingResult recognize(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputPDF) throws IOException
recognize(File) instead.
template - template instance
targetPDF - path to pdf file for recognition
outputPDF - path to pdf file with recognition results (annotation type)
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException - if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public static ParsingResult recognize(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream) throws IOException
recognize(InputStream) instead.
template - template instance
targetInputStream - input stream from pdf file for recognition
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
@Deprecated public static ParsingResult recognize(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream pdfOutputStream) throws IOException
recognize(InputStream) instead.
template - template instance
targetInputStream - input stream from pdf file for recognition
pdfOutputStream - output stream for writing recognition results (pdf annotation type)
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException - if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public static ParsingResult recognize(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream pdfOutputStream, RecognitionProperties properties) throws IOException
recognize(InputStream, RecognitionProperties) instead.
template - template instance
targetInputStream - input stream from pdf file for recognition
pdfOutputStream - output stream for writing recognition results (pdf annotation type)
properties - a RecognitionProperties instance
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException - if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public static void recognizeToXML(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputXML) throws IOException
recognize(File) with ParsingResult.saveToXml(File) instead.
template - template instance
targetPDF - path to pdf file for recognition
outputXML - path to xml file with recognition results
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
@Deprecated public static void recognizeToXML(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputXML, RecognitionProperties properties) throws IOException
recognize(File, RecognitionProperties) with ParsingResult.saveToXml(File, SerializationProperties) instead.
template - template instance
targetPDF - path to pdf file for recognition
outputXML - path to xml file with recognition results
properties - a RecognitionProperties instance
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
@Deprecated public static void recognizeToXML(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputPDF, String outputXML) throws IOException
recognize(File) with ParsingResult.saveToXml(File) instead.
template - template instance
targetPDF - path to pdf file for recognition
outputPDF - path to pdf file with recognition results (annotation type)
outputXML - path to xml file with recognition results
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException - if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public static void recognizeToXML(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputPDF, String outputXML, RecognitionProperties properties) throws IOException
recognize(File, RecognitionProperties) with ParsingResult.saveToXml(File, SerializationProperties) instead.
template - template instance
targetPDF - path to pdf file for recognition
outputPDF - path to pdf file with recognition results (annotation type)
outputXML - path to xml file with recognition results
properties - a RecognitionProperties instance
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException - if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public static void recognizeToJson(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputJson, RecognitionProperties properties) throws IOException
recognize(File, RecognitionProperties) with ParsingResult.saveToJson(OutputStream, SerializationProperties) instead.
template - template instance
targetPDF - path to pdf file for recognition
outputJson - path to json file with recognition results
properties - a RecognitionProperties instance
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException - if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public static void recognizeToXML(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream xmlOutputStream) throws IOException
recognize(InputStream) with ParsingResult.saveToXml(OutputStream) instead.
template - template instance
targetInputStream - input stream from pdf file for recognition
xmlOutputStream - output stream for writing recognition results (xml type)
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
@Deprecated public static void recognizeToXML(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream xmlOutputStream, RecognitionProperties properties) throws IOException
recognize(InputStream, RecognitionProperties) with ParsingResult.saveToXml(OutputStream, SerializationProperties) instead.
template - template instance
targetInputStream - input stream from pdf file for recognition
xmlOutputStream - output stream for writing recognition results (xml type)
properties - a RecognitionProperties instance
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
@Deprecated public static void recognizeToXML(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream pdfOutputStream, OutputStream xmlOutputStream) throws IOException
recognize(InputStream) with ParsingResult.saveToXml(OutputStream) instead.
template - template instance
targetInputStream - input stream from pdf file for recognition
pdfOutputStream - output stream for writing recognition results (pdf annotation type)
xmlOutputStream - output stream for writing recognition results (xml type)
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException - if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public static void recognizeToXML(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream pdfOutputStream, OutputStream xmlOutputStream, RecognitionProperties properties) throws IOException
recognize(InputStream, RecognitionProperties) with ParsingResult.saveToXml(OutputStream, SerializationProperties) instead.
template - template instance
targetInputStream - input stream from pdf file for recognition
pdfOutputStream - output stream for writing recognition results (pdf annotation type)
xmlOutputStream - output stream for writing recognition results (xml type)
properties - a RecognitionProperties instance
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException - if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public static void recognizeToJson(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream jsonOutputStream, RecognitionProperties properties) throws IOException
recognize(InputStream, RecognitionProperties) with ParsingResult.saveToJson(OutputStream, SerializationProperties) instead.
template - template instance
targetInputStream - input stream from pdf file for recognition
jsonOutputStream - output stream for writing recognition results (json type)
properties - a RecognitionProperties instance
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException - if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public ParsingResult recognize(String targetPDF) throws IOException
recognize(File) instead.
targetPDF - path to pdf file for recognition
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
@Deprecated public ParsingResult recognize(String targetPDF, String outputPDF) throws IOException
recognize(File) instead.
targetPDF - path to pdf file for recognition
outputPDF - path to pdf file with recognition results (annotation type)
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException - if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public ParsingResult recognize(InputStream targetInputStream, OutputStream pdfOutputStream) throws IOException
recognize(File) instead.
targetInputStream - input stream from pdf file for recognition
pdfOutputStream - output stream for writing recognition results (pdf annotation type)
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException - if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public void recognizeToXML(String targetPDF, String outputPDF, String outputXML) throws IOException
recognize(File) with ParsingResult.saveToXml(File) instead.
targetPDF - path to pdf file for recognition
outputPDF - path to pdf file with recognition results (annotation type)
outputXML - path to xml file with recognition results
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException - if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public void recognizeToXML(String targetPDF, String outputPDF, String outputXML, RecognitionProperties properties) throws IOException
recognize(File, RecognitionProperties) with ParsingResult.saveToXml(File, SerializationProperties) instead.
targetPDF - path to pdf file for recognition
outputPDF - path to pdf file with recognition results (annotation type)
outputXML - path to xml file with recognition results
properties - a RecognitionProperties instance
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException - if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public void recognizeToXML(InputStream targetInputStream, OutputStream pdfOutputStream, OutputStream xmlOutputStream) throws IOException
recognize(InputStream) with ParsingResult.saveToXml(OutputStream) instead.
targetInputStream - input stream from pdf file for recognition
pdfOutputStream - output stream for writing recognition results (pdf annotation type)
xmlOutputStream - output stream for writing recognition results (xml type)
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException - if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public void recognizeToXML(InputStream targetInputStream, OutputStream pdfOutputStream, OutputStream xmlOutputStream, RecognitionProperties properties) throws IOException
recognize(InputStream, RecognitionProperties) with ParsingResult.saveToXml(OutputStream, SerializationProperties) instead.
targetInputStream - input stream from pdf file for recognition
pdfOutputStream - output stream for writing recognition results (pdf annotation type)
xmlOutputStream - output stream for writing recognition results (xml type)
properties - a RecognitionProperties instance
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException - if pdf document is encrypted and creating/modifying annotations is not permitted
public com.itextpdf.pdf2data.template.Template getTemplate()
public ParsingResult recognize(File targetPDF) throws IOException
targetPDF - pdf file for recognition
ParsingResult instance
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
public ParsingResult recognize(File targetPDF, RecognitionProperties properties) throws IOException
targetPDF - pdf file for recognition
properties - a RecognitionProperties instance
ParsingResult instance
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
public ParsingResult recognize(InputStream targetInputStream) throws IOException
targetInputStream - input stream from pdf file for recognition
ParsingResult instance
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
public ParsingResult recognize(InputStream targetInputStream, RecognitionProperties properties) throws IOException
targetInputStream - input stream from pdf file for recognition
properties - a RecognitionProperties instance
ParsingResult instance
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
public Map<String,Integer> check(File targetPDF) throws IOException
targetPDF - pdf file for recognition
Map containing the recognition results as key-value pairs of strings and integers.
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
public Map<String,Integer> check(File targetPDF, RecognitionProperties properties) throws IOException
targetPDF - pdf file for recognition
properties - a RecognitionProperties instance
Map containing the recognition results as key-value pairs of strings and integers.
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
public Map<String,Integer> check(InputStream targetInputStream) throws IOException
targetInputStream - input stream from pdf file for recognition
Map containing the recognition results as key-value pairs of strings and integers.
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
public Map<String,Integer> check(InputStream targetInputStream, RecognitionProperties properties) throws IOException
targetInputStream - input stream from pdf file for recognition
properties - a RecognitionProperties instance
Map containing the recognition results as key-value pairs of strings and integers.
IOException - if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException - if pdf document is encrypted and extracting text is not permitted
Copyright © 2023. All rights reserved.