public class Pdf2DataExtractor extends Object
Pdf2DataExtractor
is a class for extracting templates and data from PDF files.
To create instance of Pdf2DataExtractor
, use create(File p2dFile)
.
To extract data from PDF file use recognize(File)
method.
Constructor and Description |
---|
Pdf2DataExtractor(com.itextpdf.pdf2data.template.Template template)
Deprecated.
To create an instance use
create(File) .
|
Modifier and Type | Method and Description |
---|---|
Map<String,Integer> |
check(File targetPDF)
Recognize a pdf file and returns recognition results amount.
|
Map<String,Integer> |
check(File targetPDF, RecognitionProperties properties)
Recognize a pdf file and returns recognition results amount.
|
Map<String,Integer> |
check(InputStream targetInputStream)
Recognize a pdf file and returns recognition results amount.
|
Map<String,Integer> |
check(InputStream targetInputStream, RecognitionProperties properties)
Recognize a pdf file and returns recognition results amount.
|
static Pdf2DataExtractor |
create(File p2dFile)
Creates instance of Pdf2DataExtractor from pdf2data template file.
|
static Pdf2DataExtractor |
createFromTemplateContentJson(InputStream templateContentJsonStream)
Creates instance of Pdf2DataExtractor from stream which contants pdf2data template content in JSON format.
|
com.itextpdf.pdf2data.template.Template |
getTemplate()
Gets current template instance.
|
static com.itextpdf.pdf2data.template.Template |
parseTemplateFromPDF(InputStream templateInputStream)
Deprecated.
To work with
Pdf2DataExtractor class use create(File) . If you have PDF template, XML template or P2DTA which is not ready for sdk, then first use an appropriate method(s) from Pdf2DataTemplateConverter .
|
static com.itextpdf.pdf2data.template.Template |
parseTemplateFromPDF(String templatePDF)
Deprecated.
To work with
Pdf2DataExtractor class use create(File) . If you have PDF template, XML template or P2DTA which is not ready for sdk, then first use an appropriate method(s) from Pdf2DataTemplateConverter .
|
static com.itextpdf.pdf2data.template.Template |
parseTemplateFromXML(InputStream xmlInputStream)
Deprecated.
To parse ready for SDK template use
create(File) . If you have PDF template, XML template or P2DTA which is not ready for sdk, then first use an appropriate method(s) from Pdf2DataTemplateConverter .
|
static com.itextpdf.pdf2data.template.Template |
parseTemplateFromXML(String templateXML)
Deprecated.
To work with
Pdf2DataExtractor class use create(File) . If you have PDF template, XML template or P2DTA which is not ready for sdk, then first use an appropriate method(s) from Pdf2DataTemplateConverter .
|
ParsingResult |
recognize(File targetPDF)
Deprecated.
|
ParsingResult |
recognize(File targetPDF, RecognitionProperties properties)
Deprecated.
|
ParsingResult |
recognize(InputStream targetInputStream)
Deprecated.
|
ParsingResult |
recognize(InputStream targetInputStream, OutputStream pdfOutputStream)
Deprecated.
use
recognize(File) instead.
|
ParsingResult |
recognize(InputStream targetInputStream, RecognitionProperties properties)
Deprecated.
|
ParsingResult |
recognize(String targetPDF)
Deprecated.
use
recognize(File) instead.
|
ParsingResult |
recognize(String targetPDF, String outputPDF)
Deprecated.
use
recognize(File) instead.
|
static ParsingResult |
recognize(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream)
Deprecated.
use
recognize(InputStream) instead.
|
static ParsingResult |
recognize(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream pdfOutputStream)
Deprecated.
use
recognize(InputStream) instead.
|
static ParsingResult |
recognize(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream pdfOutputStream, RecognitionProperties properties)
Deprecated.
use
recognize(InputStream, RecognitionProperties) instead.
|
static ParsingResult |
recognize(com.itextpdf.pdf2data.template.Template template, String targetPDF)
Deprecated.
use
recognize(File) instead.
|
static ParsingResult |
recognize(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputPDF)
Deprecated.
use
recognize(File) instead.
|
RecognitionResultHolder |
recognizeOnPdf(File targetPDF)
Recognize a pdf file.
|
RecognitionResultHolder |
recognizeOnPdf(File targetPDF, RecognitionProperties properties)
Recognize a pdf file.
|
RecognitionResultHolder |
recognizeOnPdf(InputStream targetInputStream)
Recognize a pdf file.
|
RecognitionResultHolder |
recognizeOnPdf(InputStream targetInputStream, RecognitionProperties properties)
Recognize a pdf file.
|
static void |
recognizeToJson(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream jsonOutputStream, RecognitionProperties properties)
Deprecated.
|
static void |
recognizeToJson(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputJson, RecognitionProperties properties)
Deprecated.
|
void |
recognizeToXML(InputStream targetInputStream, OutputStream pdfOutputStream, OutputStream xmlOutputStream)
Deprecated.
use
recognize(InputStream) with ParsingResult.saveToXml(OutputStream) instead.
|
void |
recognizeToXML(InputStream targetInputStream, OutputStream pdfOutputStream, OutputStream xmlOutputStream, RecognitionProperties properties)
Deprecated.
|
void |
recognizeToXML(String targetPDF, String outputPDF, String outputXML)
Deprecated.
use
recognize(File) with ParsingResult.saveToXml(File) instead.
|
void |
recognizeToXML(String targetPDF, String outputPDF, String outputXML, RecognitionProperties properties)
Deprecated.
|
static void |
recognizeToXML(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream xmlOutputStream)
Deprecated.
use
recognize(InputStream) with ParsingResult.saveToXml(OutputStream) instead.
|
static void |
recognizeToXML(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream pdfOutputStream, OutputStream xmlOutputStream)
Deprecated.
use
recognize(InputStream) with ParsingResult.saveToXml(OutputStream) instead.
|
static void |
recognizeToXML(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream pdfOutputStream, OutputStream xmlOutputStream, RecognitionProperties properties)
Deprecated.
|
static void |
recognizeToXML(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream xmlOutputStream, RecognitionProperties properties)
Deprecated.
|
static void |
recognizeToXML(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputXML)
Deprecated.
use
recognize(File) with ParsingResult.saveToXml(File) instead.
|
static void |
recognizeToXML(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputXML, RecognitionProperties properties)
Deprecated.
|
static void |
recognizeToXML(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputPDF, String outputXML)
Deprecated.
use
recognize(File) with ParsingResult.saveToXml(File) instead.
|
static void |
recognizeToXML(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputPDF, String outputXML, RecognitionProperties properties)
Deprecated.
|
@Deprecated public Pdf2DataExtractor(com.itextpdf.pdf2data.template.Template template)
create(File)
.
Pdf2DataExtractor
with passed template entity.
template
- template, which will be used in recognition.
public static Pdf2DataExtractor create(File p2dFile) throws IOException
Pdf2DataExtractor
from pdf2data template file. Note that template should be processed.
p2dFile
- pdf2data template archive
IOException
- if any I/O exception occurs
com.itextpdf.pdf2data.exceptions.TemplateConversionException
- if it's impossible to extract template from passed archive
public static Pdf2DataExtractor createFromTemplateContentJson(InputStream templateContentJsonStream)
Pdf2DataExtractor
from stream which contants pdf2data template content in JSON format. Note that template should be processed.
templateContentJsonStream
- processed template content stream
com.itextpdf.pdf2data.exceptions.TemplateConversionException
- if it's impossible to extract template from passed archive
@Deprecated public static com.itextpdf.pdf2data.template.Template parseTemplateFromPDF(String templatePDF) throws IOException
Pdf2DataExtractor
class use create(File)
. If you have PDF template, XML template or P2DTA which is not ready for sdk, then first use an appropriate method(s) from Pdf2DataTemplateConverter
.
templatePDF
- a path to pdf file
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
@Deprecated public static com.itextpdf.pdf2data.template.Template parseTemplateFromPDF(InputStream templateInputStream) throws IOException
Pdf2DataExtractor
class use create(File)
. If you have PDF template, XML template or P2DTA which is not ready for sdk, then first use an appropriate method(s) from Pdf2DataTemplateConverter
.
templateInputStream
- pdf input stream with template
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
@Deprecated public static com.itextpdf.pdf2data.template.Template parseTemplateFromXML(String templateXML) throws IOException
Pdf2DataExtractor
class use create(File)
. If you have PDF template, XML template or P2DTA which is not ready for sdk, then first use an appropriate method(s) from Pdf2DataTemplateConverter
.
templateXML
- a path to xml file
IOException
- if any I/O issue occurs
@Deprecated public static com.itextpdf.pdf2data.template.Template parseTemplateFromXML(InputStream xmlInputStream) throws IOException
create(File)
. If you have PDF template, XML template or P2DTA which is not ready for sdk, then first use an appropriate method(s) from Pdf2DataTemplateConverter
.
xmlInputStream
- input stream that contains template in xml form
IOException
- if any I/O issue occurs
@Deprecated public static ParsingResult recognize(com.itextpdf.pdf2data.template.Template template, String targetPDF) throws IOException
recognize(File)
instead.
template
- template instance
targetPDF
- path to pdf file for recognition
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
@Deprecated public static ParsingResult recognize(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputPDF) throws IOException
recognize(File)
instead.
template
- template instance
targetPDF
- path to pdf file for recognition
outputPDF
- path to pdf file with recognition results (annotation type)
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException
- if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public static ParsingResult recognize(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream) throws IOException
recognize(InputStream)
instead.
template
- template instance
targetInputStream
- input stream from pdf file for recognition
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
@Deprecated public static ParsingResult recognize(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream pdfOutputStream) throws IOException
recognize(InputStream)
instead.
template
- template instance
targetInputStream
- input stream from pdf file for recognition
pdfOutputStream
- output stream for writing recognition results (pdf annotation type)
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException
- if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public static ParsingResult recognize(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream pdfOutputStream, RecognitionProperties properties) throws IOException
recognize(InputStream, RecognitionProperties)
instead.
template
- template instance
targetInputStream
- input stream from pdf file for recognition
pdfOutputStream
- output stream for writing recognition results (pdf annotation type)
properties
- a RecognitionProperties
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException
- if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public static void recognizeToXML(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputXML) throws IOException
recognize(File)
with ParsingResult.saveToXml(File)
instead.
template
- template instance
targetPDF
- path to pdf file for recognition
outputXML
- path to xml file with recognition results
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
@Deprecated public static void recognizeToXML(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputXML, RecognitionProperties properties) throws IOException
recognize(File, RecognitionProperties)
with ParsingResult.saveToXml(File, SerializationProperties)
instead.
template
- template instance
targetPDF
- path to pdf file for recognition
outputXML
- path to xml file with recognition results
properties
- a RecognitionProperties
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
@Deprecated public static void recognizeToXML(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputPDF, String outputXML) throws IOException
recognize(File)
with ParsingResult.saveToXml(File)
instead.
template
- template instance
targetPDF
- path to pdf file for recognition
outputPDF
- path to pdf file with recognition results (annotation type)
outputXML
- path to xml file with recognition results
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException
- if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public static void recognizeToXML(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputPDF, String outputXML, RecognitionProperties properties) throws IOException
recognize(File, RecognitionProperties)
with ParsingResult.saveToXml(File, SerializationProperties)
instead.
template
- template instance
targetPDF
- path to pdf file for recognition
outputPDF
- path to pdf file with recognition results (annotation type)
outputXML
- path to xml file with recognition results
properties
- a RecognitionProperties
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException
- if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public static void recognizeToJson(com.itextpdf.pdf2data.template.Template template, String targetPDF, String outputJson, RecognitionProperties properties) throws IOException
recognize(File, RecognitionProperties)
with ParsingResult.saveToJson(OutputStream, SerializationProperties)
instead.
template
- template instance
targetPDF
- path to pdf file for recognition
outputJson
- path to json file with recognition results
properties
- a RecognitionProperties
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException
- if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public static void recognizeToXML(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream xmlOutputStream) throws IOException
recognize(InputStream)
with ParsingResult.saveToXml(OutputStream)
instead.
template
- template instance
targetInputStream
- input stream from pdf file for recognition
xmlOutputStream
- output stream for writing recognition results (xml type)
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
@Deprecated public static void recognizeToXML(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream xmlOutputStream, RecognitionProperties properties) throws IOException
recognize(InputStream, RecognitionProperties)
with ParsingResult.saveToXml(OutputStream, SerializationProperties)
instead.
template
- template instance
targetInputStream
- input stream from pdf file for recognition
xmlOutputStream
- output stream for writing recognition results (xml type)
properties
- a RecognitionProperties
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
@Deprecated public static void recognizeToXML(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream pdfOutputStream, OutputStream xmlOutputStream) throws IOException
recognize(InputStream)
with ParsingResult.saveToXml(OutputStream)
instead.
template
- template instance
targetInputStream
- input stream from pdf file for recognition
pdfOutputStream
- output stream for writing recognition results (pdf annotation type)
xmlOutputStream
- output stream for writing recognition results (xml type)
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException
- if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public static void recognizeToXML(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream pdfOutputStream, OutputStream xmlOutputStream, RecognitionProperties properties) throws IOException
recognize(InputStream, RecognitionProperties)
with ParsingResult.saveToXml(OutputStream, SerializationProperties)
instead.
template
- template instance
targetInputStream
- input stream from pdf file for recognition
pdfOutputStream
- output stream for writing recognition results (pdf annotation type)
xmlOutputStream
- output stream for writing recognition results (xml type)
properties
- a RecognitionProperties
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException
- if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public static void recognizeToJson(com.itextpdf.pdf2data.template.Template template, InputStream targetInputStream, OutputStream jsonOutputStream, RecognitionProperties properties) throws IOException
recognize(InputStream, RecognitionProperties)
with ParsingResult.saveToJson(OutputStream, SerializationProperties)
instead.
template
- template instance
targetInputStream
- input stream from pdf file for recognition
jsonOutputStream
- output stream for writing recognition results (json type)
properties
- a RecognitionProperties
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException
- if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public ParsingResult recognize(String targetPDF) throws IOException
recognize(File)
instead.
targetPDF
- path to pdf file for recognition
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
@Deprecated public ParsingResult recognize(String targetPDF, String outputPDF) throws IOException
recognize(File)
instead.
targetPDF
- path to pdf file for recognition
outputPDF
- path to pdf file with recognition results (annotation type)
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException
- if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public ParsingResult recognize(InputStream targetInputStream, OutputStream pdfOutputStream) throws IOException
recognize(File)
instead.
targetInputStream
- input stream from pdf file for recognition
pdfOutputStream
- output stream for writing recognition results (pdf annotation type)
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException
- if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public void recognizeToXML(String targetPDF, String outputPDF, String outputXML) throws IOException
recognize(File)
with ParsingResult.saveToXml(File)
instead.
targetPDF
- path to pdf file for recognition
outputPDF
- path to pdf file with recognition results (annotation type)
outputXML
- path to xml file with recognition results
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException
- if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public void recognizeToXML(String targetPDF, String outputPDF, String outputXML, RecognitionProperties properties) throws IOException
recognize(File, RecognitionProperties)
with ParsingResult.saveToXml(File, SerializationProperties)
instead.
targetPDF
- path to pdf file for recognition
outputPDF
- path to pdf file with recognition results (annotation type)
outputXML
- path to xml file with recognition results
properties
- a RecognitionProperties
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException
- if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public void recognizeToXML(InputStream targetInputStream, OutputStream pdfOutputStream, OutputStream xmlOutputStream) throws IOException
recognize(InputStream)
with ParsingResult.saveToXml(OutputStream)
instead.
targetInputStream
- input stream from pdf file for recognition
pdfOutputStream
- output stream for writing recognition results (pdf annotation type)
xmlOutputStream
- output stream for writing recognition results (xml type)
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException
- if pdf document is encrypted and creating/modifying annotations is not permitted
@Deprecated public void recognizeToXML(InputStream targetInputStream, OutputStream pdfOutputStream, OutputStream xmlOutputStream, RecognitionProperties properties) throws IOException
recognize(InputStream, RecognitionProperties)
with ParsingResult.saveToXml(OutputStream, SerializationProperties)
instead.
targetInputStream
- input stream from pdf file for recognition
pdfOutputStream
- output stream for writing recognition results (pdf annotation type)
xmlOutputStream
- output stream for writing recognition results (xml type)
properties
- a RecognitionProperties
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
com.itextpdf.pdf2data.exceptions.DocumentAnnotationsDeniedException
- if pdf document is encrypted and creating/modifying annotations is not permitted
public com.itextpdf.pdf2data.template.Template getTemplate()
@Deprecated public ParsingResult recognize(File targetPDF) throws IOException
targetPDF
- pdf file for recognition
ParsingResult
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
@Deprecated public ParsingResult recognize(File targetPDF, RecognitionProperties properties) throws IOException
targetPDF
- pdf file for recognition
properties
- a RecognitionProperties
instance
ParsingResult
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
@Deprecated public ParsingResult recognize(InputStream targetInputStream) throws IOException
targetInputStream
- input stream from pdf file for recognition
ParsingResult
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
@Deprecated public ParsingResult recognize(InputStream targetInputStream, RecognitionProperties properties) throws IOException
targetInputStream
- input stream from pdf file for recognition
properties
- a RecognitionProperties
instance
ParsingResult
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
public RecognitionResultHolder recognizeOnPdf(File targetPDF) throws IOException
targetPDF
- pdf file for recognition
RecognitionResultHolder
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
public RecognitionResultHolder recognizeOnPdf(File targetPDF, RecognitionProperties properties) throws IOException
targetPDF
- pdf file for recognition
properties
- a RecognitionProperties
instance
RecognitionResultHolder
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
public RecognitionResultHolder recognizeOnPdf(InputStream targetInputStream) throws IOException
targetInputStream
- input stream from pdf file for recognition
RecognitionResultHolder
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
public RecognitionResultHolder recognizeOnPdf(InputStream targetInputStream, RecognitionProperties properties) throws IOException
targetInputStream
- input stream from pdf file for recognition
properties
- a RecognitionProperties
instance
RecognitionResultHolder
instance
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
public Map<String,Integer> check(File targetPDF) throws IOException
targetPDF
- pdf file for recognition
Map
containing the recognition results as key-value pairs of strings and integers.
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
public Map<String,Integer> check(File targetPDF, RecognitionProperties properties) throws IOException
targetPDF
- pdf file for recognition
properties
- a RecognitionProperties
instance
Map
containing the recognition results as key-value pairs of strings and integers.
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
public Map<String,Integer> check(InputStream targetInputStream) throws IOException
targetInputStream
- input stream from pdf file for recognition
Map
containing the recognition results as key-value pairs of strings and integers.
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
public Map<String,Integer> check(InputStream targetInputStream, RecognitionProperties properties) throws IOException
targetInputStream
- input stream from pdf file for recognition
properties
- a RecognitionProperties
instance
Map
containing the recognition results as key-value pairs of strings and integers.
IOException
- if any I/O issue occurs
com.itextpdf.pdf2data.exceptions.DocumentExtractionDeniedException
- if pdf document is encrypted and extracting text is not permitted
Copyright © 2023. All rights reserved.