public class TaggedPdfReaderTool extends Object
| Modifier and Type | Field and Description |
|---|---|
protected PrintWriter |
out
The writer object to which the XML will be written
|
protected PdfReader |
reader
The reader object from which the content streams are read.
|
| Constructor and Description |
|---|
TaggedPdfReaderTool() |
| Modifier and Type | Method and Description |
|---|---|
void |
convertToXml(PdfReader reader, OutputStream os)
Parses a string with structured content.
|
void |
convertToXml(PdfReader reader, OutputStream os, String charset)
Parses a string with structured content.
|
void |
inspectChild(PdfObject k)
Inspects a child of a structured element.
|
void |
inspectChildArray(PdfArray k)
If the child of a structured element is an array, we need to loop over the elements.
|
void |
inspectChildDictionary(PdfDictionary k)
If the child of a structured element is a dictionary, we inspect the child; we may also draw a tag.
|
void |
inspectChildDictionary(PdfDictionary k, boolean inspectAttributes)
If the child of a structured element is a dictionary, we inspect the child; we may also draw a tag.
|
void |
parseTag(String tag, PdfObject object, PdfDictionary page)
Searches for a tag in a page.
|
protected String |
xmlName(PdfName name) |
protected PdfReader reader
protected PrintWriter out
public void convertToXml(PdfReader reader, OutputStream os, String charset) throws IOException
reader - the PdfReader that has access to the PDF file
os - the OutputStream to which the resulting xml will be written
charset - the charset to encode the data
IOException
public void convertToXml(PdfReader reader, OutputStream os) throws IOException
reader - the PdfReader that has access to the PDF file
os - the OutputStream to which the resulting xml will be written
IOException
public void inspectChild(PdfObject k) throws IOException
k - the child to inspect
IOException
public void inspectChildArray(PdfArray k) throws IOException
k - the child array to inspect
IOException
public void inspectChildDictionary(PdfDictionary k) throws IOException
k - the child dictionary to inspect
IOException
public void inspectChildDictionary(PdfDictionary k, boolean inspectAttributes) throws IOException
k - the child dictionary to inspect
IOException
public void parseTag(String tag, PdfObject object, PdfDictionary page) throws IOException
tag - the name of the tag
object - an identifier to find the marked content
page - a page dictionary
IOException
Copyright © 2016. All rights reserved.