Package com.itextpdf.pdfocr
Class TextInfo
java.lang.Object
com.itextpdf.pdfocr.TextInfo
This class describes how recognized text is positioned on the image providing bbox for each text item (could be a line or a word).
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptioncom.itextpdf.kernel.geom.Rectangle
Gets bbox coordinates.Retrieves structure tree item for the text item.getText()
Gets text element.void
setBboxRect
(com.itextpdf.kernel.geom.Rectangle bbox) Sets text bbox.void
setLogicalStructureTreeItem
(LogicalStructureTreeItem logicalStructureTreeItem) Sets logical structure tree parent item for the text info.void
Sets text element.
-
Constructor Details
-
Method Details
-
getText
Gets text element.- Returns:
- String
-
setText
Sets text element.- Parameters:
-
newText
- retrieved text
-
getBboxRect
public com.itextpdf.kernel.geom.Rectangle getBboxRect()Gets bbox coordinates.- Returns:
-
Rectangle
describing text bbox
-
setBboxRect
public void setBboxRect(com.itextpdf.kernel.geom.Rectangle bbox) Sets text bbox.- Parameters:
-
bbox
-Rectangle
describing text bbox
-
getLogicalStructureTreeItem
Retrieves structure tree item for the text item.- Returns:
- structure tree item.
-
setLogicalStructureTreeItem
Sets logical structure tree parent item for the text info. It allows to organize text chunks into logical hierarchy, e.g. specify document paragraphs, tables, etc.If LogicalStructureTreeItem is set, then the list of
TextInfo
s inIOcrEngine.doImageOcr(java.io.File)
return value is expected to be in logical order.- Parameters:
-
logicalStructureTreeItem
- structure tree item.
-