|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
ObjectPDFToString
public class PDFToString
This class provides a single static convert()
method that converts the text in a PDF file into an XML string that can be
pre-filtered and added to a Lucene database by the
XMLTextProcessor class.
Internally, the text of the PDF file is extracted using the PDFBox library.
| Field Summary | |
|---|---|
(package private) static boolean |
mustConfigureLogger
|
(package private) static PDFTextStripper |
stripper
PDFBox text stripper. |
| Constructor Summary | |
|---|---|
PDFToString()
|
|
| Method Summary | |
|---|---|
(package private) static String |
convert(InputStream PDFInputStream)
Convert a PDF file into an XML string. |
| Methods inherited from class Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
static boolean mustConfigureLogger
static PDFTextStripper stripper
| Constructor Detail |
|---|
public PDFToString()
| Method Detail |
|---|
static String convert(InputStream PDFInputStream)
throws IOException
PDFInputStream - The stream of PDF data to convert to an
XML string.
null.
IOException
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||