public interface I_CmsTextExtractor
限定符和类型 | 方法和说明 |
---|---|
I_CmsExtractionResult |
extractText(byte[] content)
Extracts the text and meta information from the given binary document.
|
I_CmsExtractionResult |
extractText(byte[] content,
java.lang.String encoding)
Extracts the text and meta information from the given binary document, using the specified content encoding.
|
I_CmsExtractionResult |
extractText(java.io.InputStream in)
Extracts the text and meta information from the document on the input stream.
|
I_CmsExtractionResult |
extractText(java.io.InputStream in,
java.lang.String encoding)
Extracts the text and meta information from the document on the input stream, using the specified content encoding.
|
I_CmsExtractionResult extractText(byte[] content) throws java.lang.Exception
The encoding of the input stream is either not required (the document type may have one common default encoding) or the extractor is able to divine the encoding from the provided binary array automatically.
Delivers is the same result as calling
when extractText(byte[], String)
String == null
.
content
- the binary content of the document to extract the text fromjava.lang.Exception
- if the text extration failsI_CmsExtractionResult extractText(byte[] content, java.lang.String encoding) throws java.lang.Exception
The encoding is a hint for the text extractor, if the value given is null
then
the text extractor should try to figure out the encoding itself.
content
- the binary content of the document to extract the text fromencoding
- the encoding to usejava.lang.Exception
- if the text extration failsI_CmsExtractionResult extractText(java.io.InputStream in) throws java.lang.Exception
The encoding of the input stream is either not required (the document type may have one common default encoding) or the extractor is able to divine the encoding from the provided input stream automatically.
Delivers is the same result as calling
when extractText(InputStream, String)
String == null
.
in
- the input stream for the document to extract the text fromjava.lang.Exception
- if the text extration failsI_CmsExtractionResult extractText(java.io.InputStream in, java.lang.String encoding) throws java.lang.Exception
The encoding is a hint for the text extractor, if the value given is null
then
the text extractor should try to figure out the encoding itself.
in
- the input stream for the document to extract the text fromencoding
- the encoding to usejava.lang.Exception
- if the text extration fails