public final class CmsExtractorOpenOffice extends A_CmsTextExtractor
限定符和类型 | 方法和说明 |
---|---|
I_CmsExtractionResult |
extractText(java.io.InputStream in,
java.lang.String encoding)
Extracts the text and meta information from the document on the input stream, using the specified content encoding.
|
static I_CmsTextExtractor |
getExtractor()
Returns an instance of this text extractor.
|
combineContentItem, extractText, extractText, extractText, extractText, removeControlChars
public static I_CmsTextExtractor getExtractor()
public I_CmsExtractionResult extractText(java.io.InputStream in, java.lang.String encoding) throws java.lang.Exception
I_CmsTextExtractor
The encoding is a hint for the text extractor, if the value given is null
then
the text extractor should try to figure out the encoding itself.
extractText
在接口中 I_CmsTextExtractor
extractText
在类中 A_CmsTextExtractor
in
- the input stream for the document to extract the text fromencoding
- the encoding to usejava.lang.Exception
- if the text extration failsA_CmsTextExtractor.extractText(java.io.InputStream, java.lang.String)