Package | Description |
---|---|
bixo.parser |
Modifier and Type | Class and Description |
---|---|
class |
BoilerpipeContentExtractor
BoilerpipeContentExtractor is a content extractor that extracts Boilerpipe cleaned content
|
class |
HtmlContentExtractor |
class |
SimpleContentExtractor |
Modifier and Type | Field and Description |
---|---|
protected BaseContentExtractor |
SimpleParser._contentExtractor |
Constructor and Description |
---|
SimpleParser(BaseContentExtractor contentExtractor,
BaseLinkExtractor linkExtractor,
ParserPolicy parserPolicy) |
SimpleParser(BaseContentExtractor contentExtractor,
BaseLinkExtractor linkExtractor,
ParserPolicy parserPolicy,
boolean includeMarkup) |
SimpleParser(BaseContentExtractor contentExtractor,
BaseLinkExtractor linkExtractor,
ParserPolicy parserPolicy,
org.apache.tika.parser.ParseContext parseContext) |
Copyright © 2012 Bixo Labs