Class | Description |
---|---|
ConfigUtils | |
CrawlDirUtils | |
DiskQueue<E extends java.io.Serializable> |
A queue that writes extra elements to disk, and reads them in as needed.
|
DmozLinks | |
DomainInfo | |
DomainNames |
Utilities to extract the PLD (paid-level domain, as per the IRLbot paper)
from a hostname and perform similar hostname analysis.
|
EncodingUtils | |
EncodingUtils.ExpandedResult | |
FieldUtils | |
GroupingKey | |
HtmlUtils | |
HttpUtils | |
IoUtils | |
StringUtils | |
ThreadedExecutor |
A wrapper for ThreadPoolExecutor that implements a specific behavior we need in Bixo.
|
TimeStampUtils | |
UrlUtils |
Copyright © 2012 Bixo Labs