libboilerpipe-java - 1.2.0-2 main

The boilerpipe library provides algorithms to detect and remove the surplus
"clutter" (boilerplate, templates) around the main textual content of a web
page.
.
The library already provides specific strategies for common tasks (for example:
news article extraction) and may also be easily extended for individual problem
settings.
.
Extracting content is very fast (milliseconds), just needs the input document
(no global or site-level information required) and is usually quite accurate.

Priority: optional
Section: java
Suites: amber byzantium crimson dawn landing 
Maintainer: Debian Java Maintainers <pkg-java-maintainers [꩜] lists.alioth.debian.org>
 
Homepage Source Package
 

Dependencies

Installed Size: 135.2 kB
Architectures: all 

 

Versions

1.2.0-2 all