HtmlCleaner can be used in java code, as command line tool or as Ant task.
It is designed to be small, independent (no runtime dependencies except
JRE 1.5+), fast and flexible (its behavior is configurable through number of
parameters). Although the main motive was to prepare ordinary HTML for XML
processing with XPath, XQuery and XSLT, structured data produced by
HtmlCleaner may be consumed and handled in many other ways.
.
This package contains de library itself.
Installed Size: 212.0 kB
Architectures: all