This is an allowlist-based and very opinionated HTML sanitizer
that can be used both for untrusted and trusted sources.
It attempts to clean up the mess
made by various rich text editors and or copy-pasting
to make styling of webpages simpler and more consistent.
It builds on the excellent HTML cleaner in lxml
to make the result both valid and safe.
.
HTML sanitizer goes further than e.g. bleach
in that it not only ensures that content is safe
and tags and attributes conform to a given allowlist,
but also applies additional transforms to HTML fragments.
Installed Size: 69.6 kB
Architectures: all