tagsoup
1.2.1_2SAX-compliant parser written in Java
TagSoup - Just Keep On Truckin' TagSoup is a SAX-compliant parser written in Java that, instead of parsing well-formed or valid XML, parses HTML as it is found in the wild: poor, nasty and brutish, though quite often far from short. TagSoup is designed for people who have to process this stuff using some semblance of a rational application design. By providing a SAX interface, it allows standard XML tools to be applied to even the worst HTML. TagSoup also includes a command-line processor that reads HTML files and can generate either clean HTML or well-formed XML that is a close approximation to XHTML.
Origin: textproc/tagsoup
Category: textproc
Size: 98.8KiB
License: APACHE20
Maintainer: java@FreeBSD.org
Dependencies: 1 packages
Required by: 0 packages
Website: vrici.lojban.org/~cowan/tagsoup
$
pkg install tagsoupDependencies (1)
More in textproc
libxml22.15.2
XML parser library for GNOMEexpat2.7.4
XML 1.0 parser written in Cqt5-xml5.15.18p109
Qt SAX and DOM implementations (KDE patched)kf6-kcodecs6.22.0
String encoding librarylibxslt1.1.45
XML stylesheet transformation libraryrubygem-nokogiri1.19.1
HTML, XML, SAX, and Reader parseraspell0.60.8.1_1,1
Spelling checker with better suggestion logic than ispellphp84-xml8.4.16
The xml shared extension for phpkf6-sonnet6.22.0
Multi-language spell checkerp5-XML-LibXML2.0210_1,1
Interface to Gnome libxml2 library