py311-html5-parser
0.4.12_4Fast implementation of the HTML 5 parsing spec for Python
A fast implementation of the HTML 5 parsing spec for Python. Parsing is done in C using a variant of the gumbo parser. The gumbo parse tree is then transformed into an lxml tree, also in C, yielding parse times that can be a thirtieth of the html5lib parse times. That is a speedup of 30x. This differs, for instance, from the gumbo python bindings, where the initial parsing is done in C but the transformation into the final tree is done in python.
Origin: www/py-html5-parser
Category: www
Size: 544KiB
License: APACHE20
Maintainer: madpilot@FreeBSD.org
Dependencies: 2 packages
Required by: 2 packages
Website: html5-parser.readthedocs.io
$
pkg install py311-html5-parserDependencies (2)
Required By (2 packages)
More in www
py311-requests2.32.5
Python HTTP for Humansp5-libwww6.81
Perl5 library for WWW accessp5-HTML-Parser3.83
Perl5 module for parsing HTML documentsphp84-session8.4.16
The session shared extension for phpp5-Catalyst-Runtime5.90132_1
Elegant MVC Web Application Framework (Runtime)py311-django424.2.29
High-level Python Web Frameworkapache242.4.66
Version 2.4.x of Apache web serverp5-HTTP-Message7.01
Representation of HTTP style messagesp5-Template-Toolkit3.102
Extensible template processing systemp5-Plack1.0051
Perl extension of PSGI reference implementation and utilities