Unicode Standard tokenization routines
The segments package provides Unicode Standard tokenization routines and orthography segmentation, implementing the linear algorithm described in the orthography profile specification from The Unicode Cookbook.
$
pkg install py311-segmentsOrigin
textproc/py-segments
Size
126KiB
License
APACHE20
Maintainer
yuri@FreeBSD.org
Dependencies
3 packages
Required by
0 packages