Trained module to guess a document's language
Text::Language::Guess guesses a document's language. Its implementation is simple: Using "Text::ExtractWords" and "Lingua::StopWords" from CPAN, it determines how many of the known stopwords the document contains for each language supported by "Lingua::StopWords". Each word in the document recognized as stopword of a particular language scores one point for this language. The "language_guess()" function takes a document as a parameter and returns the abbreviation of the language that it is most likely written in.
$
pkg install p5-Text-Language-GuessOrigin
textproc/p5-Text-Language-Guess
Size
12.3KiB
License
not specified
Maintainer
markun@onohara.to
Dependencies
4 packages
Required by
0 packages