Rubygem-whatlanguage

Jul 20, 2023

Natural language detection for text samples

WhatLanguage, written in pure-Ruby, detects the human language of supplied text. It uses Bloom filters, so it is fast and memory efficient. It works well on text of over 10 words in length e.g. blog posts or comments and very poorly on short or Twitter-esque text.

It works with Arabic, Dutch, English, Farsi, Finnish, French, German, Greek, Hebrew, Hungarian, Italian, Korean, Norwegian, Pinyin, Polish, Portuguese, Russian, Spanish, and Swedish out of the box.



Checkout these related ports:
  • Zxing-cpp - ZXing C++ Library for QR code recognition
  • Zu-hunspell - Zulu hunspell dictionaries
  • Zu-aspell - Aspell Zulu dictionary
  • Zq - Easier and faster alternative to jq
  • Zorba - General purpose C++ XQuery processor
  • Zenxml - Simple C++ XML Processing
  • Zed - Command-line tool to manage and query Zed data lakes
  • Yq - Command-line YAML and XML processor, jq wrapper for YAML/XML documents
  • Yould - Pronounceable word generator
  • Yodl - Easy to use but powerful document formatting/preparation language
  • Yi-hunspell - Yiddish hunspell dictionaries
  • Yi-aspell - Aspell Yiddish dictionary
  • Yelp-xsl - DocBook XSLT stylesheets for yelp
  • Yelp-tools - Utilities to help manage documentation for Yelp and the web
  • Ydiff - Diff readability enhancer for color terminals