Jul 20, 2023

Yet Another Part-of-Speech and Morphological Analyzer

MeCab is open source Japanese dependency structure analyzer developed through the joint research project between Graduate School of Informatics Kyoto University and NTT Nippon Telegraph and Telephone Communication Science Laboratories. It has following features

  • General-purpose design independent from language, dictionary and corpus.
  • High precision of analysis based on Conditional Random Fields.
  • Faster than ChaSen, Juman and KAKASI.
  • Library is reentrant.
  • Scripting language bindings such as Perl/Ruby/Python/Java/C#.

Checkout these related ports:
  • Zipcodes - Japanese zipcode tables. includes both 3/5 and 7 digits form
  • Zinnia - Simple, customizable, and portable online handwriting recognition system
  • Zinnia-tomoe - Handwriting recognition files for Zinnia (Tomoe data)
  • Yc.el - Yet another Canna client for Emacs
  • Xv - X11 program that displays images of various formats with japanization
  • Xtr - Japanese text formatting processor
  • Xshodou - Japanese shodou program for X based on Tcl/Tk
  • Xpdf - Japanese font support for xpdf
  • Xdtp - XML document transfer program
  • Wwasw-fpw - Biographical dictionary (EPWING V1 format)
  • Wordpress -
  • Wordnet-fpw - English - English Dictionary (EPWING V1 format)
  • Wnn7egg - Wnn7 elisp client
  • Webalizer -
  • Web1913-fpw - Webster's Revised Unabridged Dictionary (1913) (EPWING V1 format)