P5-lingua-en-namedentity

Jul 20, 2023

Basic Named Entity Extraction algorithm

“Named entities” is the NLP jargon for proper nouns which represent people, places, organisations, and so on.

This module provides a very simple way of extracting these from a text. If we run the “extract_entities” routine on a piece of news coverage of recent UK political events, we should expect to see it return a list of hash references looking like this

entity => ‘Mr Howard’, class => ‘person’, scores => … , , entity => ‘Ministry of Defence’, class => ‘organisation’, … , entity => ‘Oxfordshire’, class => ‘place’, … ,

The additional “scores” hash reference in there breaks down the various possible classes for this entity in an open-ended scale.



Checkout these related ports:
  • Zxing-cpp - ZXing C++ Library for QR code recognition
  • Zu-hunspell - Zulu hunspell dictionaries
  • Zu-aspell - Aspell Zulu dictionary
  • Zq - Easier and faster alternative to jq
  • Zorba - General purpose C++ XQuery processor
  • Zenxml - Simple C++ XML Processing
  • Zed - Command-line tool to manage and query Zed data lakes
  • Yq - Command-line YAML and XML processor, jq wrapper for YAML/XML documents
  • Yould - Pronounceable word generator
  • Yodl - Easy to use but powerful document formatting/preparation language
  • Yi-hunspell - Yiddish hunspell dictionaries
  • Yi-aspell - Aspell Yiddish dictionary
  • Yelp-xsl - DocBook XSLT stylesheets for yelp
  • Yelp-tools - Utilities to help manage documentation for Yelp and the web
  • Ydiff - Diff readability enhancer for color terminals