Wondering what’s next for npm?Check out our public roadmap! »

    en-lexicon
    TypeScript icon, indicating that this package has built-in type declarations

    1.0.11 • Public • Published

    English Lexicon

    Extensible English language lexicon for POS tagging with Emojis and around 110K words

    Installation

    npm install en-lexicon --save
    

    Usage

     
    const lexicon = require("en-lexicon");
     
    console.log(lexicon.lexicon.faraway);
    // "JJ"
     
    // multiple POS tags are separated by "|"
    console.log(lexicon.lexicon.acquired);
    // "VBN|JJ|VBD"
     

    Extending

    One of the main reason that I had to write my own lexicon module is that I needed it to be extensible.

    To extend the lexion with medical terms for example:

    const lexicon = require("en-lexicon");
    lexicon.extend({
        lactate:"VB",
        serum:"NN"
    });
     
    // Now that you've extended the lexicon with your own terms
    // you won't only get the terms you entered
    // The lexicon will (try) to be smart and
    // apply some inflections on those terms
     
    // the term you entered
    console.log(lexicon.lexicon.lactate);
    // "VB"
    console.log(lexicon.lexicon.lactated);
    // "VBD|VBN"
    console.log(lexicon.lexicon.lactating);
    // "VBG"
     

    Credits

    I've used Eric Brill's lexicon as starting point for this project, manually corrected some cases, and expanded it using various corpora, this one and this one for example.

    License

    License: The MIT License (MIT) - Copyright (c) 2017 Alex Corvi

    Install

    npm i en-lexicon

    DownloadsWeekly Downloads

    3,924

    Version

    1.0.11

    License

    MIT

    Last publish

    Collaborators

    • avatar