28 packages found

    Description

    日本語で書かれた技術書のコーパス

    Keywords

    Publisher

    published 2.0.1a year ago
    M
    Q
    P

    Description

    A wrapper for CETEMPúblico, an European Portuguese corpus of news extracts from the newspaper Público, with 180 million words tagged automatically using PALAVRAS.

    Keywords

    Publisher

    published 1.4.010 months ago
    M
    Q
    P

    Description

    Corpus representaion stored in JSON and wrapped into Corpus CRUD API

    Keywords

    Publisher

    published 1.0.27 years ago
    M
    Q
    P

    Description

    Corpus CRUD API wrapper

    Keywords

    Publisher

    published 2.0.36 years ago
    M
    Q
    P

    Description

    translate languages using a statistical model

    Keywords

    Publisher

    published 0.8.3a year ago
    M
    Q
    P

    Description

    A JavaScript (Node.js) library that converts a tagged (monolinear) text to DLx JSON format

    Keywords

    Publisher

    published 0.4.02 years ago
    M
    Q
    P

    Description

    Corpus CRUD API wrapper

    Keywords

    Publisher

    published 1.0.07 years ago
    M
    Q
    P

    Description

    A Node.js library for concordancing a corpus formatted according to the Data Format for Digital Linguistis (DaFoDiL)

    Keywords

    Publisher

    published 0.4.02 years ago
    M
    Q
    P

    Description

    Text corpora from Project Gutenburg used by NLTK.

    Keywords

    Publisher

    published 1.0.15 years ago
    M
    Q
    P

    Description

    State of the Union addresses by U.S. Presidents as a UMD bundle.

    Keywords

    Publisher

    published 0.0.934 months ago
    M
    Q
    P

    Description

    Spam Assassin public mail corpus as a UMD bundle.

    Keywords

    Publisher

    published 0.0.934 months ago
    M
    Q
    P

    Description

    The text of Moby Dick by Herman Melville as a UMD bundle.

    Keywords

    Publisher

    published 0.0.934 months ago
    M
    Q
    P

    Description

    Text mining library

    Keywords

    Publisher

    published 1.1.25 years ago
    M
    Q
    P

    Description

    List of ~636,000 Spanish words

    Keywords

    Publisher

    published 2.0.0a year ago
    M
    Q
    P

    Description

    A core type to handle CoNLL-U format

    Keywords

    Publisher

    published 0.2.05 months ago
    M
    Q
    P

    Description

    Some classes to represent elements in a text corpus.

    Keywords

    Publisher

    published 0.0.210 months ago
    M
    Q
    P

    Description

    List of ~336,000 French words

    Keywords

    Publisher

    published 2.0.0a year ago
    M
    Q
    P

    Description

    A dashboard to visualize a synthesis on a structured corpus, using several charts (pie, histogram, ...)

    Keywords

    Publisher

    published 6.8.55 years ago
    M
    Q
    P

    Description

    Calculate how many documents contain a certain term, within a list (`Array`) of text documents.

    Keywords

    Publisher

    published 0.0.17 years ago
    M
    Q
    P

    Description

    A CJK text tokenizer

    Keywords

    Publisher

    published 0.1.04 years ago
    M
    Q
    P