Package: cleanNLP 3.1.0

cleanNLP: A Tidy Data Model for Natural Language Processing

Provides a set of fast tools for converting a textual corpus into a set of normalized tables. Users may make use of the 'udpipe' back end with no external dependencies, or a Python back ends with 'spaCy' <https://spacy.io>. Exposed annotation tasks include tokenization, part of speech tagging, named entity recognition, and dependency parsing.

Authors:Taylor B. Arnold [aut, cre]

cleanNLP_3.1.0.tar.gz
cleanNLP_3.1.0.zip(r-4.7)cleanNLP_3.1.0.zip(r-4.6)cleanNLP_3.1.0.zip(r-4.5)
cleanNLP_3.1.0.tgz(r-4.6-any)cleanNLP_3.1.0.tgz(r-4.5-any)
cleanNLP_3.1.0.tar.gz(r-4.6-any)cleanNLP_3.1.0.tar.gz(r-4.5-any)
cleanNLP_3.1.0.tgz(r-4.5-emscripten)
cleanNLP.pdf |cleanNLP.html
cleanNLP/json (API)
NEWS

# Install 'cleanNLP' in R:
install.packages('cleanNLP', repos = c('https://taylor-arnold.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/statsmaths/cleannlp/issues

Datasets:
  • un - Universal Declaration of Human Rights
  • word_frequency - Most frequent English words

On CRAN:

Conda:

algorithmsspatial-analysistext-analysis

8.46 score 218 stars 267 scripts 364 downloads 8 exports 15 dependencies

Last updated from:fef7c1b376. Checks:9 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-x86_64OK140
source / vignettesOK192
linux-release-x86_64OK136
macos-release-arm64OK115
macos-oldrel-arm64OK88
windows-develOK72
windows-releaseOK72
windows-oldrelOK67
wasm-releaseOK123

Exports:cnlp_annotatecnlp_download_spacycnlp_init_spacycnlp_init_stringicnlp_init_udpipecnlp_utils_pcacnlp_utils_tfcnlp_utils_tfidf

Dependencies:data.tableherejsonlitelatticeMatrixpngrappdirsRcppRcppTOMLreticulaterlangrprojrootstringiudpipewithr

Creating Text Visualizations with Wikipedia Data

Rendered fromwikipedia.Rmdusingknitr::rmarkdownon Apr 05 2026.

Last update: 2025-06-08
Started: 2025-06-08

Exploring the State of the Union Addresses: A Case Study with cleanNLP

Rendered fromstate-of-union.Rmdusingknitr::rmarkdownon Apr 05 2026.

Last update: 2025-06-08
Started: 2025-06-08