Package: cleanNLP 3.1.0
cleanNLP: A Tidy Data Model for Natural Language Processing
Provides a set of fast tools for converting a textual corpus into a set of normalized tables. Users may make use of the 'udpipe' back end with no external dependencies, or a Python back ends with 'spaCy' <https://spacy.io>. Exposed annotation tasks include tokenization, part of speech tagging, named entity recognition, and dependency parsing.
Authors:
cleanNLP_3.1.0.tar.gz
cleanNLP_3.1.0.zip(r-4.7)cleanNLP_3.1.0.zip(r-4.6)cleanNLP_3.1.0.zip(r-4.5)
cleanNLP_3.1.0.tgz(r-4.6-any)cleanNLP_3.1.0.tgz(r-4.5-any)
cleanNLP_3.1.0.tar.gz(r-4.6-any)cleanNLP_3.1.0.tar.gz(r-4.5-any)
cleanNLP_3.1.0.tgz(r-4.5-emscripten)
cleanNLP.pdf |cleanNLP.html✨
cleanNLP/json (API)
NEWS
| # Install 'cleanNLP' in R: |
| install.packages('cleanNLP', repos = c('https://taylor-arnold.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/statsmaths/cleannlp/issues
- un - Universal Declaration of Human Rights
- word_frequency - Most frequent English words
algorithmsspatial-analysistext-analysis
Last updated from:fef7c1b376. Checks:9 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-x86_64 | OK | 140 | ||
| source / vignettes | OK | 192 | ||
| linux-release-x86_64 | OK | 136 | ||
| macos-release-arm64 | OK | 115 | ||
| macos-oldrel-arm64 | OK | 88 | ||
| windows-devel | OK | 72 | ||
| windows-release | OK | 72 | ||
| windows-oldrel | OK | 67 | ||
| wasm-release | OK | 123 |
Exports:cnlp_annotatecnlp_download_spacycnlp_init_spacycnlp_init_stringicnlp_init_udpipecnlp_utils_pcacnlp_utils_tfcnlp_utils_tfidf
Dependencies:data.tableherejsonlitelatticeMatrixpngrappdirsRcppRcppTOMLreticulaterlangrprojrootstringiudpipewithr
Creating Text Visualizations with Wikipedia Data
Rendered fromwikipedia.Rmdusingknitr::rmarkdownon Apr 05 2026.Last update: 2025-06-08
Started: 2025-06-08
Exploring the State of the Union Addresses: A Case Study with cleanNLP
Rendered fromstate-of-union.Rmdusingknitr::rmarkdownon Apr 05 2026.Last update: 2025-06-08
Started: 2025-06-08
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| cleanNLP: A Tidy Data Model for Natural Language Processing | cleanNLP-package cleanNLP |
| Run the annotation pipeline on a set of documents | cnlp_annotate |
| Download model files needed for spacy | cnlp_download_spacy |
| Interface for initializing the spacy backend | cnlp_init_spacy |
| Interface for initializing the standard R backend | cnlp_init_stringi |
| Interface for initializing the udpipe backend | cnlp_init_udpipe |
| Compute Principal Components and store as a Data Frame | cnlp_utils_pca |
| Construct the TF-IDF Matrix from Annotation or Data Frame | cnlp_utils_tf cnlp_utils_tfidf |
| Universal Declaration of Human Rights | un |
| Most frequent English words | word_frequency |
