Package: quanteda 4.4
quanteda: Quantitative Analysis of Textual Data
A fast, flexible, and comprehensive framework for quantitative text analysis in R. Provides functionality for corpus management, creating and manipulating tokens and n-grams, exploring keywords in context, forming and manipulating sparse matrices of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and distances, applying content dictionaries, applying supervised and unsupervised machine learning, visually representing text and text analyses, and more.
Authors:
quanteda_4.4.tar.gz
quanteda_4.4.zip(r-4.7)quanteda_4.4.zip(r-4.6)quanteda_4.4.zip(r-4.5)
quanteda_4.4.tgz(r-4.6-x86_64)quanteda_4.4.tgz(r-4.6-arm64)quanteda_4.4.tgz(r-4.5-x86_64)quanteda_4.4.tgz(r-4.5-arm64)
quanteda_4.4.tar.gz(r-4.6-arm64)quanteda_4.4.tar.gz(r-4.6-x86_64)quanteda_4.4.tar.gz(r-4.5-arm64)quanteda_4.4.tar.gz(r-4.5-x86_64)
quanteda_4.4.tgz(r-4.5-emscripten)
quanteda.pdf |quanteda.html✨
quanteda/json (API)
NEWS
| # Install 'quanteda' in R: |
| install.packages('quanteda', repos = c('https://quanteda.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/quanteda/quanteda/issues
Pkgdown/docs site:https://quanteda.io
- data_char_sampletext - A paragraph of text for testing various text-based functions
- data_char_ukimmig2010 - Immigration-related sections of 2010 UK party manifestos
- data_corpus_inaugural - US presidential inaugural address texts
- data_dfm_lbgexample - Dfm from data in Table 1 of Laver, Benoit, and Garry
- data_dictionary_LSD2015 - Lexicoder Sentiment Dictionary
corpusnatural-language-processingquantedatext-analyticsonetbbcpp
Last updated from:115799382e. Checks:13 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-arm64 | OK | 325 | ||
| linux-devel-x86_64 | OK | 343 | ||
| source / vignettes | OK | 379 | ||
| linux-release-arm64 | OK | 296 | ||
| linux-release-x86_64 | OK | 322 | ||
| macos-release-arm64 | OK | 270 | ||
| macos-release-x86_64 | OK | 532 | ||
| macos-oldrel-arm64 | OK | 233 | ||
| macos-oldrel-x86_64 | OK | 609 | ||
| windows-devel | OK | 393 | ||
| windows-release | OK | 378 | ||
| windows-oldrel | OK | 449 | ||
| wasm-release | OK | 195 |
Exports:%>%as.corpusas.dfmas.dictionaryas.fcmas.listas.phraseas.tensoras.tokensas.tokens_xptras.yamlbootstrap_dfmbreakrules_getbreakrules_resetbreakrules_setchar_keepchar_ngramschar_removechar_segmentchar_selectchar_tolowerchar_toupperchar_trimchar_wordstemcheck_charactercheck_doublecheck_integercheck_logicalcolMeanscolSumsCompareconcatconcatenatorconvertcorpuscorpus_chunkcorpus_groupcorpus_reshapecorpus_samplecorpus_segmentcorpus_subsetcorpus_trimdfmdfm_compressdfm_groupdfm_keepdfm_lookupdfm_matchdfm_removedfm_replacedfm_sampledfm_selectdfm_smoothdfm_sortdfm_subsetdfm_tfidfdfm_tolowerdfm_toupperdfm_trimdfm_weightdfm_wordstemdictionarydocfreqdociddocnamesdocnames<-docvarsdocvars<-fcmfcm_compressfcm_keepfcm_removefcm_selectfcm_sortfcm_tolowerfcm_toupperfeatfreqfeatnamesflatten_dictionaryindexinfo_tbbis.collocationsis.corpusis.dfmis.dictionaryis.fcmis.indexis.kwicis.phraseis.tokensis.tokens_xptrkwicmetameta<-ndocnfeatnormalize_charactersnsentencentokenntypeobject2fixedobject2idpattern2fixedpattern2idphraseprintquanteda_optionsrowMeansrownames<-rowSumssegidsparsitystopwordsttextstexts<-tokenize_charactertokenize_customtokenize_dictionarytokenize_fasterwordtokenize_fastestwordtokenize_paragraphtokenize_sentencetokenize_word1tokenize_word2tokenize_word3tokenize_word4tokenstokens_annotatetokens_chunktokens_compoundtokens_grouptokens_keeptokens_lookuptokens_ngramstokens_removetokens_replacetokens_restoretokens_sampletokens_segmenttokens_selecttokens_skipgramstokens_splittokens_subsettokens_tolowertokens_touppertokens_trimtokens_wordstemtopfeaturestypes
Dependencies:clifastmatchISOcodesjsonlitelatticelifecyclemagrittrMatrixRcpprlangSnowballCstopwordsstringixml2yaml
