Package: text2vec 0.6.6
text2vec: Modern Text Mining Framework for R
Fast and memory-friendly tools for text vectorization, topic modeling (LDA, LSA), word embeddings (GloVe), similarities. This package provides a source-agnostic streaming API, which allows researchers to perform analysis of collections of documents which are larger than available RAM. All core functions are parallelized to benefit from multicore machines.
Authors:
text2vec_0.6.6.tar.gz
text2vec_0.6.6.zip(r-4.7)text2vec_0.6.6.zip(r-4.6)text2vec_0.6.6.zip(r-4.5)
text2vec_0.6.6.tgz(r-4.6-x86_64)text2vec_0.6.6.tgz(r-4.6-arm64)text2vec_0.6.6.tgz(r-4.5-x86_64)text2vec_0.6.6.tgz(r-4.5-arm64)
text2vec_0.6.6.tar.gz(r-4.7-arm64)text2vec_0.6.6.tar.gz(r-4.7-x86_64)text2vec_0.6.6.tar.gz(r-4.6-arm64)text2vec_0.6.6.tar.gz(r-4.6-x86_64)
text2vec_0.6.6.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
card.svg |card.png
text2vec/json (API)
NEWS
| # Install 'text2vec' in R: |
| install.packages('text2vec', repos = c('https://dselivanov.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/dselivanov/text2vec/issues
- movie_review - IMDB movie reviews
glovelatent-dirichlet-allocationnatural-language-processingtext-miningtopic-modelingvectorizationword-embeddingsword2veccpp
Last updated from:0b31bdd81f. Checks:13 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-arm64 | OK | 249 | ||
| linux-devel-x86_64 | OK | 204 | ||
| source / vignettes | OK | 269 | ||
| linux-release-arm64 | OK | 176 | ||
| linux-release-x86_64 | OK | 242 | ||
| macos-release-arm64 | OK | 115 | ||
| macos-release-x86_64 | OK | 426 | ||
| macos-oldrel-arm64 | OK | 115 | ||
| macos-oldrel-x86_64 | OK | 230 | ||
| windows-devel | OK | 205 | ||
| windows-release | OK | 170 | ||
| windows-oldrel | OK | 188 | ||
| wasm-release | OK | 118 |
Exports:as.lda_cBNSchar_tokenizercheck_analogy_accuracycoherenceCollocationscombine_vocabulariescreate_dtmcreate_tcmcreate_vocabularydist2fitfit_transformGlobalVectorsGloVehash_vectorizeridirifilesifiles_parallelitokenitoken_paralleljsPCA_robustLatentDirichletAllocationLatentSemanticAnalysisLDALSAnormalizepdist2perplexitypostag_lemma_tokenizerprepare_analogy_questionsprune_vocabularypsim2RelaxedWordMoversDistanceRWMDsim2space_tokenizersplit_intoTfIdfvocab_vectorizervocabularyword_tokenizer
Dependencies:data.tabledigestfloatlatticelgrMatrixMatrixExtramlapiR6RcppRcppArmadilloRhpcBLASctlrsparsestringi
