Package: text2vec 0.6.4
text2vec: Modern Text Mining Framework for R
Fast and memory-friendly tools for text vectorization, topic modeling (LDA, LSA), word embeddings (GloVe), similarities. This package provides a source-agnostic streaming API, which allows researchers to perform analysis of collections of documents which are larger than available RAM. All core functions are parallelized to benefit from multicore machines.
Authors:
text2vec_0.6.4.tar.gz
text2vec_0.6.4.zip(r-4.5)text2vec_0.6.4.zip(r-4.4)text2vec_0.6.4.zip(r-4.3)
text2vec_0.6.4.tgz(r-4.4-x86_64)text2vec_0.6.4.tgz(r-4.4-arm64)text2vec_0.6.4.tgz(r-4.3-x86_64)text2vec_0.6.4.tgz(r-4.3-arm64)
text2vec_0.6.4.tar.gz(r-4.5-noble)text2vec_0.6.4.tar.gz(r-4.4-noble)
text2vec_0.6.4.tgz(r-4.4-emscripten)text2vec_0.6.4.tgz(r-4.3-emscripten)
text2vec.pdf |text2vec.html✨
text2vec/json (API)
NEWS
# Install 'text2vec' in R: |
install.packages('text2vec', repos = c('https://dselivanov.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/dselivanov/text2vec/issues
- movie_review - IMDB movie reviews
glovelatent-dirichlet-allocationnatural-language-processingtext-miningtopic-modelingvectorizationword-embeddingsword2vec
Last updated 3 months agofrom:bea438b105. Checks:OK: 7 NOTE: 2. Indexed: yes.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Nov 14 2024 |
R-4.5-win-x86_64 | NOTE | Nov 14 2024 |
R-4.5-linux-x86_64 | NOTE | Nov 14 2024 |
R-4.4-win-x86_64 | OK | Nov 14 2024 |
R-4.4-mac-x86_64 | OK | Nov 14 2024 |
R-4.4-mac-aarch64 | OK | Nov 14 2024 |
R-4.3-win-x86_64 | OK | Nov 14 2024 |
R-4.3-mac-x86_64 | OK | Nov 14 2024 |
R-4.3-mac-aarch64 | OK | Nov 14 2024 |
Exports:as.lda_cBNSchar_tokenizercheck_analogy_accuracycoherenceCollocationscombine_vocabulariescreate_dtmcreate_tcmcreate_vocabularydist2fitfit_transformGlobalVectorsGloVehash_vectorizeridirifilesifiles_parallelitokenitoken_paralleljsPCA_robustLatentDirichletAllocationLatentSemanticAnalysisLDALSAnormalizepdist2perplexitypostag_lemma_tokenizerprepare_analogy_questionsprune_vocabularypsim2RelaxedWordMoversDistanceRWMDsim2space_tokenizersplit_intoTfIdfvocab_vectorizervocabularyword_tokenizer
Dependencies:data.tabledigestfloatlatticelgrMatrixMatrixExtramlapiR6RcppRcppArmadilloRhpcBLASctlrsparsestringi