Generates dense vector embeddings using pre-trained word vectors
Public fields
dimensionEmbedding dimension
model_typeType of model being used
languageLanguage setting ("en" or "ml")
Methods
Method new()
Create a new DenseEmbedder
Usage
DenseEmbedder$new(
dimension = 100,
model_path = NULL,
model_type = "tfidf",
sentence_embedder = NULL,
auto_download = FALSE,
language = "en"
)Arguments
dimensionVector dimension (default: 100 for word2vec, 50/100/200/300 for GloVe)
model_pathOptional path to pre-trained model file
model_typeType: "word2vec", "glove", "glove-pretrained", or "tfidf"
sentence_embedderOptional SentenceEmbedder object to use
auto_downloadAuto-download GloVe vectors if model_type is glove-pretrained
languageLanguage behavior ("en" = ASCII-focused, "ml" = Unicode-aware)
Method embed()
Embed texts to vectors