Embeddings
Embeddings convert text or other data into vector representations for machine learning and natural language processing tasks.
embeddings​
The embeddings section in your configuration specifies one or more embedding models for your datasets.
Example:
embeddings:
- from: huggingface:huggingface.co/sentence-transformers/all-MiniLM-L6-v2:latest
name: text_embedder
params:
max_length: '128'
datasets:
- my_text_dataset
from​
The from field specifies the source of the embedding model. It supports the following prefixes:
huggingface:huggingface.co- Models from Hugging Facefile:- Local file pathsopenai- OpenAI models
Follows the same convention as models.from.
name​
A unique identifier for this embedding component.
files​
Optional. A list of files associated with this model. Each file has:
path: The path to the filename: Optional. A name for the filetype: Optional. The type of the file (automatically determined if not specified)
Follows the same convention as models.files.
params​
Optional. A map of key-value pairs for additional parameters specific to the embedding model.
dependsOn​
Optional. A list of dependencies that must be loaded and available before this embedding model.
