Skip to main content
Go to documentation:
⌘U
Weaviate Database

Develop AI applications using Weaviate's APIs and tools

Deploy

Deploy, configure, and maintain Weaviate Database

Query Agent

Run agentic search over your Weaviate Cloud collections

Weaviate Cloud

Manage and scale Weaviate in the cloud

Engram

Persistent memory for LLM agents and applications

Additional resources

Integrations
Weaviate Academy

Need help?

Weaviate LogoAsk AI Assistant⌘K
Support
Community Forum
Contributor guide

Choose a model

On this page, you can find a list of pre-trained models designed specifically for enterprise retrieval tasks in English and other languages. Additional models and features will be added in the future, so please check back regularly for updates.

How to choose the right model?

Here are some simple recommendations on when you should use a specific model:

Text embedding models

Multimodal model

  • ModernVBERT/colmodernvbert Best for visual document retrieval where you want to search document images (PDFs, slides, invoices) using text queries. This model embeds documents directly as images, eliminating the need for OCR or text extraction pipelines.

Below, you can find a complete list of all available models.


Available models

Text embedding models

Snowflake/snowflake-arctic-embed-l-v2.0 (default)

  • A 568M parameter, 1024-dimensional model for multilingual enterprise retrieval tasks.
  • Trained with Matryoshka Representation Learning to allow vector truncation with minimal loss.
  • Quantization-friendly: Using scalar quantization and 256 dimensions provides 99% of unquantized, full-precision performance.
  • Read more at the Snowflake blog, and the Hugging Face model card
  • Allowable dimensions: 1024 (default), 256

Snowflake/snowflake-arctic-embed-m-v1.5

  • A 109M parameter, 768-dimensional model for enterprise retrieval tasks in English.
  • Trained with Matryoshka Representation Learning to allow vector truncation with minimal loss.
  • Quantization-friendly: Using scalar quantization and 256 dimensions provides 99% of unquantized, full-precision performance.
  • Read more at the Snowflake blog, and the Hugging Face model card
  • Allowable dimensions: 768 (default), 256
Input truncation

Currently, input exceeding the model's context windows is truncated from the right (i.e. the end of the input).

Multimodal embedding models

Weaviate Embeddings also offers multimodal models for visual document retrieval tasks. These models generate embeddings from document images (PDFs, slides, invoices converted to images) that can be searched with text queries.

ModernVBERT/colmodernvbert

  • A 250M parameter late-interaction vision-language encoder, fine-tuned for visual document retrieval tasks.
  • Generates multi-vector embeddings (ColBERT-style late-interaction) from document images and text queries.
  • Ideal for getting documents directly into Weaviate without heavy preprocessing - no OCR or text extraction required.
  • State-of-the-art performance in its size class, matching models up to 10x larger.
  • Query token limit: 8,092 tokens
  • Read more at the Hugging Face model card
  • For integration details, see Weaviate Embeddings: Multimodal
MUVERA encoding recommended

Enable MUVERA encoding to reduce memory usage while preserving retrieval quality.


Vectorizer parameters

  • model (optional): The name of the model to use for embedding generation.
  • dimensions (optional): The number of dimensions to use for the generated embeddings.
  • base_url (optional): The base URL for the Weaviate Embeddings service. (Not required in most cases.)

The following examples show how to configure Weaviate Embeddings-specific options.

py docs  API docs
More infoCode snippets in the documentation reflect the latest client library and Weaviate Database version. Check the Release notes for specific versions.

If a snippet doesn't work or you have feedback, please open a GitHub issue.
from weaviate.classes.config import Configure

client.collections.create(
"DemoCollection",
vector_config=[
Configure.Vectors.text2vec_weaviate(
name="title_vector",
source_properties=["title"],
model="Snowflake/snowflake-arctic-embed-m-v1.5",
# Further options
# dimensions=256
# base_url="<custom_weaviate_embeddings_url>",
)
],
# Additional parameters not shown
)

Additional resources

Support

If you use Weaviate Cloud (Database cluster(s) or Weaviate product in the cloud) or have a self-hosted support package, open a ticket in the Support Portal or email Weaviate support directly. To add a support plan, contact Weaviate sales.

Use the Support Portal for direct help from the Weaviate team: open and track tickets, and we'll respond in line with your support plan. The Community Forum is open to everyone, and a great place to ask questions, get help with your cluster, and connect with other developers. For all the ways to get help, see the Support overview.