Skip to main content
Go to documentation:
⌘U
Weaviate Database

Develop AI applications using Weaviate's APIs and tools

Deploy

Deploy, configure, and maintain Weaviate Database

Weaviate Agents

Build and deploy intelligent agents with Weaviate

Weaviate Cloud

Manage and scale Weaviate in the cloud

Additional resources

Academy
Integrations
Contributor guide

Need help?

Weaviate LogoAsk AI Assistant⌘K
Community Forum

Choose a model

On this page, you can find a list of pre-trained models designed specifically for enterprise retrieval tasks in English and other languages. Additional models and features will be added in the future, so please check back regularly for updates.

How to choose the right model?

Here are some simple recommendations on when you should use a specific model:

Below, you can find a complete list of all available models.


Available models

Snowflake/snowflake-arctic-embed-l-v2.0 (default)

  • A 568M parameter, 1024-dimensional model for multilingual enterprise retrieval tasks.
  • Trained with Matryoshka Representation Learning to allow vector truncation with minimal loss.
  • Quantization-friendly: Using scalar quantization and 256 dimensions provides 99% of unquantized, full-precision performance.
  • Read more at the Snowflake blog, and the Hugging Face model card
  • Allowable dimensions: 1024 (default), 256

Snowflake/snowflake-arctic-embed-m-v1.5

  • A 109M parameter, 768-dimensional model for enterprise retrieval tasks in English.
  • Trained with Matryoshka Representation Learning to allow vector truncation with minimal loss.
  • Quantization-friendly: Using scalar quantization and 256 dimensions provides 99% of unquantized, full-precision performance.
  • Read more at the Snowflake blog, and the Hugging Face model card
  • Allowable dimensions: 768 (default), 256
Input truncation

Currently, input exceeding the model's context windows is truncated from the right (i.e. the end of the input).

Vectorizer parameters

  • model (optional): The name of the model to use for embedding generation.
  • dimensions (optional): The number of dimensions to use for the generated embeddings.
  • base_url (optional): The base URL for the Weaviate Embeddings service. (Not required in most cases.)

The following examples show how to configure Weaviate Embeddings-specific options.

from weaviate.classes.config import Configure

client.collections.create(
"DemoCollection",
vectorizer_config=[
Configure.NamedVectors.text2vec_weaviate(
name="title_vector",
source_properties=["title"],
model="Snowflake/snowflake-arctic-embed-m-v1.5",
# Further options
# dimensions=256
# base_url="<custom_weaviate_embeddings_url>",
)
],
# Additional parameters not shown
)

Additional resources

Support & feedback

For help with Serverless Cloud, Enterprise Cloud, and Bring Your Own Cloud accounts, contact Weaviate support directly to open a support ticket. To add a support plan, contact Weaviate sales.

If you have any questions or feedback, let us know in the user forum.