Uncompressed vector embeddings

Compression by Default

Starting with v1.33, you can set a default quantization for new collections using the DEFAULT_QUANTIZATION environment variable. This variable is not set by default, meaning no quantization is applied unless you explicitly configure it. When set (e.g., to 8-bit RQ quantization), all newly created collections will use that quantization setting. Note that once set on a collection, quantization can't be disabled. Default quantization won't be applied to a collection if the index type isn't supported (for example PQ and SQ aren't supported for the flat index).

You can opt-out of using vector quantization to compress your vector data.

Disable compression for new collection

When creating the collection, you can choose not to use quantization through the collection definition:

API docs

More info

from weaviate.classes.config import Configure, Property, DataType

client.collections.create(
    name="MyCollection",
    vector_config=Configure.Vectors.text2vec_openai(
        quantizer=Configure.VectorIndex.Quantizer.none()
    ),
    properties=[
        Property(name="title", data_type=DataType.TEXT),
    ],
)

Additional considerations

Multiple vector embeddings (named vectors)

Collections can have multiple named vectors. The vectors in a collection can have their own configurations, and compression must be enabled independently for each vector. Every vector is independent and can use PQ, BQ, RQ, SQ, or no compression.

Multi-vector embeddings (ColBERT, ColPali, etc.)

Added in v1.30

Multi-vector embeddings (implemented through models like ColBERT, ColPali, or ColQwen) represent each object or query using multiple vectors instead of a single vector. Just like with single vectors, multi-vectors support PQ, BQ, RQ, SQ, or no compression.

During the initial search phase, compressed vectors are used for efficiency. However, when computing the MaxSim operation, uncompressed vectors are utilized to ensure more precise similarity calculations. This approach balances the benefits of compression for search efficiency with the accuracy of uncompressed vectors during final scoring.

Multi-vector performance

RQ supports multi-vector embeddings. Each token vector is rounded up to a multiple of 64 dimensions, which may result in less than 4x compression for very short vectors. This is a technical limitation that may be addressed in future versions.

Further resources

Questions and feedback

If you have any questions or feedback, let us know in the user forum.

Technical questions

If you have questions feel free to post on our Community forum.

Documentation feedback

Leave feedback by opening a GitHub issue.

Additional resources

Need help?

Uncompressed vector embeddings

Disable compression for new collection

Additional considerations

Multiple vector embeddings (named vectors)

Multi-vector embeddings (ColBERT, ColPali, etc.)

Further resources

Questions and feedback

Additional resources

Need help?

Disable compression for new collection​

Additional considerations​

Multiple vector embeddings (named vectors)​

Multi-vector embeddings (ColBERT, ColPali, etc.)​

Further resources​

Questions and feedback​

Disable compression for new collection

Additional considerations

Multiple vector embeddings (named vectors)

Multi-vector embeddings (ColBERT, ColPali, etc.)

Further resources

Questions and feedback