Query Agent
The Weaviate Cloud (WCD) Query Agent provides and interactive console for using the Weaviate Query Agent, a pre-built agentic service designed to answer natural language queries based on the data stored in Weaviate Cloud.
The user simply provides a prompt/question in natural language, and the Query Agent takes care of all intervening steps to provide an answer.
Query a collection
In order to use the Query Agent, log into your Weaviate Cloud account and select Agents
in the sidebar panel.
Additional parameters for querying
In addition to selecting one or more collections to query, you can also:
- Specify API keys for external model providers (like OpenAI, Anthropic, etc.)
- Select which vectors in a collection to query (if there are multiple vectors in the collection)
- Specify a system prompt that provides additional information or instructions to the agent (for example, setting the tone, language, etc.)
Generate code snippets
After executing a query, you can also generate a code snippet that performs the same task through one of the available Weaviate Agents client libraries, either Python or TypeScript.
Usage and limitations
Usage limits
The current usage limit is 500 Query Agent queries per day per Weaviate Cloud organization for free. This limit may change in the near future.
Custom collection descriptions
The Query Agent makes use of each collection's description
metadata as well as individual property descriptions in deciding what collection to query.
Both collection descriptions and property descriptions can be updated after the collection has been created. For detailed instructions on updating collection and property descriptions, see the update collection definition documentation.
We are investigating an ability to specify a custom collection description at runtime.
Execution times
The Query Agent performs multiple operations to translate a natural language query into Weaviate queries, and to process the response.
This typically requires multiple calls to generative models (e.g. LLMs) and multiple queries to Weaviate.
As a result, each Query Agent run may take some time to complete. Depending on the query complexity, it may not be uncommon to see execution times of ~10 seconds.
Multi-tenancy
The WCD Query Agent does not support multi-tenancy. In order to utilize multi-tenancy with the Query Agent, use a client library.
Further resources
Support
For help with Serverless Cloud, Enterprise Cloud, and Bring Your Own Cloud accounts, contact Weaviate support directly to open a support ticket. To add a support plan, contact Weaviate sales.
If you have any questions or feedback, let us know in the user forum.