Skip to content

Hybrid Search

Search, Filter, and Discover in Real Time. All in One Query.

CrateDB unifies full-text, vector, and structured search, enabling real-time hybrid search across any data, with millisecond response times. Modern search isn’t just about keywords, it’s about meaning, context, and relevance. CrateDB combines full-text search with vector similarity and classic SQL filters in one unified engine. Whether you’re powering intelligent applications, IoT monitoring, or AI-enhanced analytics, CrateDB delivers instant, precise results from any data type.

Full-text search with SQL simplicity

CrateDB supports full-text indexing and search directly within SQL queries. Use the MATCH() function to find relevant documents, log entries, or descriptions instantly. No need for an external search system.

Example use cases:

  • Search sensor logs for specific error messages
  • Filter IoT events containing certain keywords
  • Combine text search with numerical or temporal filters
Why it matters: You get the power of a search engine, with the simplicity of SQL and the scalability of a distributed database.
cr-quote-image

Vector search for semantic relevance

CrateDB brings vector similarity search natively into its query engine. Using the KNN_MATCH() function, you can find semantically similar results based on embeddings, ideal for AI and machine learning use cases.

Use cases include:

  • Recommendation engines based on user embeddings
  • Semantic document or image retrieval
  • Intelligent anomaly detection in multidimensional data

Highlights:

  • Store and query vector data alongside relational or JSON data
  • Combine similarity search with SQL filters for full context
  • Query results in milliseconds, even at high dimensionality
cr-quote-image

True hybrid search

What sets CrateDB apart is the ability to combine MATCH() and KNN_MATCH() in the same query, alongside standard SQL filters, joins, or aggregations. This enables complex, hybrid search scenarios where textual, numerical, and semantic dimensions interact seamlessly.

SELECT id, title, score
FROM articles
WHERE MATCH(content, 'renewable energy')
ORDER BY KNN_MATCH(embedding, [0.23, 0.41, 0.67]) LIMIT 10;

This example blends keyword relevance with semantic similarity to surface the most meaningful results; instantly and at scale.

cr-quote-image

Search meets real-time data

Because CrateDB continuously indexes incoming data, your search results always reflect the latest state, even as new records are ingested. No manual refreshes, no sync lag between systems.

Key benefits:

  • Real-time visibility into logs, documents, or sensor data
  • Fresh and accurate search results within seconds of ingestion
  • Unified query layer for analytics, AI, and discovery
cr-quote-image

Why choose CrateDB for hybrid search

Traditional search systems CrateDB unified engine
Separate engines for text, vector, and SQL One system for all search types
Complex data synchronization Automatic indexing and consistency
Minutes to update search indexes Query fresh data in milliseconds
cr-quote-image

CrateDB architecture guide

This comprehensive guide covers all the key concepts you need to know about CrateDB's architecture. It will help you gain a deeper understanding of what makes it performant, scalable, flexible and easy to use. Armed with this knowledge, you will be better equipped to make informed decisions about when to leverage CrateDB for your data projects. 

CrateDB-Architecture-Guide-Cover

Additional resources

Want to learn more?