Distributed Database
CrateDB is a distributed database, which means that data is stored on multiple nodes in a network (see also shared-nothing architecture). In a CrateDB cluster, data is equally distributed through automatic rebalancing, and its distributed SQL query engine allows for aggregations, JOINs, sub-selects, and ad-hoc queries to be performed at in-memory speed. CrateDB also integrates native, full-text search features, which enable you to store and query structured or unstructured data together. Therefore, you no longer have to use separate SQL and Search databases to manage tabular and non-tabular data.
Benefits of a distributed database
- Performance and availability
- Cost-effectiveness
- Scalability
- Fault tolerance
- Data consistency
- Flexibility
Distributed SQL queries
CrateDB uses native SQL as its query language for data querying and manipulation, which reduces the learning curve and allows users to focus on query logic rather than dealing with the details of a distributed system and a proprietary query language. A key feature of CrateDB is its ability to efficiently manage extensive concurrent reads and writes, which is crucial in a distributed system.
Users can also write user-defined functions to manipulate data. SQL statements are translated into a series of processing steps, optimized for efficiency. CrateDB's execution involves logical and physical plans that guide data retrieval from distributed nodes. The execution layer distributes these plans across nodes for parallel processing.
CrateDB’s query engine has been engineered from the outset to optimize data throughput and query performance, especially as the number of concurrent operations grows. This approach ensures effective and scalable query execution in a distributed database environment, allowing users to extract insights and perform actions on vast datasets with unparalleled speed and efficiency.
The engine's advanced indexing techniques, real-time data ingestion, and real-time querying synergize to deliver a seamless and high-performance user experience.
On-demand Workshop 2023
Introduction to CrateDB and its Architecture
Timestamp: 14:01–16:40
CrateDB at Berlin Buzzwords 2023
When milliseconds matter: maximizing query performance in CrateDB.
Timestamp: 1:00 – 1:28
CrateDB Architecture Guide
This comprehensive guide covers all the key concepts you need to know about CrateDB's architecture. It will help you gain a deeper understanding of what makes it performant, scalable, flexible and easy to use. Armed with this knowledge, you will be better equipped to make informed decisions about when to leverage CrateDB for your data projects.