Long-term store¶
Never retire data just because your other systems can’t handle the cardinality.
CrateDB stores large volumes of data, keeping it accessible for querying and insightful analysis, even considering historic data records.
Many organizations need to retain data for years or decades to meet regulatory requirements, support historical analysis, or preserve valuable insights for future use. However, traditional storage systems force you to choose between accessibility and affordability, often leading to data exports, archival systems, or downsampling that sacrifice query capabilities.
CrateDB eliminates this trade-off by storing large volumes of data efficiently while keeping it fully accessible for querying and analysis. Unlike systems that struggle with high cardinality or require expensive tiered architectures, CrateDB handles billions of unique records in a single platform, maintaining fast query performance even on historic datasets spanning years.
By keeping all your data in one place, you avoid the complexity and costs of exporting to specialized long-term storage systems, data lakes, or cold storage tiers. Your historical data remains as queryable as your recent data, enabling seamless analysis across any time range without data movement, ETL pipelines, or rehydration processes.
With CrateDB, compatible with PostgreSQL, you can do all of that using plain SQL. Other than integrating well with commodity systems using standard database access interfaces like ODBC or JDBC, it provides a proprietary HTTP interface on top.
Use cases¶
Metrics and monitoring
Prometheus and similar monitoring systems excel at real-time alerting but face challenges with long-term metric retention due to storage costs and query performance at scale. CrateDB addresses these challenges by providing:
Scalable long-term storage: Store years of metrics without compromising query performance.
High cardinality support: Handle millions of unique label combinations that would overwhelm traditional TSDBs.
Rich SQL analytics: Perform complex analytical queries on historic metrics using standard SQL.
Seamless integration: Use CrateDB’s Prometheus Adapter for transparent remote write/read operations.
OpenTelemetry and similar observability frameworks excel at generating rich telemetry data but face challenges with long-term retention due to storage scale and query complexity. CrateDB addresses these challenges by providing:
Scalable long-term storage: Store large volumes of telemetry through CrateDB’s distributed architecture.
Vendor-neutral ingestion: Use OpenTelemetry SDKs/agents and Telegraf to send telemetry into your CrateDB observability pipeline.
Rich SQL analytics: Run SQL/time-series queries, aggregations and joins on telemetry data for troubleshooting and analytics.
Flexible attribute mapping: Customize which span/log/profile attributes become columns/tags for dimensional queries.