A couple of years ago, data lakes became an early standard for large volumes of data and running business analytics on them. Today requirements have increased and real-time access at scale is the new normal - this is where CrateDB comes in.
In essence, cloud object store data lakes are optimized for storing large data volumes but struggle with real-time analytics at scale. CrateDB is the new modern component to enhance and accelerate analytical performance for Hadoop, Azure Data Lake, AWS S3 and more.
In a traditional data lake architecture:
Such a data lake infrastructure can be simplified and accelerated for analytics with a database that offers scalable and fast data ingestion and sub-second, fast queries of large data sets leveraging the benefits and simplicity of Standard SQL.
We are introducing a modern architecture where CrateDB perfectly augments existing data stores, tools and applications while simplifying the stack and greatly expanding the accessibility of data and interoperability with surrounding systems.
In addition, CrateDB can be integrated with existing legacy architecture for archiving (e.g, Cold store) and Data Science processing (e.g, Spark) housed in legacy data lakes.
The modern architecture and database solution of CrateDB enables its user to benefit from real-time analytic performance across several data sources with scalable SQL with a cost-effective platform integrated into Microsoft Azure or other hyper scaler cloud environments.