StreamSetsΒΆ

StreamSets logo

About

The StreamSets Data Collector is a lightweight, powerful engine for building streaming, batch, and change data capture (CDC) pipelines that ingest and transform data from various sources.

Use it to run pipelines from sources such as Kafka, Oracle, Salesforce, JDBC, and Hive to destinations including Snowflake, Databricks, Amazon S3, and Azure Data Lake Storage (ADLS). It runs on-premises or in any cloud.

Learn

Use StreamSet with CrateDB

Learn how to create data streaming pipelines using CrateDB and the StreamSets Data Collector.

Data Stream Pipelines with CrateDB and StreamSets Data Collector