StreamSetsΒΆ
About
The StreamSets Data Collector is a lightweight, powerful engine for building streaming, batch, and change data capture (CDC) pipelines that ingest and transform data from various sources.
Use it to run pipelines from sources such as Kafka, Oracle, Salesforce, JDBC, and Hive to destinations including Snowflake, Databricks, Amazon S3, and Azure Data Lake Storage (ADLS). It runs on-premises or in any cloud.
Learn
Use StreamSet with CrateDB
Learn how to create data streaming pipelines using CrateDB and the StreamSets Data Collector.
