#java #apache_flink #cdc #change_data_capture #database #flink_cdc #flink_connectors
https://github.com/ververica/flink-cdc-connectors
https://github.com/ververica/flink-cdc-connectors
GitHub
GitHub - apache/flink-cdc: Flink CDC is a streaming data integration tool
Flink CDC is a streaming data integration tool. Contribute to apache/flink-cdc development by creating an account on GitHub.
#java #airflow #azkaban #dataworks #davinci #etl #flink #governance #griffin #hadoop #hive #hue #kettle #linkis #scriptis #spark #supperset #tableau #visualis #workflow #zeppelin
https://github.com/WeBankFinTech/DataSphereStudio
https://github.com/WeBankFinTech/DataSphereStudio
GitHub
GitHub - WeBankFinTech/DataSphereStudio: DataSphereStudio is a one stop data application development& management portal, covering…
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, ...
#scala #etl_pipeline #flink #one_stop_solution #spark #streaming #streaming_warehouse #streamx
https://github.com/streamxhub/streamx
https://github.com/streamxhub/streamx
GitHub
GitHub - apache/incubator-streampark: StreamPark, Make stream processing easier! easy-to-use streaming application development…
StreamPark, Make stream processing easier! easy-to-use streaming application development framework and operation platform - GitHub - apache/incubator-streampark: StreamPark, Make stream processing ...
#java #big_data #data_integration #data_lake #data_pipeline #data_synchronization #flink #high_performance #real_time
https://github.com/bytedance/bitsail
https://github.com/bytedance/bitsail
GitHub
GitHub - bytedance/bitsail: BitSail is a distributed high-performance data integration engine which supports batch, streaming and…
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data ever...
#java #batch #cdc #change_data_capture #data_integration #data_pipeline #distributed #elt #etl #flink #kafka #mysql #paimon #postgresql #real_time #schema_evolution
Flink CDC is a tool that helps you move and transform data in real-time or in batches. It makes data integration simple by using YAML files to describe how data should be moved and transformed. This tool offers features like full database synchronization, table sharding, schema evolution, and data transformation. To use it, you need to set up an Apache Flink cluster, download Flink CDC, create a YAML file to define your data sources and sinks, and then run the job. This benefits you by making it easier to manage and integrate your data efficiently across different databases.
https://github.com/apache/flink-cdc
Flink CDC is a tool that helps you move and transform data in real-time or in batches. It makes data integration simple by using YAML files to describe how data should be moved and transformed. This tool offers features like full database synchronization, table sharding, schema evolution, and data transformation. To use it, you need to set up an Apache Flink cluster, download Flink CDC, create a YAML file to define your data sources and sinks, and then run the job. This benefits you by making it easier to manage and integrate your data efficiently across different databases.
https://github.com/apache/flink-cdc
GitHub
GitHub - apache/flink-cdc: Flink CDC is a streaming data integration tool
Flink CDC is a streaming data integration tool. Contribute to apache/flink-cdc development by creating an account on GitHub.