#python #airflow #airflow_operators #aws #aws_ec2 #aws_s3 #aws_sdk #cassandra #cassandra_database #cloudformation #cluster #data #data_engineering #data_engineering_pipeline #data_lake #data_modeling #data_warehouse #etl_pipeline #infrastructure #postgres #postgresql_database
https://github.com/san089/Udacity-Data-Engineering-Projects
https://github.com/san089/Udacity-Data-Engineering-Projects
GitHub
GitHub - san089/Udacity-Data-Engineering-Projects: Few projects related to Data Engineering including Data Modeling, Infrastructure…
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development. - san089/Udacity-Data-Engineering-Projects
#scala #etl_pipeline #flink #one_stop_solution #spark #streaming #streaming_warehouse #streamx
https://github.com/streamxhub/streamx
https://github.com/streamxhub/streamx
GitHub
GitHub - apache/incubator-streampark: StreamPark, Make stream processing easier! easy-to-use streaming application development…
StreamPark, Make stream processing easier! easy-to-use streaming application development framework and operation platform - GitHub - apache/incubator-streampark: StreamPark, Make stream processing ...
#python #etl_pipeline #llm_platform #unstructured_data
Unstract is a powerful tool that helps you extract data from unstructured documents using large language models (LLMs). It has a no-code platform where you can easily develop and test prompts to get the data you need. Here’s how it benefits you You can automate the extraction of data from complex documents without needing to write code.
- **Prompt Studio** You can set up workflows in three simple steps to deploy APIs or ETL pipelines, automating critical business processes.
- **Integration with Various Tools**: Unstract supports multiple LLM providers, vector databases, embedding models, and text extractors, making it versatile and compatible with many systems.
Overall, Unstract saves time and effort by simplifying the process of extracting valuable data from unstructured documents.
https://github.com/Zipstack/unstract
Unstract is a powerful tool that helps you extract data from unstructured documents using large language models (LLMs). It has a no-code platform where you can easily develop and test prompts to get the data you need. Here’s how it benefits you You can automate the extraction of data from complex documents without needing to write code.
- **Prompt Studio** You can set up workflows in three simple steps to deploy APIs or ETL pipelines, automating critical business processes.
- **Integration with Various Tools**: Unstract supports multiple LLM providers, vector databases, embedding models, and text extractors, making it versatile and compatible with many systems.
Overall, Unstract saves time and effort by simplifying the process of extracting valuable data from unstructured documents.
https://github.com/Zipstack/unstract
GitHub
GitHub - Zipstack/unstract: No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents - Zipstack/unstract