GitHub Trends

#java #big_data #data_science #database #databases #datalake #distributed_database #distributed_databases #distributed_systems #hadoop #hive #jdbc #presto #prestodb #query_engine #sql #trino

https://github.com/trinodb/trino

GitHub

GitHub - trinodb/trino: Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL…

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io) - trinodb/trino

1.26K views12:05

GitHub Trends

#java #analytics #big_data #cloudnative #database #datalake #delta_lake #distributed_database #hudi #iceberg #join #lakehouse #lakehouse_platform #mpp #olap #real_time_analytics #real_time_updates #realtime_database #sql #star_schema #vectorized

StarRocks is a very fast query engine for analyzing data quickly, even in just a second. It works 3 times faster than other similar tools and doesn't require you to move or change your data. Here are some key benefits:
- It uses advanced technology to speed up queries.
- It supports standard SQL and works with various clients and BI software.
- It optimizes complex queries efficiently.
- It allows real-time updates and direct access to data from different sources.
- It manages resources well and is easy to maintain and scale.

Using StarRocks can help you analyze data much faster and more efficiently, making your work easier and quicker.

https://github.com/StarRocks/starrocks

GitHub

GitHub - StarRocks/starrocks: The world's fastest open query engine for sub-second analytics both on and off the data lakehouse.…

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class perf...

436 views00:00

GitHub Trends

#python #ai #csv #data #data_analysis #data_science #data_visualization #database #datalake #gpt_4 #llm #pandas #sql #text_to_sql

PandaAI is a tool that lets you ask questions about your data using natural language. It's helpful for both non-technical and technical users. Non-technical users can interact with data more easily, while technical users can save time and effort. You can load your data, save it as a dataframe, and then ask questions like "Which are the top 5 countries by sales?" or "What is the total sales for the top 3 countries?" PandaAI also allows you to visualize charts and work with multiple datasets. It's easy to install using pip or poetry and can be used in Jupyter notebooks, Streamlit apps, or even a secure Docker sandbox. This makes it simpler and more efficient to analyze your data.

https://github.com/sinaptik-ai/pandas-ai

GitHub

GitHub - sinaptik-ai/pandas-ai: Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational…

Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG. - sinaptik-ai/pandas-ai

535 views12:30

GitHub Trends

#java #ai_catalog #data_catalog #datalake #federated_query #lakehouse #metadata #metalake #model_catalog #opendatacatalog #skycomputing #stratosphere

Apache Gravitino is a powerful tool for managing metadata across different sources and regions. It's available under the Apache 2.0 license, which means you can use it freely for any purpose, including commercial projects. You can modify and distribute the software as needed. This flexibility allows businesses to integrate Gravitino into their systems without worrying about royalties or strict usage restrictions. The benefit to users is that they can easily manage complex data environments while having full control over how they use and customize the software.

https://github.com/apache/gravitino

GitHub

GitHub - apache/gravitino: World's most powerful open data catalog for building a high-performance, geo-distributed and federated…

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake. - apache/gravitino

421 views13:00

About

Blog

Apps

Platform