GitHub Trends
10.1K subscribers
15.3K links
See what the GitHub community is most excited about today.

A bot automatically fetches new repositories from https://github.com/trending and sends them to the channel.

Author and maintainer: https://github.com/katursis
Download Telegram
#go #distributed_tracing #grafana

Grafana Tempo is a free, easy-to-use tool for tracking how different parts of your system interact. It works well with other tools like Grafana, Prometheus, and Loki, and it can handle data from various formats. Tempo is cost-efficient because it only needs object storage to run, making it cheap and robust. It also has a powerful query language called TraceQL that lets you easily find specific data. You can get started quickly with guides and examples, and there are community forums and Slack channels if you need help. This makes it easier for you to monitor and understand your system's performance.

https://github.com/grafana/tempo
#java #bigdata #data_encryption #data_pipeline #database #database_cluster #database_gateway #database_middleware #distributed_database #distributed_sql_database #distributed_transaction #encrypt #mysql #postgresql #read_write_splitting #shard #sql

Apache ShardingSphere is a powerful tool that helps manage and scale databases. It allows you to break down large databases into smaller pieces (sharding), handle more data traffic (scaling), and secure your data with encryption. This tool works with any database and provides a unified way for applications to interact with multiple databases as if they were one.

The benefits include Your database can handle more data and users without slowing down.
- **Improved Security** Applications only need to communicate with one standardized service, making it simpler to manage.
- **Flexibility**: You can customize the tool to fit your needs using its pluggable architecture.

Overall, Apache ShardingSphere makes managing and scaling databases much easier and more efficient.

https://github.com/apache/shardingsphere
#cplusplus #ai #api #audio_generation #distributed #gemma #gpt4all #image_generation #kubernetes #llama #llama3 #llm #mamba #mistral #musicgen #p2p #rerank #rwkv #stable_diffusion #text_generation #tts

LocalAI is a free, open-source alternative to OpenAI that you can run on your own computer or server. It allows you to generate text, images, and audio locally without needing a GPU. You can use it with various models and it supports multiple functionalities like text-to-audio, audio-to-text, and image generation. LocalAI is easy to set up using an installer script or Docker, and it has a user-friendly web interface. This tool is beneficial because it saves you money by not requiring cloud services and gives you full control over your data privacy. Plus, it's community-driven, so there are many resources and integrations available to help you get started and customize it to your needs.

https://github.com/mudler/LocalAI
#go #bigdata #cloud_native #distributed_systems #filesystem #go #golang #hdfs #object_storage #posix #redis #s3 #storage

JuiceFS is a powerful file system designed for cloud environments. It allows you to use massive cloud storage as if it were local storage, without changing your code. Here are the key benefits JuiceFS offers low latency and high throughput, making it suitable for big data, machine learning, and AI applications.
- **POSIX Compatibility** Supports Kubernetes and various object storage services like Amazon S3, Google Cloud Storage, and more.
- **Strong Consistency** Ensures data security and efficiency.
- **Shared Access**: Multiple clients can read and write files simultaneously.

Using JuiceFS, you can efficiently manage large amounts of data in the cloud, making it easier to integrate with various platforms and applications.

https://github.com/juicedata/juicefs
👍1👎1
#c_lang #ai #big_data #c #cloudberry #data_analysis #data_warehouse #database #distributed_database #greenplum #mpp #olap #postgres #postgresql #sql

Apache Cloudberry is a powerful, open-source database designed for large-scale data processing and analytics. It is built by the creators of Greenplum Database and uses a newer PostgreSQL kernel, making it suitable for data warehouses and AI/ML workloads. You can easily try it out using a Docker-based sandbox or build it from source on Linux or macOS. The community is active, with many channels for support, discussions, and contributions. This means you can get help quickly, share ideas, and even contribute to the project yourself. It's licensed under the Apache License, Version 2.0, making it free to use and modify. Overall, Apache Cloudberry offers advanced database capabilities and a supportive community, which can greatly benefit users needing robust data management solutions.

https://github.com/apache/cloudberry
#go #cloud #cloud_computing #cloud_native #distributed_systems #go #golang #message_bus #message_queue #messaging #microservices_architecture #nats_server

NATS is a simple, secure, and fast communication system for digital systems, services, and devices. It can run anywhere, from your own server to the cloud or even a small device like a Raspberry Pi. NATS helps make complex systems easier to design and operate. With over 40 different language implementations, it's versatile and widely supported. Using NATS benefits you by providing a reliable and secure way to communicate between different parts of your system, making your work more efficient and stable.

https://github.com/nats-io/nats-server
#elixir #blockchain #consensus #cryptography #distributed_systems #elixir #erlang #operating_system #p2p #protocol

Anoma is a protocol that allows you to run a node for a decentralized network. To use it, you need to install several dependencies depending on your operating system (Mac, Windows, or Linux). You can either download pre-built binaries or compile the code yourself. Here are the key steps Download the release for your platform, extract it, and run the `bin/anoma` command.
- **Compilation**: Install necessary tools like Git, CMake, Rust, and Elixir, then follow the compilation instructions.

Following development is easy through GitHub issues, project overviews, and research forums. This helps you stay updated on what's being worked on and how the project is progressing.

The benefit to you is that you can participate in and contribute to a cutting-edge decentralized network, staying informed about its development and helping shape its future.

https://github.com/anoma/anoma
#java #batch #cdc #change_data_capture #data_integration #data_pipeline #distributed #elt #etl #flink #kafka #mysql #paimon #postgresql #real_time #schema_evolution

Flink CDC is a tool that helps you move and transform data in real-time or in batches. It makes data integration simple by using YAML files to describe how data should be moved and transformed. This tool offers features like full database synchronization, table sharding, schema evolution, and data transformation. To use it, you need to set up an Apache Flink cluster, download Flink CDC, create a YAML file to define your data sources and sinks, and then run the job. This benefits you by making it easier to manage and integrate your data efficiently across different databases.

https://github.com/apache/flink-cdc
#go #cncf #distributed_tracing #hacktoberfest #jaeger #observability #opentelemetry #tracing

Jaeger is a tool that helps you understand how different parts of your software work together. It's like a map that shows where data goes and how long it takes to get there. This helps you find and fix problems faster. Jaeger is free and open source, meaning anyone can use and improve it. It's supported by a big community and has clear guides on how to get started and contribute. Using Jaeger can make your software run more smoothly and efficiently.

https://github.com/jaegertracing/jaeger
#java #analytics #big_data #cloudnative #database #datalake #delta_lake #distributed_database #hudi #iceberg #join #lakehouse #lakehouse_platform #mpp #olap #real_time_analytics #real_time_updates #realtime_database #sql #star_schema #vectorized

StarRocks is a very fast query engine for analyzing data quickly, even in just a second. It works 3 times faster than other similar tools and doesn't require you to move or change your data. Here are some key benefits:
- It uses advanced technology to speed up queries.
- It supports standard SQL and works with various clients and BI software.
- It optimizes complex queries efficiently.
- It allows real-time updates and direct access to data from different sources.
- It manages resources well and is easy to maintain and scale.

Using StarRocks can help you analyze data much faster and more efficiently, making your work easier and quicker.

https://github.com/StarRocks/starrocks