GitHub Trends
10.1K subscribers
15.3K links
See what the GitHub community is most excited about today.

A bot automatically fetches new repositories from https://github.com/trending and sends them to the channel.

Author and maintainer: https://github.com/katursis
Download Telegram
#python #ai4science #document_analysis #extract_data #layout_analysis #ocr #parser #pdf #pdf_converter #pdf_extractor_llm #pdf_extractor_pretrain #pdf_extractor_rag #pdf_parser #python

MinerU is a tool that converts PDFs into machine-readable formats like markdown or JSON. Here are the key benefits and features MinerU removes headers, footers, and other unnecessary elements to ensure the text is semantically coherent and in human-readable order, even for complex layouts.
- **Structure Preservation** It extracts images, image descriptions, tables, and table titles.
- **Formula Conversion** Recognizes tables and converts them to LaTeX or HTML format.
- **OCR Support** Supports multiple output formats and various visualization results.
- **GPU and CPU Compatibility**: Works on both CPU and GPU environments, compatible with Windows, Linux, and Mac.

You can try MinerU through an online demo, a quick CPU demo, or by using a GPU for faster processing. For detailed usage, refer to the command line options, API integration, and deployment guides provided.

https://github.com/opendatalab/MinerU
#typescript

The es-toolkit is a powerful and efficient JavaScript library that provides many useful functions like debounce, delay, chunk, sum, and pick. It is designed for high performance and has a very small bundle size, reducing your JavaScript code by up to 97%. The library also supports TypeScript with strong type annotations and is thoroughly tested for reliability. Using es-toolkit can make your coding faster and more efficient, helping you write cleaner and more performant code.

https://github.com/toss/es-toolkit
#powershell #dotnet #dotnet_core

The .NET release notes provide important information about different versions of .NET, such as their release dates, support status, and end-of-support dates. For example, .NET 9 will be released on November 12, 2024, and will be supported until May 12, 2026. You can also find the latest patch versions for each release. This information helps users plan and manage their projects by knowing when to update or switch to a new version, ensuring they have the latest security patches and features. Additionally, you can follow GitHub discussions for the latest news and updates on .NET releases.

https://github.com/dotnet/core
#typescript #design_systems #material_design #react #react_components

Material UI is a powerful library of React components that follows Google's Material Design system. It has been developed and tested by thousands of contributors over more than a decade, making it reliable and trustworthy. This library helps you build user interfaces quickly and consistently, with a wide range of pre-built components. It also has extensions like MUI X for advanced use cases and Toolpad for building full-stack applications. Using Material UI benefits you by providing a well-tested and maintained set of components, saving you time and effort in developing your projects.

https://github.com/mui/material-ui
#go

Lazydocker is a simple terminal UI for managing Docker and Docker Compose. It helps you manage your containers and services in one window, making it easier to view logs, metrics, and perform common tasks like restarting or removing containers. You can install it using Homebrew, Scoop, Chocolatey, or by downloading a binary release. The tool simplifies managing multiple terminal windows and memorizing Docker commands, allowing you to focus on your work with keybindings and custom commands at your fingertips. This makes your development process faster and more efficient.

https://github.com/jesseduffield/lazydocker
#typescript #animation #animation_js #motion #react #reactanimations #spring

Motion is a free and open-source library that helps you create animations in JavaScript and React. It combines the power of JavaScript animations with the performance of native browser APIs, making your animations smooth and efficient. You can easily install it using `npm install motion` and start animating elements with simple code. For example, in JavaScript, you can animate an element by using `animate("#box", { x 100 }} />`. This makes it easy to add dynamic and engaging animations to your web projects, enhancing the user experience.

https://github.com/motiondivision/motion
#python #build_tools #hpc #hpsf #linux #macos #package_manager #python #radiuss #scientific_computing #spack

Spack is a tool that helps you install and manage different versions of software on your computer. It works on many operating systems like Linux, macOS, and Windows, and even on supercomputers. The best part is that it doesn't break existing installations, so you can have multiple versions of the same software running at the same time. Spack uses a simple syntax to specify what you want to install, and it's easy to contribute to the project if you want to add new features or packages. This makes it very flexible and useful for managing complex software setups.

https://github.com/spack/spack
#go #containers #event_driven #kubernetes #microservice #microservices #pubsub #serverless #sidecar #state_management

Dapr is a tool that helps developers build strong, scalable, and reliable microservices easily. It works with any programming language and framework, making it very flexible. Dapr adds a small side-car to your application that handles complex tasks like state management, messaging, and event triggers, so you don't have to worry about these details. This means you can focus on writing your code without getting bogged down in low-level details. Dapr also runs on various platforms, including cloud, edge devices, and local machines, making it very versatile. Overall, Dapr simplifies building distributed applications and makes them more consistent and portable.

https://github.com/dapr/dapr
#csharp #dotnet #hacktoberfest #help_wanted

The .NET Runtime repository contains the code for building the .NET runtime, libraries, and installers for various platforms. This project allows you to contribute by reporting issues, testing nightly builds, and even fixing bugs. You can learn how to use .NET through tutorials and documentation, and there are resources for deploying apps and getting support. Reporting security issues is also straightforward through a private email process. The project is open-source under the MIT license, making it free to use and modify. By participating, you can help improve .NET and be part of a global community of developers.

https://github.com/dotnet/runtime
#java #data_catalog #data_discovery #data_governance #datahub #metadata

DataHub is a free, open-source platform that helps you find and understand your data. It acts like a catalog for all your data, making it easier to discover, manage, and use. Built by Acryl Data and LinkedIn, DataHub supports modern data stacks and offers features like real-time metadata graphs, integration with various tools, and a user-friendly interface. You can try it out with a hosted demo or follow the quickstart guide to set it up locally. Joining the community through Slack or attending town hall meetings can also help you stay updated and connected with other users. This tool is beneficial because it simplifies data management, enhances collaboration, and improves overall data visibility within your organization.

https://github.com/datahub-project/datahub
#css #angular #charts #components #datagrid #datatable #mit #typescript #ui

PrimeNG is a useful tool for developers. You can check its latest version and download statistics on npm. It also has a continuous integration process to ensure it works well. If you have questions, you can join the Discord chat or participate in GitHub discussions. The PrimeNG website offers demos and detailed documentation to help you get started easily. This makes it easier for you to use PrimeNG effectively and stay updated with the community.

https://github.com/primefaces/primeng
👍2
#shell #atproto #bluesky #self_hosted

You can host your own Personal Data Server (PDS) for the Bluesky social media app. This lets you control your data and connect to the larger Bluesky network. To set it up, you need a server with a public IP address and DNS configuration. Follow the steps to open firewall ports, configure DNS, and install the necessary software. Once set up, you can use the Bluesky app with your PDS, create accounts, and even set up email verification. Keeping your PDS updated is important to avoid issues. This gives you more control over your data and how you interact with the Bluesky network.

https://github.com/bluesky-social/pds
#go #certificates #chrome #firefox #https #ios #linux #local_development #localhost #macos #root_ca #tls #windows

mkcert is a simple tool that helps you create and use trusted certificates for local development without any complicated setup. It automatically installs a local Certificate Authority (CA) on your system, which allows you to generate certificates that your browser and other tools will trust. This is especially useful because using real certificates for development can be risky or impossible for certain hosts like `localhost` or `example.test`. With mkcert, you can avoid trust errors and manage your own CA easily, making your development process smoother and more secure.

https://github.com/FiloSottile/mkcert
#typescript #backstage #cncf #developer_experience #developer_portal #dx #infrastructure #microservices #self_service_portal

Backstage is a free and open-source tool that helps developers manage their software and infrastructure more efficiently. It centralizes all your tools, services, and documentation into one place, making it easier to develop and deploy high-quality code quickly. With Backstage, you can manage microservices, libraries, and other software projects, use pre-made templates for new projects, and easily create and find technical documentation. It also has many customizable plugins to fit your needs. Created by Spotify and now supported by the Cloud Native Computing Foundation, Backstage simplifies your development environment and enhances team productivity.

https://github.com/backstage/backstage
1
#jupyter_notebook #gemini #gemini_api #generative_ai #google #google_cloud #google_gemini #langchain #llm #palm_api #vertex_ai #vertex_ai_gemini_api #vertexai

This repository helps you use and develop generative AI with Google Cloud. It includes notebooks, code samples, and apps for various tasks like image generation, chatbots, and language models. You can find resources for building search engines, conversational AI, and more using Vertex AI. The repository also provides setup instructions and learning materials. This helps you quickly start and manage generative AI projects, making it easier to create innovative solutions.

https://github.com/GoogleCloudPlatform/generative-ai
1
#go #api_management #circuit_breaker #consul #enforce_policies #envoy #fault_injection #kubernetes #lyft_envoy #microservice #microservices #nomad #polyglot_microservices #proxies #request_routing #resiliency #service_mesh

Istio is a tool that helps manage and connect different parts of your application easily. It adds security, load balancing, and monitoring without needing to change much of your existing code. Istio uses components like Envoy proxies to handle traffic between services and Istiod for service discovery and configuration. This makes it easier to integrate microservices, enforce policies, and collect data. Using Istio benefits you by simplifying how your services communicate, enhancing security, and improving overall efficiency.

https://github.com/istio/istio
#c_lang #c #embedded #graphics #gui #mcu #microcontroller #tft

LVGL (Light and Versatile Graphics Library) is a popular, free, and open-source library for creating beautiful user interfaces (UIs) on various microcontrollers and display types. Here’s what you need to know LVGL offers over 30 built-in widgets, a powerful style system, web-inspired layout managers, and support for multiple languages.
- **Portable and Efficient** Detailed documentation with over 100 examples helps you get started quickly. You can also try out demos directly in your browser.
- **Community Support** LVGL can be integrated into various platforms like Arduino, PlatformIO, Zephyr, ESP-IDF, and more.

Using LVGL benefits you by providing a robust and flexible tool to create modern GUIs efficiently, with extensive community support and resources to help you every step of the way.

https://github.com/lvgl/lvgl
#typescript #asana #collaboration #goal_tracking #golang #hacktoberfest #kanban_board #notion #project #project_management #trello

Focalboard is a free, open-source tool for managing projects, similar to Trello, Notion, and Asana. It helps you organize and track work for yourself or your team. You can use it as a standalone desktop app for personal projects on Windows, Mac, or Linux, or set up a personal server for multiple users. To get started, you can download the app from app stores or follow the installation guides on the website. This tool is beneficial because it allows you to define, organize, and manage your work efficiently, all while being customizable and self-hosted.

https://github.com/mattermost-community/focalboard
#other #nas #python

This tool helps you manage your media library on a NAS (Network Attached Storage) device. You can install it using Docker, download an executable file, or use a package for Synology devices. The tool is open-source and has a wiki for guidance. Using this tool makes managing your media files easier and more organized, saving you time and effort. It also supports different platforms like amd64 and arm64, making it versatile for various users.

https://github.com/NAStool/nas-tools
#cplusplus #cpp #debug #debugger #debugger_gdb #debugger_visualizer #debugger_visualizers #debuggers #debugging #debugging_tool #debugging_tools #gdb #gdb_frontend #gdb_mi #gdb_ui #gdbserver #gnu_debugger #qt #qt5 #qt6

Seer is a user-friendly graphical interface (GUI) for the debugger gdb on Linux. It makes debugging easier with a simple and pleasing GUI. You can install Seer using package managers or build it from source, ensuring you have the necessary requirements like Linux, C++17, gdb, CMake, and QT6. Seer offers various features such as viewing source code, variables, registers, breakpoints, and assembly code. It also includes tools like a memory visualizer, array visualizer, and struct visualizer to help you understand your program's data better. This tool simplifies the debugging process, making it more efficient and intuitive for users.

https://github.com/epasveer/seer