GitHub Trends
10.1K subscribers
15.3K links
See what the GitHub community is most excited about today.

A bot automatically fetches new repositories from https://github.com/trending and sends them to the channel.

Author and maintainer: https://github.com/katursis
Download Telegram
#python #text_to_speech #tts #vits #voice_clone #voice_cloneai #voice_cloning

GPT-SoVITS-WebUI is a powerful tool for converting text to speech and changing voices. Here’s what it offers** You can convert text to speech instantly with just a 5-second vocal sample.
- **Few-shot TTS** It works in several languages including English, Japanese, Korean, Cantonese, and Chinese.
- **WebUI Tools:** It includes tools like voice separation, automatic training set segmentation, and text labeling, making it easier to create and use the models.

Using GPT-SoVITS-WebUI benefits you by allowing quick and easy voice conversions and text-to-speech functions with high quality and flexibility.

https://github.com/RVC-Boss/GPT-SoVITS
#python #applicant_tracking_system #ats #hacktoberfest #machine_learning #natural_language_processing #nextjs #python #resume #resume_builder #resume_parser #text_similarity #typescript #vector_search #word_embeddings

Resume Matcher is a free and open-source tool that helps you tailor your resume to a job description. It uses AI to extract important keywords from the job description and matches them with your resume, improving its readability and making it more likely to pass through applicant tracking systems (ATS). Here’s how it benefits you: it analyzes your resume and job descriptions, identifies key terms, and suggests improvements to increase your chances of getting noticed by employers. This tool is easy to install and use, and it's available for free, making it a valuable resource for anyone looking to enhance their job application process.

https://github.com/srbhr/Resume-Matcher
#typescript #agent #bigquery #charts #duckdb #genbi #gpt #llm #openai #postgresql #rag #reporting #spreadsheets #sql #sqlai #text_to_sql #text2sql

Wren AI is a free, open-source tool that helps you get insights from your data easily. You can ask questions in any language, and it will generate the necessary SQL queries to get the answers. It integrates well with tools like Excel and Google Sheets, making it easy to analyze and visualize your data. Wren AI also suggests follow-up questions to help you dig deeper into your data without needing to write code. This makes it simpler for anyone, regardless of their technical skills, to understand and use their data effectively.

https://github.com/Canner/WrenAI
#cplusplus #aarch64 #android #arm32 #asr #cpp #csharp #dotnet #ios #lazarus #linux #macos #mfc #object_pascal #onnx #raspberry_pi #risc_v #speech_to_text #text_to_speech #vits #windows

This tool supports various speech functions like speech recognition, text-to-speech, speaker identification, and more. It works on multiple platforms including Android, iOS, Windows, macOS, and Linux, and supports several programming languages such as C++, Python, JavaScript, and others. You can use it locally or through web assembly, making it versatile and convenient. This benefits you by allowing you to integrate advanced speech capabilities into your projects easily, regardless of the platform or programming language you use.

https://github.com/k2-fsa/sherpa-onnx
#python #ai #csv #data #data_analysis #data_science #data_visualization #database #datalake #gpt_4 #llm #pandas #sql #text_to_sql

PandaAI is a tool that lets you ask questions about your data using natural language. It's helpful for both non-technical and technical users. Non-technical users can interact with data more easily, while technical users can save time and effort. You can load your data, save it as a dataframe, and then ask questions like "Which are the top 5 countries by sales?" or "What is the total sales for the top 3 countries?" PandaAI also allows you to visualize charts and work with multiple datasets. It's easy to install using pip or poetry and can be used in Jupyter notebooks, Streamlit apps, or even a secure Docker sandbox. This makes it simpler and more efficient to analyze your data.

https://github.com/sinaptik-ai/pandas-ai
#python #artificial_intelligence #llm #python #real_time #speech_to_text #text_to_speech

FastRTC is a Python library that helps you create real-time audio and video streams using WebRTC or WebSockets. It allows you to turn any Python function into a live stream, making it useful for applications like voice chats or video conferencing. Key features include automatic voice detection, built-in UI support with Gradio, and integration with FastAPI for custom frontends. This library simplifies the process of handling real-time communication, allowing developers to focus on their application logic rather than complex streaming setups.

https://github.com/freddyaboulton/fastrtc
#python #ai #artificial_intelligence #cython #data_science #deep_learning #entity_linking #machine_learning #named_entity_recognition #natural_language_processing #neural_network #neural_networks #nlp #nlp_library #python #spacy #text_classification #tokenization

spaCy is a powerful tool for understanding and processing human language. It helps computers analyze text by breaking it into parts like words, sentences, and entities (like names or places). This makes it useful for tasks such as identifying who is doing what in a sentence or finding specific information from large texts. Using spaCy can save time and improve accuracy compared to manual analysis. It supports many languages and integrates well with advanced models like BERT, making it ideal for real-world applications.

https://github.com/explosion/spaCy
#cplusplus #desktop_environment #multiplexer #terminal #text_based #windows_console

**vtm** is a special program that turns your terminal into a text-based desktop. It lets you run many console applications in separate windows, which can be moved and resized like regular windows. This makes it easier to organize and use multiple tools at once. **vtm** works on Windows, Linux, macOS, and other Unix-like systems. Using **vtm**, users can have a more organized workspace with floating windows for different tasks, making it simpler to manage multiple applications simultaneously.

https://github.com/directvt/vtm
#rust #code_editor #developer_tools #rust #text_editor #vim

Lapce is a very fast and powerful code editor written in Rust. It offers smart features like code completion and diagnostics thanks to its built-in Language Server Protocol (LSP) support. Users can enjoy a Vim-like editing experience and work on remote projects as if they were local. Lapce also includes a built-in terminal for running commands without leaving the editor. This makes it easy for developers to work efficiently and stay focused on their projects.

https://github.com/lapce/lapce
👍2
#python #3d #3d_aigc #3d_generation #diffusion_models #hunyuan3d #image_to_3d #shape #shape_generation #text_to_3d #texture_generation

Hunyuan3D 2.0 is a powerful tool that creates detailed 3D models with textures in two steps: first building the shape, then adding colors and materials. It works efficiently on standard computers (as low as 5GB VRAM for basic models) and offers multiple ways to use it, like coding, Blender plugins, or online demos, making it accessible for creating game-ready 3D assets, VR/AR content, or custom designs without needing advanced hardware.

https://github.com/Tencent/Hunyuan3D-2
#rust #gpui #rust_lang #text_editor #zed

Zed is a fast and powerful code editor that helps you work better with your team. It's built using Rust, which makes it very efficient, especially with large files. Zed allows real-time collaboration, so you can work on code with others at the same time. It also supports AI tools like GitHub Copilot and has features like a built-in terminal and a REPL for testing code quickly. This means you can code faster and more smoothly, and it's easy to customize to fit your needs.

https://github.com/zed-industries/zed
👍1
#python #apple_silicon #audio_processing #mlx #multimodal #speech_recognition #speech_synthesis #speech_to_text #text_to_speech #transformers

MLX-Audio is a powerful tool for converting text into speech and speech into new audio. It works well on Apple Silicon devices, like M-series chips, making it fast and efficient. You can choose from different languages and voices, and even adjust how fast the speech is. It also includes a web interface where you can see audio in 3D and play your own files. This tool is helpful for making audiobooks, interactive media, and personal projects because it's easy to use and provides high-quality audio quickly.

https://github.com/Blaizzy/mlx-audio
#python #diffusion_models #dit #image_to_video #image_to_video_generation #text_to_video #text_to_video_generation

LTX-Video is a powerful AI model that creates high-quality, realistic videos in real time, running faster than you can watch them. It can generate videos from text descriptions, images, or existing videos, and supports advanced features like keyframe animation and video extension. You can use it online or run it locally with easy setup. It offers great control over video details, smooth motion, and works well even on consumer hardware. This helps you quickly create custom videos for storytelling, social media, or prototyping, saving time and boosting creativity with detailed, lifelike results[2][4][5].

https://github.com/Lightricks/LTX-Video
🔥1
#python #comfyui #diffusion_models #dit #image_to_video #image_to_video_generation #text_to_image #text_to_image_generation

ComfyUI-LTXVideo is a tool that helps create high-quality videos from images using AI. It offers features like key frame control, improved video quality, and faster generation speeds. This means you can make smooth videos with fewer errors and more control over how they look. It also supports commercial use, so you can use the videos for business projects. The tool is designed to work well with consumer-grade GPUs, making it accessible to more users. Overall, it helps you create professional-looking videos quickly and easily.

https://github.com/Lightricks/ComfyUI-LTXVideo
🔥1
#python #ai #ai_art #art #asset_generator #chatbot #deep_learning #desktop_app #image_generation #mistral #multimodal #privacy #pygame #pyside6 #python #self_hosted #speech_to_text #stable_diffusion #text_to_image #text_to_speech #text_to_speech_app

AI Runner is a tool that lets you use AI on your own computer without needing the internet. It can do many things like **voice chatbots**, **text-to-image** generation, and **image editing**. You can also make AI personalities for more interesting conversations. It runs fast and securely, keeping your data private. To use AI Runner, you need a good computer with a strong GPU, like an NVIDIA RTX 3060 or better. This helps keep your data safe and makes AI tasks faster.

https://github.com/Capsize-Games/airunner
#rust #editor #rust #terminal #text_editor

Microsoft Edit is a simple, modern text editor designed for easy use, especially for those not familiar with command-line tools. It combines the classic feel of the old MS-DOS Editor with a fresh interface like Visual Studio Code, supporting mouse use, multiple files, find and replace (including regex), and word wrap. It’s lightweight (under 250KB) and built in Rust, making it fast and efficient. You can install it easily on Windows using WinGet. This editor is great for quick, basic text editing tasks without needing complex setups or third-party tools, making editing straightforward and accessible for everyone[1].

https://github.com/microsoft/edit
1
#python #audiobook #audiobooks #content_creation #content_creator #epub_converter #kokoro #kokoro_82m #kokoro_tts #media_generation #narrator #speech_synthesis #subtitles #text_to_audio #text_to_speech #tts #voice_synthesis

Abogen is a user-friendly tool that quickly converts ePub, PDF, or text files into natural-sounding audio with synchronized subtitles, perfect for creating audiobooks or voiceovers for social media and other projects. You can customize speech speed, choose or mix voices, generate subtitles by sentence or word, and select various audio and subtitle formats. It supports batch processing with queue mode and lets you save chapters separately or merged. Installation is straightforward on Windows, Mac, and Linux, with options for GPU acceleration. This saves you time and effort in producing high-quality audio content from text files efficiently.

https://github.com/denizsafak/abogen
1
#python #chatbi #deepseek #llm #nl2sql #rag #sqlbot #text_to_sql #text2sql

SQLBot is an easy-to-use intelligent system that turns natural language questions into SQL queries using advanced AI models and retrieval-augmented generation (RAG). You just need to set up your AI model and data source to start asking questions about your data. It integrates smoothly with other business systems and AI platforms, making it simple to add smart data querying to your apps. It also ensures data security with workspace-based resource isolation and fine-grained access control. You can quickly deploy it on a Linux server using Docker, enabling fast, secure, and intelligent data interaction without needing deep SQL knowledge. This saves you time and improves data accessibility.

https://github.com/dataease/SQLBot
#python #text_to_speech #tts #voice_clone #zero_shot_tts

OpenVoice is a free, open-source tool that lets you clone any voice using just a short audio sample, then generate speech in that voice across many languages and accents[1][5][8]. You can fine-tune how the voice sounds—adjusting emotion, accent, rhythm, pauses, and intonation—to match your needs[1][3][5]. A major benefit is “zero-shot” cloning: you can make the cloned voice speak languages it was never trained on, which is rare in voice AI[1][3][4]. The latest version, OpenVoice V2, offers even better sound quality, supports six major languages natively, and is free for both personal and commercial use[1]. This makes it easy and affordable for anyone to create realistic, customizable voice content without needing technical expertise or expensive software.

https://github.com/myshell-ai/OpenVoice
#typescript #ai #cloudflare_workers #coding_agent #durable_objects #text_to_app #vibe_coding

Cloudflare VibeSDK lets you create AI-powered web apps by simply describing what you want in natural language. It’s open source and runs on Cloudflare’s platform, providing secure, isolated environments to build, preview, and deploy apps quickly with one click. You can customize AI behavior, control code patterns, and keep data private. It’s great for companies, startups, or teams who want to build apps without deep coding skills, speeding up development and enabling non-technical users to create tools easily. The platform supports live previews, chat-based interaction, and integrates with GitHub for smooth workflows.

https://github.com/cloudflare/vibesdk
🤮3