GitHub Trends
10.1K subscribers
15.3K links
See what the GitHub community is most excited about today.

A bot automatically fetches new repositories from https://github.com/trending and sends them to the channel.

Author and maintainer: https://github.com/katursis
Download Telegram
#python #ai_translation #dubbing #localization #video_translation #voice_cloning

VideoLingo is a powerful tool that helps translate, localize, and dub videos, making them understandable across different languages. It uses advanced technologies like WhisperX for accurate subtitle recognition and GPT for high-quality translations. The tool ensures single-line subtitles, similar to those on Netflix, and offers dubbing alignment for a more natural viewing experience. You can use it online, in Google Colab, or install it locally on your computer. This makes it easier to share videos globally without language barriers, enhancing global knowledge sharing and communication.

https://github.com/Huanshere/VideoLingo
#python #deep_learning #glow_tts #hifigan #melgan #multi_speaker_tts #python #pytorch #speaker_encoder #speaker_encodings #speech #speech_synthesis #tacotron #text_to_speech #tts #tts_model #vocoder #voice_cloning #voice_conversion #voice_synthesis

The new version of TTS (Text-to-Speech) from Coqui.ai, called TTSv2, is now available with several improvements. It supports 16 languages and has better performance overall. You can fine-tune the models using the provided code and examples. The TTS system can now stream audio with less than 200ms latency, making it very responsive. Additionally, you can use over 1,100 Fairseq models and new features like voice cloning and voice conversion. This update also includes faster inference with the Tortoise model and support for multiple speakers and languages. These enhancements make it easier and more efficient to generate high-quality speech from text.

https://github.com/coqui-ai/TTS
#python #audiobooks #chinese #docker #english #epub #gradio #linux #mac #multilingual #tts #voice_cloning #windows #xtts

This tool converts eBooks into audiobooks with chapters and metadata, supporting 1124 languages and optional voice cloning. Here’s how it benefits you It converts eBooks in various formats (like `.epub`, `.pdf`, `.mobi`) into audiobooks with high-quality text-to-speech using tools like Calibre, ffmpeg, and XTTSv2.
- **Multilingual Support** You can clone your own voice or use default voices for the audiobook.
- **User-Friendly Interface** You can run it on your local machine or use Docker for consistent results across different environments.
- **Free Resources**: There are options to use free resources like Google Colab or rent a GPU for faster processing.

Make sure to use this tool responsibly with non-DRM, legally acquired eBooks.

https://github.com/DrewThomasson/ebook2audiobook
#python #text_to_speech #tts #vits #voice_clone #voice_cloneai #voice_cloning

GPT-SoVITS-WebUI is a powerful tool for converting text to speech and changing voices. Here’s what it offers** You can convert text to speech instantly with just a 5-second vocal sample.
- **Few-shot TTS** It works in several languages including English, Japanese, Korean, Cantonese, and Chinese.
- **WebUI Tools:** It includes tools like voice separation, automatic training set segmentation, and text labeling, making it easier to create and use the models.

Using GPT-SoVITS-WebUI benefits you by allowing quick and easy voice conversions and text-to-speech functions with high quality and flexibility.

https://github.com/RVC-Boss/GPT-SoVITS