GitHub repos

enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E, WIP
Language: Python
#audio_lm #pytorch #text_to_speech #tts #vall_e #valle
Stars: 212 Issues: 2 Forks: 32
https://github.com/enhuiz/vall-e

GitHub

GitHub - enhuiz/vall-e: An unofficial PyTorch implementation of the audio LM VALL-E

An unofficial PyTorch implementation of the audio LM VALL-E - GitHub - enhuiz/vall-e: An unofficial PyTorch implementation of the audio LM VALL-E

👍5😐1

2.67K views11:04

GitHub repos

netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
#ai #deep_learning #emotion #emotivoice #multi_speaker #prompt #python #pytorch #speech #speech_synthesis #style #text_to_speech #tts
Stars: 432 Issues: 3 Forks: 38
https://github.com/netease-youdao/EmotiVoice

GitHub

GitHub - netease-youdao/EmotiVoice: EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine - netease-youdao/EmotiVoice

👍1

2.15K views11:21

GitHub repos

jishengpeng/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Language: Python
#acoustic #audio_representation #codec #dac #encodec #gpt4o #music_representation_learning #semantic #soundstream #speech_language_model #speech_representation #text_to_speech
Stars: 332 Issues: 6 Forks: 20
https://github.com/jishengpeng/WavTokenizer

GitHub

GitHub - jishengpeng/WavTokenizer: [ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language…

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling - GitHub - jishengpeng/WavTokenizer: [ICLR 2025] SOTA discrete acoustic codec models with 4...

2.39K views04:00

GitHub repos

lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
Language: Python
#diffusion_transformer #flow_matching #mlx #text_to_speech #tts
Stars: 193 Issues: 2 Forks: 17
https://github.com/lucasnewman/f5-tts-mlx

GitHub

GitHub - lucasnewman/f5-tts-mlx: Implementation of F5-TTS in MLX

Implementation of F5-TTS in MLX. Contribute to lucasnewman/f5-tts-mlx development by creating an account on GitHub.

1.76K views16:00

GitHub repos

edwko/OuteTTS
Interface for OuteTTS models.
Language: Python
#gguf #llama #text_to_speech #transformers #tts
Stars: 278 Issues: 6 Forks: 13
https://github.com/edwko/OuteTTS

GitHub

GitHub - edwko/OuteTTS: Interface for OuteTTS models.

Interface for OuteTTS models. Contribute to edwko/OuteTTS development by creating an account on GitHub.

1.79K views11:00

GitHub repos

isaiahbjork/orpheus-tts-local
Run Orpheus 3B Locally With LM Studio
Language: Python
#ai #python #text_to_speech #tts
Stars: 263 Issues: 15 Forks: 50
https://github.com/isaiahbjork/orpheus-tts-local

GitHub

GitHub - isaiahbjork/orpheus-tts-local: Run Orpheus 3B Locally With LM Studio

Run Orpheus 3B Locally With LM Studio. Contribute to isaiahbjork/orpheus-tts-local development by creating an account on GitHub.

👍1

1.71K views16:00

GitHub repos

nari-labs/dia
A TTS model capable of generating ultra-realistic dialogue in one pass.
Language: Python
#ai #open_weight #text_to_speech
Stars: 2047 Issues: 8 Forks: 92
https://github.com/nari-labs/dia

GitHub

GitHub - nari-labs/dia: A TTS model capable of generating ultra-realistic dialogue in one pass.

A TTS model capable of generating ultra-realistic dialogue in one pass. - nari-labs/dia

1.66K views10:00

GitHub repos

superstarryeyes/lue
Terminal eBook Reader with Text-to-Speech
Language: Python
#book #cli #doc #docx #ebook #epub #modular #pdf #reader #terminal #text_to_speech #tts #txt #voice
Stars: 325 Issues: 2 Forks: 9
https://github.com/superstarryeyes/lue

GitHub

GitHub - superstarryeyes/lue: Terminal eBook Reader with Text-to-Speech

Terminal eBook Reader with Text-to-Speech. Contribute to superstarryeyes/lue development by creating an account on GitHub.

1.59K views04:00

GitHub repos

High-Logic/Genie
GPT-SoVITS ONNX Inference Engine & Model Converter
Language: Python
#gpt_sovits #text_to_speech #tts #vits #voice_clone #voice_cloning
Stars: 212 Issues: 1 Forks: 10
https://github.com/High-Logic/Genie

GitHub

GitHub - High-Logic/Genie-TTS: GPT-SoVITS ONNX Inference Engine & Model Converter

GPT-SoVITS ONNX Inference Engine & Model Converter - High-Logic/Genie-TTS

❤1

1.42K views10:00

GitHub repos

wildminder/ComfyUI-VoxCPM
ComfyUI node for highly expressive speech and realistic zero-shot voice cloning
Language: Python
#ai_voice #audio #comfyui_node #t2s #text_to_speech #tts #voice_cloning #voice_generation
Stars: 198 Issues: 2 Forks: 21
https://github.com/wildminder/ComfyUI-VoxCPM

GitHub

GitHub - wildminder/ComfyUI-VoxCPM: ComfyUI node for highly expressive speech and realistic zero-shot voice cloning

ComfyUI node for highly expressive speech and realistic zero-shot voice cloning - wildminder/ComfyUI-VoxCPM

❤2

1.48K views16:00

GitHub repos

supertone-inc/supertonic
Lightning-fast, on-device TTS — running natively via ONNX.
Language: Swift
#cpp #csharp #go #ios #java #lightweight #nodejs #on_device #python #rust #swift #text_to_speech #tt #tts #web
Stars: 263 Issues: 5 Forks: 18
https://github.com/supertone-inc/supertonic

GitHub

GitHub - supertone-inc/supertonic: Lightning-Fast, On-Device TTS — running natively via ONNX.

Lightning-Fast, On-Device TTS — running natively via ONNX. - supertone-inc/supertonic

1.35K views17:00

GitHub repos

nari-labs/dia2
TTS model capable of streaming conversational audio in realtime.
Language: Python
#open_weight #text_to_speech
Stars: 245 Issues: 3 Forks: 25
https://github.com/nari-labs/dia2

GitHub

GitHub - nari-labs/dia2: TTS model capable of streaming conversational audio in realtime.

TTS model capable of streaming conversational audio in realtime. - nari-labs/dia2

❤1

1.3K views05:00

About

Blog

Apps

Platform