enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E, WIP
Language: Python
#audio_lm #pytorch #text_to_speech #tts #vall_e #valle
Stars: 212 Issues: 2 Forks: 32
https://github.com/enhuiz/vall-e
  
  An unofficial PyTorch implementation of the audio LM VALL-E, WIP
Language: Python
#audio_lm #pytorch #text_to_speech #tts #vall_e #valle
Stars: 212 Issues: 2 Forks: 32
https://github.com/enhuiz/vall-e
GitHub
  
  GitHub - enhuiz/vall-e: An unofficial PyTorch implementation of the audio LM VALL-E
  An unofficial PyTorch implementation of the audio LM VALL-E  - GitHub - enhuiz/vall-e: An unofficial PyTorch implementation of the audio LM VALL-E
π5π1
  netease-youdao/EmotiVoice
EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
#ai #deep_learning #emotion #emotivoice #multi_speaker #prompt #python #pytorch #speech #speech_synthesis #style #text_to_speech #tts
Stars: 432 Issues: 3 Forks: 38
https://github.com/netease-youdao/EmotiVoice
  
  EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
#ai #deep_learning #emotion #emotivoice #multi_speaker #prompt #python #pytorch #speech #speech_synthesis #style #text_to_speech #tts
Stars: 432 Issues: 3 Forks: 38
https://github.com/netease-youdao/EmotiVoice
GitHub
  
  GitHub - netease-youdao/EmotiVoice: EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine
  EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine - netease-youdao/EmotiVoice
π1
  jishengpeng/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Language: Python
#acoustic #audio_representation #codec #dac #encodec #gpt4o #music_representation_learning #semantic #soundstream #speech_language_model #speech_representation #text_to_speech
Stars: 332 Issues: 6 Forks: 20
https://github.com/jishengpeng/WavTokenizer
  
  SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Language: Python
#acoustic #audio_representation #codec #dac #encodec #gpt4o #music_representation_learning #semantic #soundstream #speech_language_model #speech_representation #text_to_speech
Stars: 332 Issues: 6 Forks: 20
https://github.com/jishengpeng/WavTokenizer
GitHub
  
  GitHub - jishengpeng/WavTokenizer: [ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio languageβ¦
  [ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling  - GitHub - jishengpeng/WavTokenizer: [ICLR 2025] SOTA discrete acoustic codec models with 4...
  lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
Language: Python
#diffusion_transformer #flow_matching #mlx #text_to_speech #tts
Stars: 193 Issues: 2 Forks: 17
https://github.com/lucasnewman/f5-tts-mlx
  
  Implementation of F5-TTS in MLX
Language: Python
#diffusion_transformer #flow_matching #mlx #text_to_speech #tts
Stars: 193 Issues: 2 Forks: 17
https://github.com/lucasnewman/f5-tts-mlx
GitHub
  
  GitHub - lucasnewman/f5-tts-mlx: Implementation of F5-TTS in MLX
  Implementation of F5-TTS in MLX. Contribute to lucasnewman/f5-tts-mlx development by creating an account on GitHub.
  edwko/OuteTTS
Interface for OuteTTS models.
Language: Python
#gguf #llama #text_to_speech #transformers #tts
Stars: 278 Issues: 6 Forks: 13
https://github.com/edwko/OuteTTS
  
  Interface for OuteTTS models.
Language: Python
#gguf #llama #text_to_speech #transformers #tts
Stars: 278 Issues: 6 Forks: 13
https://github.com/edwko/OuteTTS
GitHub
  
  GitHub - edwko/OuteTTS: Interface for OuteTTS models.
  Interface for OuteTTS models. Contribute to edwko/OuteTTS development by creating an account on GitHub.
  isaiahbjork/orpheus-tts-local
Run Orpheus 3B Locally With LM Studio
Language: Python
#ai #python #text_to_speech #tts
Stars: 263 Issues: 15 Forks: 50
https://github.com/isaiahbjork/orpheus-tts-local
  
  Run Orpheus 3B Locally With LM Studio
Language: Python
#ai #python #text_to_speech #tts
Stars: 263 Issues: 15 Forks: 50
https://github.com/isaiahbjork/orpheus-tts-local
GitHub
  
  GitHub - isaiahbjork/orpheus-tts-local: Run Orpheus 3B Locally With LM Studio
  Run Orpheus 3B Locally With LM Studio. Contribute to isaiahbjork/orpheus-tts-local development by creating an account on GitHub.
π1
  nari-labs/dia
A TTS model capable of generating ultra-realistic dialogue in one pass.
Language: Python
#ai #open_weight #text_to_speech
Stars: 2047 Issues: 8 Forks: 92
https://github.com/nari-labs/dia
  
  A TTS model capable of generating ultra-realistic dialogue in one pass.
Language: Python
#ai #open_weight #text_to_speech
Stars: 2047 Issues: 8 Forks: 92
https://github.com/nari-labs/dia
GitHub
  
  GitHub - nari-labs/dia: A TTS model capable of generating ultra-realistic dialogue in one pass.
  A TTS model capable of generating ultra-realistic dialogue in one pass. - nari-labs/dia
  superstarryeyes/lue
Terminal eBook Reader with Text-to-Speech
Language: Python
#book #cli #doc #docx #ebook #epub #modular #pdf #reader #terminal #text_to_speech #tts #txt #voice
Stars: 325 Issues: 2 Forks: 9
https://github.com/superstarryeyes/lue
  
  Terminal eBook Reader with Text-to-Speech
Language: Python
#book #cli #doc #docx #ebook #epub #modular #pdf #reader #terminal #text_to_speech #tts #txt #voice
Stars: 325 Issues: 2 Forks: 9
https://github.com/superstarryeyes/lue
GitHub
  
  GitHub - superstarryeyes/lue: Terminal eBook Reader with Text-to-Speech
  Terminal eBook Reader with Text-to-Speech. Contribute to superstarryeyes/lue development by creating an account on GitHub.
  High-Logic/Genie
GPT-SoVITS ONNX Inference Engine & Model Converter
Language: Python
#gpt_sovits #text_to_speech #tts #vits #voice_clone #voice_cloning
Stars: 212 Issues: 1 Forks: 10
https://github.com/High-Logic/Genie
  
  GPT-SoVITS ONNX Inference Engine & Model Converter
Language: Python
#gpt_sovits #text_to_speech #tts #vits #voice_clone #voice_cloning
Stars: 212 Issues: 1 Forks: 10
https://github.com/High-Logic/Genie
GitHub
  
  GitHub - High-Logic/Genie: GPT-SoVITS ONNX Inference Engine & Model Converter
  GPT-SoVITS ONNX Inference Engine & Model Converter - High-Logic/Genie
β€1
  wildminder/ComfyUI-VoxCPM
ComfyUI node for highly expressive speech and realistic zero-shot voice cloning
Language: Python
#ai_voice #audio #comfyui_node #t2s #text_to_speech #tts #voice_cloning #voice_generation
Stars: 198 Issues: 2 Forks: 21
https://github.com/wildminder/ComfyUI-VoxCPM
  
  ComfyUI node for highly expressive speech and realistic zero-shot voice cloning
Language: Python
#ai_voice #audio #comfyui_node #t2s #text_to_speech #tts #voice_cloning #voice_generation
Stars: 198 Issues: 2 Forks: 21
https://github.com/wildminder/ComfyUI-VoxCPM
GitHub
  
  GitHub - wildminder/ComfyUI-VoxCPM: ComfyUI node for highly expressive speech and realistic zero-shot voice cloning
  ComfyUI node for highly expressive speech and realistic zero-shot voice cloning - wildminder/ComfyUI-VoxCPM
β€2