audio-tokenizer

Here are 6 public repositories matching this topic...

OpenMOSS / MOSS-TTS-Nano

MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for realtime speech generation, can run directly on CPU without a GPU, and keeps the deployment stack simple enough for local demos, web serving, and lightweight product integration.

multilingual realtime tts english chinese streaming-audio multi-modality voice-clone audio-tokenizer

Updated Apr 17, 2026
Python

OpenMOSS / MOSS-TTS

Star

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.

audio text-to-speech multimodal voice-cloning llm audio-tokenizer

Updated Apr 13, 2026
Python

alibaba / unified-audio

Star

An Open-Source Project to Unify Audio Processing and Generation

codec voice-conversion audio-processing text-to-audio audio-restoration audio-generation audio-tokenizer audio-llm

Updated Apr 7, 2026
Python

qiuk2 / AAR

Star

[Official Implementation] Acoustic Autoregressive Modeling 🔥

efficient-inference audio-tokenizer autoregressive-generation next-scale-prediction

Updated Aug 24, 2024
Python

pujariaditya / HiggsAudiov2TokenizerUnofficial

Star

Unofficial PyTorch implementation of Higgs Audio V2 Tokenizer with HuBERT semantic features. Complete training pipeline for semantic-acoustic audio tokenization with 960x downsampling and 8-layer RVQ.

pytorch audio-synthesis speech-processing audio-processing vector-quantization dac semantic-features hubert audio-generation neural-audio-codec rvq audio-tokenizer neural-codec higgs-audio speech-tokenization

Updated Oct 8, 2025
Python

madderangelfoodcake950 / OpenMOSS

Star

Build and manage self-organizing multi-agent systems for OpenClaw to improve automation and coordination in complex workflows.

audio text-to-speech practice robotics introduction humanoid-robots multimodal voice-cloning llm vision-language-model embodied-intelligence audio-tokenizer

Updated Apr 20, 2026
Vue

Improve this page

Add a description, image, and links to the audio-tokenizer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the audio-tokenizer topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

audio-tokenizer

Here are 6 public repositories matching this topic...

OpenMOSS / MOSS-TTS-Nano

OpenMOSS / MOSS-TTS

alibaba / unified-audio

qiuk2 / AAR

pujariaditya / HiggsAudiov2TokenizerUnofficial

madderangelfoodcake950 / OpenMOSS

Improve this page

Add this topic to your repo