FunASR Ecosystem
Open-source projects and integrations powered by FunASR, SenseVoice, and Paraformer.
Video & Media Tools
5.7K stars
ASR-powered video clipping. Automatic subtitles, keyword and speaker-based clip extraction.
VideoOfficial
17.6K stars
Video translation tool. Uses FunASR for Chinese speech recognition with subtitle generation.
VideoTranslation
10.4K stars
Gradio WebUI for TTS, voice cloning, and audio processing with ASR capabilities.
AudioTTS
3.2K stars
AI video dubbing toolkit. Automatic speech recognition, translation, and voice cloning for multilingual dubbing.
DubbingVideo
Voice Input & Desktop Apps
5.5K stars
PC voice input tool with offline recognition. Hold CapsLock to speak, release to paste. Powered by FunASR Paraformer.
DesktopVoice Input
1.8K stars
Voice input for macOS & Windows. Hold a key, speak, release — text appears at cursor. Uses SenseVoice via Sherpa ONNX.
DesktopmacOS
2.2K stars
Open-source Wispr Flow alternative. Desktop voice workflow integrating FunASR local models with configurable LLMs.
DesktopVoice Input
1.2K stars
Multi-function desktop app with audio/video processing, image editing, and AI-enhanced speech transcription.
DesktopToolkit
704 stars
Privacy-first local voice input tool. Converts speech to text via hotkey and auto-types into any app. Supports MCP integration.
Voice InputPrivacy
323 stars
Real-time audio translation. Captures system audio + mic, uses SenseVoice for ASR, then LLM streaming translation.
TranslationReal-time
139 stars
High-performance Linux offline Chinese voice input. Based on FunASR, 0.1s instant display, IBus/Fcitx5 support.
LinuxInput Method
Voice Assistants & Agents
12.8K stars
Digital human agent framework connecting 2.5D/3D avatars with LLMs. Uses FunASR for real-time speech recognition.
Digital HumanAgent
7.1K stars
Chinese voice assistant / smart speaker on Raspberry Pi. Supports ChatGPT, brain-computer interaction. FunASR as ASR engine.
IoTAssistant
6.9K stars
Coding agent from your phone, desktop, and CLI. Uses Paraformer and SenseVoice for speech recognition via Sherpa ONNX.
CodingAgent
3.3K stars
Digital avatar conversational system. Combines ASR, LLM, and TTS for natural dialogue with virtual characters. Uses FunASR.
Digital AvatarDialogue
2.1K stars
Extract audio/video content into structured markdown notes. Uses FunASR for accurate transcription.
NotesProductivity
1.7K stars
GPT-4o-style voice chatbot. Full ASR + LLM + TTS pipeline for natural voice conversations. Powered by FunASR.
Voice ChatGPT-4o
AI Platforms & Frameworks
58K stars
1-minute voice data TTS. Uses FunASR for training data annotation — automatic speech-to-text labeling.
TTSTraining
33K stars
Self-hosted OpenAI alternative. FunASR integration as speech-to-text backend (PR in review).
LLMSelf-hosted
12.5K stars
Voice and multimodal conversational AI framework. FunASR as community STT integration.
Conversational AI
9.8K stars
Open-source audio, music, and speech generation toolkit from OpenMMLab. Uses FunASR for TTS evaluation and data processing.
Audio ToolkitOpenMMLab
143K stars
LLM application development platform. FunASR available as speech-to-text provider via OpenAI-compatible API.
LLM Platform
9.3K stars
Distributed inference framework with built-in FunASR speech recognition backend. Deploy ASR models as API services.
InferenceDistributed
5K stars
All-in-one AI digital human system for video synthesis, voice synthesis and cloning. Integrates FunASR for speech recognition.
Digital HumanAIGC
1.6K stars
Lightweight multimodal model combining vision, audio, and language understanding. Uses FunASR for speech recognition module.
MultimodalLLM
SenseVoice Community Extensions
894 stars
Enhanced SenseVoice with high-accuracy word-level timestamps. Same speed as original model.
TimestampsSenseVoice
541 stars
API and WebSocket server for SenseVoice. Supports VAD detection, real-time streaming, and speaker verification.
APIWebSocket
451 stars
Pseudo-streaming SenseVoice with hotword boosting. Low-latency near-realtime speech recognition.
StreamingHotwords
Cross-Platform Inference
5K+ stars
Cross-platform speech processing with ONNX. Runs SenseVoice and Paraformer on iOS, Android, Raspberry Pi, and browsers.
MobileEdge
608 stars
Cross-platform ASR inference library based on ONNX Runtime and FunASR. Ready to use, supports Chinese-English mixed recognition.
ONNXCross-Platform
550 stars
C/C++ implementation of SenseVoice model. Pure C++ inference with no Python dependency.
C++Embedded
142K stars
Hugging Face Transformers library. Fun-ASR-Nano integration (PR in review) — use FunASR models with the familiar HF API.
ML Framework
211 stars
OpenAI-compatible speech server supporting FunASR, Whisper, Bark, and CosyVoice backends.
API ServerOpenAI-compat
113 stars
C++ inference engine based on GGML. CPU/CUDA support, real-time mic streaming, single GGUF file deployment.
GGMLC++
Get Started
The fastest way to try FunASR:
pip install funasr # Python API from funasr import AutoModel model = AutoModel(model="iic/SenseVoiceSmall") result = model.generate(input="audio.wav") # Or start an OpenAI-compatible server pip install vllm fastapi uvicorn python-multipart funasr-server --device cuda