FunASR Ecosystem

Open-source projects and integrations powered by FunASR, SenseVoice, and Paraformer.

50+

Integrations

50+

Languages

16K+

GitHub Stars

1M+

pip installs/month

Video & Media Tools

FunClip

5.7K stars

ASR-powered video clipping. Automatic subtitles, keyword and speaker-based clip extraction.

VideoOfficial

pyVideoTrans

17.6K stars

Video translation tool. Uses FunASR for Chinese speech recognition with subtitle generation.

VideoTranslation

Voice-Pro

10.4K stars

Gradio WebUI for TTS, voice cloning, and audio processing with ASR capabilities.

AudioTTS

Linly-Dubbing

3.2K stars

AI video dubbing toolkit. Automatic speech recognition, translation, and voice cloning for multilingual dubbing.

DubbingVideo

Voice Input & Desktop Apps

CapsWriter-Offline

5.5K stars

PC voice input tool with offline recognition. Hold CapsLock to speak, release to paste. Powered by FunASR Paraformer.

DesktopVoice Input

OpenLess

1.8K stars

Voice input for macOS & Windows. Hold a key, speak, release — text appears at cursor. Uses SenseVoice via Sherpa ONNX.

DesktopmacOS

ququ

2.2K stars

Open-source Wispr Flow alternative. Desktop voice workflow integrating FunASR local models with configurable LLMs.

DesktopVoice Input

MTools

1.2K stars

Multi-function desktop app with audio/video processing, image editing, and AI-enhanced speech transcription.

DesktopToolkit

VocoType

704 stars

Privacy-first local voice input tool. Converts speech to text via hotkey and auto-types into any app. Supports MCP integration.

Voice InputPrivacy

LiveTranslate

325 stars

Real-time audio translation. Captures system audio + mic, uses SenseVoice for ASR, then LLM streaming translation.

TranslationReal-time

VocoType-linux

139 stars

High-performance Linux offline Chinese voice input. Based on FunASR, 0.1s instant display, IBus/Fcitx5 support.

LinuxInput Method

Murmur

103 stars

Free offline voice-to-text for macOS. Push-to-talk, works in any app. Fully local processing with SenseVoice.

macOSVoice Input

AriaType

74 stars

Voice-driven writing, input, and cross-app work for your desktop. Speech-to-text with AI refinement.

DesktopWriting

VoiceSnap

73 stars

Open-source offline voice dictation — a free Typeless alternative. 100% local, SenseVoice + DirectML, ideal for air-gapped environments.

OfflineSecurity

Voice Assistants & Agents

Fay

12.8K stars

Digital human agent framework connecting 2.5D/3D avatars with LLMs. Uses FunASR for real-time speech recognition.

Digital HumanAgent

Duix Avatar

13.4K stars

Open-source AI digital human toolkit. Offline video generation and real-time interaction. Uses FunASR for speech recognition.

Digital HumanVideo

Wukong Robot

7.1K stars

Chinese voice assistant / smart speaker on Raspberry Pi. Supports ChatGPT, brain-computer interaction. FunASR as ASR engine.

IoTAssistant

Paseo

6.9K stars

Coding agent from your phone, desktop, and CLI. Uses Paraformer and SenseVoice for speech recognition via Sherpa ONNX.

CodingAgent

Linly-Talker

3.3K stars

Digital avatar conversational system. Combines ASR, LLM, and TTS for natural dialogue with virtual characters. Uses FunASR.

Digital AvatarDialogue

AudioNotes

2.1K stars

Extract audio/video content into structured markdown notes. Uses FunASR for accurate transcription.

NotesProductivity

Bailing

1.7K stars

GPT-4o-style voice chatbot. Full ASR + LLM + TTS pipeline for natural voice conversations. Powered by FunASR.

Voice ChatGPT-4o

AI Platforms & Frameworks

GPT-SoVITS

58K stars

1-minute voice data TTS. Uses FunASR for training data annotation — automatic speech-to-text labeling.

TTSTraining

LocalAI

33K stars

Self-hosted OpenAI alternative. FunASR integration as speech-to-text backend (PR in review).

LLMSelf-hosted

Pipecat

12.5K stars

Voice and multimodal conversational AI framework. FunASR as community STT integration.

Conversational AI

Amphion

9.8K stars

Open-source audio, music, and speech generation toolkit from OpenMMLab. Uses FunASR for TTS evaluation and data processing.

Audio ToolkitOpenMMLab

Dify

143K stars

LLM application development platform. FunASR available as speech-to-text provider via OpenAI-compatible API.

LLM Platform

Xinference

9.3K stars

Distributed inference framework. Built-in FunASR speech recognition backend with one-click ASR model deployment.

InferenceDistributed

AIGCPanel

5K stars

All-in-one AI digital human system with video synthesis, voice cloning. Integrates FunASR for speech recognition.

Digital HumanAIGC

MiniMind-O

1.6K stars

Lightweight multimodal model combining vision, audio, and language understanding. Uses FunASR for speech recognition module.

MultimodalLLM

ComfyUI-FunAudioLLM

95 stars

ComfyUI custom nodes for SenseVoice and CosyVoice. Visual workflow builder for speech recognition and synthesis.

ComfyUIWorkflow

SenseVoice Community Extensions

OmniSenseVoice

894 stars

Enhanced SenseVoice with high-accuracy word-level timestamps. Same speed as original model.

TimestampsSenseVoice

api4sensevoice

541 stars

API and WebSocket server for SenseVoice. Supports VAD detection, real-time streaming, and speaker verification.

APIWebSocket

streaming-sensevoice

451 stars

Pseudo-streaming SenseVoice with hotword boosting. Low-latency near-realtime speech recognition.

StreamingHotwords

SenseVoice-python

111 stars

Enterprise-grade SenseVoice inference with ONNX Runtime. No PyTorch dependency, production-ready deployment.

ONNXDeployment

SenseVoice-Api

109 stars

FastAPI wrapper for SenseVoice with ONNX inference. Smaller footprint, quantized models, GPU acceleration.

FastAPIQuantized

SenseVoice-OneApi

93 stars

SenseVoice API service compatible with OneAPI. Unified interface for managing multiple speech recognition models.

OneAPIAPI

Cross-Platform Inference

Sherpa-ONNX

5K+ stars

Cross-platform speech processing with ONNX. Runs SenseVoice and Paraformer on iOS, Android, Raspberry Pi, and browsers.

MobileEdge

RapidASR

608 stars

Cross-platform ASR inference library based on ONNX Runtime and FunASR. Ready to use, supports Chinese-English mixed recognition.

ONNXCross-Platform

SenseVoice.cpp

550 stars

C/C++ implementation of SenseVoice model. Pure C++ inference with no Python dependency.

C++Embedded

Transformers

142K stars

Hugging Face Transformers library. Fun-ASR-Nano integration (PR in review) — use FunASR models with the familiar HF API.

ML Framework

Vox-Box

211 stars

OpenAI-compatible speech server supporting FunASR, Whisper, Bark, and CosyVoice backends.

API ServerOpenAI-compat

FunSpeech

136 stars

Out-of-the-box local speech service. Microservice architecture, compatible with Alibaba Cloud Speech API and OpenAI TTS API.

API ServerSelf-hosted

FunASR-GGML

113 stars

C++ inference engine based on GGML. CPU/CUDA support, real-time mic streaming, single GGUF file deployment.

GGMLC++

ManySpeech

79 stars

Multi-model ASR inference solution supporting Paraformer, SenseVoice, Whisper, and more. ONNX-based, multi-scenario ready.

Multi-modelONNX

Get Started

The fastest way to try FunASR:

pip install funasr

# Python API
from funasr import AutoModel
model = AutoModel(model="iic/SenseVoiceSmall")
result = model.generate(input="audio.wav")

# Or start an OpenAI-compatible server
pip install vllm fastapi uvicorn python-multipart
funasr-server --device cuda

Build something with FunASR? We'd love to list it here.

GitHub Open an Issue