Which FunASR model? SenseVoice vs Paraformer vs Fun-ASR-Nano

FunASR ships three main ASR models. In one line: multilingual + emotion/events and fast → SenseVoice; Chinese production + word timestamps/hotwords → Paraformer; highest accuracy + context/hotwords across 31 languages → Fun-ASR-Nano. Details below.

Pick in one table

ModelLanguagesChinese CER ↓Arch / speedHighlightsBest for
SenseVoice50+ (zh/yue/en/ja/ko…)7.81%non-AR CTC, ~170xemotion + audio events + language IDmultilingual, emotion, real-time/low latency
ParaformerChinese (+ English variant)10.18%non-AR CIF, ~120xword timestamps, hotwords (SeACo), streamingChinese production, subtitles/timestamps, hotwords
Fun-ASR-Nano318.06%LLM (Qwen3-0.6B), vLLM 340xcontext/hotword prompting, LLM decodinghighest accuracy, context-aware, broad languages

(Chinese CER on the same 184-file set, micro-average + normalize_zh; speed = realtime factor on GPU.)

SenseVoice — the all-rounder, default pick

One non-autoregressive pass gives transcript + language + emotion + audio events, 50+ languages, lowest Chinese CER, and high speed. The default for most use cases.

from funasr import AutoModel
m = AutoModel(model="iic/SenseVoiceSmall", vad_model="fsmn-vad")
res = m.generate(input="audio.wav", language="auto", use_itn=True)

Paraformer — Chinese production + timestamps/hotwords

Industrial Chinese ASR with word-level timestamps (for subtitles), hotword customization (SeACo-Paraformer), and a low-latency streaming variant (paraformer-zh-streaming). Choose it when you need timestamps or hotwords.

m = AutoModel(model="paraformer-zh", vad_model="fsmn-vad", punc_model="ct-punc")
res = m.generate(input="audio.wav")

Fun-ASR-Nano — LLM-ASR, highest accuracy + context

A Qwen3-0.6B-based LLM-ASR across 31 languages, with context/hotword prompting and strong offline accuracy; vLLM acceleration reaches 340x. Choose it for top quality and context-awareness.

m = AutoModel(model="FunAudioLLM/Fun-ASR-Nano-2512", trust_remote_code=True, hub="hf")
res = m.generate(input="audio.wav", language="中文", hotwords=["开放时间"])

Quick decision

FunASR is open-source & commercial-friendly. A Star really helps 👇

⭐ Star FunASR

Also: SenseVoice · Fun-ASR · FunClip

Related posts