Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
-
Updated
Jun 25, 2026 - Python
Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.
Open-source, accurate and easy-to-use video speech recognition & clipping tool. LLM-based AI clipping integrated.
本地优先的一站式桌面字幕工具,内置 6 种 ASR 引擎与全平台 GPU 加速及 17+ 翻译服务商,覆盖音视频转写、翻译、校对、字幕烧录封装全流程,跨 Windows/macOS/Linux 运行
开源免费的 Wispr Flow 替代方案 | 集成FunASR本地模型和可配置大语言模型的下一代中文桌面语音工作流
End-to-end speech recognition large model: 31 languages, dialects, accents, lyrics, hotwords, timestamps, speaker diarization. Trained on tens of millions of hours.
Real-time audio translation, captures system audio + mic, runs ASR (Whisper/SenseVoice), translates via LLM API with streaming display. Perfect for VTubers, livestreamers, and watching foreign content. Windows 实时音频翻译,ASR 语音识别后 LLM 流式翻译显示,适合 VTuber、主播和外语视频观看。
开箱即用的本地私有化部署语音服务,快速搭建Qwen3ASR/FunASR与Qwen3TTS/CosyVoice后端
妙语 - 智能语音输入,妙语亦可生花。
AI native 的跨平台离线语音输入法
基于 SenseVoice 的 Windows 本地语音转文字工具,支持 OpenAI 格式 API 润色,低延迟,高精度。
这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.
FunASR实时语音识别版,识别麦克风和电脑内播放的声音,电脑语音打字软件
Add a description, image, and links to the funasr topic page so that developers can more easily learn about it.
To associate your repository with the funasr topic, visit your repo's landing page and select "manage topics."