A global guide to China's AI model ecosystem. Discover, compare, and access leading AI models developed in China.
Chinese AI labs — DeepSeek, Qwen, GLM, Kimi, MiniMax, ERNIE, Hunyuan — now ship some of the most capable and cost-efficient models available, and many are open-weight. But for developers outside China, the ecosystem is hard to navigate: scattered docs, unclear pricing, uncertain global availability.
This repository is a curated, continuously updated entry point. It covers:
- Model comparison — capabilities, context windows, open weights
- How to access — official APIs, routing gateways, and self-hosting
- Best-for guidance — which model fits which workload
- Global availability — where these models can be reached, and how to pay
- Benchmarks — aggregated from established leaderboards
- Daily updates — releases, API changes, and ecosystem news
中文:一个持续更新的中国 AI 模型索引——对比、接入方式、全球可用性、benchmark 和每日生态新闻。 Maintained by TheRouter.ai for the global developer community.
- Model Comparison
- How to Access
- Best For
- Global Availability
- Benchmarks
- Latest Updates
- Model Providers
- Featured Models
- Collections
- Machine-readable Data
- Contributing
A quick map of the major Chinese AI model families. DeepSeek is a popular low-cost alternative for coding workloads; Qwen ships strong multilingual open-weight models; GLM and Kimi focus on reasoning and long context.
| Model | Best For | Context | Vision | Reasoning | API | Open Weights |
|---|---|---|---|---|---|---|
| DeepSeek | Low-cost coding and reasoning | 128K | — | ✅ | ✅ | ✅ |
| Qwen | Multilingual and open-weight deployment | 256K | ✅ | ✅ | ✅ | ✅ |
| GLM | Chinese reasoning and agent workflows | 128K | ✅ | ✅ | ✅ | ✅ |
| Kimi | Long-context document workloads | 256K | — | ✅ | ✅ | — |
| MiniMax | Long-context, audio, and video generation | 1M | ✅ | ✅ | ✅ | ✅ |
| ERNIE | Enterprise deployment on Baidu Cloud | 128K | ✅ | ✅ | ✅ | ✅ |
| Hunyuan | Multimodal generation including 3D world models | 256K | ✅ | ✅ | ✅ | ✅ |
| Doubao | Multimodal agents on Volcengine Ark | 256K | ✅ | ✅ | ✅ | — |
Context windows and capabilities are best-effort and change often. Always verify with the official provider.
There are several ways to use Chinese AI models. This section stays vendor-neutral — official APIs, third-party aggregators, and self-hosting are listed side by side so you can choose what fits your project.
- Official API — Call the provider's own API directly.
- Best for: Production workloads that need first-party SLAs, billing, and the newest model versions.
- Each provider has its own endpoint, pricing, and regional availability. Most expose an OpenAI-compatible mode.
- TheRouter — An OpenAI-compatible routing gateway with fallback and cost controls.
- Best for: Teams that want one endpoint across multiple Chinese and global models, with routing and accounting.
- Maintained by TheRouter.ai, the maintainer of this repository. Listed as one option among several.
- OpenRouter — A third-party aggregator that exposes many models behind one API.
- Best for: Quick experimentation across providers without separate accounts.
- Independent third party. Coverage and pricing of Chinese models varies.
- Self-host — Run open-weight models on your own infrastructure.
- Best for: Data residency requirements, offline use, and full control over inference.
- Only available for models that release open weights. Inference stacks include vLLM, SGLang, and llama.cpp.
Developers new to Chinese AI models often ask which one to start with. This mapping is a starting point, not a ranking — benchmark on your own workload before committing.
- DeepSeek
- Low-cost coding workloads
- A cost-efficient alternative for reasoning tasks
- Qwen
- Multilingual applications
- Open-weight self-hosting and fine-tuning
- Terminal coding agents (Qwen-Code)
- GLM
- Chinese-language reasoning
- Agent and tool-use workflows
- Kimi
- Long-context document analysis
- Retrieval-heavy workflows
- MiniMax
- Very long context windows
- Audio and video generation
- ERNIE
- Enterprise integration on Baidu Cloud
- PaddlePaddle-based deployment
- Hunyuan
- Multimodal generation, including 3D assets
- Integration with Tencent Cloud
- Doubao
- Multimodal agents via Volcengine Ark
- High-throughput production workloads
Where can developers outside China actually reach these models, and how can they pay? This is one of the hardest things to find out — and one of the most useful.
| Provider | Global API | Russia | Middle East | Southeast Asia | Crypto Payment |
|---|---|---|---|---|---|
| DeepSeek | ? | ? | ? | ? | — |
| Qwen | ? | ? | ? | ? | — |
| GLM | ? | ◐ | ◐ | ? | — |
| Kimi | ◐ | ? | ? | ◐ | — |
| MiniMax | ? | ◐ | ◐ | ? | — |
| ERNIE | ◐ | ? | ? | ◐ | — |
| Hunyuan | ◐ | ? | ? | ◐ | — |
| Doubao | ◐ | ? | ◐ | ? | — |
Legend: ✅ available · ◐ partial · — not available · ? unverified. Availability changes often. Always verify with the provider.
This repo does not invent benchmarks. It points to established leaderboards and reports, with notes on how each covers Chinese AI models.
- LMArena (Chatbot Arena) — Human-preference Elo rankings; includes major Chinese models.
- Crowd-voted; useful for general quality signal, not task-specific performance.
- Artificial Analysis — Quality, speed, and price comparisons across providers.
- Good for cost and latency trade-offs; tracks Chinese models alongside global ones.
- OpenCompass — Open-source evaluation suite with strong Chinese-language task coverage.
- Maintained by Shanghai AI Lab; broad academic benchmark coverage.
- SuperCLUE — Chinese-language general benchmark.
- Chinese-language focus; verify test version and date.
- Hugging Face Leaderboards — Task-specific community leaderboards (coding, reasoning, embeddings).
- Coverage varies by leaderboard; check maintainer and methodology.
- 2026-06-30
open_sourceQwenLM/qwen-code release Release v0.19.3-nightly.20260630.e00fe6a27 — ## What's Changed * docs(daemon): refresh daemon docs for recent PRs (wave 2) by @doudouOUC in https://github.com/Qwe - 2026-06-30
open_sourceGitHub repository update: Tencent-Hunyuan/Rosetta-inference — Open-source native multimodal pretraining — without catastrophic forgetting. - 2026-06-30
open_sourceGitHub repository update: QwenLM/Qwen-RobotNav — Official Repo for Qwen-RobotNav - 2026-06-29
open_sourceGitHub repository update: Tencent-Hunyuan/HunyuanVideo — HunyuanVideo: A Systematic Framework For Large Video Generation Model - 2026-06-28
open_sourceQwenLM/qwen-code release Release v0.19.2-nightly.20260628.714513df2 — ## What's Changed * fix(core): allow web_fetch JSON fallback by @tt-a1i in QwenLM/qwen-code#5 - 2026-06-28
open_sourceQwenLM/qwen-code release Release v0.19.3 — ## What's Changed * fix(core): allow web_fetch JSON fallback by @tt-a1i in QwenLM/qwen-code#5660 * chore(release): v0.19 - 2026-06-28
open_sourceGitHub repository update: InternLM/VSR — Repository tracked from an official GitHub organization. - 2026-06-28
model_updateHugging Face model update: deepseek-ai/eagle3_qwen3_8b_ttt7 — Model card/update on Hugging Face. Tags: safetensors, qwen3, region:us - 2026-06-28
model_updateHugging Face model update: deepseek-ai/eagle3_qwen3_4b_ttt7 — Model card/update on Hugging Face. Tags: safetensors, qwen3, region:us - 2026-06-28
model_updateHugging Face model update: deepseek-ai/eagle3_qwen3_14b_ttt7 — Model card/update on Hugging Face. Tags: safetensors, qwen3, region:us
See newsletters/latest.md for the full daily digest.
- DeepSeek / 深度求索 — llm, reasoning, coding, open-weights.
- Alibaba Qwen / 通义千问 — llm, multimodal, coding, open-weights.
- Zhipu AI / 智谱 AI — llm, multimodal, coding.
- Moonshot AI / 月之暗面 — llm, long-context.
- MiniMax — llm, audio, video, agent.
- Baidu ERNIE / 百度文心 — llm, enterprise, paddlepaddle.
- Tencent Hunyuan / 腾讯混元 — llm, multimodal, 3d.
- ByteDance Doubao / Volcengine Ark / 字节豆包 / 火山方舟 — llm, multimodal, agent.
- OpenBMB (MiniCPM) / 清华 & 面壁智能 — llm, multimodal, on-device, open-weights.
- InternLM (Shanghai AI Lab) / 书生·浦语 / 上海人工智能实验室 — llm, multimodal, science, open-weights.
Entries from the machine-readable registry in data/models.yaml:
| Model | Provider | Type | Access | Open Weights | Official |
|---|---|---|---|---|---|
| DeepSeek-V3 | DeepSeek | chat, reasoning, coding | api / web / weights / local-deploy / openai_compatible | Yes | Source |
| Qwen3 | Alibaba Qwen | chat, reasoning, coding | api / web / weights / local-deploy / openai_compatible | Yes | Source |
| GLM-4 | Zhipu AI | chat, reasoning, coding | api / web / openai_compatible | No | Source |
| Kimi | Moonshot AI | chat, long-context | api / web / openai_compatible | No | Source |
| Qwen-Code | Alibaba Qwen | coding, agent-tool-use | weights / local-deploy | Yes | Source |
| HY-World 2.0 | Tencent Hunyuan | image-generation, video-generation | weights / local-deploy | Yes | Source |
| MiniCPM-V 4 | OpenBMB (MiniCPM) | chat, vision-language, reasoning | weights / local-deploy | Yes | Source |
| Intern-S2-Preview | InternLM (Shanghai AI Lab) | chat, vision-language, reasoning | weights / local-deploy | Yes | Source |
| Qwen3.6 | Alibaba Qwen | chat, reasoning, coding | weights / local-deploy | Yes | Source |
| VoxCPM | OpenBMB (MiniCPM) | audio-speech | weights / local-deploy | Yes | Source |
| Hy-MT2 | Tencent Hunyuan | translation | weights / local-deploy | Yes | Source |
| HunyuanOCR | Tencent Hunyuan | ocr | weights / local-deploy | Yes | Source |
collections/open-source-models.md— open-weight Chinese AI models.collections/api-accessible-models.md— models available through APIs.collections/multimodal-models.md— VLM, image, video, audio, and 3D models.collections/coding-models.md— code generation and software engineering models.collections/agent-models.md— tool-use, agent, browser, and workflow models.collections/embedding-rerank.md— embedding and reranker models.collections/therouter-supported.md— Chinese models available or tracked for TheRouter access.
The canonical registry lives in data/:
data/models.yaml— model indexdata/providers.yaml— provider indexdata/comparison.yaml— model comparison matrixdata/access_methods.yaml— how-to-access optionsdata/best_for.yaml— best-for mappingdata/availability.yaml— global availability matrixdata/benchmarks.yaml— aggregated benchmark sourcesdata/faq.yaml— frequently asked questionsdata/news.yaml— daily updates and news eventsdata/resources.yaml— official resources, docs, model cards, tutorialsdata/capabilities.yaml— normalized capability taxonomydata/use_cases.yaml— reproducible playbooks and workflows
Run local validation:
python3 scripts/validate_data.py
python3 scripts/generate_readme.py --checkAI models developed by organizations based in China, such as DeepSeek, Qwen (Alibaba), GLM (Zhipu AI), Kimi (Moonshot AI), MiniMax, ERNIE (Baidu), and Hunyuan (Tencent). Many are released with open weights and expose OpenAI-compatible APIs.
DeepSeek and Qwen are commonly used for coding workloads, often as a lower-cost alternative to closed models. Compare them on a coding benchmark and on your own tasks before committing.
Many are open-weight: DeepSeek, Qwen, GLM, and MiniMax publish weights on Hugging Face and ModelScope. Some, such as Kimi and Doubao, are API-only. Check the license of each specific model before deployment.
Three common paths: call the provider's official API, use an OpenAI-compatible aggregator such as TheRouter or OpenRouter, or self-host open-weight models. See the "How to Access" section.
Most major providers expose an OpenAI-compatible API mode, so existing OpenAI SDK code usually works with a changed base URL and API key. Confirm supported parameters in each provider's documentation.
Several Chinese models are priced below comparable closed models, which is why developers evaluate them as cost-efficient alternatives. Pricing changes frequently — verify current rates on the provider's pricing page.
Please read CONTRIBUTING.md before opening an issue or PR.
Minimum requirements:
- Prefer official sources: websites, docs, GitHub, Hugging Face, ModelScope, papers, or changelogs.
- Include source URL, category, date, and why the entry belongs here.
- Keep descriptions factual and neutral. Avoid unverified claims such as “最强”, “吊打”, or “全球第一”.
- Do not submit random mirrors, SEO spam, unverifiable screenshots, cracked products, or gray-market workflows.
TheRouter.ai provides an OpenAI-compatible access layer for multiple AI models, with routing, fallback, and cost-control workflows. It is listed in How to Access as one option among several. This repo stays source-driven and vendor-neutral; entries must remain community-useful.
This is a community-maintained index. Always verify model availability, pricing, licensing, and benchmark claims with the official provider.