DataLearner 标志DataLearnerAI
最新AI资讯
大模型排行榜
大模型评测基准
大模型列表
大模型对比
资源中心
工具
语言中文
DataLearner 标志DataLearner AI

专注大模型评测、数据资源与实践教学的知识平台,持续更新可落地的 AI 能力图谱。

产品

  • 评测榜单
  • 模型对比
  • 数据资源

资源

  • 部署教程
  • 原创内容
  • 工具导航

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner 持续整合行业数据与案例,为科研、企业与开发者提供可靠的大模型情报与实践指南。

隐私政策服务条款
首页综合排行榜Text Generation Arena 文本生成模型排行榜

LMArena 评测赛道

文本生成代码数学图像编辑文字生成视频图生视频文生图

Text Generation Arena 文本生成模型排行榜

基于 Text Generation Arena 用户匿名投票的最新AI文本生成模型排行榜,涵盖各模型的 Elo 得分、95% 置信区间、投票量、机构与许可证。

榜首模型

Claude Opus 4.6 (thinking)

最高得分

1,502

模型数量

360

数据版本

2026年05月28日

数据来源: LM Arena

关于本排行榜

本排行榜展示了当前最强 AI 大模型在文本生成任务中的综合实力排名。数据来源于 LMArena(前身为 LMSYS Chatbot Arena),这是目前全球最大的 AI 模型众包评测平台。用户在平台上与两个匿名模型同时对话,并投票选出更好的回答——排名完全由真实用户的偏好决定,而非实验室基准测试。

评测方法概要

匿名盲测:用户同时与两个"隐藏身份"的模型对话,根据回答质量投票,排除品牌偏见。

Elo 评分:基于国际象棋领域的 Elo Rating 体系(Bradley-Terry 模型),通过对战结果计算每个模型的实力分数。分数越高,说明模型在真实对话中被用户选中的概率越大。

场景覆盖广泛:涵盖编程、创意写作、数学推理、知识问答、角色扮演等高频真实场景。

DataLearner 在原始数据基础上提供中文解读与深度分析,并将排行榜模型关联至 DataLearner 模型库,方便您一键查看模型详情、API 定价、评测得分等完整信息。

来源:全部国产模型
榜单历史快照月份:

排名总表

排名模型名称得分95% CI投票数机构许可证
AnthropicClaude Opus 4.6 (thinking)Anthropic1,502+/-434,186AnthropicProprietary
AnthropicOpus 4.7 (thinking)Anthropic1,500+/-519,973AnthropicProprietary
AnthropicClaude Opus 4.6Anthropic1,498+/-436,512AnthropicProprietary
4AnthropicOpus 4.7Anthropic1,494+/-520,724AnthropicProprietary
5FAMuse SparkFacebook AI研究实验室1,489+/-612,228Facebook AI研究实验室Proprietary
6Google Deep MindGemini 3.1 Pro PreviewGoogle Deep Mind1,487+/-443,742Google Deep MindProprietary
7Google Deep MindGemini 3.0 Pro (Preview 11-2025)Google Deep Mind1,486+/-441,332Google Deep MindProprietary
8OpenAIgpt-5.5-highOpenAI1,482+/-616,573OpenAIProprietary
9OpenAIgpt-5.4-highOpenAI1,480+/-528,246OpenAIProprietary
10Googlegemini-3.5-flashGoogle1,479+/-79,045GoogleProprietary
11OpenAIGPT-5.5OpenAI1,476+/-616,852OpenAIProprietary
12OpenAIgpt-5.2-chat-latest-20260210OpenAI1,476+/-432,280OpenAIProprietary
13xAIgrok-4.20-beta1xAI1,476+/-524,468xAIProprietary
14xAIgrok-4.20-beta-0309-reasoningxAI1,475+/-529,068xAIProprietary
15Alibabaqwen3.7-max-previewAlibaba1,475+/-103,755AlibabaProprietary
16智谱GLM 5.1智谱AI1,474+/-613,957智谱AIMIT
17OpenAIgpt-5.5-instantOpenAI1,474+/-524,925OpenAIProprietary
18Google Deep MindGemini 3.0 FlashGoogle Deep Mind1,473+/-430,732Google Deep MindProprietary
19AnthropicClaude Opus 4 (thinking-32k)Anthropic1,473+/-437,130AnthropicProprietary
20xAIgrok-4.20-multi-agent-beta-0309xAI1,472+/-528,630xAIProprietary
21Baiduernie-5.1Baidu1,470+/-614,675BaiduProprietary
22AnthropicClaude Sonnet 4.6Anthropic1,470+/-527,474AnthropicProprietary
23OpenAIgpt-5.4OpenAI1,469+/-529,672OpenAIProprietary
24AnthropicClaude Opus 4Anthropic1,469+/-366,107AnthropicProprietary
25xAIGrok 4.1 ThinkingxAI1,466+/-363,569xAIProprietary
26Alibabaqwen3.5-max-previewAlibaba1,466+/-520,212AlibabaProprietary
27XImimo-v2.5-proXiaomi1,465+/-615,722XiaomiMIT
28Moonshotkimi-k2.6Moonshot1,462+/-615,765MoonshotModified MIT
29Google Deep MindGemini 3.0 Flash (minimal)Google Deep Mind1,461+/-452,876Google Deep MindProprietary
30xAIGrok 4.1xAI1,460+/-365,655xAIProprietary
31Alibabaqwen3.6-max-previewAlibaba1,459+/-94,648AlibabaProprietary
32DeepSeekdeepseek-v4-pro-thinkingDeepSeek1,458+/-615,852DeepSeekMIT
33智谱GLM-5智谱AI1,457+/-521,930智谱AIMIT
34Bytedancedola-seed-2.0-proBytedance1,456+/-437,742BytedanceProprietary
35AnthropicClaude Sonnet 4.5Anthropic1,455+/-376,121AnthropicProprietary
36AnthropicClaude Sonnet 4.5 (thinking-32k)Anthropic1,455+/-377,813AnthropicProprietary
37OpenAIGPT-5.1 Pro (high)OpenAI1,455+/-440,856OpenAIProprietary
38DeepSeekdeepseek-v4-proDeepSeek1,454+/-616,920DeepSeekMIT
39Googlegemma-4-31bGoogle1,452+/-85,855GoogleApache 2.0
40OpenAIgpt-5.4-mini-highOpenAI1,451+/-526,397OpenAIProprietary
41Moonshot AIKimi K2 ThinkingMoonshot AI1,449+/-436,795Moonshot AIModified MIT
42百度ERNIE 5.0百度1,449+/-79,752百度Proprietary
43AnthropicOpus 4.1 (thinking-16k)Anthropic1,449+/-349,833AnthropicProprietary
44OpenAIgpt-5.3-chat-latestOpenAI1,449+/-430,882OpenAIProprietary
45XImimo-v2-proXiaomi1,448+/-522,638XiaomiProprietary
46百度ERNIE 5.0百度1,448+/-434,159百度Proprietary
47AnthropicOpus 4.1Anthropic1,447+/-377,373AnthropicProprietary
48xAIgrok-4.3xAI1,447+/-615,773xAIProprietary
49Google Deep MindGemini 2.5 Pro Experimental 03-25Google Deep Mind1,446+/-3122,636Google Deep MindProprietary
50阿里Qwen3.5-397B-A17B阿里巴巴1,445+/-431,970阿里巴巴Apache 2.0
51OpenAIGPT-4.5OpenAI1,445+/-614,547OpenAIProprietary
52Alibabaqwen3.6-plusAlibaba1,444+/-518,202AlibabaProprietary
53OpenAIchatgpt-4o-latest-20250326OpenAI1,443+/-382,471OpenAIProprietary
54智谱GLM-4.7智谱AI1,443+/-612,133智谱AIMIT
55OpenAIGPT-5.1 InstantOpenAI1,439+/-443,501OpenAIProprietary
56Googlegemma-4-26b-a4bGoogle1,439+/-85,789GoogleApache 2.0
57OpenAIGPT-5.2 Pro (high)OpenAI1,438+/-446,111OpenAIProprietary
58DeepSeekdeepseek-v4-flash-thinkingDeepSeek1,437+/-616,545DeepSeekMIT
59Meituanlongcat-flash-chat-2602-expMeituan1,436+/-523,731MeituanProprietary
60阿里Qwen3 Max (Preview)阿里巴巴1,435+/-527,736阿里巴巴Proprietary
61OpenAIGPT-5.2OpenAI1,435+/-446,492OpenAIProprietary
62XImimo-v2.5Xiaomi1,434+/-615,979XiaomiMIT
63OpenAIGPT-5-Pro (high)OpenAI1,434+/-531,947OpenAIProprietary
64Googlegemini-3.1-flash-lite-previewGoogle1,433+/-435,135GoogleProprietary
65DeepSeekdeepseek-v4-flashDeepSeek1,433+/-616,725DeepSeekMIT
66Moonshotkimi-k2.5-instantMoonshot1,432+/-78,197MoonshotModified MIT
67OpenAIOpenAI o3OpenAI1,431+/-459,775OpenAIProprietary
68xAIgrok-4-1-fast-reasoningxAI1,431+/-354,616xAIProprietary
69Moonshotkimi-k2-thinking-turboMoonshot1,430+/-360,235MoonshotModified MIT
70Amazonamazon-nova-experimental-chat-26-02-10Amazon1,427+/-103,418AmazonProprietary
71OpenAIGPT-5OpenAI1,427+/-431,595OpenAIProprietary
72智谱GLM-4.6智谱AI1,426+/-435,661智谱AIMIT
73DeepSeek-AIDeepSeek V3.2-Exp (thinking)DeepSeek-AI1,425+/-79,064DeepSeek-AIMIT
74DeepSeek-AIDeepSeek V3.2DeepSeek-AI1,424+/-446,204DeepSeek-AIMIT
75AnthropicClaude Opus 4 (thinking-16k)Anthropic1,424+/-436,900AnthropicProprietary
76Alibabaqwen3-max-2025-09-23Alibaba1,424+/-69,158AlibabaProprietary
77DeepSeek-AIDeepSeek V3.2-ExpDeepSeek-AI1,423+/-611,941DeepSeek-AIMIT
78阿里Qwen3-235B-A22B-2507阿里巴巴1,423+/-395,473阿里巴巴Apache 2.0
79DeepSeek-AIDeepSeek-R1-0528DeepSeek-AI1,422+/-618,467DeepSeek-AIMIT
80DeepSeek-AIDeepSeek V3.2 (thinking)DeepSeek-AI1,422+/-440,111DeepSeek-AIMIT
81xAIGrok 4 FastxAI1,421+/-86,820xAIProprietary
82百度ERNIE 5.0百度1,420+/-94,708百度Proprietary
83Moonshotkimi-k2-0905-previewMoonshot1,418+/-611,795MoonshotModified MIT
84DeepSeek-AIDeepSeek-V3.1DeepSeek-AI1,418+/-614,969DeepSeek-AIMIT
85DeepSeekdeepseek-v3.1-terminus-thinkingDeepSeek1,418+/-103,468DeepSeekMIT
86Moonshot AIKimi K2Moonshot AI1,417+/-527,643Moonshot AIModified MIT
87Alibabaqwen3.5-122b-a10bAlibaba1,417+/-426,670AlibabaApache 2.0
88DeepSeek-AIDeepSeek-V3.1 (thinking)DeepSeek-AI1,417+/-711,746DeepSeek-AIMIT
89DeepSeek-AIDeepSeek-V3.1 TerminusDeepSeek-AI1,416+/-103,705DeepSeek-AIMIT
90Amazonamazon-nova-experimental-chat-26-01-10Amazon1,416+/-103,414AmazonProprietary
91Tencenthunyuan-hy3-previewTencent1,416+/-85,812Tencenttencent-hunyuan-community
92阿里Qwen3-VL-235B-A22B-Instruct阿里巴巴1,415+/-611,515阿里巴巴Apache 2.0
93MistralAIMistral Large 3MistralAI1,415+/-442,553MistralAIApache 2.0
94XImimo-v2-omniXiaomi1,414+/-112,968XiaomiProprietary
95OpenAIgpt-4.1-2025-04-14OpenAI1,413+/-450,997OpenAIProprietary
96MiniMaxAIMiniMax-M2.7MiniMaxAI1,413+/-523,278MiniMaxAIModified MIT
97AnthropicClaude Opus 4Anthropic1,412+/-444,223AnthropicProprietary
98xAIGrok 3xAI1,412+/-432,909xAIProprietary
99智谱GLM-4.5智谱AI1,411+/-524,322智谱AIMIT
100Google Deep MindGemini 2.5 FlashGoogle Deep Mind1,411+/-3122,458Google Deep MindProprietary
101Anthropicclaude-haiku-4-5-20251001Anthropic1,411+/-378,134AnthropicProprietary
102MistralAIMagistral-Medium-2506MistralAI1,410+/-392,031MistralAIProprietary
103xAIgrok-4-0709xAI1,410+/-441,413xAIProprietary
104Alibabaqwen3.5-27bAlibaba1,408+/-525,772AlibabaApache 2.0
105Googlegemini-2.5-flash-preview-09-2025Google1,405+/-432,925GoogleProprietary
106xAIgrok-4-fast-reasoningxAI1,404+/-518,729xAIProprietary
107Alibabaqwen3-235b-a22b-no-thinkingAlibaba1,403+/-538,226AlibabaApache 2.0
108OpenAIgpt-5.4-nano-highOpenAI1,403+/-525,617OpenAIProprietary
109Alibabaqwen3-next-80b-a3b-instructAlibaba1,402+/-522,881AlibabaApache 2.0
110OpenAIo1-2024-12-17OpenAI1,402+/-427,807OpenAIProprietary
111Meituanlongcat-flash-chatMeituan1,401+/-611,405MeituanMIT
112Alibabaqwen3-235b-a22b-thinking-2507Alibaba1,400+/-78,993AlibabaApache 2.0
113AnthropicClaude Sonnet 4 (thinking-32k)Anthropic1,399+/-435,127AnthropicProprietary
114DeepSeek-AIDeepSeek-R1DeepSeek-AI1,398+/-518,524DeepSeek-AIMIT
115Alibabaqwen3.5-35b-a3bAlibaba1,396+/-427,304AlibabaApache 2.0
116Alibabaqwen3.5-flashAlibaba1,396+/-429,647AlibabaProprietary
117Alibabaqwen3-vl-235b-a22b-thinkingAlibaba1,396+/-77,947AlibabaApache 2.0
118Tencenthunyuan-vision-1.5-thinkingTencent1,396+/-122,220TencentProprietary
119DeepSeek-AIDeepSeek-V3-0324DeepSeek-AI1,395+/-445,518DeepSeek-AIMIT
120Amazonamazon-nova-experimental-chat-12-10Amazon1,395+/-103,681AmazonProprietary
121StepFunAIStep 3.5 FlashStepFunAI1,394+/-434,466StepFunAIApache 2.0
122XImimo-v2-flash (non-thinking)Xiaomi1,393+/-444,619XiaomiMIT
123MiniMaxAIMiniMax M2.5MiniMaxAI1,391+/-436,265MiniMaxAIModified MIT
124OpenAIgpt-5-mini-highOpenAI1,390+/-527,039OpenAIProprietary
125OpenAIo4-mini-2025-04-16OpenAI1,390+/-445,452OpenAIProprietary
126AnthropicClaude Sonnet 4Anthropic1,389+/-440,323AnthropicProprietary
127OpenAIo1-previewOpenAI1,388+/-531,122OpenAIProprietary
128Alibabaqwen3-coder-480b-a35b-instructAlibaba1,388+/-525,741AlibabaApache 2.0
129Tencenthunyuan-t1-20250711Tencent1,387+/-94,711TencentProprietary
130XImimo-v2-flash (thinking)Xiaomi1,387+/-610,974XiaomiMIT
131AnthropicClaude Sonnet 3.7 (thinking-32k)Anthropic1,387+/-438,827AnthropicProprietary
132Mistralmistral-medium-2505Mistral1,387+/-533,230MistralProprietary
133MiniMaxminimax-m2.1-previewMiniMax1,385+/-517,138MiniMaxMIT
134Alibabaqwen3-30b-a3b-instruct-2507Alibaba1,384+/-523,746AlibabaApache 2.0
135OpenAIgpt-4.1-mini-2025-04-14OpenAI1,382+/-439,339OpenAIProprietary
136Tencenthunyuan-turbos-20250416Tencent1,382+/-610,725TencentProprietary
137Googlegemini-2.5-flash-lite-preview-09-2025-no-thinkingGoogle1,380+/-347,246GoogleProprietary
138ARtrinity-large-previewArcee AI1,378+/-428,284Arcee AIApache 2.0
139智谱GLM-4.6V智谱AI1,378+/-112,808智谱AIMIT
140Alibabaqwen3-235b-a22bAlibaba1,375+/-526,268AlibabaApache 2.0
141Googlegemini-2.5-flash-lite-preview-06-17-thinkingGoogle1,375+/-532,907GoogleProprietary
142Alibabaqwen2.5-maxAlibaba1,374+/-432,623AlibabaProprietary
143Z.aiglm-4.5-airZ.ai1,373+/-431,095Z.aiMIT
144Anthropicclaude-3-5-sonnet-20241022Anthropic1,372+/-388,350AnthropicProprietary
145ARtrinity-large-thinkingArcee AI1,371+/-523,918Arcee AIApache 2.0
146AnthropicClaude Sonnet 3.7Anthropic1,371+/-443,194AnthropicProprietary
147Alibabaqwen3-next-80b-a3b-thinkingAlibaba1,370+/-613,700AlibabaApache 2.0
148Z.aiglm-4.7-flashZ.ai1,368+/-611,736Z.aiMIT
149Amazonamazon-nova-experimental-chat-11-10Amazon1,367+/-425,407AmazonProprietary
150Googlegemma-3-27b-itGoogle1,366+/-447,545GoogleGemma
151MiniMaxminimax-m1MiniMax1,364+/-435,214MiniMaxApache 2.0
152OpenAIo3-mini-highOpenAI1,363+/-518,589OpenAIProprietary
153xAIgrok-3-mini-highxAI1,362+/-516,968xAIProprietary
154Nvidianvidia-nemotron-3-super-120b-a12bNvidia1,361+/-77,458NvidiaNVIDIA Open Model
155Googlegemini-2.0-flash-001Google1,360+/-443,762GoogleProprietary
156DeepSeekdeepseek-v3DeepSeek1,358+/-521,770DeepSeekDeepSeek
157Mistralmistral-small-2506Mistral1,357+/-517,712MistralApache 2.0
158xAIgrok-3-mini-betaxAI1,357+/-522,724xAIProprietary
159PRintellect-3Prime Intellect1,356+/-85,329Prime IntellectMIT
160Coherecommand-a-03-2025Cohere1,354+/-356,283CohereCC-BY-NC-4.0
161Z.aiglm-4.5vZ.ai1,353+/-84,958Z.aiMIT
162Googlegemini-2.0-flash-lite-preview-02-05Google1,353+/-424,955GoogleProprietary
163OpenAIgpt-oss-120bOpenAI1,353+/-430,639OpenAIApache 2.0
164Googlegemini-1.5-pro-002Google1,351+/-355,606GoogleProprietary
165Amazonamazon-nova-experimental-chat-10-20Amazon1,350+/-611,474AmazonProprietary
166Tencenthunyuan-turbos-20250226Tencent1,349+/-122,220TencentProprietary
167StepFunstep-3StepFun1,348+/-76,545StepFunApache 2.0
168Amazonamazon-nova-experimental-chat-10-09Amazon1,348+/-112,839AmazonProprietary
169OpenAIo3-miniOpenAI1,348+/-457,344OpenAIProprietary
170Nvidiallama-3.1-nemotron-ultra-253b-v1Nvidia1,347+/-122,549NvidiaNvidia Open Model
171Alibabaqwen3-32bAlibaba1,347+/-93,926AlibabaApache 2.0
172INmercury-2Inception AI1,347+/-113,123Inception AIProprietary
173INling-flash-2.0InclusionAI1,346+/-77,010InclusionAIMIT
174MiniMaxminimax-m2MiniMax1,346+/-86,875MiniMaxApache 2.0
175Alibabaqwen-plus-0125Alibaba1,346+/-85,819AlibabaProprietary
176OpenAIgpt-4o-2024-05-13OpenAI1,346+/-3112,881OpenAIProprietary
177Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5Nvidia1,343+/-103,345NvidiaNvidia Open
178ZHglm-4-plus-0111Zhipu1,343+/-85,760ZhipuProprietary
179Anthropicclaude-3-5-sonnet-20240620Anthropic1,342+/-382,419AnthropicProprietary
180Googlegemma-3-12b-itGoogle1,342+/-103,829GoogleGemma
181Tencenthunyuan-turbo-0110Tencent1,341+/-122,290TencentProprietary
182Amazonnova-2-liteAmazon1,338+/-612,246AmazonProprietary
183OpenAIgpt-5-nano-highOpenAI1,337+/-78,270OpenAIProprietary
184OpenAIo1-miniOpenAI1,337+/-451,981OpenAIProprietary
185Alibabaqwq-32bAlibaba1,336+/-425,402AlibabaApache 2.0
186xAIgrok-2-2024-08-13xAI1,335+/-463,498xAIProprietary
187Googlegemini-advanced-0514Google1,335+/-550,148GoogleProprietary
188OpenAIgpt-4o-2024-08-06OpenAI1,335+/-445,499OpenAIProprietary
189Metallama-3.1-405b-instruct-bf16Meta1,335+/-441,375MetaLlama 3.1 Community
190StepFunstep-2-16k-exp-202412StepFun1,334+/-94,833StepFunProprietary
191Metallama-3.1-405b-instruct-fp8Meta1,333+/-459,656MetaLlama 3.1 Community
192AIolmo-3.1-32b-instructAi21,330+/-612,225Ai2Apache 2.0
19301yi-lightning01 AI1,328+/-527,33201 AIProprietary
194AImolmo-2-8bAi21,328+/-21805Ai2Apache 2.0
195Nvidiallama-3.3-nemotron-49b-super-v1Nvidia1,328+/-122,218NvidiaNvidia
196Alibabaqwen3-30b-a3bAlibaba1,327+/-526,495AlibabaApache 2.0
197Metallama-4-maverick-17b-128e-instructMeta1,327+/-439,987MetaLlama 4
198Tencenthunyuan-large-2025-02-10Tencent1,326+/-103,738TencentProprietary
199OpenAIgpt-4-turbo-2024-04-09OpenAI1,324+/-498,114OpenAIProprietary
200DeepSeekdeepseek-v2.5-1210DeepSeek1,323+/-86,795DeepSeekDeepSeek
201Anthropicclaude-3-5-haiku-20241022Anthropic1,323+/-369,993AnthropicProprietary
202Googlegemini-1.5-pro-001Google1,323+/-479,138GoogleProprietary
203Metallama-4-scout-17b-16e-instructMeta1,323+/-530,299MetaLlama
204OpenAIgpt-4.1-nano-2025-04-14OpenAI1,322+/-86,103OpenAIProprietary
205AnthropicClaude3-OpusAnthropic1,321+/-3194,909AnthropicProprietary
206INring-flash-2.0InclusionAI1,321+/-77,148InclusionAIMIT
207StepFunstep-1o-turbo-202506StepFun1,320+/-79,038StepFunProprietary
208ZHglm-4-plusZhipu AI1,319+/-526,126Zhipu AIProprietary
209Metallama-3.3-70b-instructMeta1,318+/-354,745MetaLlama-3.3
210Googlegemma-3n-e4b-itGoogle1,318+/-522,600GoogleGemma
211Alibabaqwen-max-0919Alibaba1,318+/-616,478AlibabaQwen
212OpenAIgpt-oss-20bOpenAI1,318+/-610,633OpenAIApache 2.0
213OpenAIgpt-4o-mini-2024-07-18OpenAI1,318+/-468,709OpenAIProprietary
214Nvidianvidia-nemotron-3-nano-30b-a3b-bf16Nvidia1,317+/-615,513NvidiaNVIDIA Open Model
215Alibabaqwen2.5-plus-1127Alibaba1,315+/-610,187AlibabaProprietary
216NEathene-v2-chatNexusFlow1,314+/-524,739NexusFlowNexusFlow
217Mistralmistral-large-2407Mistral1,314+/-445,459MistralMistral Research
218OpenAIgpt-4-0125-previewOpenAI1,313+/-493,439OpenAIProprietary
219IBgranite-4.1-8bIBM1,312+/-103,614IBMApache 2.0
220OpenAIgpt-4-1106-previewOpenAI1,312+/-4100,105OpenAIProprietary
221Tencenthunyuan-standard-2025-02-10Tencent1,311+/-103,904TencentProprietary
222Googlegemini-1.5-flash-002Google1,309+/-434,902GoogleProprietary
223xAIgrok-2-mini-2024-08-13xAI1,308+/-452,567xAIProprietary
224DeepSeekdeepseek-v2.5DeepSeek1,307+/-524,572DeepSeekDeepSeek
225INmercuryInception AI1,306+/-141,957Inception AIProprietary
226NEathene-70b-0725NexusFlow1,306+/-619,621NexusFlowCC-BY-NC-4.0
227AIolmo-3-32b-thinkAi21,305+/-85,947Ai2Apache 2.0
228Mistralmistral-large-2411Mistral1,305+/-428,073MistralMRL
229Mistralmagistral-medium-2506Mistral1,304+/-611,641MistralProprietary
230Mistralmistral-small-3.1-24b-instruct-2503Mistral1,303+/-533,220MistralApache 2.0
231Googlegemma-3-4b-itGoogle1,303+/-94,171GoogleGemma
232Alibabaqwen2.5-72b-instructAlibaba1,303+/-439,406AlibabaQwen
233Nvidiallama-3.1-nemotron-70b-instructNvidia1,299+/-87,140NvidiaLlama 3.1
234Tencenthunyuan-large-visionTencent1,294+/-95,374TencentProprietary
235Metallama-3.1-70b-instructMeta1,293+/-455,240MetaLlama 3.1 Community
236Amazonamazon-nova-pro-v1.0Amazon1,290+/-524,745AmazonProprietary
237AIjamba-1.5-largeAI21 Labs1,289+/-78,662AI21 LabsJamba Open
238Googlegemma-2-27b-itGoogle1,288+/-375,754GoogleGemma license
239REreka-core-20240904Reka AI1,288+/-77,312Reka AIProprietary
240IBibm-granite-h-smallIBM1,287+/-85,677IBMApache 2.0
241OpenAIgpt-4-0314OpenAI1,286+/-554,173OpenAIProprietary
242AIllama-3.1-tulu-3-70bAi21,286+/-102,846Ai2Llama 3.1
243Googlegemini-1.5-flash-001Google1,286+/-562,833GoogleProprietary
244Nvidiallama-3.1-nemotron-51b-instructNvidia1,286+/-103,749NvidiaLlama 3.1
245AIolmo-3.1-32b-thinkAi21,285+/-78,505Ai2Apache 2.0
246Anthropicclaude-3-sonnet-20240229Anthropic1,280+/-4109,284AnthropicProprietary
247PRgemma-2-9b-it-simpoPrinceton1,279+/-710,072PrincetonMIT
248Nvidianemotron-4-340b-instructNvidia1,276+/-519,659NvidiaNVIDIA Open Model
249Coherecommand-r-plus-08-2024Cohere1,276+/-79,866CohereCC-BY-NC-4.0
250Metallama-3-70b-instructMeta1,276+/-4156,876MetaLlama 3 Community
251OpenAIgpt-4-0613OpenAI1,274+/-488,723OpenAIProprietary
252Mistralmistral-small-24b-instruct-2501Mistral1,274+/-614,681MistralApache 2.0
253Z.aiglm-4-0520Z.ai1,273+/-79,788Z.aiProprietary
254REreka-flash-20240904Reka AI1,272+/-77,536Reka AIProprietary
255Alibabaqwen2.5-coder-32b-instructAlibaba1,270+/-85,432AlibabaApache 2.0
256Coherec4ai-aya-expanse-32bCohere1,267+/-527,124CohereCC-BY-NC-4.0
257Googlegemma-2-9b-itGoogle1,266+/-454,611GoogleGemma license
258DeepSeekdeepseek-coder-v2DeepSeek1,264+/-615,147DeepSeekDeepSeek License
259Coherecommand-r-plusCohere1,261+/-477,554CohereCC-BY-NC-4.0
260Alibabaqwen2-72b-instructAlibaba1,261+/-537,325AlibabaQianwen LICENSE
261Anthropicclaude-3-haiku-20240307Anthropic1,260+/-4117,701AnthropicProprietary
262Amazonamazon-nova-lite-v1.0Amazon1,260+/-519,372AmazonProprietary
263Googlegemini-1.5-flash-8b-001Google1,258+/-435,558GoogleProprietary
264Microsoft AzurePhi 4 - 14BMicrosoft Azure1,256+/-524,126Microsoft AzureMIT
265AIolmo-2-0325-32b-instructAi21,251+/-113,334Ai2Apache-2.0
266Coherecommand-r-08-2024Cohere1,249+/-710,140CohereCC-BY-NC-4.0
267Mistralmistral-large-2402Mistral1,241+/-562,436MistralProprietary
268Amazonamazon-nova-micro-v1.0Amazon1,241+/-519,364AmazonProprietary
269AIjamba-1.5-miniAI21 Labs1,239+/-78,858AI21 LabsJamba Open
270Mistralministral-8b-2410Mistral1,237+/-94,781MistralMRL
271Googlegemini-pro-dev-apiGoogle1,235+/-718,354GoogleProprietary
272Alibabaqwen1.5-110b-chatAlibaba1,233+/-626,195AlibabaQianwen LICENSE
273Tencenthunyuan-standard-256kTencent1,233+/-122,728TencentProprietary
274REreka-flash-21b-20240226-onlineReka AI1,233+/-715,450Reka AIProprietary
275Alibabaqwen1.5-72b-chatAlibaba1,232+/-539,302AlibabaQianwen LICENSE
276Mistralmixtral-8x22b-instruct-v0.1Mistral1,229+/-551,416MistralApache 2.0
277Coherecommand-rCohere1,226+/-554,036CohereCC-BY-NC-4.0
278REreka-flash-21b-20240226Reka AI1,226+/-624,806Reka AIProprietary
279OpenAIgpt-3.5-turbo-0125OpenAI1,224+/-566,207OpenAIProprietary
280Metallama-3-8b-instructMeta1,223+/-4104,642MetaLlama 3 Community
281Coherec4ai-aya-expanse-8bCohere1,223+/-79,818CohereCC-BY-NC-4.0
282Mistralmistral-mediumMistral1,222+/-634,550MistralProprietary
283Googlegemini-proGoogle1,222+/-126,390GoogleProprietary
284AIllama-3.1-tulu-3-8bAi21,221+/-112,896Ai2Llama 3.1
28501yi-1.5-34b-chat01 AI1,213+/-524,14601 AIApache-2.0
286HUzephyr-orpo-141b-A35b-v0.1HuggingFace1,212+/-114,652HuggingFaceApache 2.0
287Metallama-3.1-8b-instructMeta1,211+/-449,605MetaLlama 3.1 Community
288IBgranite-3.1-8b-instructIBM1,208+/-113,090IBMApache 2.0
289Alibabaqwen1.5-32b-chatAlibaba1,203+/-621,741AlibabaQianwen LICENSE
290OpenAIgpt-3.5-turbo-1106OpenAI1,202+/-916,619OpenAIProprietary
291Googlegemma-2-2b-itGoogle1,199+/-446,616GoogleGemma license
292Microsoftphi-3-medium-4k-instructMicrosoft1,197+/-525,055MicrosoftMIT
293Mistralmixtral-8x7b-instruct-v0.1Mistral1,196+/-473,503MistralApache 2.0
294DAdbrx-instruct-previewDatabricks1,194+/-632,191DatabricksDBRX LICENSE
295INinternlm2_5-20b-chatInternLM1,191+/-79,901InternLMOther
296Alibabaqwen1.5-14b-chatAlibaba1,190+/-717,839AlibabaQianwen LICENSE
297Microsoftwizardlm-70bMicrosoft1,184+/-98,214MicrosoftLlama 2 Community
298DeepSeekdeepseek-llm-67b-chatDeepSeek1,184+/-124,932DeepSeekDeepSeek License
29901yi-34b-chat01 AI1,183+/-715,48301 AIYi License
300IBgranite-3.0-8b-instructIBM1,181+/-96,638IBMApache 2.0
301OPopenchat-3.5OpenChat1,181+/-107,968OpenChatApache-2.0
302OPopenchat-3.5-0106OpenChat1,181+/-812,637OpenChatApache-2.0
303Googlegemma-1.1-7b-itGoogle1,181+/-623,893GoogleGemma license
304SNsnowflake-arctic-instructSnowflake1,179+/-632,832SnowflakeApache 2.0
305IBgranite-3.1-2b-instructIBM1,178+/-113,188IBMApache 2.0
306ALtulu-2-dpo-70bAllenAI/UW1,177+/-106,535AllenAI/UWAI2 ImpACT Low-risk
307NOopenhermes-2.5-mistral-7bNousResearch1,174+/-105,006NousResearchApache-2.0
308LMvicuna-33bLMSYS1,172+/-622,479LMSYSNon-commercial
309NEstarling-lm-7b-betaNexusflow1,171+/-716,056NexusflowApache-2.0
310Microsoftphi-3-small-8k-instructMicrosoft1,170+/-617,766MicrosoftMIT
311Metallama-2-70b-chatMeta1,170+/-638,492MetaLlama 2 Community
312UCstarling-lm-7b-alphaUC Berkeley1,167+/-810,224UC BerkeleyCC-BY-NC-4.0
313Metallama-3.2-3b-instructMeta1,166+/-87,936MetaLlama 3.2
314NOnous-hermes-2-mixtral-8x7b-dpoNousResearch1,164+/-123,777NousResearchApache-2.0
315Alibabaqwq-32b-previewAlibaba1,155+/-113,231AlibabaApache 2.0
316IBgranite-3.0-2b-instructIBM1,155+/-86,837IBMApache 2.0
317Nvidiallama2-70b-steerlm-chatNvidia1,154+/-133,585NvidiaLlama 2 Community
318UPsolar-10.7b-instruct-v1.0Upstage AI1,151+/-134,155Upstage AICC-BY-NC-4.0
319COdolphin-2.2.1-mistral-7bCognitive Computations1,151+/-151,679Cognitive ComputationsApache-2.0
320MOmpt-30b-chatMosaicML1,149+/-122,572MosaicMLCC-BY-NC-SA-4.0
321Mistralmistral-7b-instruct-v0.2Mistral1,149+/-719,402MistralApache-2.0
322Microsoftwizardlm-13bMicrosoft1,148+/-97,044MicrosoftLlama 2 Community
323TIfalcon-180b-chatTII1,146+/-171,295TIIFalcon-180B TII License
324Alibabaqwen1.5-7b-chatAlibaba1,143+/-104,737AlibabaQianwen LICENSE
325Microsoftphi-3-mini-4k-instruct-june-2024Microsoft1,142+/-612,297MicrosoftMIT
326Metallama-2-13b-chatMeta1,141+/-719,174MetaLlama 2 Community
327LMvicuna-13bLMSYS1,140+/-719,367LMSYSLlama 2 Community
328Alibabaqwen-14b-chatAlibaba1,138+/-114,964AlibabaQianwen LICENSE
329Googlepalm-2Google1,137+/-98,554GoogleProprietary
330Googlegemma-7b-itGoogle1,136+/-98,925GoogleGemma license
331Metacodellama-34b-instructMeta1,136+/-97,366MetaLlama 2 Community
332HUzephyr-7b-betaHuggingFace1,130+/-911,118HuggingFaceMIT
333Microsoftphi-3-mini-128k-instructMicrosoft1,128+/-720,685MicrosoftMIT
334Microsoftphi-3-mini-4k-instructMicrosoft1,127+/-620,118MicrosoftMIT
335UWguanaco-33bUW1,126+/-122,921UWNon-commercial
336HUzephyr-7b-alphaHuggingFace1,126+/-161,785HuggingFaceMIT
337TOstripedhyena-nous-7bTogether AI1,120+/-115,182Together AIApache 2.0
338Metacodellama-70b-instructMeta1,118+/-181,143MetaLlama 2 Community
339Googlegemma-1.1-2b-itGoogle1,115+/-810,854GoogleGemma license
340LMvicuna-7bLMSYS1,114+/-96,923LMSYSLlama 2 Community
341HUsmollm2-1.7b-instructHuggingFace1,114+/-142,199HuggingFaceApache 2.0
342Metallama-3.2-1b-instructMeta1,110+/-88,045MetaLlama 3.2
343Mistralmistral-7b-instructMistral1,109+/-98,977MistralApache 2.0
344Metallama-2-7b-chatMeta1,107+/-714,148MetaLlama 2 Community
345Googlegemma-2b-itGoogle1,092+/-114,780GoogleGemma license
346Alibabaqwen1.5-4b-chatAlibaba1,089+/-97,597AlibabaQianwen LICENSE
347AIolmo-7b-instructAi21,073+/-116,328Ai2Apache-2.0
348UCkoala-13bUC Berkeley1,070+/-106,965UC BerkeleyNon-commercial
349STalpaca-13bStanford1,067+/-115,745StanfordNon-commercial
350NOgpt4all-13b-snoozyNomic AI1,065+/-151,743Nomic AINon-commercial
351MOmpt-7b-chatMosaicML1,061+/-123,924MosaicMLCC-BY-NC-SA-4.0
352TSchatglm3-6bTsinghua1,055+/-124,658TsinghuaApache-2.0
353RWRWKV-4-Raven-14BRWKV1,041+/-114,845RWKVApache 2.0
354TSchatglm2-6bTsinghua1,023+/-142,658TsinghuaApache-2.0
355OPoasst-pythia-12bOpenAssistant1,021+/-116,310OpenAssistantApache 2.0
356TSchatglm-6bTsinghua995+/-134,914TsinghuaNon-commercial
357LMfastchat-t5-3bLMSYS991+/-124,203LMSYSApache 2.0
358DAdolly-v2-12bDatabricks980+/-143,412DatabricksMIT
359Metallama-13bMeta972+/-162,391MetaNon-commercial
360STstablelm-tuned-alpha-7bStability AI952+/-133,287Stability AICC-BY-NC-SA-4.0

数据仅供参考,以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。

常见问题 (FAQ)

01

什么是 Text Generation Arena (LMArena)?

Text Generation Arena(原 LMSYS Chatbot Arena)是目前最具影响力的大模型匿名评测平台。用户向两个身份未知的模型提问,根据回答质量投票,系统通过 Elo 算法将数百万次投票汇聚为动态排行榜,被学术界和工业界广泛引用。

02

Arena Elo 分数是如何计算的?

Elo 算法源自国际象棋评分体系。每次对战后,胜者得分上升、败者下降,幅度取决于双方原始评分差距。95% 置信区间(CI)反映该模型参与对战次数的多少:CI 越窄说明数据越充分、排名越可信。

03

为什么同一模型会出现"Thinking"和普通两个版本?

部分模型支持"扩展思考"(Extended Thinking)模式,会在给出最终答案前进行更深入的内部推理。该模式通常在逻辑推理、数学和编程任务上得分更高,但响应时延也更长、成本更高。Arena 将两种模式分开评测,以便用户根据实际需求选择。

04

如何根据排行榜选择适合自己的大语言模型?

建议综合考虑:综合性能(看 Elo 总分)、成本(闭源 API 按量计费,开源可自部署)、中文支持、开源程度以及响应速度。