DataLearner 标志DataLearnerAI
最新AI资讯
大模型排行榜
大模型评测基准
大模型列表
大模型对比
资源中心
工具
语言中文
DataLearner 标志DataLearner AI

专注大模型评测、数据资源与实践教学的知识平台,持续更新可落地的 AI 能力图谱。

产品

  • 评测榜单
  • 模型对比
  • 数据资源

资源

  • 部署教程
  • 原创内容
  • 工具导航

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner 持续整合行业数据与案例,为科研、企业与开发者提供可靠的大模型情报与实践指南。

隐私政策服务条款
首页综合排行榜LMArena Coding Arena 代码能力排行榜

LMArena 评测赛道

文本生成代码数学图像编辑文字生成视频图生视频文生图

LMArena Coding Arena 代码能力排行榜

基于 LMArena Coding Arena 用户匿名投票的最新AI大模型代码编程能力排行榜,涵盖各模型的 Elo 得分、95% 置信区间、投票量、机构与许可证。

榜首模型

Opus 4.7 (thinking)

最高得分

1555.00

模型数量

355

数据版本

2026年05月28日

数据来源: LM Arena

关于本排行榜

本排行榜展示了当前 AI 大模型在代码编程任务中的实力排名。数据来源于 LMArena (前身为 LMSYS Chatbot Arena)的 Coding 子赛道,通过真实用户匿名盲测投票评估各模型在代码编程任务中的表现。

评测方法概要

匿名盲测:用户发出编程问题后,由两个"隐藏身份"的模型分别给出代码解答,用户投票选出更好的回答,排除品牌偏见。

Elo 评分:采用 Bradley-Terry 模型计算 Elo 分数,分数越高说明该模型的代码回答越容易被用户选择。

覆盖多种编程场景:包括代码生成、Bug 修复、算法实现、代码解释等高频真实编程场景。

DataLearner 在原始数据基础上提供中文解读与深度分析,并将排行榜模型关联至 DataLearner 模型库,方便您一键查看模型详情、API 定价、评测得分等完整信息。

来源:全部国产模型
榜单历史快照月份:

排名总表

排名模型名称得分95% CI投票数机构许可证
AnthropicOpus 4.7 (thinking)Anthropic1555.00+/-95,578AnthropicProprietary
AnthropicClaude Opus 4.6 (thinking)Anthropic1551.00+/-78,335AnthropicProprietary
AnthropicClaude Opus 4.6Anthropic1546.00+/-79,596AnthropicProprietary
4AnthropicOpus 4.7Anthropic1546.00+/-95,903AnthropicProprietary
5AnthropicClaude Opus 4 (thinking-32k)Anthropic1530.00+/-77,628AnthropicProprietary
6智谱GLM 5.1智谱AI1527.00+/-103,756智谱AIMIT
7FAMuse SparkFacebook AI研究实验室1526.00+/-113,260Facebook AI研究实验室Proprietary
8Alibabaqwen3.7-max-previewAlibaba1525.00+/-181,137AlibabaProprietary
9Google Deep MindGemini 3.1 Pro PreviewGoogle Deep Mind1525.00+/-711,296Google Deep MindProprietary
10OpenAIgpt-5.5-highOpenAI1522.00+/-104,494OpenAIProprietary
11AnthropicClaude Sonnet 4.6Anthropic1522.00+/-87,216AnthropicProprietary
12OpenAIgpt-5.4-highOpenAI1521.00+/-87,181OpenAIProprietary
13AnthropicClaude Opus 4Anthropic1521.00+/-615,711AnthropicProprietary
14AnthropicClaude Sonnet 4.5 (thinking-32k)Anthropic1520.00+/-517,763AnthropicProprietary
15XImimo-v2.5-proXiaomi1520.00+/-104,275XiaomiMIT
16Google Deep MindGemini 3.0 Pro (Preview 11-2025)Google Deep Mind1519.00+/-78,575Google Deep MindProprietary
17OpenAIgpt-5.2-chat-latest-20260210OpenAI1518.00+/-78,304OpenAIProprietary
18Baiduernie-5.1Baidu1515.00+/-103,943BaiduProprietary
19AnthropicClaude Sonnet 4.5Anthropic1515.00+/-517,609AnthropicProprietary
20OpenAIgpt-5.5-instantOpenAI1515.00+/-87,212OpenAIProprietary
21xAIgrok-4.20-multi-agent-beta-0309xAI1515.00+/-77,631xAIProprietary
22Alibabaqwen3.5-max-previewAlibaba1514.00+/-85,491AlibabaProprietary
23Moonshotkimi-k2.6Moonshot1514.00+/-104,237MoonshotModified MIT
24OpenAIGPT-5.4OpenAI1513.00+/-87,884OpenAIProprietary
25AnthropicOpus 4.1 (thinking-16k)Anthropic1513.00+/-79,848AnthropicProprietary
26xAIgrok-4.20-beta-0309-reasoningxAI1512.00+/-87,705xAIProprietary
27Bytedancedola-seed-2.0-proBytedance1511.00+/-710,045BytedanceProprietary
28xAIgrok-4.20-beta1xAI1509.00+/-86,203xAIProprietary
29Google Deep MindGemini 3.0 FlashGoogle Deep Mind1509.00+/-86,383Google Deep MindProprietary
30OpenAIGPT-5.5OpenAI1508.00+/-94,672OpenAIProprietary
31Googlegemini-3.5-flashGoogle1506.00+/-122,592GoogleProprietary
32Alibabaqwen3.6-max-previewAlibaba1506.00+/-161,327AlibabaProprietary
33Moonshotkimi-k2.5-instantMoonshot1505.00+/-141,803MoonshotModified MIT
34AnthropicOpus 4.1Anthropic1505.00+/-515,538AnthropicProprietary
35Moonshot AIKimi K2 ThinkingMoonshot AI1503.00+/-79,469Moonshot AIModified MIT
36XImimo-v2-proXiaomi1503.00+/-86,196XiaomiProprietary
37Meituanlongcat-flash-chat-2602-expMeituan1503.00+/-86,475MeituanProprietary
38DeepSeekdeepseek-v4-proDeepSeek1500.00+/-94,940DeepSeekMIT
39xAIGrok 4.1 ThinkingxAI1499.00+/-614,270xAIProprietary
40OpenAIgpt-5.4-mini-highOpenAI1499.00+/-86,903OpenAIProprietary
41Googlegemma-4-31bGoogle1498.00+/-151,355GoogleApache 2.0
42AnthropicClaude Opus 4 (thinking-16k)Anthropic1498.00+/-86,674AnthropicProprietary
43OpenAIgpt-5.3-chat-latestOpenAI1497.00+/-87,910OpenAIProprietary
44Google Deep MindGemini 3.0 Flash (minimal)Google Deep Mind1495.00+/-612,797Google Deep MindProprietary
45智谱GLM-5智谱AI1495.00+/-85,384智谱AIMIT
46DeepSeekdeepseek-v4-pro-thinkingDeepSeek1494.00+/-94,535DeepSeekMIT
47阿里Qwen3.5-397B-A17B阿里巴巴1493.00+/-78,580阿里巴巴Apache 2.0
48xAIgrok-4.3xAI1493.00+/-94,422xAIProprietary
49百度ERNIE 5.0百度1492.00+/-78,166百度Proprietary
50Alibabaqwen3.6-plusAlibaba1492.00+/-95,403AlibabaProprietary
51OpenAIGPT-5.2 Pro (high)OpenAI1491.00+/-611,036OpenAIProprietary
52xAIGrok 4.1xAI1490.00+/-614,818xAIProprietary
53OpenAIGPT-5.1 Pro (high)OpenAI1490.00+/-78,210OpenAIProprietary
54XImimo-v2.5Xiaomi1490.00+/-94,584XiaomiMIT
55Amazonamazon-nova-experimental-chat-26-02-10Amazon1488.00+/-20841AmazonProprietary
56Moonshotkimi-k2-thinking-turboMoonshot1487.00+/-614,116MoonshotModified MIT
57智谱GLM-4.7智谱AI1486.00+/-122,411智谱AIMIT
58OpenAIGPT-5.2OpenAI1483.00+/-611,360OpenAIProprietary
59阿里Qwen3 Max (Preview)阿里巴巴1482.00+/-85,366阿里巴巴Proprietary
60Googlegemma-4-26b-a4bGoogle1480.00+/-151,365GoogleApache 2.0
61Anthropicclaude-haiku-4-5-20251001Anthropic1479.00+/-518,302AnthropicProprietary
62Amazonamazon-nova-experimental-chat-26-01-10Amazon1479.00+/-21736AmazonProprietary
63DeepSeekdeepseek-v4-flashDeepSeek1479.00+/-94,780DeepSeekMIT
64DeepSeekdeepseek-v4-flash-thinkingDeepSeek1478.00+/-94,709DeepSeekMIT
65MiniMaxAIMiniMax-M2.7MiniMaxAI1475.00+/-86,572MiniMaxAIModified MIT
66Alibabaqwen3-max-2025-09-23Alibaba1475.00+/-132,042AlibabaProprietary
67DeepSeek-AIDeepSeek V3.2 (thinking)DeepSeek-AI1475.00+/-78,193DeepSeek-AIMIT
68Meituanlongcat-flash-chatMeituan1474.00+/-132,233MeituanMIT
69DeepSeek-AIDeepSeek V3.2-Exp (thinking)DeepSeek-AI1474.00+/-131,919DeepSeek-AIMIT
70OpenAIGPT-5.1 InstantOpenAI1474.00+/-79,130OpenAIProprietary
71AnthropicClaude Sonnet 4 (thinking-32k)Anthropic1473.00+/-86,414AnthropicProprietary
72阿里Qwen3-235B-A22B-2507阿里巴巴1472.00+/-520,628阿里巴巴Apache 2.0
73百度ERNIE 5.0百度1472.00+/-131,960百度Proprietary
74OpenAIchatgpt-4o-latest-20250326OpenAI1469.00+/-515,865OpenAIProprietary
75DeepSeek-AIDeepSeek V3.2DeepSeek-AI1469.00+/-710,179DeepSeek-AIMIT
76MistralAIMistral Large 3MistralAI1468.00+/-79,554MistralAIApache 2.0
77Moonshotkimi-k2-0905-previewMoonshot1467.00+/-132,243MoonshotModified MIT
78OpenAIGPT-5-Pro (high)OpenAI1467.00+/-86,360OpenAIProprietary
79DeepSeek-AIDeepSeek V3.2-ExpDeepSeek-AI1466.00+/-122,501DeepSeek-AIMIT
80阿里Qwen3-VL-235B-A22B-Instruct阿里巴巴1466.00+/-132,315阿里巴巴Apache 2.0
81Google Deep MindGemini 2.5 Pro Experimental 03-25Google Deep Mind1465.00+/-425,765Google Deep MindProprietary
82DeepSeek-AIDeepSeek-R1-0528DeepSeek-AI1465.00+/-112,728DeepSeek-AIMIT
83XImimo-v2-omniXiaomi1464.00+/-21848XiaomiProprietary
84AnthropicClaude Opus 4Anthropic1464.00+/-77,903AnthropicProprietary
85OpenAIGPT-5OpenAI1464.00+/-85,991OpenAIProprietary
86DeepSeekdeepseek-v3.1-terminus-thinkingDeepSeek1463.00+/-24636DeepSeekMIT
87xAIgrok-4-1-fast-reasoningxAI1462.00+/-612,670xAIProprietary
88Tencenthunyuan-hy3-previewTencent1462.00+/-151,648Tencenttencent-hunyuan-community
89OpenAIgpt-5.4-nano-highOpenAI1460.00+/-86,894OpenAIProprietary
90Moonshot AIKimi K2Moonshot AI1460.00+/-85,244Moonshot AIModified MIT
91智谱GLM-4.6智谱AI1460.00+/-77,481智谱AIMIT
92OpenAIGPT-4.5OpenAI1459.00+/-131,939OpenAIProprietary
93Googlegemini-3.1-flash-lite-previewGoogle1459.00+/-79,137GoogleProprietary
94xAIGrok 4 FastxAI1459.00+/-161,249xAIProprietary
95OpenAIOpenAI o3OpenAI1459.00+/-611,756OpenAIProprietary
96Alibabaqwen3-coder-480b-a35b-instructAlibaba1457.00+/-94,849AlibabaApache 2.0
97DeepSeek-AIDeepSeek-V3.1 (thinking)DeepSeek-AI1457.00+/-131,904DeepSeek-AIMIT
98OpenAIgpt-4.1-2025-04-14OpenAI1456.00+/-79,316OpenAIProprietary
99MistralAIMagistral-Medium-2506MistralAI1456.00+/-520,392MistralAIProprietary
100Alibabaqwen3-vl-235b-a22b-thinkingAlibaba1455.00+/-141,625AlibabaApache 2.0
101Alibabaqwen3.5-122b-a10bAlibaba1455.00+/-87,029AlibabaApache 2.0
102智谱GLM-4.5智谱AI1454.00+/-94,772智谱AIMIT
103AnthropicClaude Sonnet 3.7 (thinking-32k)Anthropic1451.00+/-86,191AnthropicProprietary
104AnthropicClaude Sonnet 4Anthropic1449.00+/-77,396AnthropicProprietary
105Alibabaqwen3.5-27bAlibaba1448.00+/-86,863AlibabaApache 2.0
106DeepSeek-AIDeepSeek-V3.1DeepSeek-AI1448.00+/-122,624DeepSeek-AIMIT
107StepFunAIStep 3.5 FlashStepFunAI1447.00+/-78,364StepFunAIApache 2.0
108Alibabaqwen3-next-80b-a3b-instructAlibaba1446.00+/-94,794AlibabaApache 2.0
109Alibabaqwen3-235b-a22b-no-thinkingAlibaba1446.00+/-86,975AlibabaApache 2.0
110XImimo-v2-flash (non-thinking)Xiaomi1445.00+/-611,214XiaomiMIT
111DeepSeek-AIDeepSeek-R1DeepSeek-AI1444.00+/-122,317DeepSeek-AIMIT
112MiniMaxAIMiniMax M2.5MiniMaxAI1444.00+/-79,266MiniMaxAIModified MIT
113xAIGrok 3xAI1443.00+/-85,400xAIProprietary
114Alibabaqwen3-235b-a22b-thinking-2507Alibaba1442.00+/-151,611AlibabaApache 2.0
115ARtrinity-large-previewArcee AI1441.00+/-86,942Arcee AIApache 2.0
116Alibabaqwen3-30b-a3b-instruct-2507Alibaba1440.00+/-94,660AlibabaApache 2.0
117MiniMaxminimax-m2.1-previewMiniMax1439.00+/-103,426MiniMaxMIT
118DeepSeek-AIDeepSeek-V3.1 TerminusDeepSeek-AI1439.00+/-21778DeepSeek-AIMIT
119Tencenthunyuan-vision-1.5-thinkingTencent1438.00+/-27437TencentProprietary
120xAIgrok-4-fast-reasoningxAI1437.00+/-93,956xAIProprietary
121Alibabaqwen3.5-35b-a3bAlibaba1437.00+/-87,198AlibabaApache 2.0
122xAIgrok-4-0709xAI1436.00+/-78,155xAIProprietary
123Amazonamazon-nova-experimental-chat-12-10Amazon1435.00+/-21704AmazonProprietary
124OpenAIo3-mini-highOpenAI1435.00+/-122,596OpenAIProprietary
125Anthropicclaude-3-5-sonnet-20241022Anthropic1434.00+/-614,964AnthropicProprietary
126Alibabaqwen3-235b-a22bAlibaba1433.00+/-94,339AlibabaApache 2.0
127百度ERNIE 5.0百度1433.00+/-19916百度Proprietary
128OpenAIgpt-4.1-mini-2025-04-14OpenAI1433.00+/-76,918OpenAIProprietary
129Mistralmistral-medium-2505Mistral1433.00+/-85,900MistralProprietary
130OpenAIo1-2024-12-17OpenAI1433.00+/-103,973OpenAIProprietary
131Alibabaqwen3.5-flashAlibaba1432.00+/-78,187AlibabaProprietary
132OpenAIo4-mini-2025-04-16OpenAI1432.00+/-78,721OpenAIProprietary
133XImimo-v2-flash (thinking)Xiaomi1432.00+/-122,444XiaomiMIT
134OpenAIgpt-5-mini-highOpenAI1430.00+/-85,502OpenAIProprietary
135AnthropicClaude Sonnet 3.7Anthropic1429.00+/-77,146AnthropicProprietary
136DeepSeek-AIDeepSeek-V3-0324DeepSeek-AI1429.00+/-78,372DeepSeek-AIMIT
137Googlegemini-2.5-flash-preview-09-2025Google1429.00+/-86,846GoogleProprietary
138Z.aiglm-4.5-airZ.ai1427.00+/-86,104Z.aiMIT
139Google Deep MindGemini 2.5 FlashGoogle Deep Mind1424.00+/-425,169Google Deep MindProprietary
140Z.aiglm-4.7-flashZ.ai1423.00+/-112,687Z.aiMIT
141Alibabaqwen3-next-80b-a3b-thinkingAlibaba1421.00+/-112,677AlibabaApache 2.0
142智谱GLM-4.6V智谱AI1420.00+/-25536智谱AIMIT
143Amazonamazon-nova-experimental-chat-11-10Amazon1420.00+/-85,322AmazonProprietary
144OpenAIo1-previewOpenAI1417.00+/-95,123OpenAIProprietary
145ARtrinity-large-thinkingArcee AI1416.00+/-86,447Arcee AIApache 2.0
146MiniMaxminimax-m1MiniMax1416.00+/-86,489MiniMaxApache 2.0
147OpenAIo3-miniOpenAI1416.00+/-69,460OpenAIProprietary
148Mistralmistral-small-2506Mistral1413.00+/-103,360MistralApache 2.0
149ANling-flash-2.0Ant Group1412.00+/-151,528Ant GroupMIT
150Amazonamazon-nova-experimental-chat-10-20Amazon1411.00+/-122,293AmazonProprietary
151PRintellect-3Prime Intellect1409.00+/-19973Prime IntellectMIT
152Nvidianvidia-nemotron-3-super-120b-a12bNvidia1409.00+/-141,747NvidiaNVIDIA Open Model
153StepFunstep-3StepFun1408.00+/-171,233StepFunApache 2.0
154Alibabaqwen3-32bAlibaba1408.00+/-24513AlibabaApache 2.0
155Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5Nvidia1405.00+/-22659NvidiaNvidia Open
156Z.aiglm-4.5vZ.ai1405.00+/-18991Z.aiMIT
157Alibabaqwen2.5-maxAlibaba1403.00+/-85,101AlibabaProprietary
158Tencenthunyuan-t1-20250711Tencent1400.00+/-20805TencentProprietary
159Tencenthunyuan-turbos-20250226Tencent1400.00+/-31275TencentProprietary
160Anthropicclaude-3-5-sonnet-20240620Anthropic1398.00+/-713,607AnthropicProprietary
161Googlegemini-2.5-flash-lite-preview-09-2025-no-thinkingGoogle1397.00+/-79,678GoogleProprietary
162Amazonnova-2-liteAmazon1397.00+/-122,519AmazonProprietary
163INmercury-2Inception AI1396.00+/-21768Inception AIProprietary
164Tencenthunyuan-turbos-20250416Tencent1394.00+/-141,776TencentProprietary
165Nvidiallama-3.1-nemotron-ultra-253b-v1Nvidia1391.00+/-30367NvidiaNvidia Open Model
166OpenAIGPT OSS 120BOpenAI1391.00+/-86,494OpenAIApache 2.0
167ANring-flash-2.0Ant Group1391.00+/-151,539Ant GroupMIT
168xAIgrok-3-mini-highxAI1390.00+/-103,296xAIProprietary
169Coherecommand-a-03-2025Cohere1390.00+/-610,219CohereCC-BY-NC-4.0
170Amazonamazon-nova-experimental-chat-10-09Amazon1389.00+/-24552AmazonProprietary
171OpenAIo1-miniOpenAI1388.00+/-78,478OpenAIProprietary
172DeepSeekdeepseek-v3DeepSeek1388.00+/-103,280DeepSeekDeepSeek
173Alibabaqwen3-30b-a3bAlibaba1387.00+/-94,531AlibabaApache 2.0
174xAIgrok-3-mini-betaxAI1387.00+/-94,255xAIProprietary
175Mistralmagistral-medium-2506Mistral1386.00+/-122,250MistralProprietary
176Alibabaqwq-32bAlibaba1385.00+/-94,046AlibabaApache 2.0
177Anthropicclaude-3-5-haiku-20241022Anthropic1384.00+/-611,248AnthropicProprietary
178MiniMaxminimax-m2MiniMax1384.00+/-151,547MiniMaxApache 2.0
179Googlegemini-2.5-flash-lite-preview-06-17-thinkingGoogle1384.00+/-86,001GoogleProprietary
180AIolmo-3.1-32b-instructAi21384.00+/-122,513Ai2Apache 2.0
181OpenAIgpt-5-nano-highOpenAI1382.00+/-151,684OpenAIProprietary
182Alibabaqwen-plus-0125Alibaba1380.00+/-18893AlibabaProprietary
183Metallama-3.1-405b-instruct-bf16Meta1375.00+/-76,249MetaLlama 3.1 Community
184DeepSeekdeepseek-v2.5-1210DeepSeek1375.00+/-171,079DeepSeekDeepSeek
185OpenAIgpt-4.1-nano-2025-04-14OpenAI1374.00+/-19807OpenAIProprietary
186Metallama-4-maverick-17b-128e-instructMeta1373.00+/-76,997MetaLlama 4
187Tencenthunyuan-turbo-0110Tencent1372.00+/-30299TencentProprietary
188StepFunstep-2-16k-exp-202412StepFun1371.00+/-20737StepFunProprietary
189OpenAIGPT OSS 20BOpenAI1370.00+/-132,167OpenAIApache 2.0
190NEathene-v2-chatNexusFlow1369.00+/-94,019NexusFlowNexusFlow
19101yi-lightning01 AI1369.00+/-104,31601 AIProprietary
192OpenAIgpt-4o-2024-05-13OpenAI1369.00+/-619,526OpenAIProprietary
193DeepSeekdeepseek-v2.5DeepSeek1368.00+/-94,252DeepSeekDeepSeek
194Metallama-3.1-405b-instruct-fp8Meta1368.00+/-79,714MetaLlama 3.1 Community
195INmercuryInception AI1367.00+/-29394Inception AIProprietary
196Tencenthunyuan-large-2025-02-10Tencent1367.00+/-25519TencentProprietary
197Googlegemini-2.0-flash-001Google1365.00+/-76,996GoogleProprietary
198AIolmo-3-32b-thinkAi21364.00+/-181,055Ai2Apache 2.0
199Nvidiallama-3.3-nemotron-49b-super-v1Nvidia1363.00+/-31286NvidiaNvidia
200Nvidianvidia-nemotron-3-nano-30b-a3b-bf16Nvidia1363.00+/-103,277NvidiaNVIDIA Open Model
201Metallama-4-scout-17b-16e-instructMeta1362.00+/-95,255MetaLlama
202Mistralmistral-small-3.1-24b-instruct-2503Mistral1362.00+/-86,136MistralApache 2.0
203OpenAIgpt-4o-2024-08-06OpenAI1360.00+/-87,318OpenAIProprietary
204IBgranite-4.1-8bIBM1360.00+/-21944IBMApache 2.0
205xAIgrok-2-2024-08-13xAI1359.00+/-710,368xAIProprietary
206Googlegemma-3-27b-itGoogle1358.00+/-78,077GoogleGemma
207Alibabaqwen2.5-plus-1127Alibaba1357.00+/-141,553AlibabaProprietary
208Googlegemini-1.5-pro-002Google1356.00+/-79,175GoogleProprietary
209Tencenthunyuan-large-visionTencent1356.00+/-19964TencentProprietary
210Alibabaqwen2.5-72b-instructAlibaba1355.00+/-86,688AlibabaQwen
211AnthropicClaude3-OpusAnthropic1353.00+/-633,748AnthropicProprietary
212Mistralmistral-large-2407Mistral1353.00+/-87,589MistralMistral Research
213StepFunstep-1o-turbo-202506StepFun1353.00+/-151,504StepFunProprietary
214Alibabaqwen-max-0919Alibaba1353.00+/-112,756AlibabaQwen
215Z.aiglm-4-plusZ.ai1352.00+/-94,449Z.aiProprietary
216NEathene-70b-0725NexusFlow1350.00+/-113,122NexusFlowCC-BY-NC-4.0
217OpenAIgpt-4o-mini-2024-07-18OpenAI1349.00+/-710,927OpenAIProprietary
218Googlegemini-1.5-pro-001Google1347.00+/-812,747GoogleProprietary
219OpenAIgpt-4-turbo-2024-04-09OpenAI1347.00+/-717,104OpenAIProprietary
220Mistralmistral-large-2411Mistral1346.00+/-94,212MistralMRL
221Metallama-3.3-70b-instructMeta1345.00+/-78,748MetaLlama-3.3
222Googlegemini-2.0-flash-lite-preview-02-05Google1343.00+/-103,474GoogleProprietary
223Amazonamazon-nova-pro-v1.0Amazon1343.00+/-93,853AmazonProprietary
224Alibabaqwen2.5-coder-32b-instructAlibaba1342.00+/-19873AlibabaApache 2.0
225DeepSeekdeepseek-coder-v2DeepSeek1342.00+/-122,671DeepSeekDeepSeek License
226OpenAIgpt-4-1106-previewOpenAI1339.00+/-715,605OpenAIProprietary
227AIolmo-3.1-32b-thinkAi21338.00+/-151,569Ai2Apache 2.0
228Googlegemini-advanced-0514Google1338.00+/-98,138GoogleProprietary
229xAIgrok-2-mini-2024-08-13xAI1335.00+/-78,652xAIProprietary
230Metallama-3.1-70b-instructMeta1333.00+/-79,389MetaLlama 3.1 Community
231Tencenthunyuan-standard-2025-02-10Tencent1332.00+/-24549TencentProprietary
232OpenAIgpt-4-0125-previewOpenAI1331.00+/-815,289OpenAIProprietary
233Z.aiglm-4-plus-0111Z.ai1331.00+/-18894Z.aiProprietary
234IBibm-granite-h-smallIBM1329.00+/-171,264IBMApache 2.0
235Nvidiallama-3.1-nemotron-70b-instructNvidia1329.00+/-151,312NvidiaLlama 3.1
236OpenAIgpt-4-0314OpenAI1328.00+/-98,306OpenAIProprietary
237Googlegemma-3-12b-itGoogle1317.00+/-23543GoogleGemma
238Anthropicclaude-3-sonnet-20240229Anthropic1317.00+/-718,888AnthropicProprietary
239Googlegemini-1.5-flash-002Google1316.00+/-85,892GoogleProprietary
240REreka-core-20240904Reka AI1315.00+/-151,216Reka AIProprietary
241OpenAIgpt-4-0613OpenAI1313.00+/-813,719OpenAIProprietary
242Mistralmistral-small-24b-instruct-2501Mistral1312.00+/-122,083MistralApache 2.0
243AIjamba-1.5-largeAI21 Labs1312.00+/-151,440AI21 LabsJamba Open
244Nvidiallama-3.1-nemotron-51b-instructNvidia1311.00+/-22665NvidiaLlama 3.1
245Googlegemini-1.5-flash-001Google1310.00+/-810,680GoogleProprietary
246Googlegemma-3n-e4b-itGoogle1309.00+/-103,530GoogleGemma
247Z.aiglm-4-0520Z.ai1308.00+/-141,718Z.aiProprietary
248AIllama-3.1-tulu-3-70bAi21307.00+/-24450Ai2Llama 3.1
249Nvidianemotron-4-340b-instructNvidia1307.00+/-113,254NvidiaNVIDIA Open Model
250Microsoft AzurePhi 4 - 14BMicrosoft Azure1306.00+/-103,305Microsoft AzureMIT
251Amazonamazon-nova-lite-v1.0Amazon1305.00+/-103,060AmazonProprietary
252Metallama-3-70b-instructMeta1305.00+/-728,126MetaLlama 3 Community
253Googlegemma-2-27b-itGoogle1305.00+/-612,088GoogleGemma license
254Tencenthunyuan-standard-256kTencent1300.00+/-25497TencentProprietary
255Anthropicclaude-3-haiku-20240307Anthropic1300.00+/-720,898AnthropicProprietary
256Alibabaqwen2-72b-instructAlibaba1296.00+/-96,249AlibabaQianwen LICENSE
257Mistralmistral-large-2402Mistral1294.00+/-910,418MistralProprietary
258Coherec4ai-aya-expanse-32bCohere1292.00+/-94,685CohereCC-BY-NC-4.0
259REreka-flash-20240904Reka AI1290.00+/-151,207Reka AIProprietary
260Amazonamazon-nova-micro-v1.0Amazon1288.00+/-102,981AmazonProprietary
261IBgranite-3.1-8b-instructIBM1287.00+/-26478IBMApache 2.0
262Coherecommand-r-08-2024Cohere1280.00+/-131,783CohereCC-BY-NC-4.0
263AIolmo-2-0325-32b-instructAi21279.00+/-27427Ai2Apache-2.0
264Coherecommand-r-plus-08-2024Cohere1279.00+/-141,675CohereCC-BY-NC-4.0
265Alibabaqwen1.5-110b-chatAlibaba1279.00+/-104,763AlibabaQianwen LICENSE
266REreka-flash-21b-20240226-onlineReka AI1276.00+/-132,879Reka AIProprietary
267Mistralmixtral-8x22b-instruct-v0.1Mistral1276.00+/-98,780MistralApache 2.0
268Googlegemma-3-4b-itGoogle1275.00+/-24605GoogleGemma
269Mistralministral-8b-2410Mistral1274.00+/-19838MistralMRL
270Alibabaqwen1.5-72b-chatAlibaba1274.00+/-106,370AlibabaQianwen LICENSE
271OpenAIgpt-3.5-turbo-0125OpenAI1273.00+/-811,130OpenAIProprietary
272Googlegemini-1.5-flash-8b-001Google1272.00+/-86,069GoogleProprietary
273PRgemma-2-9b-it-simpoPrinceton1272.00+/-151,471PrincetonMIT
274Coherecommand-r-plusCohere1271.00+/-813,937CohereCC-BY-NC-4.0
275Googlegemma-2-9b-itGoogle1271.00+/-78,921GoogleGemma license
276REreka-flash-21b-20240226Reka AI1266.00+/-114,748Reka AIProprietary
277AIjamba-1.5-miniAI21 Labs1265.00+/-151,352AI21 LabsJamba Open
278Mistralmistral-mediumMistral1261.00+/-105,149MistralProprietary
279OpenAIgpt-3.5-turbo-1106OpenAI1261.00+/-162,121OpenAIProprietary
280Alibabaqwen1.5-32b-chatAlibaba1261.00+/-113,930AlibabaQianwen LICENSE
281Metallama-3.1-8b-instructMeta1259.00+/-78,582MetaLlama 3.1 Community
282Coherec4ai-aya-expanse-8bCohere1255.00+/-151,567CohereCC-BY-NC-4.0
283AIllama-3.1-tulu-3-8bAi21253.00+/-25476Ai2Llama 3.1
284Metallama-3-8b-instructMeta1252.00+/-818,374MetaLlama 3 Community
285DAdbrx-instruct-previewDatabricks1250.00+/-115,502DatabricksDBRX LICENSE
286IBgranite-3.1-2b-instructIBM1248.00+/-25508IBMApache 2.0
287Googlegemini-proGoogle1248.00+/-24678GoogleProprietary
288INinternlm2_5-20b-chatInternLM1247.00+/-141,684InternLMOther
28901yi-1.5-34b-chat01 AI1247.00+/-103,84101 AIApache-2.0
290HUzephyr-orpo-141b-A35b-v0.1HuggingFace1244.00+/-21831HuggingFaceApache 2.0
291Coherecommand-rCohere1242.00+/-99,645CohereCC-BY-NC-4.0
292IBgranite-3.0-8b-instructIBM1239.00+/-181,108IBMApache 2.0
293Googlegemini-pro-dev-apiGoogle1238.00+/-142,681GoogleProprietary
294Alibabaqwen1.5-14b-chatAlibaba1238.00+/-133,208AlibabaQianwen LICENSE
295Mistralmixtral-8x7b-instruct-v0.1Mistral1238.00+/-811,784MistralApache 2.0
296NEstarling-lm-7b-betaNexusflow1234.00+/-132,948NexusflowApache-2.0
297Microsoftphi-3-medium-4k-instructMicrosoft1230.00+/-103,973MicrosoftMIT
298OPopenchat-3.5-0106OpenChat1228.00+/-142,005OpenChatApache-2.0
299SNsnowflake-arctic-instructSnowflake1223.00+/-115,734SnowflakeApache 2.0
300Googlegemma-1.1-7b-itGoogle1216.00+/-104,332GoogleGemma license
301DeepSeekdeepseek-llm-67b-chatDeepSeek1216.00+/-24649DeepSeekDeepSeek License
302ALtulu-2-dpo-70bAllenAI/UW1213.00+/-21805AllenAI/UWAI2 ImpACT Low-risk
303Alibabaqwen1.5-7b-chatAlibaba1208.00+/-21772AlibabaQianwen LICENSE
304IBgranite-3.0-2b-instructIBM1208.00+/-181,134IBMApache 2.0
305UCstarling-lm-7b-alphaUC Berkeley1206.00+/-161,397UC BerkeleyCC-BY-NC-4.0
30601yi-34b-chat01 AI1204.00+/-132,34501 AIYi License
307Microsoftphi-3-small-8k-instructMicrosoft1203.00+/-123,219MicrosoftMIT
308OPopenchat-3.5OpenChat1201.00+/-20971OpenChatApache-2.0
309Alibabaqwen-14b-chatAlibaba1196.00+/-24599AlibabaQianwen LICENSE
310Microsoftphi-3-mini-4k-instruct-june-2024Microsoft1196.00+/-141,841MicrosoftMIT
311Googlegemma-2-2b-itGoogle1193.00+/-87,298GoogleGemma license
312LMvicuna-33bLMSYS1192.00+/-132,866LMSYSNon-commercial
313Microsoftwizardlm-70bMicrosoft1192.00+/-20988MicrosoftLlama 2 Community
314Microsoftphi-3-mini-4k-instructMicrosoft1186.00+/-123,449MicrosoftMIT
315NOopenhermes-2.5-mistral-7bNousResearch1185.00+/-23589NousResearchApache-2.0
316Mistralmistral-7b-instruct-v0.2Mistral1184.00+/-123,114MistralApache-2.0
317UPsolar-10.7b-instruct-v1.0Upstage AI1182.00+/-27482Upstage AICC-BY-NC-4.0
318Metallama-2-70b-chatMeta1177.00+/-105,717MetaLlama 2 Community
319Metallama-3.2-3b-instructMeta1175.00+/-161,351MetaLlama 3.2
320NOnous-hermes-2-mixtral-8x7b-dpoNousResearch1174.00+/-24575NousResearchApache-2.0
321Alibabaqwq-32b-previewAlibaba1173.00+/-24566AlibabaApache 2.0
322Googlegemma-1.1-2b-itGoogle1171.00+/-141,963GoogleGemma license
323Googlegemma-7b-itGoogle1167.00+/-171,381GoogleGemma license
324MOmpt-30b-chatMosaicML1166.00+/-35258MosaicMLCC-BY-NC-SA-4.0
325HUzephyr-7b-alphaHuggingFace1165.00+/-40201HuggingFaceMIT
326LMvicuna-13bLMSYS1162.00+/-142,389LMSYSLlama 2 Community
327Metallama-2-13b-chatMeta1161.00+/-132,626MetaLlama 2 Community
328HUsmollm2-1.7b-instructHuggingFace1159.00+/-33352HuggingFaceApache 2.0
329Metacodellama-34b-instructMeta1158.00+/-20853MetaLlama 2 Community
330Microsoftphi-3-mini-128k-instructMicrosoft1153.00+/-133,886MicrosoftMIT
331Googlepalm-2Google1152.00+/-21917GoogleProprietary
332HUzephyr-7b-betaHuggingFace1151.00+/-181,250HuggingFaceMIT
333Microsoftwizardlm-13bMicrosoft1150.00+/-22735MicrosoftLlama 2 Community
334Metallama-3.2-1b-instructMeta1148.00+/-161,346MetaLlama 3.2
335Nvidiallama2-70b-steerlm-chatNvidia1144.00+/-28467NvidiaLlama 2 Community
336Mistralmistral-7b-instructMistral1143.00+/-201,032MistralApache 2.0
337Googlegemma-2b-itGoogle1136.00+/-22742GoogleGemma license
338LMvicuna-7bLMSYS1130.00+/-23726LMSYSLlama 2 Community
339Alibabaqwen1.5-4b-chatAlibaba1130.00+/-171,283AlibabaQianwen LICENSE
340TOstripedhyena-nous-7bTogether AI1126.00+/-22704Together AIApache 2.0
341UWguanaco-33bUW1112.00+/-36263UWNon-commercial
342AIolmo-7b-instructAi21106.00+/-22772Ai2Apache-2.0
343Metallama-2-7b-chatMeta1101.00+/-141,956MetaLlama 2 Community
344TSchatglm3-6bTsinghua1089.00+/-26535TsinghuaApache-2.0
345MOmpt-7b-chatMosaicML1064.00+/-31397MosaicMLCC-BY-NC-SA-4.0
346UCkoala-13bUC Berkeley1064.00+/-24747UC BerkeleyNon-commercial
347RWRWKV-4-Raven-14BRWKV1058.00+/-27505RWKVApache 2.0
348OPoasst-pythia-12bOpenAssistant1049.00+/-25714OpenAssistantApache 2.0
349TSchatglm-6bTsinghua1034.00+/-27551TsinghuaNon-commercial
350TSchatglm2-6bTsinghua1029.00+/-35293TsinghuaApache-2.0
351STstablelm-tuned-alpha-7bStability AI1003.00+/-33363Stability AICC-BY-NC-SA-4.0
352STalpaca-13bStanford998.00+/-27626StanfordNon-commercial
353DAdolly-v2-12bDatabricks961.00+/-34396DatabricksMIT
354LMfastchat-t5-3bLMSYS906.00+/-30428LMSYSApache 2.0
355Metallama-13bMeta881.00+/-39304MetaNon-commercial

数据仅供参考,以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。

常见问题 (FAQ)

01

什么是 LMArena Coding Arena?

LMArena Coding Arena 是 LMArena 旗下专注于代码能力的匿名评测平台。用户提交真实编程任务(如调试、代码生成、算法实现),系统将不同模型的输出并排展示(隐藏模型名称),由用户投票选出更好的答案,最终通过 Elo 算法汇总形成动态排行榜。

02

Coding Arena 与 SWE-bench、HumanEval 等静态基准有什么区别?

SWE-bench、HumanEval、MBPP 等静态基准使用固定测试集和自动化评分,可重现性强但容易被针对性优化("刷榜")。Coding Arena 来自真实用户的开放式需求,测试内容不固定,更能反映模型在实际编程场景中的表现,两者互为补充。

03

国产大模型在代码能力方面表现如何?

DeepSeek、Qwen 等国产模型在 Coding Arena 表现亮眼,已跻身全球前列。DeepSeek 以 MIT 协议开源,Qwen 系列支持中文编程场景,是开发者选择开源代码模型的重要参考。

04

如何用 AI 辅助日常编程工作?

常见场景包括:代码补全与生成、调试、代码审查、单元测试生成,以及跨语言翻译。