DataLearner 标志DataLearnerAI
最新AI资讯
大模型排行榜
大模型评测基准
大模型列表
大模型对比
资源中心
工具
语言中文
DataLearner 标志DataLearner AI

专注大模型评测、数据资源与实践教学的知识平台,持续更新可落地的 AI 能力图谱。

产品

  • 评测榜单
  • 模型对比
  • 数据资源

资源

  • 部署教程
  • 原创内容
  • 工具导航

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner 持续整合行业数据与案例,为科研、企业与开发者提供可靠的大模型情报与实践指南。

隐私政策服务条款
首页综合排行榜Arcada Labs Code Categories Arena 代码能力排行榜

Arcada Labs Code Categories Arena 代码能力排行榜

基于 Arcada Labs Code Categories Arena 用户匿名投票的最新AI大模型代码能力排行榜,通过 Bradley-Terry 模型对 Website、UI Component、Game Dev、Data Visualization 等代码子类别进行综合评分与排名。

榜首模型

Claude Opus 4.6

最高得分

1346.00

模型数量

127

数据版本

2026年05月31日

数据来源: Arcada Labs

来源:全部国产模型
榜单历史快照月份:

排名总表

排名模型名称得分95% CI投票数机构许可证
AnthropicClaude Opus 4.6Anthropic1346.00—16,089AnthropicProprietary
AnthropicClaude Opus 4.7 (Thinking)Anthropic1344.00—7,755AnthropicProprietary
AnthropicClaude Opus 4.6 (Thinking)Anthropic1341.00—13,540AnthropicProprietary
4Moonshot AIKimi K2.6Moonshot AI1337.00—15,535Moonshot AIOpen Source
5ZHGLM 5.1Zhipu AI1336.00—5,197Zhipu AIOpen Source
6AnthropicOpus 4.7Anthropic1330.00—11,025AnthropicProprietary
7AnthropicClaude Sonnet 4.6Anthropic1329.00—15,336AnthropicProprietary
8ZHGLM 5 TurboZhipu AI1329.00—14,085Zhipu AIProprietary
9XIMiMo-V2.5-ProXiaomi1327.00—3,587XiaomiOpen Source
10AlibabaQwen3.7 MaxAlibaba1314.00—7,534AlibabaProprietary
11XIMiMo-V2.5Xiaomi1309.00—15,671XiaomiOpen Source
12FAMuse SparkFacebook AI研究实验室1307.00—4,248Facebook AI研究实验室Proprietary
13DeepSeekDeepSeek-V4-ProDeepSeek1306.00—9,410DeepSeekOpen Source
14GoogleGemini 3.5 FlashGoogle1302.00—6,073GoogleProprietary
15ZHGLM 5Zhipu AI1302.00—30,971Zhipu AIOpen Source
16OpenAIGPT-5.5OpenAI1302.00—8,045OpenAIProprietary
17AnthropicOpus 4.5Anthropic1296.00—28,169AnthropicProprietary
18Google Deep MindGemini 3.1 Pro PreviewGoogle Deep Mind1296.00—23,948Google Deep MindProprietary
19Moonshot AIKimi K2.5 (Thinking)Moonshot AI1294.00—30,129Moonshot AIOpen Source
20MiniMaxMiniMax M2.7MiniMax1286.00—24,347MiniMaxOpen Source
21智谱GLM-5V-Turbo智谱AI1286.00—19,033智谱AIProprietary
22Google Deep MindGemini 3.1 Pro PreviewGoogle Deep Mind1283.00—25,876Google Deep MindProprietary
23阿里Qwen 3.6 Plus Preview阿里巴巴1283.00—16,861阿里巴巴Proprietary
24AnthropicClaude Opus 4.8Anthropic1282.00—6,131AnthropicProprietary
25ZHGLM 4.7Zhipu AI1275.00—38,816Zhipu AIOpen Source
26xAIGrok 4.20 Beta (Reasoning)xAI1272.00—17,718xAIProprietary
27DeepSeekDeepSeek-V4-FlashDeepSeek1270.00—15,684DeepSeekOpen Source
28OpenAIGPT-5.4 (Design Skill, Medium)OpenAI1269.00—6,369OpenAIProprietary
29OpenAIGPT-5.4 (Medium)OpenAI1267.00—13,383OpenAIProprietary
30MiniMaxMiniMax M2.5MiniMax1262.00—11,504MiniMaxOpen Source
31xAIGrok 4.3xAI1262.00—12,334xAIProprietary
32xAIGrok 4.20 BetaxAI1253.00—18,535xAIProprietary
33GoogleGemini 3 Flash PreviewGoogle1245.00—4,446GoogleProprietary
34MiniMaxMiniMax M2.1MiniMax1245.00—20,892MiniMaxOpen Source
35AnthropicClaude Sonnet 4.5 (Thinking)Anthropic1238.00—32,348AnthropicProprietary
36AnthropicClaude Sonnet 4.5Anthropic1237.00—33,156AnthropicProprietary
37阿里Qwen3.5-397B-A17B阿里巴巴1235.00—8,129阿里巴巴Open Source
38OpenAIGPT-5.4 (Low)OpenAI1234.00—14,824OpenAIProprietary
39OpenAIGPT-5.4 (None)OpenAI1234.00—16,608OpenAIProprietary
40ZHGLM 4.7 FlashZhipu AI1233.00—11,706Zhipu AIOpen Source
41AnthropicClaude 3.7 SonnetAnthropic1232.00—15,317AnthropicProprietary
42DeepSeekDeepSeek-V3.1 (Thinking)DeepSeek1231.00—16,327DeepSeekOpen Source
43AnthropicClaude Opus 4.1 (Thinking)Anthropic1226.00—15,778AnthropicProprietary
44DeepSeek-AIDeepSeek V3.2-ExpDeepSeek-AI1226.00—19,549DeepSeek-AIOpen Source
45OpenAIGPT-5.1 (high)OpenAI1226.00—16,146OpenAIProprietary
46OpenAIGPT-5.2 (None)OpenAI1225.00—24,334OpenAIProprietary
47OpenAIGPT-5.2 (medium)OpenAI1225.00—23,098OpenAIProprietary
48OpenAIGPT-5 (high)OpenAI1224.00—13,476OpenAIProprietary
49AlibabaQwen3.5 Plus 02-15Alibaba1223.00—17,272AlibabaProprietary
50DeepSeek-AIDeepSeek V3.2DeepSeek-AI1222.00—24,178DeepSeek-AIOpen Source
51OpenAIGPT-5.2 (Low)OpenAI1222.00—24,599OpenAIProprietary
52AnthropicClaude Opus 4.1Anthropic1221.00—32,495AnthropicProprietary
53ZHGLM 4.6Zhipu AI1221.00—16,997Zhipu AIOpen Source
54ZHGLM 4.5Zhipu AI1220.00—19,727Zhipu AIOpen Source
55OpenAIGPT-5 (Minimal)OpenAI1219.00—31,838OpenAIProprietary
56OpenAIGPT-5.1 (Medium)OpenAI1217.00—21,393OpenAIProprietary
57AnthropicClaude Opus 4Anthropic1216.00—16,750AnthropicProprietary
58StepFunStep 3.7 FlashStepFun1216.00—3,137StepFunOpen Source
59OpenAIGPT-5.1 (Low)OpenAI1211.00—22,262OpenAIProprietary
60XIMiMo-V2-FlashXiaomi1211.00—32,252XiaomiOpen Source
61Google Deep MindGemini 2.5-ProGoogle Deep Mind1209.00—7,044Google Deep MindProprietary
62OpenAIGPT-5.1 CodexOpenAI1206.00—1,807OpenAIProprietary
63OpenAIGPT-5.1 (None)OpenAI1206.00—22,399OpenAIProprietary
64OpenAIGPT-5.2 (High)OpenAI1205.00—4,167OpenAIProprietary
65OpenAIGPT-5.3 CodexOpenAI1200.00—15,763OpenAIProprietary
66AlibabaQwen3 Coder 480B A35B InstructAlibaba1198.00—1,958AlibabaOpen Source
67AnthropicClaude Sonnet 4Anthropic1197.00—17,619AnthropicProprietary
68MistralMistral Large 3 (2512)Mistral1197.00—29,272MistralOpen Source
69DeepSeek-AIDeepSeek-R1-0528DeepSeek-AI1194.00—18,052DeepSeek-AIOpen Source
70ZHGLM 4.5 AirZhipu AI1193.00—17,361Zhipu AIOpen Source
71AnthropicClaude Sonnet 4 (Thinking)Anthropic1192.00—16,301AnthropicProprietary
72MiniMaxMiniMax M2 StableMiniMax1190.00—10,933MiniMaxOpen Source
73DEAesCoder-4BDesignFlow1182.00—37,423DesignFlowOpen Source
74MistralMistral Medium 3.5Mistral1178.00—8,390MistralOpen Source
75MistralMistral Medium 3.1 (2508)Mistral1176.00—26,826MistralProprietary
76ARTrinity Large ThinkingArcee AI1174.00—10,557Arcee AIOpen Source
77OpenAIGPT-5 mini (Default)OpenAI1171.00—31,116OpenAIProprietary
78AnthropicClaude Haiku 4.5Anthropic1170.00—34,519AnthropicProprietary
79DeepSeek-AIDeepSeek-V3.1DeepSeek-AI1167.00—20,375DeepSeek-AIOpen Source
80AlibabaQwen3 MaxAlibaba1167.00—32,079AlibabaProprietary
81DeepSeek-AIDeepSeek-V3-0324DeepSeek-AI1163.00—19,366DeepSeek-AIOpen Source
82PRPrime Intellect: INTELLECT-3Prime Intellect1162.00—29,267Prime IntellectOpen Source
83GoogleGemini 2.5 Flash Preview 09-2025Google1159.00—19,439GoogleProprietary
84Moonshot AIKimi K2 0905 PreviewMoonshot AI1153.00—1,504Moonshot AIOpen Source
85OpenAIGPT-5.1 Codex MiniOpenAI1151.00—31,457OpenAIProprietary
86xAIGrok 4 FastxAI1151.00—35,534xAIProprietary
87xAIGrok 4.1 FastxAI1148.00—34,086xAIProprietary
88xAIGrok 4.1 Fast (Reasoning)xAI1144.00—31,697xAIProprietary
89OpenAIGPT-5 nano (Default)OpenAI1140.00—6,710OpenAIProprietary
90Moonshot AIKimi K2 Turbo PreviewMoonshot AI1139.00—2,096Moonshot AIOpen Source
91GoogleGemini 2.5 Flash Lite Preview 09-2025Google1136.00—6,860GoogleProprietary
92GoogleGemini 3.1 Flash-Lite PreviewGoogle1128.00—20,906GoogleProprietary
93MistralMistral Medium 3 (2505)Mistral1124.00—6,396MistralProprietary
94MistralMinistral 3 14B (2512)Mistral1120.00—2,379MistralOpen Source
95GoogleGemini 2.5 FlashGoogle1114.00—6,960GoogleProprietary
96VEv0-1.5-mdVercel1112.00—11,086VercelProprietary
97xAIGrok 3xAI1108.00—26,957xAIProprietary
98MistralMinistral 3 8B (2512)Mistral1108.00—2,427MistralOpen Source
99xAIGrok 4 Fast (Reasoning)xAI1101.00—36,078xAIProprietary
100阿里Qwen3-235B-A22B-2507阿里巴巴1094.00—6,932阿里巴巴Open Source
101Moonshot AI (Legacy)Kimi K2Moonshot AI (Legacy)1089.00—1,352Moonshot AI (Legacy)Open Source
102MistralMagistral Medium 1.2 (2509)Mistral1089.00—5,851MistralProprietary
103AlibabaQwen3-235B-A22B-Thinking-2507Alibaba1088.00—6,169AlibabaOpen Source
104OpenAIGPT-4.1OpenAI1081.00—1,747OpenAIProprietary
105OpenAIOpenAI o3OpenAI1075.00—1,365OpenAIProprietary
106xAIGrok 4xAI1072.00—24,117xAIProprietary
107MistralDevstral MediumMistral1068.00—7,158MistralProprietary
108MistralMinistral 3 3B (2512)Mistral1065.00—2,852MistralOpen Source
109MistralCodestral 2508Mistral1062.00—6,746MistralProprietary
110AlibabaQwen3-235B-A22BAlibaba1057.00—5,154AlibabaOpen Source
111xAIGrok Code Fast 1xAI1054.00—4,296xAIProprietary
112OpenAIGPT-4.1 miniOpenAI1049.00—1,566OpenAIProprietary
113MistralMagistral Small 1.2 (2509)Mistral1041.00—6,448MistralOpen Source
114OpenAIo4-miniOpenAI1031.00—2,011OpenAIProprietary
115ALOlmo 3.1 32B ThinkAllen AI1030.00—16,219Allen AIOpen Source
116OpenAIGPT-4.1 nanoOpenAI1018.00—1,901OpenAIProprietary
117OpenAIGPT OSS 120BOpenAI1018.00—5,268OpenAIOpen Source
118AlibabaQwen3 30B-A3BAlibaba997.00—2,575AlibabaOpen Source
119xAIGrok 3 MinixAI985.00—7,626xAIProprietary
120NVIDIALlama 3.1 Nemotron Ultra 253BNVIDIA984.00—3,172NVIDIAOpen Source
121MistralMistral Small 3.2Mistral962.00—1,243MistralOpen Source
122FALlama 4 MaverickFacebook AI研究实验室935.00—1,678Facebook AI研究实验室Open Source
123MistralMistral Large 2.1 (2411)Mistral918.00—1,317MistralProprietary
124OpenAIGPT-4oOpenAI916.00—1,780OpenAIProprietary
125MistralCodestral 2 (2501)Mistral889.00—1,444MistralOpen Source
126MistralDevstral Small 1.1Mistral862.00—1,250MistralOpen Source
127FALlama 4 ScoutFacebook AI研究实验室845.00—1,275Facebook AI研究实验室Open Source

数据仅供参考,以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。

关于本榜单

本榜单数据来源于Design Arena,由 Y Combinator 支持的 Arcada Labs 开发,是专注于评测 AI 设计代码生成能力的众包匿名对战平台。

与 LMArena 评测通用文本和编程能力不同,Design Arena 的代码榜专门考察模型生成具有视觉呈现效果的前端代码的能力。平台将代码任务细分为 Website、UI 组件、游戏开发、数据可视化、SVG、Web App、移动端等多个子类别,每个子类别均有独立排行。

本页展示的是 Code Categories 综合榜,即将所有子类别的用户投票混合汇总后,统一用 Bradley-Terry 模型(类 Elo 算法)计算出的综合排名。每票等权,不对各子类别做加权处理,因此投票量较大的子类别(如 Website)对综合分数的影响更大。得分越高,代表模型在设计代码生成场景下的综合人类偏好越强。

常见问题 (FAQ)

01

什么是 Arcada Labs Code Categories Arena?

Arcada Labs Code Categories Arena 是专注于设计代码生成能力的匿名评测平台,覆盖 Website、UI 组件、游戏开发、数据可视化等多个代码生成子类别,并将投票汇总为综合榜单。

02

Arcada Code Arena 与 LMArena Coding Arena 有什么区别?

LMArena Coding Arena 主要评测通用编程能力,例如代码生成、调试和算法实现;Arcada Code Arena 专注于具有视觉呈现效果的前端设计代码,例如 HTML 页面、交互 UI、图表、SVG 和原型。

03

排名方法论是什么?

Arcada Labs 将各代码子类别的原始投票混合后运行 Bradley-Terry 模型。每票等权,不按子类别单独加权,因此投票量较大的子类别会对综合分数产生更大影响。

04

哪类模型在设计代码场景表现更好?

具备强视觉理解和前端代码生成能力的大模型通常表现更好。针对 UI 和代码生成优化的专项模型,在布局、交互和视觉细节任务上也可能有突出表现。