DataLearner 标志DataLearnerAI
最新AI资讯
大模型排行榜
大模型评测基准
大模型列表
大模型对比
资源中心
工具
语言中文
DataLearner 标志DataLearner AI

专注大模型评测、数据资源与实践教学的知识平台,持续更新可落地的 AI 能力图谱。

产品

  • 评测榜单
  • 模型对比
  • 数据资源

资源

  • 部署教程
  • 原创内容
  • 工具导航

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner 持续整合行业数据与案例,为科研、企业与开发者提供可靠的大模型情报与实践指南。

隐私政策服务条款
首页综合排行榜Artificial Analysis Intelligence Index AI模型智能指数排行榜

Artificial Analysis Intelligence Index AI模型智能指数排行榜

Artificial Analysis Intelligence Index v4.0 综合了10项权威评测基准(GDPval-AA、Terminal-Bench、GPQA Diamond、SciCode等),从数学、科学、编程、推理等多维度对AI模型进行全面评估和排名。

榜首模型

Claude Opus 4.8 (max)

最高得分

61

模型数量

201

数据版本

2026年05月31日

数据来源: Artificial Analysis

来源:全部国产模型
榜单历史快照月份:

排名总表

排名模型名称智能指数机构
AnthropicClaude Opus 4.8 (max)Anthropic61Anthropic
OpenAIGPT-5.5 (xhigh)OpenAI60OpenAI
OpenAIGPT-5.5 (high)OpenAI59OpenAI
4AnthropicOpus 4.7 (max)Anthropic57Anthropic
5Google Deep MindGemini 3.1 Pro PreviewGoogle Deep Mind57Google Deep Mind
6OpenAIGPT-5.5 (medium)OpenAI57OpenAI
7AlibabaQwen3.7 MaxAlibaba57Alibaba
8GoogleGemini 3.5 FlashGoogle55Google
9GoogleGemini 3.5 Flash (medium)Google55Google
10Moonshot AIKimi K2.6Moonshot AI54Moonshot AI
11XIMiMo-V2.5-ProXiaomi54Xiaomi
12OpenAIGPT-5.3 Codex (xhigh)OpenAI54OpenAI
13xAIGrok 4.3 (high)xAI53xAI
14FAMuse SparkFacebook AI研究实验室52Facebook AI研究实验室
15AnthropicOpus 4.7 (high)Anthropic52Anthropic
16AnthropicClaude Sonnet 4.6 (max)Anthropic52Anthropic
17DeepSeek-AIDeepSeek-V4-Pro (max)DeepSeek-AI52DeepSeek-AI
18智谱GLM 5.1智谱AI51智谱AI
19OpenAIGPT-5.5 (low)OpenAI51OpenAI
20阿里Qwen 3.6 Plus Preview阿里巴巴50阿里巴巴
21DeepSeek-AIDeepSeek-V4-Pro (high)DeepSeek-AI50DeepSeek-AI
22MiniMaxAIMiniMax-M2.7MiniMaxAI50MiniMaxAI
23XIMiMo-V2.5Xiaomi49Xiaomi
24OpenAIGPT-5.4 mini (xhigh)OpenAI49OpenAI
25xAIGrok 4.3 (medium)xAI49xAI
26智谱GLM-5-Turbo智谱AI47智谱AI
27DeepSeek-AIDeepSeek-V4-Flash (max)DeepSeek-AI47DeepSeek-AI
28DeepSeek-AIDeepSeek-V4-Flash (high)DeepSeek-AI46DeepSeek-AI
29阿里Qwen3.6-27B阿里巴巴46阿里巴巴
30阿里Qwen3.5-397B-A17B阿里巴巴45阿里巴巴
31亚马Nova 2 Omni(Preview)亚马逊45亚马逊
32AnthropicClaude Sonnet 4.6 (non-reasoning)Anthropic44Anthropic
33OpenAIGPT-5.4 nano (xhigh)OpenAI44OpenAI
34xAIGrok 4.3 (low)xAI44xAI
35智谱GLM 5.1智谱AI44智谱AI
36阿里Qwen3.6-35B-A3B阿里巴巴43阿里巴巴
37XIMiMo-V2-OmniXiaomi43Xiaomi
38GoogleGemini 3.5 Flash (minimal)Google43Google
39Moonshot AIKimi K2.6Moonshot AI43Moonshot AI
40智谱GLM-5V-Turbo智谱AI43智谱AI
41AnthropicClaude Sonnet 4.6 (Non-reasoning, Low Effort)Anthropic43Anthropic
42TencentHy3-previewTencent42Tencent
43OpenAIGPT-5.5 Instant (May 2026)OpenAI42OpenAI
44阿里Qwen3.5-122B-A10B阿里巴巴42阿里巴巴
45DeepMindGemini 2.0 Flash ExperimentalDeepMind41DeepMind
46OpenAIGPT-5.5 (non-reasoning)OpenAI41OpenAI
47阿里Qwen3.5-397B-A17B阿里巴巴40阿里巴巴
48DeepSeek-AIDeepSeek-V4-ProDeepSeek-AI39DeepSeek-AI
49MistralMistral Medium 3.5Mistral39Mistral
50DeepMindGemma 4 31BDeepMind39DeepMind
51阿里Qwen3.5-Omni-Plus阿里巴巴39阿里巴巴
52StepFunAIStep 3.5 FlashStepFunAI38StepFunAI
53INRing-2.6-1TInclusionAI38InclusionAI
54OpenAIOpenAI o3OpenAI38OpenAI
55OpenAIGPT-5.4 nanoOpenAI38OpenAI
56OpenAIGPT-5.4 mini (medium)OpenAI38OpenAI
57CohereCommand A+Cohere37Cohere
58阿里Qwen3.6-27B阿里巴巴37阿里巴巴
59AnthropicHaiku 4.5Anthropic37Anthropic
60DeepSeek-AIDeepSeek-V4-FlashDeepSeek-AI36DeepSeek-AI
61CHJT-35B-FlashChina Mobile36China Mobile
62NVIDIANVIDIA Nemotron 3 SuperNVIDIA36NVIDIA
63阿里Qwen3.5-122B-A10B阿里巴巴36阿里巴巴
64亚马Nova 2 Pro(Preview) (medium)亚马逊36亚马逊
65XIMiMo-V2.5-ProXiaomi36Xiaomi
66Google Deep MindGemini 2.5-ProGoogle Deep Mind35Google Deep Mind
67亚马Nova 2 Lite (high)亚马逊35亚马逊
68TencentHy3-previewTencent34Tencent
69INLing-2.6-1TInclusionAI34InclusionAI
70ByteDance SeedDoubao Seed CodeByteDance Seed34ByteDance Seed
71GoogleGemini 3.1 Flash-LiteGoogle34Google
72OpenAIGPT OSS 120B (high)OpenAI33OpenAI
73INMercury 2Inception33Inception
74阿里Qwen3.5-9B-Instruct阿里巴巴32阿里巴巴
75DeepMindGemma 4 31BDeepMind32DeepMind
76LGK-EXAONELG AI Research32LG AI Research
77亚马Nova 2 Pro(Preview) (low)亚马逊32亚马逊
78ARTrinity Large ThinkingArcee AI32Arcee AI
79阿里Qwen3.6-35B-A3B阿里巴巴32阿里巴巴
80DeepMindGemma 4 26B A4BDeepMind31DeepMind
81AnthropicHaiku 4.5Anthropic31Anthropic
82xAIGrok 4.3xAI31xAI
83阿里Qwen3.5-35B-A3B阿里巴巴31阿里巴巴
84XIMiMo-V2-FlashXiaomi30Xiaomi
85LGEXAONE 4.5 33BLG AI Research30LG AI Research
86亚马Nova 2 Lite (medium)亚马逊30亚马逊
87百度ERNIE 5.0百度29百度
88NVIDIANemotron Cascade 2 30B A3BNVIDIA28NVIDIA
89阿里Qwen3-Coder-Next阿里巴巴28阿里巴巴
90亚马Nova 2 Omni(Preview) (medium)亚马逊28亚马逊
91MistralMistral Small 4Mistral28Mistral
92阿里Qwen3.5-9B-Instruct阿里巴巴27阿里巴巴
93MistralMagistral Medium 1.2Mistral27Mistral
94DeepMindGemma 4 26B A4BDeepMind27DeepMind
95AlibabaQwen3.5 4BAlibaba27Alibaba
96阿里Qwen3-Next阿里巴巴27阿里巴巴
97INLing 2.6 FlashInclusionAI26InclusionAI
98UPSolar Pro 3Upstage26Upstage
99阿里Qwen3.5-Omni-Flash阿里巴巴26阿里巴巴
100CHJT-MINIChina Mobile25China Mobile
101亚马Nova 2 Lite (low)亚马逊25亚马逊
102OpenAIGPT OSS 20B (high)OpenAI24OpenAI
103OpenAIGPT OSS 120B (low)OpenAI24OpenAI
104OpenAIGPT-5.4 nanoOpenAI24OpenAI
105NVIDIANVIDIA Nemotron 3 NanoNVIDIA24NVIDIA
106LOLongCat Flash LiteLongCat24LongCat
107LGK-EXAONELG AI Research23LG AI Research
108OpenAIGPT-5.4 miniOpenAI23OpenAI
109亚马Nova 2 Omni(Preview) (low)亚马逊23亚马逊
110亚马Nova 2 Pro(Preview)亚马逊23亚马逊
111KOMi:dm K 2.5 ProKorea Telecom23Korea Telecom
112MistralAIMistral Large 3MistralAI23MistralAI
113AlibabaQwen3.5 4BAlibaba23Alibaba
114PRINTELLECT-3Prime Intellect22Prime Intellect
115MistralDevstral 2Mistral22Mistral
116UPSolar Open 100BUpstage22Upstage
117NVIDIANemotron 3 Nano Omni 30B A3B ReasoningNVIDIA21NVIDIA
118OpenAIGPT OSS 20B (low)OpenAI21OpenAI
119阿里Qwen3-Next阿里巴巴20阿里巴巴
120MistralDevstral Small 2Mistral19Mistral
121MOMotif-2-12.7BMotif Technologies19Motif Technologies
122AmazonNova PremierAmazon19Amazon
123DeepMindGemma 4 E4BDeepMind19DeepMind
124MetaLlama Nemotron Super 49B v1.5Meta19Meta
125MistralMistral Small 4Mistral19Mistral
126FALlama 4 MaverickFacebook AI研究实验室18Facebook AI研究实验室
127MistralMagistral Small 1.2Mistral18Mistral
128SASarvam 105B (high)Sarvam18Sarvam
129亚马Nova 2 Lite亚马逊18亚马逊
130OPMiniCPM5-1BOpenBMB18OpenBMB
131FALlama3.1-405BFacebook AI研究实验室17Facebook AI研究实验室
132LGEXAONE 4.0 32BLG AI Research17LG AI Research
133亚马Nova 2 Omni(Preview)亚马逊17亚马逊
134AlibabaQwen3.5 2BAlibaba16Alibaba
135NANanbeige4.1-3BNanbeige16Nanbeige
136MistralAIMinistral 3 14BMistralAI16MistralAI
137TIFalcon-H1R-7BTII UAE16TII UAE
138阿里Qwen3-Omni-30B-A3B阿里巴巴16阿里巴巴
139StepFunStep3 VL 10BStepFun15StepFun
140DeepMindGemma 4 E2BDeepMind15DeepMind
141NVIDIALlama Nemotron UltraNVIDIA15NVIDIA
142百度ERNIE-4.5-300B-A47B百度15百度
143UPSolar Pro 2Upstage15Upstage
144NVIDIANVIDIA Nemotron Nano 12B v2 VLNVIDIA15NVIDIA
145MistralAIMinistral 3 8BMistralAI15MistralAI
146DeepMindGemma 4 E4BDeepMind15DeepMind
147NVIDIANVIDIA Nemotron Nano 9B V2NVIDIA15NVIDIA
148IBGranite 4.1 30BIBM15IBM
149NVIDIANVIDIA Nemotron 3 Nano 4BNVIDIA15NVIDIA
150AlibabaQwen3.5 2BAlibaba15Alibaba
151MetaLlama Nemotron Super 49B v1.5Meta15Meta
152FALlama3.3-70B-InstructFacebook AI研究实验室14Facebook AI研究实验室
153KimiKimi Linear 48B A3B InstructKimi14Kimi
154INRing-flash-2.0InclusionAI14InclusionAI
155UPSolar Pro 2Upstage14Upstage
156FALlama 4 ScoutFacebook AI研究实验室14Facebook AI研究实验室
157CohereAIC4AI Command A (202503)CohereAI13CohereAI
158NVIDIALlama 3.1 Nemotron 70BNVIDIA13NVIDIA
159NVIDIANVIDIA Nemotron 3 NanoNVIDIA13NVIDIA
160NVIDIANVIDIA Nemotron Nano 9B V2NVIDIA13NVIDIA
161OPMiniCPM-V 4.6 1.3BOpenBMB13OpenBMB
162IBGranite 4.1 8BIBM12IBM
163SASarvam 30B (high)Sarvam12Sarvam
164DeepMindGemma 4 E2BDeepMind12DeepMind
165PerplexityR1 1776Perplexity12Perplexity
166FALlama 3.2-Vision-90BFacebook AI研究实验室12Facebook AI研究实验室
167LGEXAONE 4.0 32BLG AI Research12LG AI Research
168MistralMinistral 3 3BMistral11Mistral
169AIJamba 1.7 LargeAI21 Labs11AI21 Labs
170IBGranite 4.0 H SmallIBM11IBM
171阿里Qwen3-Omni-30B-A3B阿里巴巴11阿里巴巴
172AlibabaQwen3.5 0.8BAlibaba11Alibaba
173LILFM2 24B A2BLiquid AI10Liquid AI
174Microsoft AzurePhi 4 - 14BMicrosoft Azure10Microsoft Azure
175亚马Amazon Nova Micro亚马逊10亚马逊
176NVIDIANVIDIA Nemotron Nano 12B v2 VLNVIDIA10NVIDIA
177Microsoft AzurePhi-4-multimodal-instruct Microsoft Azure10Microsoft Azure
178AlibabaQwen3.5 0.8BAlibaba10Alibaba
179AIJamba Reasoning 3BAI21 Labs10AI21 Labs
180Google Deep MindGemini 3.0 FlashGoogle Deep Mind10Google Deep Mind
181INLing-mini-2.0InclusionAI9InclusionAI
182FALlama 3.2-Vision-11BFacebook AI研究实验室9Facebook AI研究实验室
183IBGranite 4.1 3BIBM9IBM
184Microsoft AzurePhi-4-mini-instruct (3.8B)Microsoft Azure8Microsoft Azure
185LGExaone 4.0 1.2BLG AI Research8LG AI Research
186LGExaone 4.0 1.2BLG AI Research8LG AI Research
187LILFM2.5-1.2B-ThinkingLiquid AI8Liquid AI
188AIJamba 1.7 MiniAI21 Labs8AI21 Labs
189LILFM2 2.6BLiquid AI8Liquid AI
190LILFM2.5-1.2B-InstructLiquid AI8Liquid AI
191IBGranite 4.0 H 1BIBM8IBM
192Google Deep MindGemma 3-270MGoogle Deep Mind8Google Deep Mind
193SWApertus 70B InstructSwiss AI8Swiss AI
194IBGranite 4.0 MicroIBM8IBM
195IBGranite 4.0 1BIBM7IBM
196LILFM2 8B A1BLiquid AI7Liquid AI
197LILFM2.5-VL-1.6BLiquid AI6Liquid AI
198IBGranite 4.0 350MIBM6IBM
199SWApertus 8B InstructSwiss AI6Swiss AI
200IBGranite 4.0 H 350MIBM5IBM
201CohereTiny Aya GlobalCohere5Cohere

数据仅供参考,以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。

评测基准组成(Intelligence Index v4.0)

Intelligence Index 综合10项严格的评测基准,全面衡量AI模型能力,避免单一维度的过拟合。

GDPval-AA
智能体真实任务
τ²-Bench
智能体工具调用
Terminal-Bench
智能体编程
SciCode
编程能力
AA-LCR
长上下文推理
AA-Omniscience
知识与幻觉检测
IFBench
指令遵循
Humanity's Last Exam
推理与知识
GPQA Diamond
科学推理
CritPt
物理推理

常见问题 (FAQ)

什么是 Artificial Analysis Intelligence Index?▼
Artificial Analysis Intelligence Index v4.0 是一个综合评测指数,聚合了10项具有挑战性的评估——涵盖数学、科学、编程、智能体任务和推理——以全面衡量AI能力。它旨在防止单一维度的过拟合,提供一个统一分数来追踪模型进步。
智能指数是如何计算的?▼
该指数综合了10项评测的分数:GDPval-AA(智能体真实任务)、τ²-Bench(工具调用)、Terminal-Bench Hard(智能体编程)、SciCode(编程)、AA-LCR(长上下文推理)、AA-Omniscience(知识与幻觉检测)、IFBench(指令遵循)、Humanity's Last Exam(推理)、GPQA Diamond(科学推理)和 CritPt(物理推理)。所有测试由 Artificial Analysis 在标准化硬件上独立运行。
这与 LMArena 排行榜有什么区别?▼
LMArena 排名基于众包用户投票(盲测A/B对比的Elo评分),反映主观的人类偏好。而 Artificial Analysis Intelligence Index 使用标准化的自动评测基准进行客观评分,衡量特定领域的技术能力。两者各有价值——LMArena 捕捉真实用户体验,而 AA Intelligence Index 提供可复现的技术测量。
在哪里可以找到原始数据?▼
原始排行榜和详细方法论可在 artificialanalysis.ai 查看。Intelligence Index 的方法论详见 Intelligence Index 页面。