What is the Artificial Analysis Intelligence Index?

The Artificial Analysis Intelligence Index v4.0 is a composite benchmark that aggregates performance across 10 evaluations spanning mathematics, science, coding, agentic tasks, and reasoning to measure AI capabilities holistically.

How is the Intelligence Index calculated?

The index aggregates scores from 10 benchmarks: GDPval-AA, τ²-Bench, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, and CritPt. All tests are independently run on standardized hardware.

How does this differ from LMArena?

LMArena uses crowdsourced user votes (Elo ratings) reflecting subjective preferences. The AA Intelligence Index uses standardized automated benchmarks with objective scoring across specific technical domains.

Where can I find the original data?

The original leaderboard is available at artificialanalysis.ai/leaderboards/models and the methodology at artificialanalysis.ai/evaluations/artificial-analysis-intelligence-index.

Artificial Analysis Intelligence Index AI模型智能指数排行榜

Name: Artificial Analysis Intelligence Index AI模型智能指数排行榜
Creator: DataLearner
License: https://creativecommons.org/licenses/by/4.0/

Artificial Analysis Intelligence Index v4.0 综合了10项权威评测基准（GDPval-AA、Terminal-Bench、GPQA Diamond、SciCode等），从数学、科学、编程、推理等多维度对AI模型进行全面评估和排名。

榜首模型

Claude Opus 4.8 (max)

最高得分

模型数量

201

数据版本

2026年05月31日

数据来源: Artificial Analysis

来源：全部国产模型

榜单历史快照月份:

排名总表

排名	模型名称	智能指数	机构
	Claude Opus 4.8 (max)Anthropic	61	Anthropic
	GPT-5.5 (xhigh)OpenAI	60	OpenAI
	GPT-5.5 (high)OpenAI	59	OpenAI
4	Opus 4.7 (max)Anthropic	57	Anthropic
5	Gemini 3.1 Pro PreviewGoogle Deep Mind	57	Google Deep Mind
6	GPT-5.5 (medium)OpenAI	57	OpenAI
7	Qwen3.7 MaxAlibaba	57	Alibaba
8	Gemini 3.5 FlashGoogle	55	Google
9	Gemini 3.5 Flash (medium)Google	55	Google
10	Kimi K2.6Moonshot AI	54	Moonshot AI
11	MiMo-V2.5-ProXiaomi	54	Xiaomi
12	GPT-5.3 Codex (xhigh)OpenAI	54	OpenAI
13	Grok 4.3 (high)xAI	53	xAI
14	Muse SparkFacebook AI研究实验室	52	Facebook AI研究实验室
15	Opus 4.7 (high)Anthropic	52	Anthropic
16	Claude Sonnet 4.6 (max)Anthropic	52	Anthropic
17	DeepSeek-V4-Pro (max)DeepSeek-AI	52	DeepSeek-AI
18	GLM 5.1智谱AI	51	智谱AI
19	GPT-5.5 (low)OpenAI	51	OpenAI
20	Qwen 3.6 Plus Preview阿里巴巴	50	阿里巴巴
21	DeepSeek-V4-Pro (high)DeepSeek-AI	50	DeepSeek-AI
22	MiniMax-M2.7MiniMaxAI	50	MiniMaxAI
23	MiMo-V2.5Xiaomi	49	Xiaomi
24	GPT-5.4 mini (xhigh)OpenAI	49	OpenAI
25	Grok 4.3 (medium)xAI	49	xAI
26	GLM-5-Turbo智谱AI	47	智谱AI
27	DeepSeek-V4-Flash (max)DeepSeek-AI	47	DeepSeek-AI
28	DeepSeek-V4-Flash (high)DeepSeek-AI	46	DeepSeek-AI
29	Qwen3.6-27B阿里巴巴	46	阿里巴巴
30	Qwen3.5-397B-A17B阿里巴巴	45	阿里巴巴
31	Nova 2 Omni（Preview）亚马逊	45	亚马逊
32	Claude Sonnet 4.6 (non-reasoning)Anthropic	44	Anthropic
33	GPT-5.4 nano (xhigh)OpenAI	44	OpenAI
34	Grok 4.3 (low)xAI	44	xAI
35	GLM 5.1智谱AI	44	智谱AI
36	Qwen3.6-35B-A3B阿里巴巴	43	阿里巴巴
37	MiMo-V2-OmniXiaomi	43	Xiaomi
38	Gemini 3.5 Flash (minimal)Google	43	Google
39	Kimi K2.6Moonshot AI	43	Moonshot AI
40	GLM-5V-Turbo智谱AI	43	智谱AI
41	Claude Sonnet 4.6 (Non-reasoning, Low Effort)Anthropic	43	Anthropic
42	Hy3-previewTencent	42	Tencent
43	GPT-5.5 Instant (May 2026)OpenAI	42	OpenAI
44	Qwen3.5-122B-A10B阿里巴巴	42	阿里巴巴
45	Gemini 2.0 Flash ExperimentalDeepMind	41	DeepMind
46	GPT-5.5 (non-reasoning)OpenAI	41	OpenAI
47	Qwen3.5-397B-A17B阿里巴巴	40	阿里巴巴
48	DeepSeek-V4-ProDeepSeek-AI	39	DeepSeek-AI
49	Mistral Medium 3.5Mistral	39	Mistral
50	Gemma 4 31BDeepMind	39	DeepMind
51	Qwen3.5-Omni-Plus阿里巴巴	39	阿里巴巴
52	Step 3.5 FlashStepFunAI	38	StepFunAI
53	Ring-2.6-1TInclusionAI	38	InclusionAI
54	OpenAI o3OpenAI	38	OpenAI
55	GPT-5.4 nanoOpenAI	38	OpenAI
56	GPT-5.4 mini (medium)OpenAI	38	OpenAI
57	Command A+Cohere	37	Cohere
58	Qwen3.6-27B阿里巴巴	37	阿里巴巴
59	Haiku 4.5Anthropic	37	Anthropic
60	DeepSeek-V4-FlashDeepSeek-AI	36	DeepSeek-AI
61	JT-35B-FlashChina Mobile	36	China Mobile
62	NVIDIA Nemotron 3 SuperNVIDIA	36	NVIDIA
63	Qwen3.5-122B-A10B阿里巴巴	36	阿里巴巴
64	Nova 2 Pro（Preview） (medium)亚马逊	36	亚马逊
65	MiMo-V2.5-ProXiaomi	36	Xiaomi
66	Gemini 2.5-ProGoogle Deep Mind	35	Google Deep Mind
67	Nova 2 Lite (high)亚马逊	35	亚马逊
68	Hy3-previewTencent	34	Tencent
69	Ling-2.6-1TInclusionAI	34	InclusionAI
70	Doubao Seed CodeByteDance Seed	34	ByteDance Seed
71	Gemini 3.1 Flash-LiteGoogle	34	Google
72	GPT OSS 120B (high)OpenAI	33	OpenAI
73	Mercury 2Inception	33	Inception
74	Qwen3.5-9B-Instruct阿里巴巴	32	阿里巴巴
75	Gemma 4 31BDeepMind	32	DeepMind
76	K-EXAONELG AI Research	32	LG AI Research
77	Nova 2 Pro（Preview） (low)亚马逊	32	亚马逊
78	Trinity Large ThinkingArcee AI	32	Arcee AI
79	Qwen3.6-35B-A3B阿里巴巴	32	阿里巴巴
80	Gemma 4 26B A4BDeepMind	31	DeepMind
81	Haiku 4.5Anthropic	31	Anthropic
82	Grok 4.3xAI	31	xAI
83	Qwen3.5-35B-A3B阿里巴巴	31	阿里巴巴
84	MiMo-V2-FlashXiaomi	30	Xiaomi
85	EXAONE 4.5 33BLG AI Research	30	LG AI Research
86	Nova 2 Lite (medium)亚马逊	30	亚马逊
87	ERNIE 5.0百度	29	百度
88	Nemotron Cascade 2 30B A3BNVIDIA	28	NVIDIA
89	Qwen3-Coder-Next阿里巴巴	28	阿里巴巴
90	Nova 2 Omni（Preview） (medium)亚马逊	28	亚马逊
91	Mistral Small 4Mistral	28	Mistral
92	Qwen3.5-9B-Instruct阿里巴巴	27	阿里巴巴
93	Magistral Medium 1.2Mistral	27	Mistral
94	Gemma 4 26B A4BDeepMind	27	DeepMind
95	Qwen3.5 4BAlibaba	27	Alibaba
96	Qwen3-Next阿里巴巴	27	阿里巴巴
97	Ling 2.6 FlashInclusionAI	26	InclusionAI
98	Solar Pro 3Upstage	26	Upstage
99	Qwen3.5-Omni-Flash阿里巴巴	26	阿里巴巴
100	JT-MINIChina Mobile	25	China Mobile
101	Nova 2 Lite (low)亚马逊	25	亚马逊
102	GPT OSS 20B (high)OpenAI	24	OpenAI
103	GPT OSS 120B (low)OpenAI	24	OpenAI
104	GPT-5.4 nanoOpenAI	24	OpenAI
105	NVIDIA Nemotron 3 NanoNVIDIA	24	NVIDIA
106	LongCat Flash LiteLongCat	24	LongCat
107	K-EXAONELG AI Research	23	LG AI Research
108	GPT-5.4 miniOpenAI	23	OpenAI
109	Nova 2 Omni（Preview） (low)亚马逊	23	亚马逊
110	Nova 2 Pro（Preview）亚马逊	23	亚马逊
111	Mi:dm K 2.5 ProKorea Telecom	23	Korea Telecom
112	Mistral Large 3MistralAI	23	MistralAI
113	Qwen3.5 4BAlibaba	23	Alibaba
114	INTELLECT-3Prime Intellect	22	Prime Intellect
115	Devstral 2Mistral	22	Mistral
116	Solar Open 100BUpstage	22	Upstage
117	Nemotron 3 Nano Omni 30B A3B ReasoningNVIDIA	21	NVIDIA
118	GPT OSS 20B (low)OpenAI	21	OpenAI
119	Qwen3-Next阿里巴巴	20	阿里巴巴
120	Devstral Small 2Mistral	19	Mistral
121	Motif-2-12.7BMotif Technologies	19	Motif Technologies
122	Nova PremierAmazon	19	Amazon
123	Gemma 4 E4BDeepMind	19	DeepMind
124	Llama Nemotron Super 49B v1.5Meta	19	Meta
125	Mistral Small 4Mistral	19	Mistral
126	Llama 4 MaverickFacebook AI研究实验室	18	Facebook AI研究实验室
127	Magistral Small 1.2Mistral	18	Mistral
128	Sarvam 105B (high)Sarvam	18	Sarvam
129	Nova 2 Lite亚马逊	18	亚马逊
130	MiniCPM5-1BOpenBMB	18	OpenBMB
131	Llama3.1-405BFacebook AI研究实验室	17	Facebook AI研究实验室
132	EXAONE 4.0 32BLG AI Research	17	LG AI Research
133	Nova 2 Omni（Preview）亚马逊	17	亚马逊
134	Qwen3.5 2BAlibaba	16	Alibaba
135	Nanbeige4.1-3BNanbeige	16	Nanbeige
136	Ministral 3 14BMistralAI	16	MistralAI
137	Falcon-H1R-7BTII UAE	16	TII UAE
138	Qwen3-Omni-30B-A3B阿里巴巴	16	阿里巴巴
139	Step3 VL 10BStepFun	15	StepFun
140	Gemma 4 E2BDeepMind	15	DeepMind
141	Llama Nemotron UltraNVIDIA	15	NVIDIA
142	ERNIE-4.5-300B-A47B百度	15	百度
143	Solar Pro 2Upstage	15	Upstage
144	NVIDIA Nemotron Nano 12B v2 VLNVIDIA	15	NVIDIA
145	Ministral 3 8BMistralAI	15	MistralAI
146	Gemma 4 E4BDeepMind	15	DeepMind
147	NVIDIA Nemotron Nano 9B V2NVIDIA	15	NVIDIA
148	Granite 4.1 30BIBM	15	IBM
149	NVIDIA Nemotron 3 Nano 4BNVIDIA	15	NVIDIA
150	Qwen3.5 2BAlibaba	15	Alibaba
151	Llama Nemotron Super 49B v1.5Meta	15	Meta
152	Llama3.3-70B-InstructFacebook AI研究实验室	14	Facebook AI研究实验室
153	Kimi Linear 48B A3B InstructKimi	14	Kimi
154	Ring-flash-2.0InclusionAI	14	InclusionAI
155	Solar Pro 2Upstage	14	Upstage
156	Llama 4 ScoutFacebook AI研究实验室	14	Facebook AI研究实验室
157	C4AI Command A (202503)CohereAI	13	CohereAI
158	Llama 3.1 Nemotron 70BNVIDIA	13	NVIDIA
159	NVIDIA Nemotron 3 NanoNVIDIA	13	NVIDIA
160	NVIDIA Nemotron Nano 9B V2NVIDIA	13	NVIDIA
161	MiniCPM-V 4.6 1.3BOpenBMB	13	OpenBMB
162	Granite 4.1 8BIBM	12	IBM
163	Sarvam 30B (high)Sarvam	12	Sarvam
164	Gemma 4 E2BDeepMind	12	DeepMind
165	R1 1776Perplexity	12	Perplexity
166	Llama 3.2-Vision-90BFacebook AI研究实验室	12	Facebook AI研究实验室
167	EXAONE 4.0 32BLG AI Research	12	LG AI Research
168	Ministral 3 3BMistral	11	Mistral
169	Jamba 1.7 LargeAI21 Labs	11	AI21 Labs
170	Granite 4.0 H SmallIBM	11	IBM
171	Qwen3-Omni-30B-A3B阿里巴巴	11	阿里巴巴
172	Qwen3.5 0.8BAlibaba	11	Alibaba
173	LFM2 24B A2BLiquid AI	10	Liquid AI
174	Phi 4 - 14BMicrosoft Azure	10	Microsoft Azure
175	Amazon Nova Micro亚马逊	10	亚马逊
176	NVIDIA Nemotron Nano 12B v2 VLNVIDIA	10	NVIDIA
177	Phi-4-multimodal-instruct Microsoft Azure	10	Microsoft Azure
178	Qwen3.5 0.8BAlibaba	10	Alibaba
179	Jamba Reasoning 3BAI21 Labs	10	AI21 Labs
180	Gemini 3.0 FlashGoogle Deep Mind	10	Google Deep Mind
181	Ling-mini-2.0InclusionAI	9	InclusionAI
182	Llama 3.2-Vision-11BFacebook AI研究实验室	9	Facebook AI研究实验室
183	Granite 4.1 3BIBM	9	IBM
184	Phi-4-mini-instruct (3.8B)Microsoft Azure	8	Microsoft Azure
185	Exaone 4.0 1.2BLG AI Research	8	LG AI Research
186	Exaone 4.0 1.2BLG AI Research	8	LG AI Research
187	LFM2.5-1.2B-ThinkingLiquid AI	8	Liquid AI
188	Jamba 1.7 MiniAI21 Labs	8	AI21 Labs
189	LFM2 2.6BLiquid AI	8	Liquid AI
190	LFM2.5-1.2B-InstructLiquid AI	8	Liquid AI
191	Granite 4.0 H 1BIBM	8	IBM
192	Gemma 3-270MGoogle Deep Mind	8	Google Deep Mind
193	Apertus 70B InstructSwiss AI	8	Swiss AI
194	Granite 4.0 MicroIBM	8	IBM
195	Granite 4.0 1BIBM	7	IBM
196	LFM2 8B A1BLiquid AI	7	Liquid AI
197	LFM2.5-VL-1.6BLiquid AI	6	Liquid AI
198	Granite 4.0 350MIBM	6	IBM
199	Apertus 8B InstructSwiss AI	6	Swiss AI
200	Granite 4.0 H 350MIBM	5	IBM
201	Tiny Aya GlobalCohere	5	Cohere

数据仅供参考，以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。

评测基准组成（Intelligence Index v4.0）

Intelligence Index 综合10项严格的评测基准，全面衡量AI模型能力，避免单一维度的过拟合。

GDPval-AA

智能体真实任务

τ²-Bench

智能体工具调用

Terminal-Bench

智能体编程

SciCode

编程能力

AA-LCR

长上下文推理

AA-Omniscience

知识与幻觉检测

IFBench

指令遵循

Humanity's Last Exam

推理与知识

GPQA Diamond

科学推理

CritPt

物理推理

常见问题 (FAQ)

什么是 Artificial Analysis Intelligence Index？▼

Artificial Analysis Intelligence Index v4.0 是一个综合评测指数，聚合了10项具有挑战性的评估——涵盖数学、科学、编程、智能体任务和推理——以全面衡量AI能力。它旨在防止单一维度的过拟合，提供一个统一分数来追踪模型进步。

智能指数是如何计算的？▼

该指数综合了10项评测的分数：GDPval-AA（智能体真实任务）、τ²-Bench（工具调用）、Terminal-Bench Hard（智能体编程）、SciCode（编程）、AA-LCR（长上下文推理）、AA-Omniscience（知识与幻觉检测）、IFBench（指令遵循）、Humanity's Last Exam（推理）、GPQA Diamond（科学推理）和 CritPt（物理推理）。所有测试由 Artificial Analysis 在标准化硬件上独立运行。

这与 LMArena 排行榜有什么区别？▼

LMArena 排名基于众包用户投票（盲测A/B对比的Elo评分），反映主观的人类偏好。而 Artificial Analysis Intelligence Index 使用标准化的自动评测基准进行客观评分，衡量特定领域的技术能力。两者各有价值——LMArena 捕捉真实用户体验，而 AA Intelligence Index 提供可复现的技术测量。

在哪里可以找到原始数据？▼

原始排行榜和详细方法论可在 artificialanalysis.ai 查看。Intelligence Index 的方法论详见 Intelligence Index 页面。

Artificial Analysis Intelligence Index AI模型智能指数排行榜

榜首模型

Claude Opus 4.8 (max)

最高得分

模型数量

201

数据版本

2026年05月31日

排名

模型名称

智能指数

机构

Claude Opus 4.8 (max)Anthropic

Anthropic

GPT-5.5 (xhigh)OpenAI

OpenAI

GPT-5.5 (high)OpenAI

OpenAI

Opus 4.7 (max)Anthropic

Anthropic

Gemini 3.1 Pro PreviewGoogle Deep Mind

Google Deep Mind

GPT-5.5 (medium)OpenAI

OpenAI

Qwen3.7 MaxAlibaba

Alibaba

Gemini 3.5 FlashGoogle

Google

Gemini 3.5 Flash (medium)Google

Google

Kimi K2.6Moonshot AI

Moonshot AI

MiMo-V2.5-ProXiaomi

Xiaomi

GPT-5.3 Codex (xhigh)OpenAI

OpenAI

Grok 4.3 (high)xAI

xAI

Muse SparkFacebook AI研究实验室

Facebook AI研究实验室

Opus 4.7 (high)Anthropic

Anthropic

Claude Sonnet 4.6 (max)Anthropic

Anthropic

DeepSeek-V4-Pro (max)DeepSeek-AI

DeepSeek-AI

GLM 5.1智谱AI

智谱AI

GPT-5.5 (low)OpenAI

OpenAI

Qwen 3.6 Plus Preview阿里巴巴

阿里巴巴

DeepSeek-V4-Pro (high)DeepSeek-AI

DeepSeek-AI

MiniMax-M2.7MiniMaxAI

MiniMaxAI

MiMo-V2.5Xiaomi

Xiaomi

GPT-5.4 mini (xhigh)OpenAI

OpenAI

Grok 4.3 (medium)xAI

xAI

GLM-5-Turbo智谱AI

智谱AI

DeepSeek-V4-Flash (max)DeepSeek-AI

DeepSeek-AI

DeepSeek-V4-Flash (high)DeepSeek-AI

DeepSeek-AI

Qwen3.6-27B阿里巴巴

阿里巴巴

Qwen3.5-397B-A17B阿里巴巴

阿里巴巴

Nova 2 Omni（Preview）亚马逊

亚马逊

Claude Sonnet 4.6 (non-reasoning)Anthropic

Anthropic

GPT-5.4 nano (xhigh)OpenAI

OpenAI

Grok 4.3 (low)xAI

xAI

GLM 5.1智谱AI

智谱AI

Qwen3.6-35B-A3B阿里巴巴

阿里巴巴

MiMo-V2-OmniXiaomi

Xiaomi

Gemini 3.5 Flash (minimal)Google

Google

Kimi K2.6Moonshot AI

Moonshot AI

GLM-5V-Turbo智谱AI

智谱AI

Claude Sonnet 4.6 (Non-reasoning, Low Effort)Anthropic

Anthropic

Hy3-previewTencent

Tencent

GPT-5.5 Instant (May 2026)OpenAI

OpenAI

Qwen3.5-122B-A10B阿里巴巴

阿里巴巴

Gemini 2.0 Flash ExperimentalDeepMind

DeepMind

GPT-5.5 (non-reasoning)OpenAI

OpenAI

Qwen3.5-397B-A17B阿里巴巴

阿里巴巴

DeepSeek-V4-ProDeepSeek-AI

DeepSeek-AI

Mistral Medium 3.5Mistral

Mistral

Gemma 4 31BDeepMind

DeepMind

Qwen3.5-Omni-Plus阿里巴巴

阿里巴巴

Step 3.5 FlashStepFunAI

StepFunAI

Ring-2.6-1TInclusionAI

InclusionAI

OpenAI o3OpenAI

OpenAI

GPT-5.4 nanoOpenAI

OpenAI

GPT-5.4 mini (medium)OpenAI

OpenAI

Command A+Cohere

Cohere

Qwen3.6-27B阿里巴巴

阿里巴巴

Haiku 4.5Anthropic

Anthropic

DeepSeek-V4-FlashDeepSeek-AI

DeepSeek-AI

JT-35B-FlashChina Mobile

China Mobile

NVIDIA Nemotron 3 SuperNVIDIA

NVIDIA

Qwen3.5-122B-A10B阿里巴巴

阿里巴巴

Nova 2 Pro（Preview） (medium)亚马逊

亚马逊

MiMo-V2.5-ProXiaomi

Xiaomi

Gemini 2.5-ProGoogle Deep Mind

Google Deep Mind

Nova 2 Lite (high)亚马逊

亚马逊

Hy3-previewTencent

Tencent

Ling-2.6-1TInclusionAI

InclusionAI

Doubao Seed CodeByteDance Seed

ByteDance Seed

Gemini 3.1 Flash-LiteGoogle

Google

GPT OSS 120B (high)OpenAI

OpenAI

Mercury 2Inception

Inception

Qwen3.5-9B-Instruct阿里巴巴

阿里巴巴

Gemma 4 31BDeepMind

DeepMind

K-EXAONELG AI Research

LG AI Research

Nova 2 Pro（Preview） (low)亚马逊

亚马逊

Trinity Large ThinkingArcee AI

Arcee AI

Qwen3.6-35B-A3B阿里巴巴

阿里巴巴

Gemma 4 26B A4BDeepMind

DeepMind

Haiku 4.5Anthropic

Anthropic

Grok 4.3xAI

xAI

Qwen3.5-35B-A3B阿里巴巴

阿里巴巴

MiMo-V2-FlashXiaomi

Xiaomi

EXAONE 4.5 33BLG AI Research

LG AI Research

Nova 2 Lite (medium)亚马逊

亚马逊

ERNIE 5.0百度

百度

Nemotron Cascade 2 30B A3BNVIDIA

NVIDIA

Qwen3-Coder-Next阿里巴巴

阿里巴巴

Nova 2 Omni（Preview） (medium)亚马逊

亚马逊

Mistral Small 4Mistral

Mistral

Qwen3.5-9B-Instruct阿里巴巴

阿里巴巴

Magistral Medium 1.2Mistral

Mistral

Gemma 4 26B A4BDeepMind

DeepMind

Qwen3.5 4BAlibaba

Alibaba

Qwen3-Next阿里巴巴

阿里巴巴

Ling 2.6 FlashInclusionAI

InclusionAI

Solar Pro 3Upstage

Upstage

Qwen3.5-Omni-Flash阿里巴巴

阿里巴巴

100

JT-MINIChina Mobile

China Mobile

101

Nova 2 Lite (low)亚马逊

亚马逊

102

GPT OSS 20B (high)OpenAI

OpenAI

103

GPT OSS 120B (low)OpenAI

OpenAI

104

GPT-5.4 nanoOpenAI

OpenAI

105

NVIDIA Nemotron 3 NanoNVIDIA

NVIDIA

106

LongCat Flash LiteLongCat

LongCat

107

K-EXAONELG AI Research

LG AI Research

108

GPT-5.4 miniOpenAI

OpenAI

109

Nova 2 Omni（Preview） (low)亚马逊

亚马逊

110

Nova 2 Pro（Preview）亚马逊

亚马逊

111

Mi:dm K 2.5 ProKorea Telecom

Korea Telecom

112

Mistral Large 3MistralAI

MistralAI

113

Qwen3.5 4BAlibaba

Alibaba

114

INTELLECT-3Prime Intellect

Prime Intellect

115

Devstral 2Mistral

Mistral

116

Solar Open 100BUpstage

Upstage

117

Nemotron 3 Nano Omni 30B A3B ReasoningNVIDIA

NVIDIA

118

GPT OSS 20B (low)OpenAI

OpenAI

119

Qwen3-Next阿里巴巴

阿里巴巴

120

Devstral Small 2Mistral

Mistral

121

Motif-2-12.7BMotif Technologies

Motif Technologies

122

Nova PremierAmazon

Amazon

123

Gemma 4 E4BDeepMind

DeepMind

124

Llama Nemotron Super 49B v1.5Meta