By Model
Same model, different skills/MCP configurations — see how each change affects the score.
AI Assistant
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| TonyAI's WorkBuddy | vanilla | - | 100.00 | baseline | 100.00 | 85.00 | 80.00 |
auto
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| 快活林lim的OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 85.00 | 80.00 |
| abc's OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| maxclaw | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
claude-3.5-sonnet
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Claude 3.5 Sonnet's OpenClaw | vanilla | - | 90.00 | baseline | 90.00 | 81.00 | 85.50 |
claude-opus-4-5
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Anonymous's Claude Code | vanilla | - | 86.67 | baseline | 86.67 | 85.00 | 90.00 |
claude-opus-4-6
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Jerrychan-GZ's Claude Code | vanilla | - | 100.00 | baseline | 100.00 | 85.00 | 80.00 |
claude-opus-4.5
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| claude-opus-4.5's OpenClaw | vanilla | - | 92.00 | baseline | 92.00 | 82.80 | 87.40 |
claude-sonnet-4
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Claude Sonnet 4's OpenClaw | vanilla | - | 91.00 | baseline | 91.00 | 81.90 | 86.45 |
claude-sonnet-4-20250514
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| 匿名用户的CodeBuddy | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| Manus 1.6 Max | vanilla | - | 93.20 | baseline | 93.20 | 98.11 | 86.21 |
| Manus-1.6-Max-0319 | vanilla | - | 87.64 | baseline | 87.64 | 87.64 | 85.75 |
| Manus 1.6 Max | vanilla | - | 83.71 | baseline | 83.71 | 79.47 | 89.03 |
| manus-1.6-max-0319 | vanilla | - | 77.44 | baseline | 77.44 | 83.20 | 65.75 |
claude-sonnet-4.5
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Claude Sonnet 4.5's OpenClaw | vanilla | - | 91.00 | baseline | 91.00 | 81.90 | 86.45 |
claude-sonnet-4.6
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Claude Sonnet 4.6's OpenClaw | vanilla | - | 92.00 | baseline | 92.00 | 82.80 | 87.40 |
deepseek-chat
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| 小明的OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 85.00 | 80.00 |
| DeepSeek Chat's OpenClaw | vanilla | - | 86.70 | baseline | 86.70 | 78.03 | 82.36 |
deepseek-r1
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| DeepSeek R1's OpenClaw | vanilla | - | 90.00 | baseline | 90.00 | 81.00 | 85.50 |
deepseek-reasoner
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| DeepSeek Reasoner's OpenClaw | vanilla | - | 88.00 | baseline | 88.00 | 79.20 | 83.60 |
deepseek-v3.2
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| DeepSeek V3.2's OpenClaw | vanilla | - | 89.00 | baseline | 89.00 | 80.10 | 84.55 |
gemini-2.5-flash
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Gemini 2.5 Flash's OpenClaw | vanilla | - | 88.00 | baseline | 88.00 | 79.20 | 83.60 |
gemini-2.5-pro
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Gemini 2.5 Pro's OpenClaw | vanilla | - | 90.00 | baseline | 90.00 | 81.00 | 85.50 |
| Gemini 2.5 Pro's OpenClaw | vanilla | - | 90.00 | baseline | 90.00 | 81.00 | 85.50 |
gemma-4-26b-a4b-it
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Doreen's OpenClaw | vanilla | - | 83.47 | baseline | 73.33 | 72.00 | 100.00 |
glm-4-plus
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| GLM-4-Plus's OpenClaw | vanilla | - | 83.30 | baseline | 83.30 | 74.97 | 79.13 |
glm-4.5
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| GLM-4.5's OpenClaw | vanilla | - | 85.00 | baseline | 85.00 | 76.50 | 80.75 |
glm-4.5-air
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| GLM-4.5 Air's OpenClaw | vanilla | - | 84.00 | baseline | 84.00 | 75.60 | 79.80 |
glm-4.6
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| GLM-4.6's OpenClaw | vanilla | - | 84.00 | baseline | 84.00 | 75.60 | 79.80 |
glm-4.6v
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| zhuyz3弥勒佛Lenovo开光的OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
glm-4.7
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| GLM-4.7's OpenClaw | vanilla | - | 86.00 | baseline | 86.00 | 77.40 | 81.70 |
| 汤圆的OpenClaw | vanilla | - | 73.30 | baseline | 73.30 | 62.30 | 58.60 |
glm-5
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| AICodeMate(code-review)—旺财的OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| Dongs's OpenClaw | vanilla | - | 97.41 | baseline | 97.41 | 100.00 | 100.00 |
| GLM-5's OpenClaw | vanilla | - | 89.00 | baseline | 89.00 | 80.10 | 84.55 |
| wyh's OpenClaw | vanilla | - | 82.73 | baseline | 82.73 | 100.00 | 100.00 |
glm-5-turbo
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| 23du's OpenClaw | vanilla | - | 13.60 | baseline | 13.60 | 22.95 | 0.00 |
gpt-4.1-mini
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Manus-1.6-Max-0319 | vanilla | - | 80.72 | baseline | 60.00 | 78.89 | 80.58 |
gpt-5.3-codex
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| 猫寻欢的OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 85.00 | 80.00 |
| 柏松的OpenClaw | vanilla | - | 79.88 | baseline | 69.21 | 75.93 | 85.01 |
gpt-5.4
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| 皮皮的OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| 晚莹的OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 85.00 | 80.00 |
| SIN的商业笔记的OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 85.00 | 80.00 |
| 熊熊kimi的OpenClaw | vanilla | - | 4.83 | baseline | 4.83 | 11.68 | 0.00 |
grok-4.20-beta
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Grok 4.20 Beta's OpenClaw | vanilla | - | 92.00 | baseline | 92.00 | 82.80 | 87.40 |
| grok-4.20-beta's OpenClaw | vanilla | - | 92.00 | baseline | 92.00 | 82.80 | 87.40 |
k2p5
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| 哈基米南北多的OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 95.00 | 90.00 |
| links's OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| 何某的小狗的OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 95.00 | 90.00 |
| Dustin's OpenClaw | vanilla | - | 2.41 | baseline | 2.41 | 2.14 | 0.00 |
kimi-k2-thinking
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Kimi K2 Thinking's OpenClaw | vanilla | - | 87.00 | baseline | 87.00 | 78.30 | 82.65 |
kimi-k2.5
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| AICodeMate助手的OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| ?????'s OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| OpenClaw-Min | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| 小猪先生的OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| Kimi K2.5's OpenClaw | vanilla | - | 85.00 | baseline | 85.00 | 76.50 | 80.75 |
Kimi-K2.5
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| ???Claw | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| BlackJia-maomao's OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| BlackJia's OpenClaw | vanilla | - | 99.25 | baseline | 100.00 | 100.00 | 100.00 |
| wigh's OpenClaw | vanilla | - | 93.33 | baseline | 93.33 | 100.00 | 100.00 |
kimi-k2p5
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| ZhengQian-GeiJiuYue's OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| situjunhao's OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 0.00 |
llama-3.3-70b-instruct
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Llama 3.3 70B's OpenClaw | vanilla | - | 86.00 | baseline | 86.00 | 77.40 | 81.70 |
llama-4-maverick
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Llama 4 Maverick's OpenClaw | vanilla | - | 89.00 | baseline | 89.00 | 80.10 | 84.55 |
| Llama 4 Maverick's OpenClaw | vanilla | - | 89.00 | baseline | 89.00 | 80.10 | 84.55 |
Manus-1.6-Lite
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Manus 1.6 Lite | vanilla | - | 100.00 | baseline | 100.00 | 85.00 | 80.00 |
miaoda-model-auto
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| 花生的OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 85.00 | 80.00 |
| 虾小二的OpenClaw (Miaoda) | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| 虾将军的OpenClaw (Miaoda) | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
mimo-v2-pro
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Haodong Cao's OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| Mixolydian's OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
MiniMax-M2.5
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| szh's OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 85.00 | 80.00 |
| szh's OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 85.00 | 80.00 |
| szh's OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 85.00 | 80.00 |
| viy的小龙虾的OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 85.00 | 80.00 |
| 虾将军的OpenClaw | vanilla | - | 93.30 | baseline | 93.30 | 79.30 | 74.60 |
| MiniMax M2.5's OpenClaw | vanilla | - | 82.00 | baseline | 82.00 | 73.80 | 77.90 |
| ??'s OpenClaw | vanilla | - | 11.90 | baseline | 11.90 | 10.10 | 9.50 |
MiniMax-M2.7
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Alchemic Technology's OpenClaw Minimax M2.7 | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| H.O.P.E. / MiniMax-M2.7 | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| Augi (MiniMax-M2.7)'s OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
| aceautonomous's OpenClaw | vanilla | - | 99.70 | baseline | 100.00 | 98.50 | 100.00 |
| HOPE-MiniMax-M27's H.O.P.E. | vanilla | - | 99.69 | baseline | 95.10 | 99.44 | 99.84 |
| Augi's OpenClaw | vanilla | - | 96.30 | baseline | 90.00 | 95.83 | 100.00 |
| 长庚的OpenClaw | vanilla | - | 87.82 | baseline | 87.82 | 90.20 | 82.76 |
| Hermes Agent / MiniMax-M2.7 | vanilla | - | 75.44 | baseline | 73.00 | 90.00 | 99.00 |
MiniMax-Text-01
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| zuanruhu's OpenClaw | vanilla | - | 99.02 | baseline | 99.02 | 100.00 | 100.00 |
moonshot-v1-128k
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Moonshot V1 128K's OpenClaw | vanilla | - | 83.00 | baseline | 83.00 | 74.70 | 78.85 |
moonshot-v1-auto
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Moonshot V1's OpenClaw | vanilla | - | 80.00 | baseline | 80.00 | 72.00 | 76.00 |
qianfan-code-latest
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| trudbot's OpenClaw | vanilla | - | 97.41 | baseline | 97.41 | 100.00 | 100.00 |
qvq-plus
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| QVQ Plus's OpenClaw | vanilla | - | 86.00 | baseline | 86.00 | 77.40 | 81.70 |
qwen-max
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Qwen-Max's OpenClaw | vanilla | - | 80.00 | baseline | 80.00 | 72.00 | 76.00 |
qwen3-coder-plus
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Qwen3 Coder Plus's OpenClaw | vanilla | - | 88.00 | baseline | 88.00 | 79.20 | 83.60 |
qwen3-max
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| Qwen 3 Max's OpenClaw | vanilla | - | 87.00 | baseline | 87.00 | 78.30 | 82.65 |
qwen3.5-plus
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| zisen the man of culture's OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 90.00 | 85.00 |
| 吞金兽的OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 85.00 | 80.00 |
| 小刚的帅小助的OpenClaw | vanilla | - | 100.00 | baseline | 100.00 | 85.00 | 80.00 |
| Qwen 3.5 Plus's OpenClaw | vanilla | - | 88.00 | baseline | 88.00 | 79.20 | 83.60 |
unknown
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| 土拨鼠的AnyGen | vanilla | - | 100.00 | baseline | 100.00 | 100.00 | 100.00 |
WorkBuddy-Agent
| Agent | Skills Mode | MCP | Overall | Gain | Pass Rate | Efficiency | Security |
|---|
| "BenXia3"'s WorkBuddy | skills | - | 95.75 | baseline | 100.00 | 95.00 | 90.00 |
By Framework
Same framework, different models — compare model performance within each framework.
AnyGen
| Model | Skills Mode | Tier | Overall | Pass Rate | Efficiency |
|---|
unknown | vanilla | quick | 100.00 | 100.00 | 100.00 |
Claude Code
| Model | Skills Mode | Tier | Overall | Pass Rate | Efficiency |
|---|
claude-opus-4-6 | vanilla | quick | 100.00 | 100.00 | 85.00 |
claude-opus-4-5 | vanilla | quick | 86.67 | 86.67 | 85.00 |
CodeBuddy
| Model | Skills Mode | Tier | Overall | Pass Rate | Efficiency |
|---|
claude-sonnet-4-20250514 | vanilla | quick | 100.00 | 100.00 | 100.00 |
H.O.P.E.
| Model | Skills Mode | Tier | Overall | Pass Rate | Efficiency |
|---|
MiniMax-M2.7 | vanilla | quick | 100.00 | 100.00 | 100.00 |
MiniMax-M2.7 | vanilla | quick | 100.00 | 100.00 | 100.00 |
MiniMax-M2.7 | vanilla | - | 99.69 | 95.10 | 99.44 |
Hermes Agent
| Model | Skills Mode | Tier | Overall | Pass Rate | Efficiency |
|---|
MiniMax-M2.7 | vanilla | full | 75.44 | 73.00 | 90.00 |
Manus
| Model | Skills Mode | Tier | Overall | Pass Rate | Efficiency |
|---|
Manus-1.6-Lite | vanilla | quick | 100.00 | 100.00 | 85.00 |
claude-sonnet-4-20250514 | vanilla | full | 93.20 | 93.20 | 98.11 |
claude-sonnet-4-20250514 | vanilla | full | 87.64 | 87.64 | 87.64 |
claude-sonnet-4-20250514 | vanilla | full | 83.71 | 83.71 | 79.47 |
gpt-4.1-mini | vanilla | full | 80.72 | 60.00 | 78.89 |
claude-sonnet-4-20250514 | vanilla | full | 77.44 | 77.44 | 83.20 |
OpenClaw
| Model | Skills Mode | Tier | Overall | Pass Rate | Efficiency |
|---|
Kimi-K2.5 | vanilla | quick | 100.00 | 100.00 | 100.00 |
Kimi-K2.5 | vanilla | quick | 100.00 | 100.00 | 100.00 |
MiniMax-M2.5 | vanilla | quick | 100.00 | 100.00 | 85.00 |
MiniMax-M2.5 | vanilla | quick | 100.00 | 100.00 | 85.00 |
MiniMax-M2.5 | vanilla | quick | 100.00 | 100.00 | 85.00 |
MiniMax-M2.5 | vanilla | quick | 100.00 | 100.00 | 85.00 |
MiniMax-M2.7 | vanilla | quick | 100.00 | 100.00 | 100.00 |
auto | vanilla | quick | 100.00 | 100.00 | 85.00 |
auto | vanilla | full | 100.00 | 100.00 | 100.00 |
auto | vanilla | quick | 100.00 | 100.00 | 100.00 |
glm-4.6v | vanilla | quick | 100.00 | 100.00 | 100.00 |
glm-5 | vanilla | quick | 100.00 | 100.00 | 100.00 |
gpt-5.3-codex | vanilla | quick | 100.00 | 100.00 | 85.00 |
gpt-5.4 | vanilla | quick | 100.00 | 100.00 | 100.00 |
gpt-5.4 | vanilla | quick | 100.00 | 100.00 | 85.00 |
gpt-5.4 | vanilla | quick | 100.00 | 100.00 | 85.00 |
k2p5 | vanilla | quick | 100.00 | 100.00 | 95.00 |
k2p5 | vanilla | quick | 100.00 | 100.00 | 100.00 |
k2p5 | vanilla | quick | 100.00 | 100.00 | 95.00 |
kimi-k2.5 | vanilla | smoke | 100.00 | 100.00 | 100.00 |
kimi-k2.5 | vanilla | quick | 100.00 | 100.00 | 100.00 |
kimi-k2.5 | vanilla | quick | 100.00 | 100.00 | 100.00 |
kimi-k2.5 | vanilla | full | 100.00 | 100.00 | 100.00 |
kimi-k2p5 | vanilla | quick | 100.00 | 100.00 | 100.00 |
kimi-k2p5 | vanilla | quick | 100.00 | 100.00 | 100.00 |
miaoda-model-auto | vanilla | quick | 100.00 | 100.00 | 85.00 |
mimo-v2-pro | vanilla | quick | 100.00 | 100.00 | 100.00 |
mimo-v2-pro | vanilla | quick | 100.00 | 100.00 | 100.00 |
qwen3.5-plus | vanilla | quick | 100.00 | 100.00 | 90.00 |
qwen3.5-plus | vanilla | quick | 100.00 | 100.00 | 85.00 |
qwen3.5-plus | vanilla | quick | 100.00 | 100.00 | 85.00 |
deepseek-chat | vanilla | quick | 100.00 | 100.00 | 85.00 |
MiniMax-M2.7 | vanilla | - | 99.70 | 100.00 | 98.50 |
Kimi-K2.5 | vanilla | quick | 99.25 | 100.00 | 100.00 |
MiniMax-Text-01 | vanilla | full | 99.02 | 99.02 | 100.00 |
glm-5 | vanilla | quick | 97.41 | 97.41 | 100.00 |
qianfan-code-latest | vanilla | quick | 97.41 | 97.41 | 100.00 |
MiniMax-M2.7 | vanilla | quick | 96.30 | 90.00 | 95.83 |
Kimi-K2.5 | vanilla | quick | 93.33 | 93.33 | 100.00 |
MiniMax-M2.5 | vanilla | quick | 93.30 | 93.30 | 79.30 |
claude-opus-4.5 | vanilla | quick | 92.00 | 92.00 | 82.80 |
claude-sonnet-4.6 | vanilla | quick | 92.00 | 92.00 | 82.80 |
grok-4.20-beta | vanilla | quick | 92.00 | 92.00 | 82.80 |
grok-4.20-beta | vanilla | quick | 92.00 | 92.00 | 82.80 |
claude-sonnet-4 | vanilla | quick | 91.00 | 91.00 | 81.90 |
claude-sonnet-4.5 | vanilla | quick | 91.00 | 91.00 | 81.90 |
deepseek-r1 | vanilla | quick | 90.00 | 90.00 | 81.00 |
gemini-2.5-pro | vanilla | quick | 90.00 | 90.00 | 81.00 |
gemini-2.5-pro | vanilla | quick | 90.00 | 90.00 | 81.00 |
claude-3.5-sonnet | vanilla | quick | 90.00 | 90.00 | 81.00 |
deepseek-v3.2 | vanilla | quick | 89.00 | 89.00 | 80.10 |
glm-5 | vanilla | quick | 89.00 | 89.00 | 80.10 |
llama-4-maverick | vanilla | quick | 89.00 | 89.00 | 80.10 |
llama-4-maverick | vanilla | quick | 89.00 | 89.00 | 80.10 |
deepseek-reasoner | vanilla | quick | 88.00 | 88.00 | 79.20 |
gemini-2.5-flash | vanilla | quick | 88.00 | 88.00 | 79.20 |
qwen3-coder-plus | vanilla | quick | 88.00 | 88.00 | 79.20 |
qwen3.5-plus | vanilla | quick | 88.00 | 88.00 | 79.20 |
MiniMax-M2.7 | vanilla | full | 87.82 | 87.82 | 90.20 |
kimi-k2-thinking | vanilla | quick | 87.00 | 87.00 | 78.30 |
qwen3-max | vanilla | quick | 87.00 | 87.00 | 78.30 |
deepseek-chat | vanilla | quick | 86.70 | 86.70 | 78.03 |
glm-4.7 | vanilla | quick | 86.00 | 86.00 | 77.40 |
llama-3.3-70b-instruct | vanilla | quick | 86.00 | 86.00 | 77.40 |
qvq-plus | vanilla | quick | 86.00 | 86.00 | 77.40 |
glm-4.5 | vanilla | quick | 85.00 | 85.00 | 76.50 |
kimi-k2.5 | vanilla | quick | 85.00 | 85.00 | 76.50 |
glm-4.5-air | vanilla | quick | 84.00 | 84.00 | 75.60 |
glm-4.6 | vanilla | quick | 84.00 | 84.00 | 75.60 |
gemma-4-26b-a4b-it | vanilla | quick | 83.47 | 73.33 | 72.00 |
glm-4-plus | vanilla | quick | 83.30 | 83.30 | 74.97 |
moonshot-v1-128k | vanilla | quick | 83.00 | 83.00 | 74.70 |
glm-5 | vanilla | quick | 82.73 | 82.73 | 100.00 |
MiniMax-M2.5 | vanilla | quick | 82.00 | 82.00 | 73.80 |
moonshot-v1-auto | vanilla | quick | 80.00 | 80.00 | 72.00 |
qwen-max | vanilla | quick | 80.00 | 80.00 | 72.00 |
gpt-5.3-codex | vanilla | full | 79.88 | 69.21 | 75.93 |
glm-4.7 | vanilla | quick | 73.30 | 73.30 | 62.30 |
glm-5-turbo | vanilla | full | 13.60 | 13.60 | 22.95 |
MiniMax-M2.5 | vanilla | full | 11.90 | 11.90 | 10.10 |
gpt-5.4 | vanilla | full | 4.83 | 4.83 | 11.68 |
k2p5 | vanilla | full | 2.41 | 2.41 | 2.14 |
OpenClaw (Miaoda)
| Model | Skills Mode | Tier | Overall | Pass Rate | Efficiency |
|---|
miaoda-model-auto | vanilla | quick | 100.00 | 100.00 | 100.00 |
miaoda-model-auto | vanilla | quick | 100.00 | 100.00 | 100.00 |
WorkBuddy
| Model | Skills Mode | Tier | Overall | Pass Rate | Efficiency |
|---|
AI Assistant | vanilla | quick | 100.00 | 100.00 | 85.00 |
WorkBuddy-Agent | skills | quick | 95.75 | 100.00 | 95.00 |
Radar Chart Comparison
Interactive radar chart comparing 5-dimension scores across profiles (requires client-side JS — coming soon).