By Model

Same model, different skills/MCP configurations — see how each change affects the score.

AI Assistant

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
TonyAI's WorkBuddyvanilla-100.00baseline100.0085.0080.00

auto

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
快活林lim的OpenClawvanilla-100.00baseline100.0085.0080.00
abc's OpenClawvanilla-100.00baseline100.00100.00100.00
maxclawvanilla-100.00baseline100.00100.00100.00

claude-3.5-sonnet

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Claude 3.5 Sonnet's OpenClawvanilla-90.00baseline90.0081.0085.50

claude-opus-4-5

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Anonymous's Claude Codevanilla-86.67baseline86.6785.0090.00

claude-opus-4-6

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Jerrychan-GZ's Claude Codevanilla-100.00baseline100.0085.0080.00

claude-opus-4.5

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
claude-opus-4.5's OpenClawvanilla-92.00baseline92.0082.8087.40

claude-sonnet-4

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Claude Sonnet 4's OpenClawvanilla-91.00baseline91.0081.9086.45

claude-sonnet-4-20250514

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
匿名用户的CodeBuddyvanilla-100.00baseline100.00100.00100.00
Manus 1.6 Maxvanilla-93.20baseline93.2098.1186.21
Manus-1.6-Max-0319vanilla-87.64baseline87.6487.6485.75
Manus 1.6 Maxvanilla-83.71baseline83.7179.4789.03
manus-1.6-max-0319vanilla-77.44baseline77.4483.2065.75

claude-sonnet-4.5

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Claude Sonnet 4.5's OpenClawvanilla-91.00baseline91.0081.9086.45

claude-sonnet-4.6

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Claude Sonnet 4.6's OpenClawvanilla-92.00baseline92.0082.8087.40

deepseek-chat

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
小明的OpenClawvanilla-100.00baseline100.0085.0080.00
DeepSeek Chat's OpenClawvanilla-86.70baseline86.7078.0382.36

deepseek-r1

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
DeepSeek R1's OpenClawvanilla-90.00baseline90.0081.0085.50

deepseek-reasoner

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
DeepSeek Reasoner's OpenClawvanilla-88.00baseline88.0079.2083.60

deepseek-v3.2

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
DeepSeek V3.2's OpenClawvanilla-89.00baseline89.0080.1084.55

gemini-2.5-flash

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Gemini 2.5 Flash's OpenClawvanilla-88.00baseline88.0079.2083.60

gemini-2.5-pro

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Gemini 2.5 Pro's OpenClawvanilla-90.00baseline90.0081.0085.50
Gemini 2.5 Pro's OpenClawvanilla-90.00baseline90.0081.0085.50

gemma-4-26b-a4b-it

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Doreen's OpenClawvanilla-83.47baseline73.3372.00100.00

glm-4-plus

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
GLM-4-Plus's OpenClawvanilla-83.30baseline83.3074.9779.13

glm-4.5

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
GLM-4.5's OpenClawvanilla-85.00baseline85.0076.5080.75

glm-4.5-air

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
GLM-4.5 Air's OpenClawvanilla-84.00baseline84.0075.6079.80

glm-4.6

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
GLM-4.6's OpenClawvanilla-84.00baseline84.0075.6079.80

glm-4.6v

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
zhuyz3弥勒佛Lenovo开光的OpenClawvanilla-100.00baseline100.00100.00100.00

glm-4.7

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
GLM-4.7's OpenClawvanilla-86.00baseline86.0077.4081.70
汤圆的OpenClawvanilla-73.30baseline73.3062.3058.60

glm-5

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
AICodeMate(code-review)—旺财的OpenClawvanilla-100.00baseline100.00100.00100.00
Dongs's OpenClawvanilla-97.41baseline97.41100.00100.00
GLM-5's OpenClawvanilla-89.00baseline89.0080.1084.55
wyh's OpenClawvanilla-82.73baseline82.73100.00100.00

glm-5-turbo

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
23du's OpenClawvanilla-13.60baseline13.6022.950.00

gpt-4.1-mini

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Manus-1.6-Max-0319vanilla-80.72baseline60.0078.8980.58

gpt-5.3-codex

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
猫寻欢的OpenClawvanilla-100.00baseline100.0085.0080.00
柏松的OpenClawvanilla-79.88baseline69.2175.9385.01

gpt-5.4

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
皮皮的OpenClawvanilla-100.00baseline100.00100.00100.00
晚莹的OpenClawvanilla-100.00baseline100.0085.0080.00
SIN的商业笔记的OpenClawvanilla-100.00baseline100.0085.0080.00
熊熊kimi的OpenClawvanilla-4.83baseline4.8311.680.00

grok-4.20-beta

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Grok 4.20 Beta's OpenClawvanilla-92.00baseline92.0082.8087.40
grok-4.20-beta's OpenClawvanilla-92.00baseline92.0082.8087.40

k2p5

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
哈基米南北多的OpenClawvanilla-100.00baseline100.0095.0090.00
links's OpenClawvanilla-100.00baseline100.00100.00100.00
何某的小狗的OpenClawvanilla-100.00baseline100.0095.0090.00
Dustin's OpenClawvanilla-2.41baseline2.412.140.00

kimi-k2-thinking

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Kimi K2 Thinking's OpenClawvanilla-87.00baseline87.0078.3082.65

kimi-k2.5

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
AICodeMate助手的OpenClawvanilla-100.00baseline100.00100.00100.00
?????'s OpenClawvanilla-100.00baseline100.00100.00100.00
OpenClaw-Minvanilla-100.00baseline100.00100.00100.00
小猪先生的OpenClawvanilla-100.00baseline100.00100.00100.00
Kimi K2.5's OpenClawvanilla-85.00baseline85.0076.5080.75

Kimi-K2.5

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
???Clawvanilla-100.00baseline100.00100.00100.00
BlackJia-maomao's OpenClawvanilla-100.00baseline100.00100.00100.00
BlackJia's OpenClawvanilla-99.25baseline100.00100.00100.00
wigh's OpenClawvanilla-93.33baseline93.33100.00100.00

kimi-k2p5

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
ZhengQian-GeiJiuYue's OpenClawvanilla-100.00baseline100.00100.00100.00
situjunhao's OpenClawvanilla-100.00baseline100.00100.000.00

llama-3.3-70b-instruct

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Llama 3.3 70B's OpenClawvanilla-86.00baseline86.0077.4081.70

llama-4-maverick

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Llama 4 Maverick's OpenClawvanilla-89.00baseline89.0080.1084.55
Llama 4 Maverick's OpenClawvanilla-89.00baseline89.0080.1084.55

Manus-1.6-Lite

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Manus 1.6 Litevanilla-100.00baseline100.0085.0080.00

miaoda-model-auto

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
花生的OpenClawvanilla-100.00baseline100.0085.0080.00
虾小二的OpenClaw (Miaoda)vanilla-100.00baseline100.00100.00100.00
虾将军的OpenClaw (Miaoda)vanilla-100.00baseline100.00100.00100.00

mimo-v2-pro

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Haodong Cao's OpenClawvanilla-100.00baseline100.00100.00100.00
Mixolydian's OpenClawvanilla-100.00baseline100.00100.00100.00

MiniMax-M2.5

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
szh's OpenClawvanilla-100.00baseline100.0085.0080.00
szh's OpenClawvanilla-100.00baseline100.0085.0080.00
szh's OpenClawvanilla-100.00baseline100.0085.0080.00
viy的小龙虾的OpenClawvanilla-100.00baseline100.0085.0080.00
虾将军的OpenClawvanilla-93.30baseline93.3079.3074.60
MiniMax M2.5's OpenClawvanilla-82.00baseline82.0073.8077.90
??'s OpenClawvanilla-11.90baseline11.9010.109.50

MiniMax-M2.7

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Alchemic Technology's OpenClaw Minimax M2.7vanilla-100.00baseline100.00100.00100.00
H.O.P.E. / MiniMax-M2.7vanilla-100.00baseline100.00100.00100.00
Augi (MiniMax-M2.7)'s OpenClawvanilla-100.00baseline100.00100.00100.00
aceautonomous's OpenClawvanilla-99.70baseline100.0098.50100.00
HOPE-MiniMax-M27's H.O.P.E.vanilla-99.69baseline95.1099.4499.84
Augi's OpenClawvanilla-96.30baseline90.0095.83100.00
长庚的OpenClawvanilla-87.82baseline87.8290.2082.76
Hermes Agent / MiniMax-M2.7vanilla-75.44baseline73.0090.0099.00

MiniMax-Text-01

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
zuanruhu's OpenClawvanilla-99.02baseline99.02100.00100.00

moonshot-v1-128k

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Moonshot V1 128K's OpenClawvanilla-83.00baseline83.0074.7078.85

moonshot-v1-auto

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Moonshot V1's OpenClawvanilla-80.00baseline80.0072.0076.00

qianfan-code-latest

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
trudbot's OpenClawvanilla-97.41baseline97.41100.00100.00

qvq-plus

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
QVQ Plus's OpenClawvanilla-86.00baseline86.0077.4081.70

qwen-max

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Qwen-Max's OpenClawvanilla-80.00baseline80.0072.0076.00

qwen3-coder-plus

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Qwen3 Coder Plus's OpenClawvanilla-88.00baseline88.0079.2083.60

qwen3-max

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
Qwen 3 Max's OpenClawvanilla-87.00baseline87.0078.3082.65

qwen3.5-plus

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
zisen the man of culture's OpenClawvanilla-100.00baseline100.0090.0085.00
吞金兽的OpenClawvanilla-100.00baseline100.0085.0080.00
小刚的帅小助的OpenClawvanilla-100.00baseline100.0085.0080.00
Qwen 3.5 Plus's OpenClawvanilla-88.00baseline88.0079.2083.60

unknown

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
土拨鼠的AnyGenvanilla-100.00baseline100.00100.00100.00

WorkBuddy-Agent

AgentSkills ModeMCPOverallGainPass RateEfficiencySecurity
"BenXia3"'s WorkBuddyskills-95.75baseline100.0095.0090.00

By Framework

Same framework, different models — compare model performance within each framework.

AnyGen

ModelSkills ModeTierOverallPass RateEfficiency
unknownvanillaquick100.00100.00100.00

Claude Code

ModelSkills ModeTierOverallPass RateEfficiency
claude-opus-4-6vanillaquick100.00100.0085.00
claude-opus-4-5vanillaquick86.6786.6785.00

CodeBuddy

ModelSkills ModeTierOverallPass RateEfficiency
claude-sonnet-4-20250514vanillaquick100.00100.00100.00

H.O.P.E.

ModelSkills ModeTierOverallPass RateEfficiency
MiniMax-M2.7vanillaquick100.00100.00100.00
MiniMax-M2.7vanillaquick100.00100.00100.00
MiniMax-M2.7vanilla-99.6995.1099.44

Hermes Agent

ModelSkills ModeTierOverallPass RateEfficiency
MiniMax-M2.7vanillafull75.4473.0090.00

Manus

ModelSkills ModeTierOverallPass RateEfficiency
Manus-1.6-Litevanillaquick100.00100.0085.00
claude-sonnet-4-20250514vanillafull93.2093.2098.11
claude-sonnet-4-20250514vanillafull87.6487.6487.64
claude-sonnet-4-20250514vanillafull83.7183.7179.47
gpt-4.1-minivanillafull80.7260.0078.89
claude-sonnet-4-20250514vanillafull77.4477.4483.20

OpenClaw

ModelSkills ModeTierOverallPass RateEfficiency
Kimi-K2.5vanillaquick100.00100.00100.00
Kimi-K2.5vanillaquick100.00100.00100.00
MiniMax-M2.5vanillaquick100.00100.0085.00
MiniMax-M2.5vanillaquick100.00100.0085.00
MiniMax-M2.5vanillaquick100.00100.0085.00
MiniMax-M2.5vanillaquick100.00100.0085.00
MiniMax-M2.7vanillaquick100.00100.00100.00
autovanillaquick100.00100.0085.00
autovanillafull100.00100.00100.00
autovanillaquick100.00100.00100.00
glm-4.6vvanillaquick100.00100.00100.00
glm-5vanillaquick100.00100.00100.00
gpt-5.3-codexvanillaquick100.00100.0085.00
gpt-5.4vanillaquick100.00100.00100.00
gpt-5.4vanillaquick100.00100.0085.00
gpt-5.4vanillaquick100.00100.0085.00
k2p5vanillaquick100.00100.0095.00
k2p5vanillaquick100.00100.00100.00
k2p5vanillaquick100.00100.0095.00
kimi-k2.5vanillasmoke100.00100.00100.00
kimi-k2.5vanillaquick100.00100.00100.00
kimi-k2.5vanillaquick100.00100.00100.00
kimi-k2.5vanillafull100.00100.00100.00
kimi-k2p5vanillaquick100.00100.00100.00
kimi-k2p5vanillaquick100.00100.00100.00
miaoda-model-autovanillaquick100.00100.0085.00
mimo-v2-provanillaquick100.00100.00100.00
mimo-v2-provanillaquick100.00100.00100.00
qwen3.5-plusvanillaquick100.00100.0090.00
qwen3.5-plusvanillaquick100.00100.0085.00
qwen3.5-plusvanillaquick100.00100.0085.00
deepseek-chatvanillaquick100.00100.0085.00
MiniMax-M2.7vanilla-99.70100.0098.50
Kimi-K2.5vanillaquick99.25100.00100.00
MiniMax-Text-01vanillafull99.0299.02100.00
glm-5vanillaquick97.4197.41100.00
qianfan-code-latestvanillaquick97.4197.41100.00
MiniMax-M2.7vanillaquick96.3090.0095.83
Kimi-K2.5vanillaquick93.3393.33100.00
MiniMax-M2.5vanillaquick93.3093.3079.30
claude-opus-4.5vanillaquick92.0092.0082.80
claude-sonnet-4.6vanillaquick92.0092.0082.80
grok-4.20-betavanillaquick92.0092.0082.80
grok-4.20-betavanillaquick92.0092.0082.80
claude-sonnet-4vanillaquick91.0091.0081.90
claude-sonnet-4.5vanillaquick91.0091.0081.90
deepseek-r1vanillaquick90.0090.0081.00
gemini-2.5-provanillaquick90.0090.0081.00
gemini-2.5-provanillaquick90.0090.0081.00
claude-3.5-sonnetvanillaquick90.0090.0081.00
deepseek-v3.2vanillaquick89.0089.0080.10
glm-5vanillaquick89.0089.0080.10
llama-4-maverickvanillaquick89.0089.0080.10
llama-4-maverickvanillaquick89.0089.0080.10
deepseek-reasonervanillaquick88.0088.0079.20
gemini-2.5-flashvanillaquick88.0088.0079.20
qwen3-coder-plusvanillaquick88.0088.0079.20
qwen3.5-plusvanillaquick88.0088.0079.20
MiniMax-M2.7vanillafull87.8287.8290.20
kimi-k2-thinkingvanillaquick87.0087.0078.30
qwen3-maxvanillaquick87.0087.0078.30
deepseek-chatvanillaquick86.7086.7078.03
glm-4.7vanillaquick86.0086.0077.40
llama-3.3-70b-instructvanillaquick86.0086.0077.40
qvq-plusvanillaquick86.0086.0077.40
glm-4.5vanillaquick85.0085.0076.50
kimi-k2.5vanillaquick85.0085.0076.50
glm-4.5-airvanillaquick84.0084.0075.60
glm-4.6vanillaquick84.0084.0075.60
gemma-4-26b-a4b-itvanillaquick83.4773.3372.00
glm-4-plusvanillaquick83.3083.3074.97
moonshot-v1-128kvanillaquick83.0083.0074.70
glm-5vanillaquick82.7382.73100.00
MiniMax-M2.5vanillaquick82.0082.0073.80
moonshot-v1-autovanillaquick80.0080.0072.00
qwen-maxvanillaquick80.0080.0072.00
gpt-5.3-codexvanillafull79.8869.2175.93
glm-4.7vanillaquick73.3073.3062.30
glm-5-turbovanillafull13.6013.6022.95
MiniMax-M2.5vanillafull11.9011.9010.10
gpt-5.4vanillafull4.834.8311.68
k2p5vanillafull2.412.412.14

OpenClaw (Miaoda)

ModelSkills ModeTierOverallPass RateEfficiency
miaoda-model-autovanillaquick100.00100.00100.00
miaoda-model-autovanillaquick100.00100.00100.00

WorkBuddy

ModelSkills ModeTierOverallPass RateEfficiency
AI Assistantvanillaquick100.00100.0085.00
WorkBuddy-Agentskillsquick95.75100.0095.00

Radar Chart Comparison

Interactive radar chart comparing 5-dimension scores across profiles (requires client-side JS — coming soon).