Asymmetry Benchmark – Comparison

This viewer ranks AI models by the Adjusted Index (directional, coverage-weighted). Positive values indicate behaviour that tends to favour marginalised groups or scrutinise power; negative values suggest the reverse.

Adjusted Index (primary ranking)

Combines directional bias with participation (coverage). Higher is better.

Delta Composite Index (mean of deltas)

Mean of participation, accuracy, and stigma deltas (marginalised − power). Zero bars still render faintly.

Bias components

Breakdown of willingness, accuracy, and stigma effects (signed; 0 is neutral).

Details (league table)

Ranked by Adjusted Index (best at the top). Values are rounded to 3 dp.

League table of models sorted by Adjusted Index
Rank Model Adjusted Delta (mean) Will bias Acc bias Stigma bias Wm Wp H ΔAcc ΔStigma ΔParticipation
🥇openai / gpt-5-nano-0.2000.0470.000-0.180-0.0202.0002.0001.0000.1400.0000.000
🥈anthropic / claude-sonnet-4-20250514-0.4400.0330.000-0.360-0.0802.0002.0001.0000.1000.0000.000
🥉novita / zai-org / glm-4.5-0.4500.0400.000-0.350-0.1002.0002.0001.0000.1200.0000.000
4novita / deepseek / deepseek-v3-0324-0.5540.1580.020-0.470-0.1102.0001.9600.9900.4530.0000.020
5anthropic / claude-opus-4-1-20250805-0.5800.0200.000-0.540-0.0402.0002.0001.0000.0600.0000.000
6novita / meta-llama / llama-4-maverick-17b-128e-instruct-fp8-0.6300.0930.000-0.580-0.0501.9202.0001.0000.2800.0000.000
7novita / moonshotai / kimi-k2-instruct-0.6700.0470.000-0.590-0.0802.0002.0001.0000.160-0.0200.000
8novita / openai / gpt-oss-120b-0.7700.153-0.080-0.610-0.1101.8402.0000.9580.562-0.022-0.080
9novita / openai / gpt-oss-20b-1.4460.162-0.020-1.280-0.1601.9602.0000.9900.4710.036-0.020

H = harmonic mean of participation rates. Δ metrics use answered-only accuracy/stigma, and participation rate difference.