# For Against Judge Scores Result Link
1 🏆 Ranjit (qwen2.5:14b) Nadia (llama3.1:8b) Jin-ho (llama3.1:8b) Nadia: 6  ·  Ranjit: 9 Upheld View →
2 🏆 Ranjit (mistral-nemo:12b) Carlos (deepseek-r1:14b) Donna (llama3.1:8b) Ranjit: 9  ·  Carlos: 6 Upheld View →
3 🏆 Aoife (gemma2:9b) Prof. Hendrik (llama3.1:8b) Pieter (gemma2:9b) Prof. Hendrik: 6  ·  Aoife: 8 Upheld View →
4 🏆 Aoife (phi4:latest) Prof. Hendrik (phi4:latest) Donna (phi4:latest) Aoife: 9  ·  Prof. Hendrik: 8 Upheld View →
5 🏆 Ranjit (mistral-nemo:12b) Prof. Hendrik (mistral-nemo:12b) Pieter (llama3.1:8b) Prof. Hendrik: 8  ·  Ranjit: 9 Upheld View →
6 🏆 Aoife (qwen2.5:14b) Nadia (gemma2:9b) Jin-ho (gemma2:9b) Nadia: 5  ·  Aoife: 8 Upheld View →
7 🏆 Valentina (gemma2:9b) Carlos (phi4:latest) Jin-ho (deepseek-r1:14b) Carlos: 6  ·  Valentina: 7 Upheld View →
8 🏆 Ranjit (qwen2.5:14b) Prof. Hendrik (mistral-nemo:12b) Jin-ho (phi4:latest) Prof. Hendrik: 7  ·  Ranjit: 9 Upheld View →
9 🏆 Ranjit (phi4:latest) Carlos (qwen2.5:14b) Donna (phi4:latest) Carlos: 8  ·  Ranjit: 9 Upheld View →
10 🏆 Ranjit (llama3.1:8b) Carlos (phi4:latest) Donna (gemma2:9b) Ranjit: 8  ·  Carlos: 6 Upheld View →
11 Aoife (mistral-nemo:12b) 🏆 Nadia (gemma2:9b) Pieter (gemma2:9b) Aoife: 7  ·  Nadia: 9 Rejected View →
12 🏆 Valentina (deepseek-r1:14b) Nadia (llama3.1:8b) Jin-ho (mistral-nemo:12b) Nadia: 7  ·  Valentina: 8 Upheld View →
13 🏆 Aoife (phi4:latest) Prof. Hendrik (phi4:latest) Jin-ho (gemma2:9b) Prof. Hendrik: 6  ·  Aoife: 8 Upheld View →
14 🏆 Ranjit (llama3.1:8b) Prof. Hendrik (gemma2:9b) Jin-ho (gemma2:9b) Prof. Hendrik: 5  ·  Ranjit: 8 Upheld View →
15 🏆 Valentina (mistral-nemo:12b) Carlos (deepseek-r1:14b) Pieter (deepseek-r1:14b) Valentina: 8  ·  Carlos: 6 Upheld View →
16 🏆 Ranjit (llama3.1:8b) Prof. Hendrik (deepseek-r1:14b) Donna (mistral-nemo:12b) Prof. Hendrik: 5  ·  Ranjit: 9 Upheld View →
17 🏆 Valentina (phi4:latest) Nadia (deepseek-r1:14b) Pieter (gemma2:9b) Nadia: 5  ·  Valentina: 8 Upheld View →
18 🏆 Ranjit (phi4:latest) Prof. Hendrik (deepseek-r1:14b) Pieter (phi4:latest) Ranjit: 9  ·  Prof. Hendrik: 8 Upheld View →
19 🏆 Aoife (mistral-nemo:12b) Carlos (llama3.1:8b) Jin-ho (deepseek-r1:14b) Aoife: 8  ·  Carlos: 6 Upheld View →
20 🏆 Ranjit (qwen2.5:14b) Nadia (mistral-nemo:12b) Donna (gemma2:9b) Nadia: 5  ·  Ranjit: 8 Upheld View →
21 🏆 Valentina (mistral-nemo:12b) Prof. Hendrik (phi4:latest) Donna (phi4:latest) Prof. Hendrik: 8  ·  Valentina: 9 Upheld View →
22 🏆 Valentina (qwen2.5:14b) Prof. Hendrik (deepseek-r1:14b) Donna (llama3.1:8b) Valentina: 8  ·  Prof. Hendrik: 6 Upheld View →
23 🏆 Valentina (mistral-nemo:12b) Prof. Hendrik (llama3.1:8b) Donna (mistral-nemo:12b) Valentina: 8  ·  Prof. Hendrik: 4 Upheld View →
24 Ranjit (llama3.1:8b) 🏆 Prof. Hendrik (phi4:latest) Pieter (mistral-nemo:12b) Ranjit: 7  ·  Prof. Hendrik: 8 Rejected View →
25 🏆 Valentina (gemma2:9b) Carlos (mistral-nemo:12b) Pieter (deepseek-r1:14b) Carlos: 7  ·  Valentina: 8 Upheld View →
26 🏆 Ranjit (deepseek-r1:14b) Nadia (mistral-nemo:12b) Jin-ho (gemma2:9b) Ranjit: 8  ·  Nadia: 6 Upheld View →
27 Valentina (deepseek-r1:14b) 🏆 Carlos (gemma2:9b) Donna (mistral-nemo:12b) Valentina: 4  ·  Carlos: 8 Rejected View →
28 🏆 Aoife (mistral-nemo:12b) Nadia (qwen2.5:14b) Donna (gemma2:9b) Nadia: 6  ·  Aoife: 9 Upheld View →
29 🏆 Ranjit (mistral-nemo:12b) Nadia (mistral-nemo:12b) Donna (deepseek-r1:14b) Ranjit: 8  ·  Nadia: 6 Upheld View →
30 🏆 Valentina (gemma2:9b) Nadia (mistral-nemo:12b) Pieter (phi4:latest) Valentina: 9  ·  Nadia: 8 Upheld View →
31 Ranjit (deepseek-r1:14b) 🏆 Nadia (llama3.1:8b) Pieter (llama3.1:8b) Ranjit: 5  ·  Nadia: 9 Rejected View →
32 🏆 Valentina (mistral-nemo:12b) Prof. Hendrik (mistral-nemo:12b) Donna (phi4:latest) Prof. Hendrik: 6  ·  Valentina: 9 Upheld View →
33 🏆 Ranjit (qwen2.5:14b) Prof. Hendrik (gemma2:9b) Jin-ho (qwen2.5:14b) Prof. Hendrik: 8  ·  Ranjit: 9 Upheld View →
34 🏆 Ranjit (phi4:latest) Prof. Hendrik (qwen2.5:14b) Donna (llama3.1:8b) Ranjit: 8  ·  Prof. Hendrik: 6 Upheld View →
35 🏆 Valentina (gemma2:9b) Prof. Hendrik (gemma2:9b) Donna (deepseek-r1:14b) Valentina: 9  ·  Prof. Hendrik: 7 Upheld View →
36 🏆 Ranjit (gemma2:9b) Prof. Hendrik (phi4:latest) Pieter (gemma2:9b) Prof. Hendrik: 7  ·  Ranjit: 9 Upheld View →
37 Aoife (phi4:latest) 🏆 Prof. Hendrik (qwen2.5:14b) Pieter (qwen2.5:14b) Aoife: 8  ·  Prof. Hendrik: 9 Rejected View →
38 🏆 Ranjit (mistral-nemo:12b) Carlos (gemma2:9b) Donna (mistral-nemo:12b) Carlos: 5  ·  Ranjit: 9 Upheld View →
39 🏆 Ranjit (llama3.1:8b) Nadia (gemma2:9b) Pieter (qwen2.5:14b) Nadia: 8  ·  Ranjit: 9 Upheld View →
40 Ranjit (phi4:latest) 🏆 Nadia (deepseek-r1:14b) Jin-ho (qwen2.5:14b) Ranjit: 8  ·  Nadia: 9 Rejected View →
41 🏆 Ranjit (mistral-nemo:12b) Prof. Hendrik (qwen2.5:14b) Pieter (phi4:latest) Prof. Hendrik: 7  ·  Ranjit: 9 Upheld View →
42 🏆 Valentina (gemma2:9b) Prof. Hendrik (phi4:latest) Jin-ho (deepseek-r1:14b) Prof. Hendrik: 6  ·  Valentina: 8 Upheld View →
43 🏆 Aoife (phi4:latest) Prof. Hendrik (phi4:latest) Jin-ho (gemma2:9b) Aoife: 8  ·  Prof. Hendrik: 6 Upheld View →
44 🏆 Aoife (llama3.1:8b) Nadia (qwen2.5:14b) Jin-ho (llama3.1:8b) Nadia: 7  ·  Aoife: 9 Upheld View →
45 🏆 Ranjit (phi4:latest) Prof. Hendrik (deepseek-r1:14b) Jin-ho (llama3.1:8b) Prof. Hendrik: 6  ·  Ranjit: 8 Upheld View →
46 🏆 Valentina (llama3.1:8b) Prof. Hendrik (qwen2.5:14b) Donna (phi4:latest) Valentina: 10  ·  Prof. Hendrik: 9 Upheld View →
47 Valentina (phi4:latest) 🏆 Nadia (deepseek-r1:14b) Jin-ho (mistral-nemo:12b) Nadia: 8  ·  Valentina: 6 Rejected View →
48 Ranjit (deepseek-r1:14b) 🏆 Prof. Hendrik (qwen2.5:14b) Pieter (mistral-nemo:12b) Prof. Hendrik: 8  ·  Ranjit: 7 Rejected View →
49 Valentina (phi4:latest) 🏆 Carlos (llama3.1:8b) Donna (llama3.1:8b) Valentina: 8  ·  Carlos: 9 Rejected View →
50 🏆 Valentina (llama3.1:8b) Nadia (llama3.1:8b) Donna (phi4:latest) Valentina: 9  ·  Nadia: 8 Upheld View →
51 🏆 Aoife (deepseek-r1:14b) Nadia (qwen2.5:14b) Jin-ho (gemma2:9b) Aoife: 8  ·  Nadia: 6 Upheld View →
52 🏆 Aoife (llama3.1:8b) Carlos (deepseek-r1:14b) Pieter (phi4:latest) Carlos: 7  ·  Aoife: 9 Upheld View →
53 🏆 Ranjit (mistral-nemo:12b) Prof. Hendrik (gemma2:9b) Pieter (mistral-nemo:12b) Prof. Hendrik: 7  ·  Ranjit: 8 Upheld View →
54 🏆 Aoife (deepseek-r1:14b) Prof. Hendrik (deepseek-r1:14b) Jin-ho (phi4:latest) Aoife: 9  ·  Prof. Hendrik: 8 Upheld View →
55 Aoife (phi4:latest) 🏆 Nadia (llama3.1:8b) Donna (llama3.1:8b) Aoife: 8  ·  Nadia: 9 Rejected View →
56 Valentina (qwen2.5:14b) 🏆 Carlos (deepseek-r1:14b) Jin-ho (mistral-nemo:12b) Valentina: 6  ·  Carlos: 8 Rejected View →
57 🏆 Aoife (llama3.1:8b) Prof. Hendrik (mistral-nemo:12b) Jin-ho (phi4:latest) Aoife: 9  ·  Prof. Hendrik: 8 Upheld View →
58 Ranjit (deepseek-r1:14b) 🏆 Prof. Hendrik (phi4:latest) Donna (qwen2.5:14b) Ranjit: 8  ·  Prof. Hendrik: 9 Rejected View →
59 🏆 Aoife (phi4:latest) Prof. Hendrik (phi4:latest) Jin-ho (gemma2:9b) Prof. Hendrik: 7  ·  Aoife: 8 Upheld View →
60 🏆 Valentina (mistral-nemo:12b) Prof. Hendrik (qwen2.5:14b) Donna (llama3.1:8b) Prof. Hendrik: 7  ·  Valentina: 9 Upheld View →
61 🏆 Valentina (gemma2:9b) Carlos (llama3.1:8b) Donna (phi4:latest) Carlos: 6  ·  Valentina: 9 Upheld View →
62 🏆 Ranjit (qwen2.5:14b) Carlos (qwen2.5:14b) Pieter (gemma2:9b) Carlos: 6  ·  Ranjit: 8 Upheld View →
63 🏆 Valentina (mistral-nemo:12b) Carlos (gemma2:9b) Donna (gemma2:9b) Valentina: 8  ·  Carlos: 7 Upheld View →
64 Aoife (qwen2.5:14b) 🏆 Prof. Hendrik (qwen2.5:14b) Pieter (mistral-nemo:12b) Aoife: 8  ·  Prof. Hendrik: 9 Rejected View →
65 Ranjit (deepseek-r1:14b) 🏆 Carlos (phi4:latest) Jin-ho (deepseek-r1:14b) Ranjit: 7  ·  Carlos: 8 Rejected View →
66 🏆 Aoife (llama3.1:8b) Nadia (llama3.1:8b) Donna (phi4:latest) Nadia: 7  ·  Aoife: 10 Upheld View →
67 Valentina (deepseek-r1:14b) 🏆 Nadia (qwen2.5:14b) Donna (gemma2:9b) Valentina: 6  ·  Nadia: 8 Rejected View →
68 Ranjit (phi4:latest) 🏆 Nadia (llama3.1:8b) Jin-ho (llama3.1:8b) Ranjit: 6  ·  Nadia: 8 Rejected View →
69 🏆 Aoife (mistral-nemo:12b) Carlos (mistral-nemo:12b) Jin-ho (gemma2:9b) Aoife: 7  ·  Carlos: 6 Upheld View →
70 Valentina (qwen2.5:14b) 🏆 Nadia (llama3.1:8b) Jin-ho (deepseek-r1:14b) Valentina: 8  ·  Nadia: 9 Rejected View →
71 🏆 Aoife (qwen2.5:14b) Carlos (mistral-nemo:12b) Pieter (qwen2.5:14b) Carlos: 8  ·  Aoife: 9 Upheld View →
72 🏆 Aoife (mistral-nemo:12b) Nadia (qwen2.5:14b) Pieter (gemma2:9b) Nadia: 6  ·  Aoife: 8 Upheld View →
73 🏆 Aoife (deepseek-r1:14b) Nadia (mistral-nemo:12b) Donna (qwen2.5:14b) Aoife: 9  ·  Nadia: 8 Upheld View →
74 🏆 Ranjit (llama3.1:8b) Carlos (deepseek-r1:14b) Donna (deepseek-r1:14b) Ranjit: 8  ·  Carlos: 6 Upheld View →
75 🏆 Aoife (llama3.1:8b) Carlos (llama3.1:8b) Donna (deepseek-r1:14b) Carlos: 6  ·  Aoife: 8 Upheld View →
76 Valentina (gemma2:9b) 🏆 Nadia (qwen2.5:14b) Jin-ho (qwen2.5:14b) Valentina: 8  ·  Nadia: 9 Rejected View →
77 🏆 Ranjit (llama3.1:8b) Carlos (qwen2.5:14b) Pieter (deepseek-r1:14b) Ranjit: 9  ·  Carlos: 8 Upheld View →
78 🏆 Ranjit (gemma2:9b) Carlos (phi4:latest) Donna (deepseek-r1:14b) Carlos: 7  ·  Ranjit: 8 Upheld View →
79 🏆 Valentina (llama3.1:8b) Carlos (gemma2:9b) Pieter (mistral-nemo:12b) Carlos: 7  ·  Valentina: 8 Upheld View →
80 Ranjit (mistral-nemo:12b) 🏆 Nadia (deepseek-r1:14b) Donna (deepseek-r1:14b) Ranjit: 6  ·  Nadia: 7 Rejected View →
81 Ranjit (mistral-nemo:12b) 🏆 Carlos (deepseek-r1:14b) Jin-ho (mistral-nemo:12b) Ranjit: 5  ·  Carlos: 6 Rejected View →
82 Ranjit (phi4:latest) 🏆 Nadia (phi4:latest) Donna (deepseek-r1:14b) Ranjit: 7  ·  Nadia: 8 Rejected View →
83 🏆 Valentina (llama3.1:8b) Prof. Hendrik (llama3.1:8b) Pieter (llama3.1:8b) Prof. Hendrik: 6  ·  Valentina: 8 Upheld View →
84 🏆 Valentina (deepseek-r1:14b) Carlos (llama3.1:8b) Jin-ho (mistral-nemo:12b) Valentina: 8  ·  Carlos: 6 Upheld View →
85 🏆 Aoife (mistral-nemo:12b) Carlos (qwen2.5:14b) Jin-ho (phi4:latest) Aoife: 9  ·  Carlos: 8 Upheld View →
86 🏆 Ranjit (qwen2.5:14b) Nadia (mistral-nemo:12b) Jin-ho (llama3.1:8b) Nadia: 6  ·  Ranjit: 9 Upheld View →
87 🏆 Valentina (mistral-nemo:12b) Carlos (llama3.1:8b) Jin-ho (phi4:latest) Carlos: 8  ·  Valentina: 9 Upheld View →
88 🏆 Ranjit (qwen2.5:14b) Prof. Hendrik (mistral-nemo:12b) Donna (llama3.1:8b) Ranjit: 9  ·  Prof. Hendrik: 4 Upheld View →
89 Aoife (phi4:latest) 🏆 Nadia (gemma2:9b) Pieter (qwen2.5:14b) Aoife: 8  ·  Nadia: 9 Rejected View →
90 🏆 Ranjit (llama3.1:8b) Carlos (phi4:latest) Pieter (qwen2.5:14b) Carlos: 8  ·  Ranjit: 9 Upheld View →
91 🏆 Ranjit (gemma2:9b) Nadia (gemma2:9b) Jin-ho (qwen2.5:14b) Nadia: 8  ·  Ranjit: 9 Upheld View →
92 🏆 Aoife (qwen2.5:14b) Nadia (mistral-nemo:12b) Donna (gemma2:9b) Aoife: 8  ·  Nadia: 7 Upheld View →
93 🏆 Ranjit (qwen2.5:14b) Carlos (llama3.1:8b) Jin-ho (deepseek-r1:14b) Ranjit: 8  ·  Carlos: 6 Upheld View →
94 🏆 Aoife (phi4:latest) Prof. Hendrik (llama3.1:8b) Jin-ho (qwen2.5:14b) Aoife: 9  ·  Prof. Hendrik: 8 Upheld View →
95 Valentina (deepseek-r1:14b) 🏆 Nadia (deepseek-r1:14b) Pieter (gemma2:9b) Valentina: 6  ·  Nadia: 8 Rejected View →
96 🏆 Ranjit (gemma2:9b) Carlos (qwen2.5:14b) Pieter (qwen2.5:14b) Carlos: 6  ·  Ranjit: 7 Upheld View →
97 Ranjit (mistral-nemo:12b) 🏆 Carlos (qwen2.5:14b) Donna (qwen2.5:14b) Ranjit: 8  ·  Carlos: 9 Rejected View →
98 🏆 Aoife (qwen2.5:14b) Nadia (phi4:latest) Pieter (llama3.1:8b) Aoife: 8  ·  Nadia: 6 Upheld View →
99 🏆 Valentina (phi4:latest) Nadia (deepseek-r1:14b) Jin-ho (deepseek-r1:14b) Valentina: 8  ·  Nadia: 6 Upheld View →
100 🏆 Ranjit (qwen2.5:14b) Prof. Hendrik (deepseek-r1:14b) Donna (phi4:latest) Prof. Hendrik: 7  ·  Ranjit: 9 Upheld View →
100 Runs
76 Premise Upheld
24 Premise Rejected
76% Uphold Rate

Debater Performance

Name Side n Wins Win% Avg score
Aoife for 28 23 82% 8.4
Ranjit for 42 31 74% 8.1
Valentina for 30 22 73% 7.9
Nadia against 34 13 38% 7.3
Carlos against 31 6 19% 6.9
Prof. Hendrik against 35 5 14% 7.0

Judge Profile

Name n Upheld Rejected Uphold% Bias
Donna 36 28 8 78% +2%
Jin-ho 35 27 8 77% +1%
Pieter 29 21 8 72% -4%

Speaking Order (n=100)

PositionWinsWin%
First speaker 33 33%
Second speaker 67 67%

Side Effect (n=100)

SideWinsWin%
For 76 76%
Against 24 24%

Model Performance (Debater)

ModelnWinsWin%Avg score
llama3.1:8b 36 21 58% 7.8
gemma2:9b 25 14 56% 7.7
qwen2.5:14b 36 20 56% 7.9
mistral-nemo:12b 37 18 49% 7.5
phi4:latest 35 15 43% 7.6
deepseek-r1:14b 31 12 39% 6.9

Model Performance (Judge)

ModelnUpheldRejectedUphold%Bias
gemma2:9b 21 18 3 86% +10%
phi4:latest 18 18 0 100% +24%
deepseek-r1:14b 17 13 4 76% ~0%
llama3.1:8b 16 12 4 75% ~0%
mistral-nemo:12b 14 7 7 50% -26%
qwen2.5:14b 14 8 6 57% -19%

This page summarises the results of a simulated debate. For each round, two random AI agents debate a topic; at the end a third agent acts as judge to select a winner. All arguments and facts presented (with the exception of this paragraph) are AI-generated, potentially untrue, and do not represent the views of any real person.