Models Overview
Performance overview of all tested LLM models.
Filter Options
Model Performance Overview
| Model | Performance | Tests Completed | Last Test | Actions |
|---|---|---|---|---|
|
|
0.551
|
147 tests
|
Oct. 10, 2025, 1:02 p.m.
|
Details |
|
google/gemma-3-12b-it
AI Language Model
|
0.535
|
172374 tests
|
Oct. 29, 2025, 6:44 a.m.
|
Details |
|
google/gemini-2.5-flash
AI Language Model
|
0.532
|
172433 tests
|
Oct. 29, 2025, 6:44 a.m.
|
Details |
|
mistralai/mistral-7b-instruct
AI Language Model
|
0.532
|
217790 tests
|
Dec. 17, 2025, 12:02 a.m.
|
Details |
|
qwen/qwen3-14b
AI Language Model
|
0.513
|
217804 tests
|
Dec. 17, 2025, 12:02 a.m.
|
Details |
|
qwen/qwen-2.5-7b-instruct
AI Language Model
|
0.492
|
217830 tests
|
Dec. 17, 2025, 12:02 a.m.
|
Details |
|
qwen/qwen3-8b
AI Language Model
|
0.486
|
217779 tests
|
Dec. 17, 2025, 12:02 a.m.
|
Details |
|
deepseek/deepseek-r1-distill-qwen-14b
AI Language Model
|
0.473
|
172368 tests
|
Oct. 29, 2025, 6:44 a.m.
|
Details |
|
|
0.446
|
162 tests
|
Oct. 29, 2025, 7:42 a.m.
|
Details |
|
microsoft/phi-3.5-mini-128k-instruct
AI Language Model
|
0.361
|
172370 tests
|
Oct. 29, 2025, 6:44 a.m.
|
Details |
|
meta-llama/llama-3.1-8b-instruct
AI Language Model
|
0.325
|
217755 tests
|
Dec. 17, 2025, 12:02 a.m.
|
Details |
|
|
0.273
|
174 tests
|
Oct. 10, 2025, 10:38 a.m.
|
Details |
|
neversleep/noromaid-20b
AI Language Model
|
0.173
|
172368 tests
|
Oct. 29, 2025, 6:44 a.m.
|
Details |
|
|
0.149
|
1294 tests
|
Nov. 27, 2025, 12:05 a.m.
|
Details |
|
[email protected]/Mistral-7B-Instruct-v0.2-6e34c66b
Fine-Tuned
Based on: mistralai/Mistral-7B-Instruct-v0.2
|
0.040
|
17 tests
|
Oct. 2, 2025, 7:47 a.m.
|
Details |
|
microsoft/phi-3-medium-128k-instruct
AI Language Model
|
0.008
|
172374 tests
|
Oct. 29, 2025, 6:44 a.m.
|
Details |
|
|
0.000
|
1442 tests
|
Nov. 27, 2025, 12:06 a.m.
|
Details |