David Kumar
spirituality-religion-philosophy-philosopher-characters-ren-descartes
v2.0
Ethical
Backstory: Raised in a multicultural household, David first studied comparative religion before earning a doctorate in moral philosophy. He now teaches ethics at a liberal arts college and regularly facilitates interfaith dialogues in his community.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | deepseek/deepseek-r… | google/gemini-2.5-f… | google/gemma-3-12b-… | meta-llama/llama-3.… | microsoft/phi-3-med… | microsoft/phi-3.5-m… | mistralai/mistral-7… | neversleep/noromaid… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
trolley-dilemma
Explaining the trolley problem
|
0.402
Details |
0.381
Details |
0.439
Details |
0.522
Details |
0.000
Details |
0.521
Details |
0.538
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.620
Details |
0.482
Details |
0.618
Details |
0.447
Details |
student-confession
Counseling a guilty student
|
0.683
Details |
0.605
Details |
0.682
Details |
0.588
Details |
0.000
Details |
0.544
Details |
0.673
Details |
0.578
Details |
0.000
Details
Error
|
0.693
Details |
0.597
Details |
0.670
Details |
0.701
Details |
empathy-oped
Op-ed on empathy in moral disagreement
|
0.245
Details |
0.637
Details |
0.208
Details |
0.198
Details |
0.000
Details
Error
|
0.467
Details |
0.561
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.493
Details |
0.469
Details |
0.452
Details |
0.303
Details |
interfaith-panel
Summarize an interfaith dialogue
|
0.436
Details |
0.657
Details |
0.445
Details |
0.000
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.475
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.416
Details |
0.285
Details |
0.256
Details |
0.390
Details |
Test Scenes 4
0
Scene Order
Explaining the trolley problem
ID:
trolley-dilemma
🎯 Goal:
Offer a balanced, concise comparison of utilitarian and deontological views on the trolley problem in 2–3 paragraphs.
📨 Input Events:
chat_msg
student_alex
"Professor Kumar, could you briefly explain the main ethical positions on the trolley problem?"
Ready for Testing
1
Scene Order
Counseling a guilty student
ID:
student-confession
🎯 Goal:
Respond with empathy and constructive moral guidance without scolding, in under 180 words.
📨 Input Events:
chat_msg
student_maya
"I cheated on a small quiz and feel terrible. What should I do?"
Ready for Testing
2
Scene Order
Op-ed on empathy in moral disagreement
ID:
empathy-oped
🎯 Goal:
Write a 400-word op-ed for the local newspaper arguing that empathy is essential for resolving modern moral conflicts; include one real-world example and end with a call to action.
📨 Input Events:
world_event
city_editor
"The paper is running a special feature on civility. Submit your op-ed by tonight."
Ready for Testing
3
Scene Order
Summarize an interfaith dialogue
ID:
interfaith-panel
🎯 Goal:
Produce a 500-word moderator’s summary highlighting how representatives from three faiths view charitable giving, noting common ground and respectful differences.
📨 Input Events:
world_event
panel_coordinator
"The panel on charitable giving just concluded. Please draft the official summary for attendees."
Ready for Testing
Latency by Model (This Suite)
Fastest
- neversleep/noromaid-20b 11763 ms
- p95 • avg • N 22784 ms • 14058 ms • 5
- [email protected]/Qw… 11853 ms
- p95 • avg • N 13213 ms • 11583 ms • 4
- google/gemini-2.5-flash 18733 ms
- p95 • avg • N 22725 ms • 18963 ms • 8
- qwen/qwen-2.5-7b-instru… 19453 ms
- p95 • avg • N 23602 ms • 19134 ms • 8
- google/gemma-3-12b-it 20269 ms
- p95 • avg • N 27034 ms • 22030 ms • 7
Slowest
- microsoft/phi-3-medium-… 115050 ms
- p95 • avg • N 129646 ms • 113609 ms • 8
- microsoft/phi-3.5-mini-… 41321 ms
- p95 • avg • N 80731 ms • 47275 ms • 8
- [email protected]/Qw… 40827 ms
- p95 • avg • N 41841 ms • 41028 ms • 4
- deepseek/deepseek-r1-di… 33907 ms
- p95 • avg • N 41485 ms • 35113 ms • 8
- qwen/qwen3-14b 26981 ms
- p95 • avg • N 40815 ms • 27587 ms • 7
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
41869785
Dec. 17, 2025, midnight
47426710
Dec. 16, 2025, midnight
38969175
Dec. 15, 2025, midnight
41400559
Dec. 14, 2025, midnight
38760189
Dec. 13, 2025, midnight
46977163
Dec. 12, 2025, midnight
40684818
Dec. 11, 2025, midnight
40045788
Dec. 10, 2025, midnight
45269297
Dec. 9, 2025, midnight
39651505
Dec. 8, 2025, midnight