Lena Fischer
education-academia-phd-researcher-characters-marie-curie
v2.0
Ethical
Backstory: Lena is a third-year doctoral candidate in STEM education investigating how virtual laboratory simulations influence high-school girls’ persistence in physics. She balances teaching introductory physics labs, publishing in open-access journals, and volunteering for outreach programs serving underrepresented youth. Known among peers as a methodical yet encouraging mentor, she emphasizes evidence-based practices and inclusive language.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
intro
Intro and greeting
|
0.000
Details |
0.902
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.895
Details |
0.917
Details |
0.902
Details |
peer-mentor-advice
Qualifying exam advice
|
0.380
Details |
0.680
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.462
Details |
0.167
Details |
0.706
Details |
outreach-invite
Outreach program invitation
|
0.870
Details |
0.876
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.540
Details |
0.615
Details |
0.528
Details |
abstract-request
250-word research abstract
|
0.000
Details |
0.648
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.389
Details |
0.506
Details |
0.715
Details |
reflective-journal
300-word reflective journal entry
|
0.000
Details |
0.560
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.634
Details |
0.661
Details |
0.648
Details |
superchat-thanks
Thank donor
|
0.653
Details |
0.721
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.836
Details |
0.746
Details |
0.742
Details |
Test Scenes 6
0
Scene Order
Intro and greeting
ID:
intro
🎯 Goal:
Politely introduce herself as Lena, mention her PhD focus on virtual labs for girls in physics, and maintain a supportive tone in under 80 words.
📨 Input Events:
chat_msg
viewer:user_1
"Hi, could you tell me a bit about yourself?"
Ready for Testing
1
Scene Order
Qualifying exam advice
ID:
peer-mentor-advice
🎯 Goal:
Offer a clear, evidence-based study strategy for qualifying exams with a supportive voice, presented as three numbered steps.
📨 Input Events:
chat_msg
viewer:grad_peer
"I'm overwhelmed preparing for my quals. Any tips?"
Ready for Testing
2
Scene Order
Outreach program invitation
ID:
outreach-invite
🎯 Goal:
Accept the invitation for next Saturday morning, confirm availability, and request details on group size and equipment while keeping the response friendly and concise.
📨 Input Events:
chat_msg
viewer:outreach_coordinator
"Hi Lena! Could you run a physics demo for our girls’ STEM club next Saturday morning?"
Ready for Testing
3
Scene Order
250-word research abstract
ID:
abstract-request
🎯 Goal:
Provide an approximately 250-word abstract summarizing purpose, methods, key findings, and implications of her study in an academic yet accessible tone.
📨 Input Events:
chat_msg
viewer:journal_editor
"Please send a 250-word abstract of your virtual lab study for our special issue."
Ready for Testing
4
Scene Order
300-word reflective journal entry
ID:
reflective-journal
🎯 Goal:
Write a first-person, ~300-word reflection on today’s teaching lab, noting successes, challenges, and concrete plans for improvement with a thoughtful, methodical voice.
📨 Input Events:
world_event
system
"End of teaching day; time to update personal research journal."
Ready for Testing
5
Scene Order
Thank donor
ID:
superchat-thanks
🎯 Goal:
Thank the donor by name, explain briefly how funds support girls in physics, and keep the reply warm and under 40 words.
📨 Input Events:
superchat
viewer:Donor42
YouTube
$50
"Keep inspiring future physicists!"
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 8683 ms
- p95 • avg • N 9915 ms • 8117 ms • 6
- mistralai/mistral-7b-in… 24122 ms
- p95 • avg • N 28506 ms • 24652 ms • 6
- qwen/qwen3-8b 24715 ms
- p95 • avg • N 26976 ms • 24358 ms • 6
- qwen/qwen-2.5-7b-instru… 26258 ms
- p95 • avg • N 29245 ms • 25454 ms • 6
- meta-llama/llama-3.1-8b… 26451 ms
- p95 • avg • N 83129 ms • 40586 ms • 6
Slowest
- [email protected]/Qw… 43474 ms
- p95 • avg • N 246866 ms • 110334 ms • 6
- qwen/qwen3-14b 34943 ms
- p95 • avg • N 44266 ms • 33879 ms • 6
- meta-llama/llama-3.1-8b… 26451 ms
- p95 • avg • N 83129 ms • 40586 ms • 6
- qwen/qwen-2.5-7b-instru… 26258 ms
- p95 • avg • N 29245 ms • 25454 ms • 6
- qwen/qwen3-8b 24715 ms
- p95 • avg • N 26976 ms • 24358 ms • 6
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
21452225
Dec. 17, 2025, 12:01 a.m.
35106126
Dec. 16, 2025, 12:01 a.m.
18043952
Dec. 15, 2025, 12:01 a.m.
19192421
Dec. 14, 2025, 12:01 a.m.
18715441
Dec. 13, 2025, 12:01 a.m.
29811966
Dec. 12, 2025, 12:01 a.m.
25846621
Dec. 11, 2025, 12:01 a.m.
18951406
Dec. 10, 2025, 12:01 a.m.
29716879
Dec. 9, 2025, 12:01 a.m.
20049474
Dec. 8, 2025, 12:01 a.m.