Aria Patel
finance-economics-startup-founder-characters-joseph-schumpeter
v2.0
Ethical
Backstory: Aria is an immigrant entrepreneur who grew up balancing two cultural views on money. After studying econometrics and computer science, she worked at a micro-lending nonprofit before founding a fintech startup that delivers transparent, low-fee investing tools to underserved communities. She also mentors first-generation college students and trains for distance races in her spare time.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | deepseek/deepseek-r… | google/gemini-2.5-f… | google/gemma-3-12b-… | meta-llama/llama-3.… | microsoft/phi-3-med… | microsoft/phi-3.5-m… | mistralai/mistral-7… | neversleep/noromaid… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
vision-one-liner
Startup vision in a sentence
|
0.700
Details |
0.709
Details |
0.778
Details |
0.000
Details |
0.000
Details
Error
|
0.685
Details |
0.821
Details |
0.595
Details |
0.000
Details
Error
|
0.800
Details |
0.683
Details |
0.631
Details |
0.629
Details |
mentorship-letter
Mentorship email to student
|
0.320
Details |
0.424
Details |
0.605
Details |
0.000
Details |
0.000
Details |
0.397
Details |
0.598
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.416
Details |
0.556
Details |
0.330
Details |
0.589
Details |
budgeting-tip
Quick budgeting advice via superchat
|
0.476
Details |
0.540
Details |
0.775
Details |
0.000
Details |
0.000
Details |
0.580
Details |
0.713
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.643
Details |
0.524
Details |
0.678
Details |
0.527
Details |
reg-analysis-blog
Long-form analysis of new regulation
|
0.511
Details |
0.711
Details |
0.264
Details |
0.485
Details |
0.000
Details |
0.000
Details
Error
|
0.354
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.288
Details |
0.369
Details |
0.310
Details |
0.474
Details |
Test Scenes 4
0
Scene Order
Startup vision in a sentence
ID:
vision-one-liner
🎯 Goal:
Deliver a single, punchy sentence that captures Aria's visionary mission without forbidden filler.
📨 Input Events:
chat_msg
viewer:user_23
"In one sentence, what's your startup's vision?"
Ready for Testing
1
Scene Order
Mentorship email to student
ID:
mentorship-letter
🎯 Goal:
Write an encouraging, data-driven mentorship email of 250-300 words to a first-generation college student interested in fintech.
📨 Input Events:
chat_msg
viewer:student_emma
"Could you send me some advice as I start my fintech career?"
Ready for Testing
2
Scene Order
Quick budgeting advice via superchat
ID:
budgeting-tip
🎯 Goal:
Provide one actionable budgeting tip in two sentences, referencing low-fee tools, and thank the sender for the superchat.
📨 Input Events:
superchat
viewer:donor_99
YouTube
$10
"Any quick budgeting hacks for young professionals?"
Ready for Testing
3
Scene Order
Long-form analysis of new regulation
ID:
reg-analysis-blog
🎯 Goal:
Produce a 400-word blog post analyzing how the newly passed Community Investment Act could impact underserved investors; cite at least three concrete data points or studies.
📨 Input Events:
world_event
news_feed
"Breaking: Congress passes the Community Investment Act, expanding access to low-fee financial services."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 10409 ms
- p95 • avg • N 11438 ms • 10259 ms • 4
- neversleep/noromaid-20b 17141 ms
- p95 • avg • N 30996 ms • 18052 ms • 4
- meta-llama/llama-3.1-8b… 19277 ms
- p95 • avg • N 23297 ms • 19376 ms • 4
- google/gemma-3-12b-it 21369 ms
- p95 • avg • N 21954 ms • 20881 ms • 4
- google/gemini-2.5-flash 22180 ms
- p95 • avg • N 33441 ms • 23887 ms • 4
Slowest
- microsoft/phi-3-medium-… 129130 ms
- p95 • avg • N 151192 ms • 124653 ms • 4
- [email protected]/Qw… 42421 ms
- p95 • avg • N 44679 ms • 42575 ms • 4
- microsoft/phi-3.5-mini-… 36956 ms
- p95 • avg • N 68484 ms • 42958 ms • 4
- deepseek/deepseek-r1-di… 32203 ms
- p95 • avg • N 41058 ms • 32277 ms • 4
- qwen/qwen3-14b 30902 ms
- p95 • avg • N 34634 ms • 30297 ms • 4
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
22861339
Dec. 17, 2025, midnight
27078079
Dec. 16, 2025, midnight
21785851
Dec. 15, 2025, midnight
24770169
Dec. 14, 2025, midnight
21707580
Dec. 13, 2025, midnight
26618901
Dec. 12, 2025, midnight
22599068
Dec. 11, 2025, midnight
21976736
Dec. 10, 2025, midnight
25307027
Dec. 9, 2025, midnight
22264682
Dec. 8, 2025, midnight