Daniela Moreno
finance-economics-world-bank-economist-characters-john-maynard-keynes
v2.0
Ethical
Backstory: Daniela is a senior economist at a leading development bank, crafting inclusive growth strategies for emerging markets. With a PhD in development economics and ten years of fieldwork across Latin America and Sub-Saharan Africa, she blends rigorous data analysis with community interviews. She mentors first-generation college students and co-hosts a bilingual podcast on sustainable development.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | deepseek/deepseek-r… | google/gemini-2.5-f… | google/gemma-3-12b-… | meta-llama/llama-3.… | microsoft/phi-3-med… | microsoft/phi-3.5-m… | mistralai/mistral-7… | neversleep/noromaid… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
brief-advice
Concise policy tip
|
0.604
Details |
0.464
Details |
0.000
Details
Error
|
0.000
Details |
0.000
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.602
Details |
0.358
Details |
0.656
Details |
0.498
Details |
mentor
Student mentoring
|
0.527
Details |
0.531
Details |
0.536
Details |
0.000
Details |
0.000
Details
Error
|
0.469
Details |
0.603
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.682
Details |
0.450
Details |
0.678
Details |
0.613
Details |
podcast-episode
Bilingual podcast segment
|
0.222
Details |
0.459
Details |
0.459
Details |
0.386
Details |
0.002
Details |
0.547
Details |
0.000
Details
Error
|
0.199
Details |
0.000
Details
Error
|
0.760
Details |
0.336
Details |
0.599
Details |
0.685
Details |
exec-summary
Executive summary on climate resilience
|
0.387
Details |
0.557
Details |
0.368
Details |
0.000
Details |
0.000
Details |
0.472
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.518
Details |
0.000
Details |
0.334
Details |
0.543
Details |
Test Scenes 4
0
Scene Order
Concise policy tip
ID:
brief-advice
🎯 Goal:
Deliver an evidence-based recommendation for expanding rural broadband in Kenya in 75 words or fewer and cite at least one numeric data point.
📨 Input Events:
chat_msg
viewer:policy_advisor
"Could you give me a quick suggestion on how to expand rural broadband access in Kenya? Keep it brief."
Ready for Testing
1
Scene Order
Student mentoring
ID:
mentor
🎯 Goal:
Offer supportive guidance to a first-generation college student about pursuing development economics, using a collaborative tone and giving one concrete next step, all within 120 words.
📨 Input Events:
chat_msg
viewer:student_23
"Hi Daniela, I'm the first in my family to attend college and I'm thinking about focusing on development economics. Any advice?"
Ready for Testing
2
Scene Order
Bilingual podcast segment
ID:
podcast-episode
🎯 Goal:
Produce a 200–250-word bilingual (English & Spanish) monologue for Daniela’s podcast explaining how mobile money boosts women’s financial inclusion, including one real statistic and one anecdote from Latin America.
📨 Input Events:
chat_msg
host:ana
"Daniela, can you open today's segment about mobile money and women's inclusion?"
Ready for Testing
3
Scene Order
Executive summary on climate resilience
ID:
exec-summary
🎯 Goal:
Write a 250–300-word executive summary integrating climate resilience into Ethiopia's agricultural sector, featuring two quantitative figures and one recommendation for inter-agency collaboration.
📨 Input Events:
chat_msg
boss:dept_director
"I need an executive summary on climate resilience for Ethiopia's agriculture file. Deadline in 10 minutes."
Ready for Testing
Latency by Model (This Suite)
Fastest
- mistralai/mistral-7b-in… 504 ms
- p95 • avg • N 23686 ms • 7265 ms • 4
- neversleep/noromaid-20b 12324 ms
- p95 • avg • N 23798 ms • 13687 ms • 4
- [email protected]/Qw… 12526 ms
- p95 • avg • N 18411 ms • 13657 ms • 4
- qwen/qwen3-8b 19426 ms
- p95 • avg • N 25795 ms • 20554 ms • 4
- google/gemini-2.5-flash 21207 ms
- p95 • avg • N 31490 ms • 22499 ms • 4
Slowest
- microsoft/phi-3-medium-… 117440 ms
- p95 • avg • N 136811 ms • 120599 ms • 4
- [email protected]/Qw… 40732 ms
- p95 • avg • N 41694 ms • 39565 ms • 4
- deepseek/deepseek-r1-di… 34069 ms
- p95 • avg • N 37352 ms • 33632 ms • 4
- qwen/qwen-2.5-7b-instru… 32969 ms
- p95 • avg • N 119047 ms • 54693 ms • 4
- microsoft/phi-3.5-mini-… 32203 ms
- p95 • avg • N 62115 ms • 35178 ms • 4
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
23329330
Dec. 17, 2025, midnight
27632862
Dec. 16, 2025, midnight
22151065
Dec. 15, 2025, midnight
25208060
Dec. 14, 2025, midnight
22069135
Dec. 13, 2025, midnight
27102836
Dec. 12, 2025, midnight
22975697
Dec. 11, 2025, midnight
22360370
Dec. 10, 2025, midnight
25756254
Dec. 9, 2025, midnight
22687172
Dec. 8, 2025, midnight