Linda Yang
finance-economics-investment-analyst-characters-joseph-de-la-vega
v2.0
Ethical
Backstory: Raised in Vancouver, Linda earned dual degrees in statistics and environmental economics before joining a mid-sized asset manager. She now covers North American renewable-energy equities, favoring low-volatility strategies backed by rigorous valuation models. Meticulous and risk-averse at work, she volunteers as a mentor to first-generation college students in her spare time.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
introduce
Brief self-introduction
|
0.616
Details |
0.626
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.519
Details |
0.556
Details |
0.528
Details |
dcf-outline
DCF framework request
|
0.431
Details |
0.518
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.396
Details |
0.457
Details |
0.408
Details |
mentor-advice
Long-form mentorship guidance
|
0.497
Details |
0.620
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.592
Details |
0.520
Details |
0.668
Details |
macro-impact
Interest-rate impact analysis
|
0.452
Details |
0.423
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.276
Details |
0.369
Details |
0.422
Details |
market-drop
Solar ETF sell-off reaction
|
0.000
Details
Error
|
0.549
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.524
Details |
0.466
Details |
0.770
Details |
quarterly-letter
Long-form quarterly outlook
|
0.324
Details |
0.560
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.348
Details |
0.165
Details |
0.688
Details |
Test Scenes 6
0
Scene Order
Brief self-introduction
ID:
introduce
🎯 Goal:
Offer a concise, data-oriented introduction that mentions role, background, and investment philosophy.
📨 Input Events:
chat_msg
viewer:client_1
"Could you introduce yourself briefly?"
Ready for Testing
1
Scene Order
DCF framework request
ID:
dcf-outline
🎯 Goal:
Provide a clear DCF outline for Brookfield Renewable Partners that highlights key risk adjustments and conservative assumptions.
📨 Input Events:
chat_msg
viewer:pm_john
"Can you sketch a DCF approach for Brookfield Renewable Partners, factoring in their debt load and power-price volatility?"
Ready for Testing
2
Scene Order
Long-form mentorship guidance
ID:
mentor-advice
🎯 Goal:
Deliver roughly 250 words of encouraging, actionable advice for a first-generation student seeking an equity-research career.
📨 Input Events:
chat_msg
viewer:student_ana
"I'm the first in my family to attend college and want to break into equity research. Any guidance?"
Ready for Testing
3
Scene Order
Interest-rate impact analysis
ID:
macro-impact
🎯 Goal:
Offer a concise 2–3 bullet assessment of how rising interest rates affect renewable-energy equities, citing recent data points.
📨 Input Events:
chat_msg
viewer:client_2
"Rates keep climbing—how does that hit renewable-energy stocks?"
Ready for Testing
4
Scene Order
Solar ETF sell-off reaction
ID:
market-drop
🎯 Goal:
Respond with a risk-averse, data-backed watchlist update and immediate next steps after an 8% solar ETF drop.
📨 Input Events:
world_event
market
"TAN solar ETF plunges 8% on tariff concerns."
Ready for Testing
5
Scene Order
Long-form quarterly outlook
ID:
quarterly-letter
🎯 Goal:
Draft a ~500-word letter to the investment committee summarizing sector performance, valuation outlook, and proposed portfolio tweaks, maintaining Linda’s analytical tone.
📨 Input Events:
chat_msg
viewer:pm_john
"Please prepare our quarterly outlook letter on renewables for the committee."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 8811 ms
- p95 • avg • N 9809 ms • 8362 ms • 6
- qwen/qwen3-14b 22365 ms
- p95 • avg • N 27126 ms • 22579 ms • 6
- qwen/qwen-2.5-7b-instru… 24936 ms
- p95 • avg • N 32840 ms • 25746 ms • 6
- mistralai/mistral-7b-in… 27537 ms
- p95 • avg • N 34852 ms • 29114 ms • 6
- meta-llama/llama-3.1-8b… 28766 ms
- p95 • avg • N 39752 ms • 27860 ms • 6
Slowest
- [email protected]/Qw… 43862 ms
- p95 • avg • N 246929 ms • 109413 ms • 6
- qwen/qwen3-8b 28945 ms
- p95 • avg • N 35115 ms • 27498 ms • 6
- meta-llama/llama-3.1-8b… 28766 ms
- p95 • avg • N 39752 ms • 27860 ms • 6
- mistralai/mistral-7b-in… 27537 ms
- p95 • avg • N 34852 ms • 29114 ms • 6
- qwen/qwen-2.5-7b-instru… 24936 ms
- p95 • avg • N 32840 ms • 25746 ms • 6
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
33582048
Dec. 17, 2025, 12:01 a.m.
48348890
Dec. 16, 2025, 12:01 a.m.
29235603
Dec. 15, 2025, 12:01 a.m.
30670137
Dec. 14, 2025, 12:01 a.m.
29663133
Dec. 13, 2025, 12:01 a.m.
42163667
Dec. 12, 2025, 12:01 a.m.
38223816
Dec. 11, 2025, 12:01 a.m.
30922005
Dec. 10, 2025, 12:01 a.m.
44286952
Dec. 9, 2025, 12:01 a.m.
33070226
Dec. 8, 2025, 12:01 a.m.