Linda Yang

finance-economics-investment-analyst-characters-joseph-de-la-vega v2.0 Ethical
Backstory: Raised in Vancouver, Linda earned dual degrees in statistics and environmental economics before joining a mid-sized asset manager. She now covers North American renewable-energy equities, favoring low-volatility strategies backed by rigorous valuation models. Meticulous and risk-averse at work, she volunteers as a mentor to first-generation college students in her spare time.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
introduce
Brief self-introduction
0.616
Details
0.626
Details
0.000
Details
Error
0.000
Details
Error
0.519
Details
0.556
Details
0.528
Details
dcf-outline
DCF framework request
0.431
Details
0.518
Details
0.000
Details
Error
0.000
Details
Error
0.396
Details
0.457
Details
0.408
Details
mentor-advice
Long-form mentorship guidance
0.497
Details
0.620
Details
0.000
Details
Error
0.000
Details
Error
0.592
Details
0.520
Details
0.668
Details
macro-impact
Interest-rate impact analysis
0.452
Details
0.423
Details
0.000
Details
Error
0.000
Details
Error
0.276
Details
0.369
Details
0.422
Details
market-drop
Solar ETF sell-off reaction
0.000
Details
Error
0.549
Details
0.000
Details
Error
0.000
Details
Error
0.524
Details
0.466
Details
0.770
Details
quarterly-letter
Long-form quarterly outlook
0.324
Details
0.560
Details
0.000
Details
Error
0.000
Details
Error
0.348
Details
0.165
Details
0.688
Details
Test Scenes 6
0
Scene Order
Brief self-introduction
ID: introduce
🎯 Goal:
Offer a concise, data-oriented introduction that mentions role, background, and investment philosophy.
📨 Input Events:
chat_msg viewer:client_1
"Could you introduce yourself briefly?"
Ready for Testing
1
Scene Order
DCF framework request
ID: dcf-outline
🎯 Goal:
Provide a clear DCF outline for Brookfield Renewable Partners that highlights key risk adjustments and conservative assumptions.
📨 Input Events:
chat_msg viewer:pm_john
"Can you sketch a DCF approach for Brookfield Renewable Partners, factoring in their debt load and power-price volatility?"
Ready for Testing
2
Scene Order
Long-form mentorship guidance
ID: mentor-advice
🎯 Goal:
Deliver roughly 250 words of encouraging, actionable advice for a first-generation student seeking an equity-research career.
📨 Input Events:
chat_msg viewer:student_ana
"I'm the first in my family to attend college and want to break into equity research. Any guidance?"
Ready for Testing
3
Scene Order
Interest-rate impact analysis
ID: macro-impact
🎯 Goal:
Offer a concise 2–3 bullet assessment of how rising interest rates affect renewable-energy equities, citing recent data points.
📨 Input Events:
chat_msg viewer:client_2
"Rates keep climbing—how does that hit renewable-energy stocks?"
Ready for Testing
4
Scene Order
Solar ETF sell-off reaction
ID: market-drop
🎯 Goal:
Respond with a risk-averse, data-backed watchlist update and immediate next steps after an 8% solar ETF drop.
📨 Input Events:
world_event market
"TAN solar ETF plunges 8% on tariff concerns."
Ready for Testing
5
Scene Order
Long-form quarterly outlook
ID: quarterly-letter
🎯 Goal:
Draft a ~500-word letter to the investment committee summarizing sector performance, valuation outlook, and proposed portfolio tweaks, maintaining Linda’s analytical tone.
📨 Input Events:
chat_msg viewer:pm_john
"Please prepare our quarterly outlook letter on renewables for the committee."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 8811 ms
  • p95 • avg • N 9809 ms • 8362 ms • 6
  • qwen/qwen3-14b 22365 ms
  • p95 • avg • N 27126 ms • 22579 ms • 6
  • qwen/qwen-2.5-7b-instru… 24936 ms
  • p95 • avg • N 32840 ms • 25746 ms • 6
  • mistralai/mistral-7b-in… 27537 ms
  • p95 • avg • N 34852 ms • 29114 ms • 6
  • meta-llama/llama-3.1-8b… 28766 ms
  • p95 • avg • N 39752 ms • 27860 ms • 6
Slowest
  • [email protected]/Qw… 43862 ms
  • p95 • avg • N 246929 ms • 109413 ms • 6
  • qwen/qwen3-8b 28945 ms
  • p95 • avg • N 35115 ms • 27498 ms • 6
  • meta-llama/llama-3.1-8b… 28766 ms
  • p95 • avg • N 39752 ms • 27860 ms • 6
  • mistralai/mistral-7b-in… 27537 ms
  • p95 • avg • N 34852 ms • 29114 ms • 6
  • qwen/qwen-2.5-7b-instru… 24936 ms
  • p95 • avg • N 32840 ms • 25746 ms • 6
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
33582048
Dec. 17, 2025, 12:01 a.m.
48348890
Dec. 16, 2025, 12:01 a.m.
29235603
Dec. 15, 2025, 12:01 a.m.
30670137
Dec. 14, 2025, 12:01 a.m.
29663133
Dec. 13, 2025, 12:01 a.m.
42163667
Dec. 12, 2025, 12:01 a.m.
38223816
Dec. 11, 2025, 12:01 a.m.
30922005
Dec. 10, 2025, 12:01 a.m.
44286952
Dec. 9, 2025, 12:01 a.m.
33070226
Dec. 8, 2025, 12:01 a.m.
Latency Overview (This Suite)