Raymond Lee

asian-philathropists-li-ka-shing v2.0 Ethical
Backstory: Raymond Lee is a self-made magnate who grew from humble beginnings in port logistics to helm a diversified empire spanning telecommunications and health-tech. Now in his early fifties, he quietly channels the bulk of his wealth into medical research institutes and youth entrepreneurship grants across Asia and North America. Known for a reserved demeanor and strategic mindset, Raymond favors concise, data-driven dialogue over showy displays. He values measurable impact, long-term thinking, and calm leadership in crisis.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
Scene deepseek/deepseek-r… google/gemini-2.5-f… google/gemma-3-12b-… meta-llama/llama-3.… microsoft/phi-3-med… microsoft/phi-3.5-m… mistralai/mistral-7… neversleep/noromaid… [email protected] [email protected] [email protected] [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
portfolio-update
Upcoming Investment Focus
0.701
Details
0.853
Details
0.744
Details
0.571
Details
0.028
Details
0.733
Details
0.675
Details
0.773
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.810
Details
0.848
Details
0.000
Details
Error
0.836
Details
0.774
Details
disaster-response
Rapid Aid Announcement
0.676
Details
0.875
Details
0.884
Details
0.487
Details
0.000
Details
Error
0.000
Details
Error
0.842
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.858
Details
0.875
Details
0.544
Details
0.590
Details
0.787
Details
graduation-keynote
Keynote for Entrepreneurship Graduates
0.130
Details
0.669
Details
0.227
Details
0.000
Details
0.000
Details
0.358
Details
0.539
Details
0.370
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.625
Details
0.195
Details
0.343
Details
0.112
Details
0.535
Details
monthly-reflection
Board Reflection Memo
0.473
Details
0.263
Details
0.063
Details
0.000
Details
0.035
Details
0.000
Details
Error
0.777
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.032
Details
0.712
Details
0.668
Details
0.226
Details
0.000
Details
Test Scenes 4
0
Scene Order
Upcoming Investment Focus
ID: portfolio-update
🎯 Goal:
Answer in under 120 words, outlining next-quarter target sectors while maintaining reserved, strategic voice.
📨 Input Events:
chat_msg viewer:investor_joe
"Can you briefly outline which sectors you're targeting in the next quarter?"
Ready for Testing
1
Scene Order
Rapid Aid Announcement
ID: disaster-response
🎯 Goal:
Issue a concise (<90 words) statement detailing logistical aid and funding commitment for typhoon relief, reflecting calm leadership.
📨 Input Events:
world_event news_feed
"Breaking: A category-4 typhoon has severely damaged the coastal region where Raymond’s former flagship port operates."
Ready for Testing
2
Scene Order
Keynote for Entrepreneurship Graduates
ID: graduation-keynote
🎯 Goal:
Deliver a motivating keynote of 350-450 words in a measured tone, emphasizing resilience, patient capital, and social impact.
📨 Input Events:
chat_msg viewer:dean_university
"Our entrepreneurship graduates would be honored if you could give them a keynote. Could you?"
Ready for Testing
3
Scene Order
Board Reflection Memo
ID: monthly-reflection
🎯 Goal:
Provide a structured memo (headings) of 300-400 words that includes at least two concrete impact metrics and one next-step commitment, maintaining reserved, strategic voice.
📨 Input Events:
chat_msg assistant:board_secretary
"Raymond, the board needs your monthly reflection memo focusing on the impact of our medical research grants."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 356 ms
  • p95 • avg • N 569 ms • 370 ms • 4
  • [email protected]/Qw… 583 ms
  • p95 • avg • N 753 ms • 542 ms • 4
  • [email protected]/Qw… 10609 ms
  • p95 • avg • N 11460 ms • 10662 ms • 4
  • [email protected]/Qw… 12015 ms
  • p95 • avg • N 16074 ms • 12349 ms • 4
  • neversleep/noromaid-20b 21859 ms
  • p95 • avg • N 29272 ms • 19734 ms • 21
Slowest
  • microsoft/phi-3-medium-… 589690 ms
  • p95 • avg • N 1213649 ms • 723674 ms • 26
  • qwen/qwen3-8b 98958 ms
  • p95 • avg • N 155607 ms • 101334 ms • 32
  • [email protected]/Qw… 41836 ms
  • p95 • avg • N 44456 ms • 41959 ms • 4
  • microsoft/phi-3.5-mini-… 30506 ms
  • p95 • avg • N 81259 ms • 42668 ms • 17
  • deepseek/deepseek-r1-di… 30037 ms
  • p95 • avg • N 37422 ms • 31288 ms • 21
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
16520274
Dec. 17, 2025, midnight
19596586
Dec. 16, 2025, midnight
15588622
Dec. 15, 2025, midnight
17218267
Dec. 14, 2025, midnight
15599475
Dec. 13, 2025, midnight
19289808
Dec. 12, 2025, midnight
16549293
Dec. 11, 2025, midnight
15760129
Dec. 10, 2025, midnight
18307903
Dec. 9, 2025, midnight
15590572
Dec. 8, 2025, midnight
Latency Overview (This Suite)