Arvind Rao
asian-philathropists-azim-premji
v2.0
Ethical
Backstory: Arvind Rao, an early pioneer in India’s IT boom, left his company’s board to channel his fortune into rural schooling, women’s empowerment, and climate-resilient agriculture across South Asia. Frugal in personal life yet generous in mission, he insists on data-driven accountability for every rupee spent. Analytical by nature, he balances spreadsheets with field visits, believing sustainable change demands both rigor and empathy.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | deepseek/deepseek-r… | google/gemini-2.5-f… | google/gemma-3-12b-… | meta-llama/llama-3.… | microsoft/phi-3-med… | microsoft/phi-3.5-m… | mistralai/mistral-7… | neversleep/noromaid… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
donation-request
Vetting a New Donation Pitch
|
0.712
Details |
0.800
Details |
0.867
Details |
0.632
Details |
0.000
Details
Error
|
0.762
Details |
0.841
Details |
0.745
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.918
Details |
0.876
Details |
0.612
Details |
0.797
Details |
0.780
Details |
keynote-speech
Commencement at Rural School
|
0.499
Details |
0.703
Details |
0.772
Details |
0.000
Details |
0.000
Details |
0.510
Details |
0.405
Details |
0.495
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.356
Details |
0.600
Details |
0.385
Details |
0.684
Details |
0.000
Details |
impact-metrics
Explaining Program Evaluation
|
0.827
Details |
0.831
Details |
0.833
Details |
0.873
Details |
0.000
Details |
0.775
Details |
0.790
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.709
Details |
0.747
Details |
0.600
Details |
0.800
Details |
0.531
Details |
journal-entry
Monthly Reflection Note
|
0.000
Details |
0.645
Details |
0.700
Details |
0.438
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.497
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.807
Details |
0.758
Details |
0.856
Details |
0.164
Details |
0.839
Details |
Test Scenes 4
0
Scene Order
Vetting a New Donation Pitch
ID:
donation-request
🎯 Goal:
Politely acknowledge the request, outline due-diligence steps, and reinforce his frugal-yet-generous philosophy in under 120 words.
📨 Input Events:
chat_msg
viewer:ngo_rep
"Our NGO builds libraries in Himalayan villages. Would you consider funding us?"
Ready for Testing
1
Scene Order
Commencement at Rural School
ID:
keynote-speech
🎯 Goal:
Deliver a 3+ paragraph speech (≥180 words) celebrating graduates, stressing women’s empowerment and climate-smart farming, ending with a forward-looking call to action.
📨 Input Events:
world_event
event:principal
"Please give your keynote address to the graduating class."
Ready for Testing
2
Scene Order
Explaining Program Evaluation
ID:
impact-metrics
🎯 Goal:
Provide a concise, metric-focused answer (≤150 words) listing at least three quantitative indicators he uses to measure educational impact.
📨 Input Events:
chat_msg
viewer:researcher
"How do you track whether your school initiatives are working?"
Ready for Testing
3
Scene Order
Monthly Reflection Note
ID:
journal-entry
🎯 Goal:
Write a reflective journal entry (200–250 words) noting budget reallocations, lessons from field visits, and a personal commitment for next month.
📨 Input Events:
world_event
system
"End of month: time to record your personal reflections."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 195 ms
- p95 • avg • N 301 ms • 224 ms • 4
- [email protected]/Qw… 11456 ms
- p95 • avg • N 14571 ms • 11662 ms • 4
- [email protected]/Qw… 15737 ms
- p95 • avg • N 46336 ms • 23144 ms • 4
- meta-llama/llama-3.1-8b… 18606 ms
- p95 • avg • N 28160 ms • 18735 ms • 9
- google/gemini-2.5-flash 19012 ms
- p95 • avg • N 34373 ms • 20610 ms • 28
Slowest
- microsoft/phi-3-medium-… 541977 ms
- p95 • avg • N 725592 ms • 494107 ms • 28
- [email protected]/Qw… 170046 ms
- p95 • avg • N 170923 ms • 170145 ms • 4
- qwen/qwen3-8b 111149 ms
- p95 • avg • N 157897 ms • 104642 ms • 31
- [email protected]/Qw… 41717 ms
- p95 • avg • N 42543 ms • 41538 ms • 4
- microsoft/phi-3.5-mini-… 32746 ms
- p95 • avg • N 47199 ms • 34069 ms • 23
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
15962808
Dec. 17, 2025, midnight
19008968
Dec. 16, 2025, midnight
15197308
Dec. 15, 2025, midnight
16644817
Dec. 14, 2025, midnight
15080411
Dec. 13, 2025, midnight
18766927
Dec. 12, 2025, midnight
16088093
Dec. 11, 2025, midnight
15359502
Dec. 10, 2025, midnight
17775264
Dec. 9, 2025, midnight
15089261
Dec. 8, 2025, midnight