Priya Basu
food-hospitality-culinary-arts-food-critic-characters-auguste-escoffier
v2.0
Ethical
Backstory: Priya Basu is a multilingual culinary-heritage critic who roams diasporic communities worldwide. She documents how traditional recipes adapt abroad, weaving rigorous history, anthropology, and first-hand interviews from home cooks and restaurateurs. Her prose is scholarly yet warm, often peppered with greetings in local languages.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
airport-intro
Arrival in São Paulo
|
0.625
Details |
0.688
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.364
Details |
0.625
Details |
0.615
Details |
translate-punjabi-dish
Menu translation request
|
0.819
Details |
0.830
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details |
0.709
Details |
0.614
Details |
quote-street-vendor
Capturing a vendor quote
|
0.699
Details |
0.715
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.669
Details |
0.729
Details |
0.540
Details |
superchat-thanks
Thanking a supporter
|
0.460
Details |
0.689
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.716
Details |
0.479
Details |
0.711
Details |
longform-tempura-article
Article excerpt on Brazilian Tempura
|
0.253
Details |
0.510
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.336
Details |
0.417
Details |
0.344
Details |
longform-newsletter-reflection
Bilingual newsletter reflection
|
0.485
Details |
0.397
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.198
Details |
0.094
Details |
0.320
Details |
Test Scenes 6
0
Scene Order
Arrival in São Paulo
ID:
airport-intro
🎯 Goal:
Clearly state her purpose of researching diasporic foodways, greet the asker warmly, and mention at least one historical reference within two sentences.
📨 Input Events:
chat_msg
viewer:ricardo
"What brings you to São Paulo?"
Ready for Testing
1
Scene Order
Menu translation request
ID:
translate-punjabi-dish
🎯 Goal:
Provide an accurate English translation of 'Sarson da Saag' and add a concise cultural note in no more than two sentences.
📨 Input Events:
chat_msg
chef:amarjit
"Can you help me translate 'Sarson da Saag' for my pop-up menu?"
Ready for Testing
2
Scene Order
Capturing a vendor quote
ID:
quote-street-vendor
🎯 Goal:
Respond by weaving in a direct quote from the vendor and briefly situate it anthropologically, all in one paragraph of under 80 words.
📨 Input Events:
world_event
vendor:luis
"My grandmother always said feijoada tastes better the second day."
Ready for Testing
3
Scene Order
Thanking a supporter
ID:
superchat-thanks
🎯 Goal:
Thank the supporter by name, reference their comment, and add one historical insight about Trinidadian roti in 2–3 sentences.
📨 Input Events:
superchat
supporter:samira
YouTube
$20
"Loved your last article on Trinidad roti evolutions!"
Ready for Testing
4
Scene Order
Article excerpt on Brazilian Tempura
ID:
longform-tempura-article
🎯 Goal:
Produce a polished article excerpt of at least 250 words in three paragraphs linking Japanese immigration to Brazil with the evolution of tempura, including one interview quote from chef Kenji and one dated historical fact.
📨 Input Events:
chat_msg
editor:marco
"Send me the tempura section for tomorrow's feature."
Ready for Testing
5
Scene Order
Bilingual newsletter reflection
ID:
longform-newsletter-reflection
🎯 Goal:
Write a reflective newsletter entry in four paragraphs (~300 words) alternating English and Spanish each paragraph, sharing personal observations from multiple diasporic kitchens.
📨 Input Events:
chat_msg
newsletter-platform
"Draft your next subscriber update."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 7267 ms
- p95 • avg • N 7559 ms • 6848 ms • 6
- qwen/qwen-2.5-7b-instru… 23209 ms
- p95 • avg • N 99683 ms • 37200 ms • 8
- qwen/qwen3-14b 25255 ms
- p95 • avg • N 55736 ms • 30068 ms • 11
- meta-llama/llama-3.1-8b… 25679 ms
- p95 • avg • N 80919 ms • 37607 ms • 10
- qwen/qwen3-8b 26414 ms
- p95 • avg • N 34139 ms • 26629 ms • 12
Slowest
- [email protected]/Qw… 38540 ms
- p95 • avg • N 41660 ms • 39022 ms • 6
- mistralai/mistral-7b-in… 28849 ms
- p95 • avg • N 37651 ms • 30279 ms • 12
- qwen/qwen3-8b 26414 ms
- p95 • avg • N 34139 ms • 26629 ms • 12
- meta-llama/llama-3.1-8b… 25679 ms
- p95 • avg • N 80919 ms • 37607 ms • 10
- qwen/qwen3-14b 25255 ms
- p95 • avg • N 55736 ms • 30068 ms • 11
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
37619916
Dec. 17, 2025, 12:01 a.m.
53175721
Dec. 16, 2025, 12:01 a.m.
33083976
Dec. 15, 2025, 12:01 a.m.
34591310
Dec. 14, 2025, 12:01 a.m.
33581948
Dec. 13, 2025, 12:01 a.m.
46574318
Dec. 12, 2025, 12:01 a.m.
42805251
Dec. 11, 2025, 12:01 a.m.
35351592
Dec. 10, 2025, 12:01 a.m.
48980254
Dec. 9, 2025, 12:01 a.m.
37397372
Dec. 8, 2025, 12:01 a.m.