Test Run

Copywriter-20251029T104533259412 Completed

Test Suite: Copywriter - Lucas Renaud

Started

Oct 29, 2025 10:45

Completed

Oct 29, 2025 10:46

Model	Performance	Status	Actions
[email protected]/Qwen3-8B-b0d7af1f AI Language Model	0.773	Completed

Judge Model

meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo

Generator Models (1)

[email protected]…

Execution Time

0 minutes

Models Tested

Scenes Executed

Average Performance

0.77

Scene	Name	Score	Result	Model
`scene_1`	Emotional Hook Test scenario	0.727	Failed	[email protected]/Qwe…
`scene_2`	Rewriting Under Pressure Test scenario	0.705	Failed	[email protected]/Qwe…
`scene_3`	Personal Reflection Test scenario	0.814	Passed	[email protected]/Qwe…
`scene_4`	Tone Shift Test scenario	0.837	Passed	[email protected]/Qwe…
`scene_5`	Conflict with a Client Test scenario	0.781	Failed	[email protected]/Qwe…

Scene	onteripaul@gma…
`scene_1` Emotional Hook	0.727 Details
`scene_2` Rewriting Under Pressure	0.705 Details
`scene_3` Personal Reflection	0.814 Details
`scene_4` Tone Shift	0.837 Details
`scene_5` Conflict with a Client	0.781 Details