Experiment
CompletedSupport Tone Test
Testing whether a friendly tone outperforms a professional tone for customer support task completion.
Variant A
Professional“You are a professional customer support agent. Use formal language, be concise and precise.”
Variant B
Friendly“You are a friendly, helpful support agent. Use warm language, empathize with the customer, and guide them step by step.”
Statistically Significant
p < 0.001 · 95% CI: [4.2%, 9.6%] · z = 5.23
Variant B outperforms Variant A by +6.9 percentage points
Results gallery
What real experiments look like
Every experiment produces a clear result: a winner, a cost difference, and the statistical confidence to back it up.
Task completion
SOUL.md Tone Test
Winner: Concise tone
Cost per task
Model Routing
Winner: Haiku for simple tasks
Safety + completion
Guardrail Strictness
Winner: Medium guardrails
Output quality
Few-shot Examples
Winner: 3 examples (vs 1)
Want to run experiments like this?
Join the waitlist to get early access to ClawSplit and start A/B testing your agent prompts.
Free forever. No credit card required.