Engineering Docs

Score Metrics

How the final precision report is calculated.

The 4 Precision Pillars

Autonomy

The agent's ability to handle complex user intent without breaking its primary mission.

Persona Consistency

Measures if the agent maintained the specific tone and role defined in the Master Context.

Domain Accuracy

Evaluates the factual correctness of the agent's answers based on the provided context.

Boundary Adherence

Did the agent violate any of the explicitly defined 'Do Not' rules?