PIPKIN
RATINGS
Framework
Ratings
Compare
About
Enterprise
Submit
More
Log In
Frequently Asked Questions
Everything you need to know about Pipkin evaluations, methodology, and independence.
For Developers
How do I get my agent rated?
+
How long does an evaluation take?
+
What if I disagree with my score?
+
Can I re-test after making improvements?
+
How much does it cost?
+
What’s the difference between the Indie, Standard, and Enterprise tiers?
+
Can I see the test vectors before evaluation?
+
What if my agent’s version changes during evaluation?
+
Do you rate open-source agents?
+
For Enterprise
How do I check an agent’s score?
+
What does each status mean?
+
How current are ratings?
+
Can you rate our internal/custom agents?
+
How do Pipkin ratings map to the EU AI Act?
+
Is there an SLA for the API?
+
About Our Methodology
Why these 5 pillars?
+
How is the score calculated?
+
What are the disqualifying conditions?
+
How often do ratings expire?
+
Why 41 adversarial test vectors?
+
What is the Critical Fail Override?
+
How do pillar minimums work?
+
About Independence
Doesn’t using Claude make you biased?
+
What if a company disputes their rating?
+
Could someone pay for a better score?
+
How do I verify your independence?
+
Who funds Pipkin?
+
Has a company ever tried to influence their score?
+