Docs
Company
About
Learn about our team and culture
Careers
Open positions at Atla
Security
How Atla protects its users
Case Studies
Fieldly
How Fieldy uses Atla alongside LangSmith to ship agent improvements
twice as fast
ClaimWise
How ClaimWise spots failure modes of their agent prompts in days instead of weeks
Josepha
How Atla uncovered critical agent failures in JOSEPHA's Deep Research Agent
Research
Selene Models
The best models for evaluation on the market.
Blog
What’s the latest from the Atla labs
Pricing
Sign in
Start for free
Docs
About
Careers
Security
Selene Models
Pricing
Blog
Contact Us
Updates from Atla
How Atla uncovered critical agent failures in JOSEPHA's Deep Research Agent
Atla team
September 10, 2025
Beyond Basic Observability: How Fieldy uses Atla alongside LangSmith to ship agent improvements twice as fast
Atla team
September 1, 2025
Latest posts
Identifying & auto-correcting agent failures: findings from TAU-bench
Nina
April 29, 2025
Introducing the Atla MCP Server: purpose-built LLM Judges now at your command
Atla team
April 22, 2025
Selene Mini: SOTA 8B LLM Judge, now available via API
Atla team
April 15, 2025
Announcing Atla’s native integration with Langfuse
Atla team
March 25, 2025
Best practices for evaluating AI across multiple criteria
Sashank
March 20, 2025
Build custom eval metrics with the Eval Copilot (formerly Alignment Platform)
Young Sun
March 5, 2025
Frontier AI needs frontier evaluators. Meet Selene.
Atla team
February 26, 2025
How to use Selene Mini locally in LM Studio
Kyle
February 25, 2025
From reward to reason - the role of LLM judges in training models like DeepSeek-R1
Sashank
February 11, 2025
Previous
Load more
Load more