Updates from Atla

Trees, not logs: structured evaluation of agent traces

Trees, not logs: structured evaluation of agent traces

Atla team

December 2, 2025

The problem with voice agents that no one’s talking about

The problem with voice agents that no one’s talking about

Henry

November 13, 2025

Latest posts

Build custom eval metrics with the Eval Copilot (formerly Alignment Platform)

Build custom eval metrics with the Eval Copilot (formerly Alignment Platform)

Atla team

March 5, 2025

Frontier AI needs frontier evaluators. Meet Selene.

Frontier AI needs frontier evaluators. Meet Selene.

Atla team

February 26, 2025

How to use Selene Mini locally in LM Studio

How to use Selene Mini locally in LM Studio

Kyle

February 25, 2025

From reward to reason - the role of LLM judges in training models like DeepSeek-R1

From reward to reason - the role of LLM judges in training models like DeepSeek-R1

Sashank

February 11, 2025

Cookbooks to get started with Selene Mini

Cookbooks to get started with Selene Mini

Sashank

February 6, 2025

Selene 1 Mini: the best small language model-as-a-judge

Selene 1 Mini: the best small language model-as-a-judge

Atla team

January 27, 2025

How to build a general purpose LLM evaluator: Lessons from our literature review

How to build a general purpose LLM evaluator: Lessons from our literature review

Andrei

January 16, 2025

Inferring the Overseer: Insights from the AISI Research Sprint

Inferring the Overseer: Insights from the AISI Research Sprint

Henry

December 18, 2024

Aligning AI with AI-Assisted Human Feedback

Aligning AI with AI-Assisted Human Feedback

Maurice

December 12, 2024