Research

Leading AI innovation.

We design large-scale AI systems guided by research that ensures they are safe, secure, privacy-first, and reliably aligned with human intent.

Our mission: build AI that is safe, secure, and privacy-first — designed to serve people, not exploit them.

Our vision. Shape a future where AI empowers humanity with trust, safety, and purpose — where the technology amplifies what we're capable of without amplifying what's harmful.

We publish what we learn, work openly with the broader research community, and welcome external review. Open inquiry is how this field gets safer.

Read our publications →

Three principles. One direction.

The principles we evaluate every research direction against — before the work starts, and again before anything ships.

Safety first

Every step we take starts with one question: is it safe, secure, and privacy-first by design for the people who'll use it?

Aligned by design

We build AI that understands and respects human goals, values, and intentions — not systems that optimize for proxies and hope for the best.

Trust through transparency

We protect user data and design systems people can rely on. Capabilities and limits are documented in plain language.

Where the research happens.

Safety & Alignment

Making AI do what people actually want

Reward modeling, interpretability, and robustness research aimed at the alignment problem head-on. Published, peer-reviewed, and reproducible.

Privacy & Security

Protecting data and standing up to threats

Differential privacy, secure aggregation, adversarial robustness, and red-teaming. Building AI that's safe against the threats we expect — and the ones we don't.

Policy & Impact

How AI changes the world, and how to steer it

Working with policymakers, civil society, and standards bodies on the institutional questions: governance, deployment thresholds, second-order effects.