# Anthropic Fellows Program — AI Safety

[Anthropic](https://www.jorb.ai/firms/anthropic.md) · London · United Kingdom · [Research / Applied Science](https://www.jorb.ai/jobs/research-applied-science.md)

Anthropic is hiring a Anthropic Fellows Program — AI Safety in London. Posted 2026-04-10; applications close 2026-06-09.

**Apply**: https://job-boards.greenhouse.io/anthropic/jobs/5183044008

Posted 12d ago.

## Role details

## About Anthropic

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We aim for AI to be safe and beneficial for our users and for society. Our team includes researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

**Apply using this link**. The next cohort of Anthropic fellows starts on July 20, 2026. **Apply by April 26, 2026** to be considered for this cohort. We will continue accepting applications for later cohorts on a rolling basis. In exceptional circumstances, we may accommodate fellows starting outside of usual cohort timelines.

This page is specific to one of the Anthropic Fellows Workstreams. See also the main **Anthropic Fellows posting**.

## Anthropic Fellows Program overview

The Anthropic Fellows Program is designed to foster AI research and engineering talent. We provide funding and mentorship to promising technical talent—regardless of previous experience.

Fellows will primarily use external infrastructure (e.g., open-source models, public APIs) to work on an empirical project aligned with our research priorities, with the goal of producing a **public output** (e.g., a paper submission). In earlier cohorts, over 80% of fellows produced papers.

We run multiple cohorts of Fellows each year and review applications on a rolling basis. This application is for cohorts starting in July 2026 and beyond.

## What to expect

  
- 4 months of full-time research
  
- Direct mentorship from Anthropic researchers
  
- Access to a shared workspace (in either Berkeley, California or London, UK)
  
- Connection to the broader AI safety and security research community
  
- Weekly stipend of 3,850 USD / 2,310 GBP / 4,300 CAD + benefits (vary by country)
  
- Funding for compute (~$15k/month) and other research expenses

## Interview process

The interview process includes an initial application and reference check, technical assessments and interviews, and a research discussion.

**We encourage you to apply even if you do not meet every single qualification.** Not all strong candidates will meet every qualification as listed. We value diverse perspectives and encourage applications from underrepresented groups.

## Compensation

The expected base stipend for this role is 3,850 USD / 2,310 GBP / 4,300 CAD per week, with an expectation of 40 hours per week for 4 months (extension possible).

## Fellows workstreams

To expand the Anthropic Fellows program across teams, we anticipate significant overlap in skills and responsibilities across workstreams and will consider candidates for all workstreams by default.

Some workstreams may include unique assessment steps; please indicate workstream preferences in the application.

Current workstreams include:

  
- AI Safety Fellows
  
- AI Security Fellows
  
- ML Systems & Performance Fellows
  
- Reinforcement Learning Fellows
  
- Economics & Societal Impacts Fellows

This page is specific to one of the Anthropic Fellows Workstreams. See also the main **Anthropic Fellows posting**.

## Across the workstreams, you may be a good fit if you:

  
- Are motivated by making AI safe and beneficial for society
  
- Are excited to transition into empirical AI research and seek a full-time role at Anthropic
  
- Have a strong technical background in computer science, mathematics, or physics
  
-  Thrive in fast-paced, collaborative environments
  
- Can implement ideas quickly and communicate clearly

## Strong candidates may also have:

  
- Strong background in a discipline relevant to a specific Fellows workstream (e.g., economics, social sciences, or cybersecurity)
  
- Experience in research or engineering related to their workstream

## Candidates must be:

  
- Fluent in Python programming
  
- Available to work full-time on the Fellows program

## AI Safety Fellows

### Mentors, research areas, & past projects

Fellows will undergo a project selection and mentor matching process. Potential mentors include:

  
- Sam Bowman
  
- Sara Price
  
- Alex Tamkin
  
- Nina Panickssery
  
- Trenton Bricken
  
- Logan Graham
  
- Jascha Sohl-Dickstein
  
- Joe Benton
  
- Collin Burns
  
- Fabien Roger
  
- Samuel Marks
  
- Kyle Fish
  
- Ethan Perez

Our mentors will lead projects in select AI safety research areas, such as:

  
- **Scalable Oversight:** Developing techniques to keep highly capable models helpful and honest, even as they surpass human-level intelligence in various domains.
  
- **Adversarial Robustness and AI Control:** Creating methods to ensure advanced AI systems remain safe and harmless in unfamiliar or adversarial scenarios.
  
- **Model Organisms:** Creating model organisms of misalignment to improve empirical understanding of alignment failures.
  
- **Model Internals / Mechanistic Interpretability:** Advancing understanding of internal workings of large language models to enable targeted interventions and safety measures.
  
- **AI Welfare:** Improving understanding of potential AI welfare and developing related evaluations and mitigations.

For past projects, read about:

  
- Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data — Alex Cloud and Minh Le, et al., mentors including Samuel Marks and Owain Evans
  
- Open-source circuits — Michael Hanna and Mateusz Piotrowski with mentorship from Emmanuel Ameisen and Jack Lindsey

For a full list of representative projects for each area, see: Introducing the Anthropic Fellows Program for AI Safety Research, Recommendations for Technical AI Safety Research Directions.

## Unique candidate criteria

You might be a particularly great fit for this workstream if you:

  
- Are motivated by reducing catastrophic risks from advanced AI systems
  
- Have experience with empirical ML research projects
  
- Have experience working with large language models
  
- Have experience in one of the research areas mentioned above
  
- Have a track record of open-source contributions

## Logistics

**Logistics Requirements:** To participate in the Fellows program, you must have work authorization in the US, UK, or Canada and be located in that country during the program.

**Workspace Locations:** We have designated shared workspaces in London and Berkeley where fellows will work from and mentors will visit. **We are also open to remote fellows in the UK, US, or Canada**. We will ask you about your availability to work from Berkeley or London (full- or part-time) during the program.

**Visa Sponsorship:** We are not currently able to sponsor visas for fellows. To participate, you need to have or independently obtain full-time work authorization in the UK, the US, or Canada.

**Program Duration:** The program runs for 4 months, full-time. If you cannot commit to the full duration, please still apply and note your constraints in the application. We review these requests case-by-case.

**Please note:** We do not guarantee full-time offers to fellows. In previous cohorts, 25–50% of fellows received a full-time offer; many others have gone on to contribute to AI safety and security at other organizations.

Applications and interviews are managed by Constellation, our official recruiting partner. Constellation also runs the Berkeley workspace that hosts fellows. Clicking "Apply here" will redirect you to Constellation's application portal. You can expect to receive emails from Constellation with application updates.

## Apply here

**Apply here**

## Policies for full-time roles

The following policies apply to full-time roles (not to the Fellows Program). Minimum education, field of study, experience, location-based hybrid policy, and visa sponsorship are outlined below. This information does not apply to the Fellows Program.

## Logistics

**Minimum education:** Bachelor’s degree or equivalent

**Required field of study:** A field relevant to the role, demonstrated through coursework, training, or professional experience

**Minimum years of experience:** Experience aligned with internal job level requirements

**Location-based hybrid policy:** Most roles require in-office presence at least 25% of the time; some roles may require more.

**Visa sponsorship:** We sponsor visas where possible, but not for every role. If offered, we will pursue visa assistance with an immigration lawyer.

**We encourage you to apply even if you do not meet every single qualification.** Not all strong candidates will meet every qualification as listed. We value diverse perspectives and encourage applications from underrepresented groups. For security, Anthropic recruiters will contact you from @anthropic.com emails. If in doubt, visit anthropic.com/careers for confirmed openings.

## How we're different

We believe the highest-impact AI research is big science, conducted by a cohesive team focused on large-scale research efforts. We value impact and the long-term goals of steerable, trustworthy AI, viewing AI research as an empirical science with collaboration and communication as core skills.

Read about our research directions in recent work, including GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.

## Come work with us!

Anthropic is a public-benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible hours, and a collaborative office space. Learn about our policy for candidate AI usage in the application process: AI usage guidance.

## More open roles at Anthropic

- [Anthropic Fellows Program — ML Systems & Performance](https://www.jorb.ai/jobs/69d8499d193675065559e149.md) — London, posted 12d ago
- [Anthropic Fellows Program — Reinforcement Learning](https://www.jorb.ai/jobs/69d8499d193675065559e14e.md) — London, posted 12d ago
- [Anthropic Fellows Program — AI Security](https://www.jorb.ai/jobs/69d8499d193675065559e14b.md) — London, posted 12d ago
- [Anthropic Fellows Program](https://www.jorb.ai/jobs/69d8499d193675065559e14c.md) — London, posted 12d ago
- [Emerging Account Executive, Startups](https://www.jorb.ai/jobs/69af8b969243cd01c9194896.md) — New York, posted 1mo ago

---

Updated: 2026-04-22
Canonical: https://www.jorb.ai/jobs/69d8499d193675065559e14d
