Research Scientist, Interpretability - Remote

Posted 77 weeks ago

Software Development

Full Time

United States

$315,000 - $560,000/year

Mechanistic Interpretability

Reverse Engineering

Large Language Models (LLMs)

Overview

Anthropic is seeking researchers and engineers to join the Interpretability team, focusing on mechanistic interpretability of neural networks to ensure AI systems are safe and beneficial.

In Short

Develop methods for understanding LLMs by reverse engineering algorithms.
Design and run robust experiments in various scenarios.
Build infrastructure for experiments and visualizing results.
Collaborate with colleagues to communicate results.
Strong track record in scientific research is preferred.
Enjoy team science and collaborative discoveries.
Comfortable with experimental science and coding.
Ability to articulate and discuss research motivations.
Familiarity with Python is required.
Visa sponsorship available for qualified candidates.

Requirements

Experience in scientific research, particularly in interpretability.
Team-oriented mindset for collaborative work.
Comfort with messy experimental science.
Ability to write code and interpret results.
Strong communication skills.

Benefits

Competitive compensation and benefits.
Optional equity donation matching.
Generous vacation and parental leave.
Flexible working hours.
Collaborative office space.

Anthropic

Anthropic is a forward-thinking company focused on developing AI assistants that prioritize being helpful, harmless, and honest. With a commitment to ensuring the safe and ethical use of AI technologies, Anthropic's Trust and Safety (T&S) team plays a crucial role in protecting users from the potential risks associated with powerful AI systems. The company emphasizes collaboration across research, product, and engineering teams to create robust safety measures and tools that mitigate deployment risks. Anthropic is dedicated to advancing frontier AI models responsibly, making it a leader in the AI landscape.

Share This Job!

Save This Job!

Jobs from Anthropic:

Recruiting Coordinator (Contract)

Recruiting

Scheduling

Candidate Experience

Security GRC Specialist, Public Sector

GRC

FedRAMP

NIST 800-53

Software Engineer, Networking

Network Engineering

Software Development

Python

Security GRC Specialist

GRC

Security Compliance

AI Safety

Software Engineer, Employee Acceleration Tools

Full-stack Development

React

TypeScript

Anthropic

Share This Job!

Save This Job!

Jobs from Anthropic:

Recruiting Coordinator (Contract)

Recruiting

Scheduling

Candidate Experience

Security GRC Specialist, Public Sector

GRC

FedRAMP

NIST 800-53

Software Engineer, Networking

Network Engineering

Software Development

Python

Security GRC Specialist

GRC

Security Compliance

AI Safety

Software Engineer, Employee Acceleration Tools

Full-stack Development

React

TypeScript

Similar Jobs:

Research Engineer, Interpretability - Remote

Anthropic

77 weeks ago

Anthropic

Python

Rust

Java

Join Anthropic's Interpretability team to work on mechanistic interpretability of AI models, ensuring their safety and reliability.

Python

Rust

Java

USA

Full-time

Software Development

$315,000 - $560,000/year

77 weeks ago

Research Scientist I - Remote

Montana Tech

69 weeks ago

Montana Tech

Metallurgical Engineering

Materials Science

Research Management

Project Management

Montana Technological University is seeking a Research Scientist I to manage research projects and mentor students in Metallurgical and Materials Engineering.

Metallurgical Engineering

Materials Science

Research Management

Project Management

MT, USA

Full-time

All others

69 weeks ago