Remote Otter LogoRemoteOtter

Research Scientist, Interpretability - Remote

Posted 22 weeks ago
Software Development
Full Time
United States
$315,000 - $560,000/year

Overview

Anthropic is seeking researchers and engineers to join the Interpretability team, focusing on mechanistic interpretability of neural networks to ensure AI systems are safe and beneficial.

In Short

  • Develop methods for understanding LLMs by reverse engineering algorithms.
  • Design and run robust experiments in various scenarios.
  • Build infrastructure for experiments and visualizing results.
  • Collaborate with colleagues to communicate results.
  • Strong track record in scientific research is preferred.
  • Enjoy team science and collaborative discoveries.
  • Comfortable with experimental science and coding.
  • Ability to articulate and discuss research motivations.
  • Familiarity with Python is required.
  • Visa sponsorship available for qualified candidates.

Requirements

  • Experience in scientific research, particularly in interpretability.
  • Team-oriented mindset for collaborative work.
  • Comfort with messy experimental science.
  • Ability to write code and interpret results.
  • Strong communication skills.

Benefits

  • Competitive compensation and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • Collaborative office space.
Anthropic logo

Anthropic

Anthropic is a forward-thinking company focused on developing AI assistants that prioritize being helpful, harmless, and honest. With a commitment to ensuring the safe and ethical use of AI technologies, Anthropic's Trust and Safety (T&S) team plays a crucial role in protecting users from the potential risks associated with powerful AI systems. The company emphasizes collaboration across research, product, and engineering teams to create robust safety measures and tools that mitigate deployment risks. Anthropic is dedicated to advancing frontier AI models responsibly, making it a leader in the AI landscape.

Share This Job!

Save This Job!

Similar Jobs:

Anthropic logo

Research Engineer, Interpretability - Remote

Anthropic

22 weeks ago

Join Anthropic's Interpretability team to work on mechanistic interpretability of AI models, ensuring their safety and reliability.

USA
Full-time
Software Development
$315,000 - $560,000/year
Montana Tech logo

Research Scientist I - Remote

Montana Tech

14 weeks ago

Montana Technological University is seeking a Research Scientist I to manage research projects and mentor students in Metallurgical and Materials Engineering.

MT, USA
Full-time
All others
Sensei Ag logo

Research Scientist - Remote

Sensei Ag

14 weeks ago

Join Sensei Ag as a Research Scientist to lead innovative research in plant biology and nutrition.

CA, USA
Full-time
All others
$100,000 - $140,000/year
Constructive Dialogue Institute logo

Research Scientist - Remote

Constructive Dialogue Institute

16 weeks ago

Join CDI as a Research Scientist to lead evaluations and advance research in higher education.

NY, USA
Full-time
Data Analysis
90000 - 105000/year

Join Oura as a Research Scientist to drive scientific innovation and analyze complex health data.

CA, USA
Full-time
Data Analysis