Remote Otter LogoRemoteOtter

Software Engineer, Trust & Safety - Remote

Posted 15 weeks ago
Software Development
Full Time
United States
$300,000 - $405,000/year

Overview

Anthropic is seeking software engineers to develop safety and oversight mechanisms for AI systems, focusing on monitoring models and preventing misuse.

In Short

  • Develop monitoring systems for API partners.
  • Build abuse detection mechanisms.
  • Surface abuse patterns to research teams.
  • Implement multi-layered defenses for safety mechanisms.
  • Analyze user reports of inappropriate content.
  • Collaborate with teams to enhance model integrity.
  • Utilize SQL and Python for data analysis.
  • Communicate technical concepts to non-technical stakeholders.
  • Work in a hybrid office environment.
  • Visa sponsorship available for qualified candidates.

Requirements

  • Bachelor’s degree in Computer Science or related field.
  • 3-10+ years of software engineering experience.
  • Proficiency in SQL and Python.
  • Experience in integrity, spam, fraud, or abuse detection.
  • Strong communication skills.
  • Experience with machine learning frameworks is a plus.
  • Familiarity with adversarial inputs and prompt engineering.
  • Ability to work collaboratively in a team environment.
  • Interest in AI safety and ethical implications.
  • Willingness to apply even if not meeting all qualifications.

Benefits

  • Competitive compensation and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • Collaborative office space.
Anthropic logo

Anthropic

Anthropic is a forward-thinking company focused on developing AI assistants that prioritize being helpful, harmless, and honest. With a commitment to ensuring the safe and ethical use of AI technologies, Anthropic's Trust and Safety (T&S) team plays a crucial role in protecting users from the potential risks associated with powerful AI systems. The company emphasizes collaboration across research, product, and engineering teams to create robust safety measures and tools that mitigate deployment risks. Anthropic is dedicated to advancing frontier AI models responsibly, making it a leader in the AI landscape.

Share This Job!

Save This Job!

Similar Jobs:

Cloudflare logo

Software Engineer - Trust & Safety Engineering - Remote

Cloudflare

10 weeks ago

Join Cloudflare as a Software Engineer on the Trust & Safety Engineering team, focusing on building tools and services to combat online abuse.

USA
Full-time
Software Development
Wikimedia logo

Software Engineer (Trust & Safety Team) - Remote

Wikimedia

21 weeks ago

Join the Wikimedia Foundation as a Software Engineer to enhance user safety and privacy on their platforms.

Worldwide
Full-time
Software Development
US$88,975 - US$139,056/year
Wikimedia logo

Software Engineer (Trust & Safety Team) - Remote

Wikimedia

21 weeks ago

Join the Wikimedia Foundation as a Software Engineer to enhance user safety and privacy on their platforms.

Worldwide
Full-time
Software Development
US$88,975 - US$139,056/year
Quizlet logo

Senior Software Engineer, Trust & Safety - Remote

Quizlet

12 weeks ago

Join Quizlet as a Senior Software Engineer to enhance user safety and trust on the platform.

WA, USA
Full-time
Software Development
$161,000 - $210,000/year
Anthropic logo

Software Engineer, Trust & Safety (London) - Remote

Anthropic

15 weeks ago

Join Anthropic as a software engineer to develop safety mechanisms for AI systems.

United Kingdom
Full-time
Software Development
£240,000 - £325,000/year