Remote Otter LogoRemoteOtter

Software Engineer, Trust & Safety (London) - Remote

Posted 15 weeks ago
Software Development
Full Time
United Kingdom
£240,000 - £325,000/year

Overview

Anthropic is seeking software engineers to develop safety and oversight mechanisms for AI systems, focusing on monitoring models and preventing misuse.

In Short

  • Develop monitoring systems for API partners.
  • Build abuse detection mechanisms.
  • Surface abuse patterns to research teams.
  • Implement multi-layered defenses for safety mechanisms.
  • Analyze user reports of inappropriate content.
  • 3-8+ years of software engineering experience required.
  • Proficiency in SQL, Python, and data analysis tools.
  • Strong communication skills needed.
  • Experience with machine learning frameworks is a plus.
  • Visa sponsorship available.

Requirements

  • Bachelor’s degree in Computer Science or related field.
  • Experience in integrity, spam, fraud, or abuse detection.
  • Ability to explain technical concepts to non-technical stakeholders.
  • Experience with trust and safety mechanisms for AI/ML systems.
  • Familiarity with prompt engineering and adversarial inputs.
  • Experience building custom internal tooling.

Benefits

  • Competitive compensation and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • Collaborative office space.
Anthropic logo

Anthropic

Anthropic is a forward-thinking company focused on developing AI assistants that prioritize being helpful, harmless, and honest. With a commitment to ensuring the safe and ethical use of AI technologies, Anthropic's Trust and Safety (T&S) team plays a crucial role in protecting users from the potential risks associated with powerful AI systems. The company emphasizes collaboration across research, product, and engineering teams to create robust safety measures and tools that mitigate deployment risks. Anthropic is dedicated to advancing frontier AI models responsibly, making it a leader in the AI landscape.

Share This Job!

Save This Job!

Similar Jobs:

Anthropic logo

Software Engineer, Trust & Safety - Remote

Anthropic

15 weeks ago

Join Anthropic as a software engineer to develop safety mechanisms for AI systems.

United States
Full-time
Software Development
$300,000 - $405,000/year
Cloudflare logo

Software Engineer - Trust & Safety Engineering - Remote

Cloudflare

10 weeks ago

Join Cloudflare as a Software Engineer on the Trust & Safety Engineering team, focusing on building tools and services to combat online abuse.

USA
Full-time
Software Development
Wikimedia logo

Software Engineer (Trust & Safety Team) - Remote

Wikimedia

21 weeks ago

Join the Wikimedia Foundation as a Software Engineer to enhance user safety and privacy on their platforms.

Worldwide
Full-time
Software Development
US$88,975 - US$139,056/year
Wikimedia logo

Software Engineer (Trust & Safety Team) - Remote

Wikimedia

21 weeks ago

Join the Wikimedia Foundation as a Software Engineer to enhance user safety and privacy on their platforms.

Worldwide
Full-time
Software Development
US$88,975 - US$139,056/year
Quizlet logo

Senior Software Engineer, Trust & Safety - Remote

Quizlet

12 weeks ago

Join Quizlet as a Senior Software Engineer to enhance user safety and trust on the platform.

WA, USA
Full-time
Software Development
$161,000 - $210,000/year