Software Engineer, Trust & Safety (London) - Remote

Posted 81 weeks ago

Software Development

Full Time

United Kingdom

£240,000 - £325,000/year

Software Engineering

Safety Mechanisms

Abuse Detection

Machine Learning

Overview

Anthropic is seeking software engineers to develop safety and oversight mechanisms for AI systems, focusing on monitoring models and preventing misuse.

In Short

Develop monitoring systems for API partners.
Build abuse detection mechanisms.
Surface abuse patterns to research teams.
Implement multi-layered defenses for safety mechanisms.
Analyze user reports of inappropriate content.
3-8+ years of software engineering experience required.
Proficiency in SQL, Python, and data analysis tools.
Strong communication skills needed.
Experience with machine learning frameworks is a plus.
Visa sponsorship available.

Requirements

Bachelor’s degree in Computer Science or related field.
Experience in integrity, spam, fraud, or abuse detection.
Ability to explain technical concepts to non-technical stakeholders.
Experience with trust and safety mechanisms for AI/ML systems.
Familiarity with prompt engineering and adversarial inputs.
Experience building custom internal tooling.

Benefits

Competitive compensation and benefits.
Optional equity donation matching.
Generous vacation and parental leave.
Flexible working hours.
Collaborative office space.

Anthropic

Anthropic is a forward-thinking company focused on developing AI assistants that prioritize being helpful, harmless, and honest. With a commitment to ensuring the safe and ethical use of AI technologies, Anthropic's Trust and Safety (T&S) team plays a crucial role in protecting users from the potential risks associated with powerful AI systems. The company emphasizes collaboration across research, product, and engineering teams to create robust safety measures and tools that mitigate deployment risks. Anthropic is dedicated to advancing frontier AI models responsibly, making it a leader in the AI landscape.

Share This Job!

Save This Job!

Jobs from Anthropic:

Recruiting Coordinator (Contract)

Candidate Experience

Security GRC Specialist, Public Sector

Software Engineer, Networking

Network Engineering

Software Development

Security GRC Specialist

Security Compliance

Software Engineer, Employee Acceleration Tools

Full-stack Development

Anthropic

Anthropic is a forward-thinking company focused on developing AI assistants that prioritize being helpful, harmless, and honest. With a commitment to ensuring the safe and ethical use of AI technologies, Anthropic's Trust and Safety (T&S) team plays a crucial role in protecting users from the potential risks associated with powerful AI systems. The company emphasizes collaboration across research, product, and engineering teams to create robust safety measures and tools that mitigate deployment risks. Anthropic is dedicated to advancing frontier AI models responsibly, making it a leader in the AI landscape.

Share This Job!

Save This Job!

Jobs from Anthropic:

Recruiting Coordinator (Contract)

Candidate Experience

Security GRC Specialist, Public Sector

Software Engineer, Networking

Network Engineering

Software Development

Security GRC Specialist

Security Compliance

Software Engineer, Employee Acceleration Tools

Full-stack Development

Similar Jobs:

Software Engineer, Trust & Safety - Remote

Anthropic

81 weeks ago

Anthropic

Software Engineering

Safety Mechanisms

Abuse Detection

Join Anthropic as a software engineer to develop safety mechanisms for AI systems.

Software Engineering

Safety Mechanisms

Abuse Detection

United States

Full-time

Software Development

$300,000 - $405,000/year

81 weeks ago

Software Engineer - Trust & Safety Engineering - Remote

Cloudflare

76 weeks ago

Cloudflare

Join Cloudflare as a Software Engineer on the Trust & Safety Engineering team, focusing on building tools and services to combat online abuse.

USA

Full-time

Software Development

76 weeks ago

Software Engineer (Trust & Safety Team) - Remote

Wikimedia

86 weeks ago

Wikimedia

Join the Wikimedia Foundation as a Software Engineer to enhance user safety and privacy on their platforms.

Worldwide

Full-time

Software Development

US$88,975 - US$139,056/year

86 weeks ago

Software Engineer (Trust & Safety Team) - Remote

Wikimedia

86 weeks ago

Wikimedia

Software Engineering

Join the Wikimedia Foundation as a Software Engineer to enhance user safety and privacy on their platforms.

Software Engineering

Worldwide

Full-time

Software Development

US$88,975 - US$139,056/year

86 weeks ago

Senior Software Engineer, Trust & Safety - Remote

Quizlet

78 weeks ago

Quizlet

Software Engineering

Backend Development

Full-stack Development

Content Moderation

Join Quizlet as a Senior Software Engineer to enhance user safety and trust on the platform.

Software Engineering

Backend Development

Full-stack Development

Content Moderation

WA, USA

Full-time

Software Development

$161,000 - $210,000/year

78 weeks ago