Software Engineer, Trust & Safety - Remote

Posted 77 weeks ago

Software Development

Full Time

United States

$300,000 - $405,000/year

Software Engineering

Safety Mechanisms

Abuse Detection

Machine Learning

Overview

Anthropic is seeking software engineers to develop safety and oversight mechanisms for AI systems, focusing on monitoring models and preventing misuse.

In Short

Develop monitoring systems for API partners.
Build abuse detection mechanisms.
Surface abuse patterns to research teams.
Implement multi-layered defenses for safety mechanisms.
Analyze user reports of inappropriate content.
Collaborate with teams to enhance model integrity.
Utilize SQL and Python for data analysis.
Communicate technical concepts to non-technical stakeholders.
Work in a hybrid office environment.
Visa sponsorship available for qualified candidates.

Requirements

Bachelor’s degree in Computer Science or related field.
3-10+ years of software engineering experience.
Proficiency in SQL and Python.
Experience in integrity, spam, fraud, or abuse detection.
Strong communication skills.
Experience with machine learning frameworks is a plus.
Familiarity with adversarial inputs and prompt engineering.
Ability to work collaboratively in a team environment.
Interest in AI safety and ethical implications.
Willingness to apply even if not meeting all qualifications.

Benefits

Competitive compensation and benefits.
Optional equity donation matching.
Generous vacation and parental leave.
Flexible working hours.
Collaborative office space.

Anthropic

Anthropic is a forward-thinking company focused on developing AI assistants that prioritize being helpful, harmless, and honest. With a commitment to ensuring the safe and ethical use of AI technologies, Anthropic's Trust and Safety (T&S) team plays a crucial role in protecting users from the potential risks associated with powerful AI systems. The company emphasizes collaboration across research, product, and engineering teams to create robust safety measures and tools that mitigate deployment risks. Anthropic is dedicated to advancing frontier AI models responsibly, making it a leader in the AI landscape.

Share This Job!

Save This Job!

Jobs from Anthropic:

Recruiting Coordinator (Contract)

Candidate Experience

Security GRC Specialist, Public Sector

Software Engineer, Networking

Network Engineering

Software Development

Security GRC Specialist

Security Compliance

Software Engineer, Employee Acceleration Tools

Full-stack Development

Anthropic

Anthropic is a forward-thinking company focused on developing AI assistants that prioritize being helpful, harmless, and honest. With a commitment to ensuring the safe and ethical use of AI technologies, Anthropic's Trust and Safety (T&S) team plays a crucial role in protecting users from the potential risks associated with powerful AI systems. The company emphasizes collaboration across research, product, and engineering teams to create robust safety measures and tools that mitigate deployment risks. Anthropic is dedicated to advancing frontier AI models responsibly, making it a leader in the AI landscape.

Share This Job!

Save This Job!

Jobs from Anthropic:

Recruiting Coordinator (Contract)

Candidate Experience

Security GRC Specialist, Public Sector

Software Engineer, Networking

Network Engineering

Software Development

Security GRC Specialist

Security Compliance

Software Engineer, Employee Acceleration Tools

Full-stack Development

Similar Jobs:

Software Engineer - Trust & Safety Engineering - Remote

Cloudflare

72 weeks ago

Cloudflare

Join Cloudflare as a Software Engineer on the Trust & Safety Engineering team, focusing on building tools and services to combat online abuse.

USA

Full-time

Software Development

72 weeks ago

Software Engineer (Trust & Safety Team) - Remote

Wikimedia

82 weeks ago

Wikimedia

Join the Wikimedia Foundation as a Software Engineer to enhance user safety and privacy on their platforms.

Worldwide

Full-time

Software Development

US$88,975 - US$139,056/year

82 weeks ago

Software Engineer (Trust & Safety Team) - Remote

Wikimedia

82 weeks ago

Wikimedia

Software Engineering

Join the Wikimedia Foundation as a Software Engineer to enhance user safety and privacy on their platforms.

Software Engineering

Worldwide

Full-time

Software Development

US$88,975 - US$139,056/year

82 weeks ago

Senior Software Engineer, Trust & Safety - Remote

Quizlet

74 weeks ago

Quizlet

Software Engineering

Backend Development

Full-stack Development

Content Moderation

Join Quizlet as a Senior Software Engineer to enhance user safety and trust on the platform.

Software Engineering

Backend Development

Full-stack Development

Content Moderation

WA, USA

Full-time

Software Development

$161,000 - $210,000/year

74 weeks ago

Software Engineer, Trust & Safety (London) - Remote

Anthropic

77 weeks ago

Anthropic

Software Engineering

Safety Mechanisms

Abuse Detection

Join Anthropic as a software engineer to develop safety mechanisms for AI systems.

Software Engineering

Safety Mechanisms

Abuse Detection

United Kingdom

Full-time

Software Development

£240,000 - £325,000/year

77 weeks ago