Anthropic8 months ago

Software Engineer, Safeguards

Anthropic

$320,000–$485,000 year

Hybrid · New York City, New York, United States or San Francisco, California, United States

New York City, New York, United States or San Francisco, California, United StatesHybridFull Time$320,000–$485,000 yearSenior LevelBachelors DegreeAI ServicesStartup

Type

Full Time

Level

Senior Level

Education

Bachelors Degree

Company size

Startup

Industry

AI Services

Job Summary

Develop monitoring systems to detect unwanted behaviors from API partners and potentially take automated enforcement actions; surface these in internal dashboards to analysts for manual review. Build abuse detection mechanisms and infrastructure. Surface abuse patterns to research teams to harden models at the training stage. Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms that work at scale. Strong candidates may have experience building trust and safety detection mechanisms for AI/ML systems, experience with prompt engineering, jailbreak attacks, and working with operational teams to build internal tooling. Must have a Bachelor’s degree and 5-10+ years of software engineering experience, proficiency in Python and TypeScript, and excellent cross-stack and communication skills.

Required Qualifications

Bachelor’s degree in Computer Science, Software Engineering or comparable experience
5-10+ years of experience in a software engineering position
Proficiency in Python and Typescript
Ability to work across the stack
Strong communication skills and ability to explain complex technical concepts to non-technical stakeholders

Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.

Hiring someone like this?

Get your role in front of qualified candidates on Sorce.

Get started