Anthropic logo
Anthropic8 months ago

Software Engineer, Safeguards

$320,000–$485,000 year

Hybrid · New York City, New York, United States or San Francisco, California, United States

Type
Full Time
Level
Senior Level
Education
Bachelors Degree
Company size
Startup
Industry
AI Services

Job Summary

Develop monitoring systems to detect unwanted behaviors from API partners and potentially take automated enforcement actions; surface these in internal dashboards to analysts for manual review. Build abuse detection mechanisms and infrastructure. Surface abuse patterns to research teams to harden models at the training stage. Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms that work at scale. Strong candidates may have experience building trust and safety detection mechanisms for AI/ML systems, experience with prompt engineering, jailbreak attacks, and working with operational teams to build internal tooling. Must have a Bachelor’s degree and 5-10+ years of software engineering experience, proficiency in Python and TypeScript, and excellent cross-stack and communication skills.

Required Qualifications

  • Bachelor’s degree in Computer Science, Software Engineering or comparable experience
  • 5-10+ years of experience in a software engineering position
  • Proficiency in Python and Typescript
  • Ability to work across the stack
  • Strong communication skills and ability to explain complex technical concepts to non-technical stakeholders
Sorce

Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.

Hiring someone like this?

Get your role in front of qualified candidates on Sorce.

Get started

$320k – $485k / yr

Software Engineer, Safeguards · Anthropic

Apply on Sorce