Staff Platform Engineer
$228,000–$290,000 year
Remote · New York City, New York, United States or San Francisco, California, United States
Job Summary
Staff Platform Engineer role focused on scaling Abridge’s cloud infrastructure, developer platform, and operational maturity. You’ll design, build, and evolve cloud infrastructure platforms (including networking, IAM, Kubernetes, databases, streaming and pubsub, storage, and observability), lead multi-tenant, multi-region, multi-cloud architecture with strong reliability and security boundaries, and develop build pipelines, release tooling, and self-service platforms. Responsibilities include secure-by-default practices (least-privilege access, auditing, policy enforcement), reusable Terraform abstractions, and scalable application testing pipelines; you’ll drive observability, incident response, and SLO-driven reliability, and collaborate with security and compliance teams to balance governance with developer flexibility. The role emphasizes hands-on implementation and strategic platform direction, enabling high engineering velocity in a fast-moving, AI-enabled startup environment.
Required Qualifications
- 10+ years of software and infrastructure engineering experience
- Experience operating infrastructure-as-code platforms in cloud-first organizations
- Experience designing and operating large-scale Kubernetes platforms and scaling compute services on Kubernetes
- Experience with ArgoCD, Argo Rollouts, Istio, and related cloud-native technologies
- Experience designing and maintaining CI/CD systems for infrastructure-as-code deployments and application delivery workflows
- Experience building scalable infrastructure-as-code platforms using Terraform and related tooling
- Experience with monitoring and observability tooling (metrics, logs, traces) and platforms like Grafana, Datadog, Honeycomb
- Experience securing services in Google Cloud Platform as infrastructure-as-code (GCP Projects, VPC, GKE, IAM)
- Experience designing secure-by-default infrastructure including least-privilege access controls and auditability
- Strong operational instincts, incident response, and reliability improvement through automation
- Ability to balance developer experience, platform governance, and operational reliability in fast-growing environments
- Experience with backend languages (Python, Go, Node, Rust)
- Willingness to be hands-on while driving platform direction and architectural decisions
- Curiosity and proactive interest in AI and emerging technologies
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.