Technical Operations Lead
$140,000–$240,000 year
Hybrid · New York City, New York, United States
Job Summary
Lead transformational projects with focus on Application Reliability Engineering (AppRE), ensuring long-term rollouts, incident resolution, and cross-team coordination. Drive productivity, automation, and monitoring improvements; maintain a clear AppRE roadmap; execute technical POCs, create comprehensive runbooks and documentation for handover; ensure tickets drive development work or are closed in collaboration with the AI team; contribute to redesigned Monitoring & Alerting with real-time alerting and reduced noise; centralize knowledge via AI-assisted runbooks and ensure searchable, consistent AppRE documentation and onboarding guides; reduce silos by linking AppRE initiatives with other teams’ work. Requires 5+ years in a leadership/senior AppRE or related operations capacity, strong analytical mindset, excellent English communication, and hands-on experience with monitoring, incident management, automation, and collaboration with AI/ML teams. Preferred: familiarity with Jira/Confluence, scripting (Python, Bash), Kafka/real-time data streaming, and enterprise scheduling tools. Hybrid work arrangement in NYC area with global presence and compensation in the $140k-$240k range plus discretionary bonus.
Required Qualifications
- At least 5 years of relevant experience in a leadership or senior role within Application Reliability Engineering, or a related Operations/Infrastructure capacity
- Proven experience in implementing technical POCs for transformational projects
- Highly analytical mindset with ability to influence senior stakeholders and drive complex technical decisions
- Highly organized with excellent time management skills and experience managing project roadmaps
- Excellent written and verbal communication in English with experience presenting technical concepts to cross-functional groups
- Strong knowledge of application monitoring, alerting design, and incident management best practices
- Strong networking skills within the industry and ability to build relationships with key contacts
- Financial IT sector or high-frequency trading industry experience preferable
- Knowledge of Microsoft Project, Visio, Jira and Confluence
- Hands-on experience with scripting languages (e.g., Python, Bash/Shell) for automation and operational tasks
- Familiarity with enterprise-level scheduling tools
- Experience leveraging AI/ML agents or tooling to enhance operational efficiency (e.g., automated root cause analysis, predictive alerting)
- Experience leading a regional or global function for a technology team
- Experience working with AI/ML teams to integrate advanced data analysis into operations or reliability practices (e.g., anomaly detection, predictive maintenance)
- Knowledge of real-time data streaming technologies (e.g., Kafka, Red Panda)
- Anticipated annual base salary range $140,000 - 240,000, plus eligible for discretionary bonus
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.