wf/wellsfargojobs logo
wf/wellsfargojobs4 days ago

Systems Operations Manager

On-site · Hyderabad, Telangana, India

Type
Full Time
Level
Senior Level
Education
Not Specified
Company size
Unknown

Job Summary

Systems Operations Manager leads L2 production support for critical applications and platforms with an SRE/AIOps mindset, focusing on reducing MTTR, improving uptime, and driving an automation-first approach. Responsibilities include incident, problem, and change management aligned with ITIL/SRE practices; RCA and permanent fixes; building self-service runbooks and knowledge bases; integrating AI/ML-driven insights, predictive monitoring, and anomaly detection; overseeing production releases and environment readiness; ensuring SLA/SLO adherence; and mentoring a operations team to improve reliability, scalability, and resilience.

Required Qualifications

  • Strong experience in production support, application operations, or platform management
  • Proven track record in reducing MTTR and improving system uptime
  • Experience with observability tools (monitoring, logging, tracing) and incident management platforms
  • Solid understanding of SRE principles (SLIs, SLOs, error budgets, reliability engineering)
  • Hands-on experience with automation tools (scripting, orchestration, CI/CD pipelines)
  • Familiarity with AI/automation in IT operations (AIOps) and self-healing systems
  • Knowledge of ITIL processes and service management frameworks
  • Strong analytical, troubleshooting, and problem-solving skills
  • Excellent leadership, communication, and stakeholder management abilities
  • Develop and promote self-service solutions (runbooks, knowledge bases, automated healing tools) for L1 teams and end users
  • Integrate AI/ML-driven insights, predictive monitoring, and anomaly detection into operations
  • Collaborate with engineering teams to improve system reliability, scalability, and resilience (SRE principles)
  • Oversee production releases, deployments, and environment readiness
  • Ensure SLA/SLO adherence and drive continuous service improvement initiatives
  • Lead, mentor, and develop operations teams with a focus on reliability engineering practices
Sorce

Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.

Hiring someone like this?

Get your role in front of qualified candidates on Sorce.

Get started

wf/wellsfargojobs

Systems Operations Manager

Apply on Sorce