Manager, Incident Ops and Observability Job at F5, Seattle, WA

N1lwOHZHZmNhanZ4K0F6SDlqMThBbnVNWEE9PQ==
  • F5
  • Seattle, WA

Job Description

Site Reliability Engineering Manager

At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation.

Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive.

We are seeking a manager to help build our new Site Reliability Engineering team to strengthen operational excellence across the Infrastructure & Security and F5 Digital organization. This role will play an important part in Digital's incident management strategy, building out the Reliability Operations Center and monitoring capabilities and technologies required to help Digital understand problems before our users do.

The ideal candidate will bring deep expertise in incident lifecycle managementfrom detection and triage to resolution and post-mortemand will collaborate cross-functionally to drive continuous improvement in our security posture. This leader will operationalize a world-class incident management program while also defining and implementing the vision for observability across F5's hybrid infrastructure and cloud environments. This role requires strong leadership, technical acumen, and the ability to operate under pressure while maintaining clear communication with stakeholders at all levels.

Key Responsibilities:

  • Lead the global Incident Response (IR) program, optimizing processes across detection, triage, containment, remediation, and post-incident analysis.
  • Hire, mentor and train global team members on incident response best practices and observability tooling.
  • Serve as technical lead and head engineer for creation and management of monitoring tools and services to support F5 infrastructure and business systems.
  • Serve as the primary incident commander during major incidents, ensuring timely resolution, excellent communication, and stakeholder alignment.
  • Define and continuously refine incident response policies, procedures, and runbooks to ensure consistent and effective handling of incidents.
  • Drive improvements in detection, escalation, and resolution through automation, tooling, and process enhancements.
  • Define and report KPIs for service reliability, incident response, and observability maturity to senior leadership.
  • Conduct root cause analyses and lead post-incident reviews to identify lessons learned and prevent recurrence.
  • Design and lead cross-functional tabletop exercises to strengthen organizational preparedness, communication, and response coordination during major incidents.
  • Maintain detailed incident records and metrics to support auditing, compliance, and continuous improvement.
  • Collaborate with ServiceNow teams and architects to manage incidents.
  • Establish and maintain on-call rotations with teams who own critical applications across the Digital organization.

Qualifications:

  • 5+ years of experience in running NOC/SOC/SRE teams with a focus on monitoring and observability.
  • 10+ years managing incident response, IT service management, or a related field.
  • Proven track record of managing complex security incidents in cloud and hybrid environments.
  • Experience with SIEM, SOAR, and log analysis tools (e.g., Splunk, DataDog, Panther, Crowdstrike).
  • Experience with observability tools, especially tooling focused on synthetics, metrics, and infrastructure telemetry (e.g. Grafana, ThousandEyes, LogicMonitor, Pingdom, Zabbix).
  • Excellent communication skills with the ability to convey technical information to both technical and non-technical audiences.
  • Ability to lead under pressure, prioritize effectively, and make decisions in high-stakes situations.
  • Familiarity with AWS, Google Workspace, and common SaaS platforms.
  • Bachelor's degree in Computer Science, Cybersecurity, Information Systems, or related field (or equivalent experience).

Preferred Qualifications:

  • Experience working in infrastructure, IT, or security organizations.
  • Familiarity with tools such as Tableau, PowerBI, or other reporting/analytics platforms.
  • Comfortable navigating ambiguity, with a proactive approach to problem-solving.
  • Strong interest in scaling operations and driving impact in security-focused initiatives.

Job Tags

Night shift

Similar Jobs

HCA

Registered Nurse PRN Job at HCA

HCA Healthcare, a premier provider in high-quality, comprehensive healthcare services, is seeking a dedicated Registered Nurse (PRN) to join our dynamic team. In this role, you will be pivotal in delivering exceptional care to patients and supporting their recovery across... 

Loenbro

Electrical Project Manager Job at Loenbro

 ...Job Summary: The Electrical Project Manager position will initially focus on business and relationship development with a goal of developing a healthy, sustainable backlog. Oversee the design, fabrication | procurement and installation per the scope of supply and contract... 

TopView Group

Brand Ambassador Job at TopView Group

 ...tours. We own and manage the portfolio of brands, including TopView Sightseeing, Event...  ...Yorkers and visitors.About Our Brand Ambassador PositionAs a Brand Ambassador for TopView...  ...outgoing personality, with a passion for travel, tourism, and exploring new destinations... 

Greystar

Maintenance Technician - Ryder Junction Job at Greystar

ABOUT GREYSTAR Greystar is a leading, fully integrated global real estate platform offering expertise in property management, investment management, development, and construction services in institutional-quality rental housing. Headquartered in Charleston, South ...

Strategic Partners & Media

Junior Web Developer Job at Strategic Partners & Media

Are you a detail-oriented and tech-savvy individual with a passion for web development? Strategic Partners & Media is looking for a Junior Web Developer to join our team and support a variety of exciting digital projects. This role blends front-end development, website...