Escalation Team Leader

עדיין מחפשים עבודה במנועי חיפוש? הגיע הזמן להשתדרג!

במקום לעבור לבד על אלפי מודעות, Jobify מנתחת את קורות החיים שלך ומציגה לך רק משרות שבאמת מתאימות לך.

מעל 80,000 משרות • 4,000 חדשות ביום
חינם. בלי פרסומות. בלי אותיות קטנות.

Cyberint, a Check Point Company

תל אביב - יפו

Cyberint, a Check Point Company

תל אביב - יפו
מלאה, היברידית
25,000-35,000 ₪ הערכה מבוססת AI ולא שכר שהתקבל מהמעסיק
הערכה מבוססת AI ולא שכר של המעסיק

We are looking for a technically strong and AI-savvy Escalation & Reliability Manager to own production reliability, incident management, and cross-functional prioritization. This role leads our AI-driven automation strategy, drives self-healing infrastructure development, and sets a new standard for modern reliability engineering.

Responsibilities

:Own production incidents and escalations end-to-end — from mitigation to RCA to corrective action
.Lead the design and development of self-healing systems capable of detecting, diagnosing, and remediating incidents au3-5 years in SRE or Incident Management
.Mandatory: Hands-on experience applied to operational challenges (AIOps, anomaly detection, LLM-based automation, or auto-remediation)
.Proven track record of automating workflows and reducing manual toil at scale
.Strong cloud background (AWS/Azure/GCP) and experience with Kubernetes, Docker, and CI/CD
.Proficiency with observability tools (Grafana, Prometheus, ELK) and scripting (Python, Bash)
.Demonstrated leadership in high-pressure, cross-functional environments
.tenuously
.Drive automation of repetitive operational workflows using AI/ML-based solutions to reduce toil and MTTR
.Lead and mentor the SRE team; improve monitoring, alerting, and observability
.Manage the cross-functional Squad handling customer and production issues; align priorities across Support, QA, R&D, and Sources
.Track key operational metrics and lead long-term reliability improvements

.
Desired Backgroun

d:3-5 years in SRE or Incident Managemen
t.Mandatory: Hands-on experience applied to operational challenges (AIOps, anomaly detection, LLM-based automation, or auto-remediation
).Proven track record of automating workflows and reducing manual toil at scal
e.Strong cloud background (AWS/Azure/GCP) and experience with Kubernetes, Docker, and CI/C
D.Proficiency with observability tools (Grafana, Prometheus, ELK) and scripting (Python, Bash
).Demonstrated leadership in high-pressure, cross-functional environment

s.
Advanta

gesBackground in cybersecurity or SaaS platfor
ms.Experience with LLMOps, AI agents, or orchestration platforms (e.g., n8n, Tempora

l).
Key Attrib

utesStrong ownership, accountability, and composure under press
ure.Passionate about leveraging AI to automate workflows, reduce toil, and accelerate incident resolut
ion.Visionary about self-healing operations — able to both define the strategy and drive its implementat
ion.Collaborative leader with the ability to align cross-functional stakehold
ers.Technically hands-on systems-level thinker with the drive to engineer scalable, long-term soluti

ons.