עדיין מחפשים עבודה במנועי חיפוש? הגיע הזמן להשתדרג!

במקום לעבור לבד על אלפי מודעות, Jobify מנתחת את קורות החיים שלך ומציגה לך רק משרות שבאמת מתאימות לך.

מעל 80,000 משרות • 4,000 חדשות ביום
חינם. בלי פרסומות. בלי אותיות קטנות.

חברה בתחום גרפיקה ועיצוב

רמת גן

חברה בתחום גרפיקה ועיצוב

רמת גן
מלאה
35,000-55,000 ₪ הערכה מבוססת AI ולא שכר שהתקבל מהמעסיק
הערכה מבוססת AI ולא שכר של המעסיק

seeking a Site Reliability Engineer who excels at bridging the gap between infrastructure and development. In this role, you will work closely with engineering teams to ensure the reliability, scalability, and performance of our systems. A strong emphasis will be placed on observability - designing and implementing effective monitoring, logging, tracing and alerting solutions to provide deep visibility into system behavior. You should be comfortable collaborating with developers, presenting technical insights, and helping shape best practices. Your responsibilities will include incident management, automation and improvement of our observability solutions, and continuous performance tuning to ensure our platform can scale and evolve with our business needs.

Role:
Ensure production systems meet or exceed established SLAs and SLOs by actively maintaining and enhancing system performance and uptime.
Design and maintain end-to-end observability systemsincluding monitoring, logging, and distributed tracingto detect anomalies and enable proactive issue resolution.
Work closely with engineering teams to improve how their applications are monitored and alerted on. Help define meaningful alerts, reduce noise, and ensure developers are accountable for the operational health of their services.
Optimize application performance on Kubernetes through resource tuning, scaling strategies, and deep performance analysis.
* Provide guidance on reliability-first design, instrumenting code for observability, and using Grafana dashboards to drive decision-making and incident response.

Requirements:
5+ years in SRE, DevOps, or Production Engineering roles
Deep expertise in AWS, Kubernetes, Linux
Being responsible of deploying and tuning monitoring tools like Prometheus, Thanos and any time-series databases for storing metrics.
Logging responsibilities with ELK stack, Loki, Grafana or any alternatives.
Experience with tracing opentelemetry, tempo, jaeger
Strong understanding of incident management processes and best practices.
Experience with automation tools and practices for deployment and infrastructure management.
Excellent communication and collaboration skills, with the ability to work effectively in a team environment.
Ownership mindset, proactive and reliable

This position is open to all candidates.

לכל המשרות של DevOps SRE Engineer

עדיין מחפשים עבודה במנועי חיפוש? הגיע הזמן להשתדרג!

DevOps SRE Engineer

DevOps SRE Engineer

הכשרות רלוונטיות

מהנדסי DevOps

DevOps

DEVOPS ENGINEER קורס

מהנדס DevOps