Mploy - דרושים

דרושים Observability & Monitoring Engineer בתל אביב

 \ 

Observability & Monitoring Engineer

 

23/07/2025

 תל אביב

 Kela Technologies

We’re looking for a hands-on Observability & Monitoring Engineer who will own the visibility, health, and reliability of Kela’s production environments. This role is all about designing, building, and maintaining Kela’s monitoring and alerting stack, enabling the organization to detect, respond to, and prevent issues before they impact our customers.

Observability & Monitoring Stack Ownership:

  • Design, implement, and maintain our observability stack, covering metrics, logs, and traces across all production sites and components (physical and software layers).
  • Work with tools like Prometheus, Grafana, ELK, or similar.
  • Ensure clear, accessible dashboards for both internal and customer-facing stakeholders, showing site/component health, uptime, and anomalies.
  • Define and continuously tune alert thresholds, escalation paths, and severity levels.
  • Reduce alert noise and focus on actionable signals.
  • Ensure all critical services have effective monitoring coverage (availability, performance, resource usage, errors, etc.).
  • Partner with R&D to define, request, and validate telemetry data: logs, custom metrics, traces, etc.
  • Advocate for observability best practices in product and feature development.
  • Influence logging and monitoring standards across engineering teams.
  • Automate routine monitoring tasks, health checks, and anomaly detection scripts.
  • Drive self-healing initiatives: Build tools and automation for faster incident mitigation.
  • Create and maintain runbooks for alert response, ensuring that Support and Delivery teams have clear operational guidance.

Contribute to incident post-mortems and help drive continuous improvement based on lessons learned.

Participate in incident response and on-call rotations (where applicable).

Support real-time production incident bridges with data-driven analysis from the observability stack.

Must Have:

  • 5+ years experience in Observability, SRE, Production Engineering, or DevOps roles focused on monitoring and system reliability.
  • Deep hands-on experience with monitoring tools like Datadog, Prometheus, Grafana, ELK, or equivalents.
  • Strong experience with Linux systems, networking basics (HTTP, DNS, firewalls, proxies).
  • Experience with Kubernetes, microservices architecture, or multi-cluster environments.
  • Background working in hybrid environments (on-prem + cloud).
  • Prior experience implementing self-healing automation in production environments.
  • Proven track record in alerting design, threshold tuning, and incident detection at scale.
  • Experience with log pipelines, metrics collection frameworks, and distributed tracing tools.
  • Solid scripting and automation skills (Python, Bash, etc.).
  • Experience participating in incident response processes, on-call rotations, and root cause analysis.
  • Excellent communication skills - able to explain system status and health metrics to both engineers and non-technical stakeholders.

משרות דומות שיכולות לעניין אותך

 

26/07/2025

 תל אביב

 Snappy

**Location: Tel Aviv

****Hybrid: 3 days in office

****About the role:

**Snappy is looking for a DevOps Team Lead to take over a well-es...

read more
 

17/08/2025

 תל אביב

 SafeBreach

🔹 Title: DevOps Engineer

📍 Location (Hybrid): Tel Aviv

💼 Role Type: Individual Contributor

👥 Reporting To: DevOps Team Leader

...

קרא עוד
 

25/07/2025

 תל אביב

 Silverfort

**Silverfort is a cyber-security startup that develops a revolutionary identity protection platform. Using patented technology, our product enables st...

read more
 

08/08/2025

 תל אביב

 Company

Compensation & Benefits Analyst

Are you an Excel expert, analytical by nature, and excited about building compensation models and simulations...

read more
 

28/07/2025

 תל אביב

 Microsoft

Proactively acts as the voice of the customer/partner and internal communities leveraging relevant insights from feedback tools and systems. Proactive...

read more
 

29/07/2025

 תל אביב

 Qualitest Israel

****Qualitest, The World’s Leading AI-Powered Quality Engineering Company

****At Qualitest we're all about ensuring everything runs smoothly, whe...

read more
 

29/07/2025

 תל אביב

 Nayax

**Nayax is a global fintech company (NASDAQ; TASE: NYAX) and a leading provider of cashless payment, consumer engagement, and business management solu...

read more
 

23/08/2025

 תל אביב

 Shavit Software

🚀 Ready to lead cutting-edge mobile apps? 🚀

Join a fast-growing team developing advanced mobile applications! We’re looking for a **Mobile Team Lea...

קרא עוד
 

23/08/2025

 רעננה, 13.91 ק"מ ממיקומך

 abra

abra is a publicly traded tech company delivering end-to-end solutions in software, enterprise systems, cloud, cyber, DevOps, and more.

With 1,00...

read more
הצג משרות דומות נוספות...

קצת עלינו

Mploy הוא לוח דרושים מבוסס AI, שנועד לסייע למחפשי עבודה ולמעסיקים כאחד, תוך יצירת פלטפורמה חדשנית, איכותית המובילה את שוק העבודה בישראל.אנו מאגדים משרות עדכניות מאלפי מקורות בארץ, ומנגישים אותן ביעילות באמצעות סוכן AI חכם שמתאים משרות רלוונטיות למועמדים ומאפשר הגשת מועמדות בלחיצת כפתור.הפלטפורמה שלנו מציעה התאמות משרות מבוססות בינה מלאכותית עם אחוז התאמה אישי, קבוצות WhatsApp ייעודיות לפי תחום, ואפליקציה מתקדמת שמאפשרת חיפוש ושליחת קורות חיים מכל מקום ובכל זמן.

Mploy אצלכם בוואטסאפ

✨ רוצים להתעדכן בכל המשרות הכי שוות ישר לנייד?

הצטרפו לקבוצות הוואטסאפ שלנו וקבלו את כל ההצעות המתאימות – בלי לחפש, ובלי לפספס. מחכים לכם! 📱😊