SRE (Site Reliability Engineering)
CodeValue
Software Engineering
Herzliya, Israel
Posted on Aug 31, 2025
Description
At CodeValue, we are looking for an SRE (Site Reliability Engineering) to join our Passive Systems team within the Retail Division. In this role, you will take part in monitoring, troubleshooting, and supporting mission-critical financial systems across multiple channels (web, mobile, and core banking). You will work closely with technical squads, support centers, and business stakeholders to ensure smooth system operations and high service availability.
Key Responsibilities
- Manage Tier-2 incidents and service requests (via Jira, Mitav, etc.), including prioritization and SLA compliance.
- Perform root cause analysis, provide first-level investigation, and ensure full resolution lifecycle tracking.
- Oversee continuous monitoring and control of systems using tools such as Splunk, Glassbox, and Dynatrace.
- Collaborate with squads to resolve cross-system issues and support new system rollouts.
- Build and maintain infrastructure for new and existing systems (certificates, servers, vaults, services, etc.).
- Maintain strong communication with support centers and provide professional service with high availability.
Requirements
Mandatory Requirements:
- At least 2-3 years of experience as an SRE engineer.
- Experience with at least one of the following: Splunk, Dynatrace, Glassbox (Big advantage for Splunk/Dynatrace)
- Experience with at least one of the cloud platforms: GCP / AWS / Azure
Advantages:
- Experience working with K8S / OpenShift.
- Knowledge of core architecture components (ESB, TOBE, Masila).
- Experience with certificates and server management.
- Familiarity with incident management platforms (Jira, Mitav).