Site Relaibility Engineer
Radware
This job is no longer accepting applications
See open jobs at Radware.See open jobs similar to "Site Relaibility Engineer" LHH (AKT).
Radware is a global leader of cyber security and application delivery solutions for physical, cloud, and software defined data centers.
At Radware, we live and breathe cybersecurity. It is our passion. Each day, our international team works to earn the trust of more than 12,500 organizations around the globe. Keeping them safe is our mission. To that end, we go head-to-head with politically motivated hacktivists, dangerous nation-state threat actors and other notorious cyber attackers -these are not your average adversaries. Backed by nearly 30 years of experience, Radware is best known for its technical excellence and innovative network and application security solutions. That is why it is so important that we build our team with bold and bright talent.
What is the job:
We are looking for an experienced SRE Engineer to drive the reliability, observability, and automation practices across our private cloud infrastructure and operations. In this role, you will be in a team of site reliability engineers, own the engineering roadmap for monitoring and automation, and act as a key liaison between development, operations, and platform teams. You bring at least 4+ years of hands-on people management experience and a deep technical background in SRE or DevOps disciplines.
What will you do?
Automation & Infrastructure
- Design, develop, and maintain automation tools to support infrastructure and operations teams at scale.
- Manage pipelines and infrastructure workflows using Jenkins, Ansible, Python, and Bash.
- Drive the adoption of infrastructure-as-code practices across the organization.
- Collaborate with system engineers to improve scalability, performance, and fault tolerance of critical systems.Monitoring & Observability
- Build and extend monitoring and alerting systems using Grafana, the ELK (Elastic) stack, Zabbix, and custom scripts.
- Implement and enforce observability best practices to ensure full visibility into systems, applications, and infrastructure.
- Define and track SLIs, SLOs, and error budgets across key services.
- Partner with development teams to embed observability earlier in the software development lifecycle.Database & Platform Support
- Support monitoring and infrastructure integration for databases including MongoDB and PostgreSQL.
- Maintain documentation and champion knowledge sharing around automation, monitoring, and reliability practices.
What you need:
4+ years of overall experience in SRE, DevOps, or infrastructure automation roles.
Strong scripting skills in Python and Bash; comfortable building and maintaining production-grade automation.
Hands-on experience with infrastructure automation tools, particularly Ansible.
Solid experience with monitoring and observability platforms - ELK stack, Grafana, and Zabbix.
Good understanding of CI/CD pipelines and related tooling, including Jenkins.
Familiarity with managing and monitoring MongoDB and PostgreSQL in a production environment.
Comfortable working in Linux-based environments.
Excellent problem-solving skills and strong written and verbal communication.
Ability to support the following:
- Experience with cloud providers - AWS, GCP, or Azure.
- Exposure to containerization technologies such as Docker and Kubernetes.
- Familiarity with infrastructure provisioning using Terraform.
- Experience introducing SRE practices (SLOs, error budgets, chaos engineering) at an organizational level.
-
Exposure and experience with migrating/ building AI tools to improve process
What We Offer:
· A senior leadership role with real influence over the reliability engineering culture and roadmap.
· A collaborative, high-trust environment that values autonomy, ownership, and learning.
· Competitive compensation, benefits, and professional development support.
· Opportunity to shape practices that impact the entire engineering organization.
Why you should join us:
Employees from more than 40 countries have chosen Radware as a place where they can belong. Radware has been recognized by Glassdoor and BDI as one of the World’s Best Places to Work, ranking among the top 100 companies across the globe in the IT category. Radware has also been named a Gold Winner for Application Security in the 2023 Globee Cybersecurity Awards, by Forrester a Leader in DDoS Protection, and has been named a Leader in WAF Market by Quadrant Knowledge Solutions. We are equally committed to our people. We strive to create a dynamic work environment that celebrates diversity, promotes equality, and thrives on the unique contributions of each individual. If you are ready to be part of a global-minded company that is inspired to create a better, safer future; and if and want to fight for the good guys and be at the forefront of helping companies protect their most critical assets from today’s cyber adversaries, then you’ve found the right fit at Radware.
#LI-SM1
Primary Location
: IL-IL-Tel AvivWork Locations
:Job
: Information SystemsThis job is no longer accepting applications
See open jobs at Radware.See open jobs similar to "Site Relaibility Engineer" LHH (AKT).