Senior Technical DevOps Engineer - SRE

Emirates Group

UAE

Ref: NP598-537

Job description / Role

Employment: Full Time

The DevSecOps and Site Reliability Engineering chapter within Emirates IT is seeking skilled Senior Engineers to join our dynamic, high-performing team of experts. Our team is actively working on maturing DevSecOps and reliability engineering practices at every phase of the software lifecycle, from planning to operations. With established partnerships with leading hyperscalers and globally hosted key workloads, we aim to expand our team with DevOps and Site Reliability Engineers. With these new team members, we will continue our journey of enhancing our DevOps and reliability practices and driving end-to-end automation.

As a key technologist in the DevOps and Reliability engineering team, Senior Technical DevOps Engineer (SRE) , will take an active role in adopting enterprise level technology solutions provided for CI/CD, Container Orchestration, Public/Private Cloud services, Vault services, Observability and to implement the same in the projects/applications using the GitOps approach of everything as code including infrastructure lifecycle management, configuration management, application build & deployment pipelines. He/she applies DevOps/Site reliability principles to ensure speed, availability, performance, efficiency, change management, monitoring, emergency response, and capacity planning and act as a bridge between development and operations by applying an engineering mindset to system administration.

In this role, you will:

- Adopt/implement end-to-end Continuous Integration and Continuous Deployment pipelines for applications/projects using GitOps approach.
- Implement/configure comprehensive monitoring/observability/rules/alerts/dashboards for infrastructure, application, database, interfaces, etc., in alignment with the Observability maturity framework.
- Provision, configure, update non-prod and prod infrastructure/environments (Public/Private Cloud and VM/Kubernetes based) using "Infra/Config as code" and GitOps approach.
- Conduct triages/postmortems for any critical production issues to document the actionable findings to improve the system.
- Assist the FinOps team in establishing cost optimization practices with fit-for-purpose infrastructure for application workloads.
- Create all necessary document artifacts on Infra/Environments that are essential for efficient issue troubleshooting and to effectively onboard new team members.
- Perform/coordinate the scheduled security patching/upgrades to the operating systems, application runtimes along with the platform and application teams to eliminate the vulnerabilities reported.
- Remove the toil by automating the processes that are repeatedly performed manually and contribute back to the enterprise-wide reusable repository of Digital Platform teams.
- Measure and report, at pre-defined intervals, the progress/maturity of the DevOps/Reliability improvements done in alignment with the KPIs/Metrics published by the SRE chapter.
- Participate in the regular drill of backup, resiliency, chaos engineering, switchover testing, disaster recovery, etc.

Requirements:

Qualifications & Experience
To be considered for this role, you must meet the below requirements:

- Degree or Honours in Computer Science or similar subject.
- Overall 5+ years of experience in Information Technology (with 5+ years of hands-on experience with DevOps, Container Orchestration, Observability, Cloud)
- 5+ Years of hands-on experience on DevOps implementation (Git, Git Branching Strategies, CICD pipelines, Jenkins, Binary Artifact Repository, Docker, Shell Scripting, Python)
- 5+ Years of hands-on experience on Container Orchestration technologies development/Management/Troubleshooting (Kubernetes, HELM, Container Registry)
- 3+ Years of hands-on experience on Infrastructure/Configuration management (Terraform, Ansible)
- 3+ Years of hands-on experience on AWS Cloud - Admninistration/Migration
- 3+ Years of hands-on experience on Systems/Application APM/Infrastructure Monitoring, Dashboards, alerting and analytics
- 3+ Years of experience working with Linux Operating System
- 2+ Years of hands-on experience on application development
- Excellent exposure and experience on ITIL and Agile Frameworks

About the Company

A fast-growing international airline with one of the youngest fleets in the sky and more than 400 awards for excellence worldwide.

Get personalised updates on latest vacancies
Similar jobs you may be interested in
Senior Cloud System Engineer - Oracle Easy Apply
Michael Page
Saudi Arabia 3 Sep
Senior IT System Engineer Easy Apply
Staffconnect
Dubai 29 Aug
Role - System And Storage Manager Easy Apply
Staffconnect
Dubai 29 Aug
Sr. DevOps Engineer Easy Apply
Staffconnect
Abu Dhabi 29 Aug
Project Engineer (MultiCloud) Easy Apply
Staffconnect
Dubai 29 Aug
Job Alerts by Email
  • Personalised updates on latest career opportunities
  • Insights on hiring and employment activity in your industry
  • Typically sent twice a month