Cloud Engineer

apartmentinter-prompt global (ip global) placeAbu Dhabi calendar_month 

Job Description

Role Summar:

The Cloud Operations Engineer is responsible for the operational support, monitoring, troubleshooting, and incident management of cloud infrastructure environments, including Software Defined Infrastructure (SDI), Cloud Native Infrastructure, Cloud Execution Environments, Containerized Network Functions (CNF), and Virtual Network Functions (VNF).

The role supports telecom cloud and core network infrastructure services within a 24x7 operational environment.

Level: L1/L2 Operations Support Engineer
Domain: Telecom Cloud / Core Network Infrastructure / Kubernetes Operations
Work Pattern: 24x7 Shift-Based Operations Support
Duration: 12 Months (likely to extend)
Location: UAE Data Centers

Security Requirements: Government security clearance required (Valid UAE ID mandatory)

Key Responsibilities

Operations & Monitoring
  • Monitor cloud infrastructure, SDI, CNF, VNF, and cloud-native environments.
  • Perform proactive health checks and identify service degradation.
  • Handle alarms, events, and incidents from monitoring systems
  • Perform first-level troubleshooting and service restoration.
  • Escalate complex issues to second-line support, vendors, and engineering teams.
Incident Management
  • Manage incidents according to ITIL processes
  • Perform fault isolation and root cause analysis
  • Coordinate with customer operations teams during critical incidents
  • Provide timely incident updates, communication, and reporting
Cloud Native Operations
  • Monitor Kubernetes clusters and cloud-native work
  • Validate pod, node, and container health
  • Investigate container failures, resource exhaustion, and application alarms
  • Support deployment validation and post-maintenance verification
Infrastructure Management
  • Support compute, storage, virtualisation, and networking platforms
  • Monitor cluster health, redundancy, and resource utilisation
  • Execute routine operational activities, backups, and housekeeping
  • Support maintenance windows and software upgrades
Documentation & Reporting
  • Maintain operational procedures, and troubleshooting guides
  • Produce incident reports and operational summaries
  • Update knowledge bases and support documentation

Required Technical Skills

Cloud & Infrastructure Platforms
  • Software Defined Infrastructure (SDI)
  • Cloud Execution Environments (CEE)
  • Cloud Native Infrastructure Solutions (CNIS)
  • Cloud Container Distribution Platforms
  • Cloud Native Network Functions (CNF)
  • Virtual Network Functions (VNF)
Linux Administration (Mandatory)
  • Red Hat Enterprise Linux (RHEL)
  • Linux troubleshooting and performance analysis
  • CPU, memory, filesystem, and process management
  • Shell scripting fundamentals
Kubernetes & Containers (Mandatory)
  • Kubernetes architecture and operations
  • Docker and container technologies
  • Pod, Deployment, StatefulSet, Service, and Ingress management
  • kubectl troubleshooting
  • Container log analysis
Networking (Mandatory)
  • TCP/IP
  • VLANs
  • Routing and Switching
  • DNS, DHCP, and NTP
  • Load balancing concepts
  • Firewall and security group fundamentals
Infrastructure & Virtualization
  • VMware vSphere or equivalent platforms
  • Compute, storage, and networking concepts
  • High availability and disaster recovery
  • Server and operating system troubleshooting
Monitoring & Automation
  • Prometheus
  • Grafana
  • ELK Stack
  • Operational monitoring tools
  • Ansible (preferred)
  • Git and CI/CD fundamentals
Telecom Domain Knowledge (Preferred)
  • 4G EPC Architecture
  • 5G Core Architecture
  • IMS Architecture
  • CNF/VNF deployment concepts
  • Telco Cloud operations
Required Competencies
  • Strong troubleshooting and analytical skills.
  • Ability to work under pressure during critical incidents.
  • Excellent communication and customer-facing skills.
  • Knowledge of ITIL Incident, Problem, and Change Management processes.
  • Ability to work in 24x7 shift rotations and on-call environments.
Education & Experience
  • Bachelor's degree in Computer Science, Telecommunications, Information Technology, or a related discipline.
  • 2–5 years of experience in Cloud Infrastructure, SDI, Telecom Operations, or related environments.
  • Experience supporting Kubernetes-based and cloud-native platforms is highly desirable.
Preferred Certifications
  • RHCSA (Red Hat Certified System Administrator)
  • CKA (Certified Kubernetes Administrator)
  • VMware VCP
  • ITIL Foundation
  • Cloud Infrastructure or Cloud Native Platform Certifications
thumb_up_altRecommended

Senior AI Engineer - Agent AI

apartmentUnikieplaceAbu Dhabi
Generation) systems  •  Implement prompt engineering, tool/function calling, and multi-agent orchestration  •  Deploy AI solutions using Azure cloud services  •  Ensure production readiness including CI/CD, monitoring, and security  •  Collaborate with stakeholders...
check_circleNew offer

Data Platform Engineer

apartmentdicetek llcplaceAbu Dhabi
structured for investigations and audits. Suitable candidates should have strong experience with cloud-based data platforms, data engineering, and working with high-volume, business critical data. Certifications Microsoft Certified: Azure Data Engineer...
business_centerHigh salary

Azure Administrator Engineer

apartmentbravanticplaceAbu Dhabi
Administrator Engineer to lead and execute cloud migration initiatives and provide ongoing operational support across our Microsoft cloud estate. The role is hands-on and delivery-focused, with three core pillars: on-premises to Azure migrations, day-to-day...