Cloud Engineer
inter-prompt global (ip global) Abu Dhabi
Job Description
Role Summar:
The Cloud Operations Engineer is responsible for the operational support, monitoring, troubleshooting, and incident management of cloud infrastructure environments, including Software Defined Infrastructure (SDI), Cloud Native Infrastructure, Cloud Execution Environments, Containerized Network Functions (CNF), and Virtual Network Functions (VNF).The role supports telecom cloud and core network infrastructure services within a 24x7 operational environment.
Level: L1/L2 Operations Support EngineerDomain: Telecom Cloud / Core Network Infrastructure / Kubernetes Operations
Work Pattern: 24x7 Shift-Based Operations Support
Duration: 12 Months (likely to extend)
Location: UAE Data Centers
Security Requirements: Government security clearance required (Valid UAE ID mandatory)
Key Responsibilities
Operations & Monitoring- Monitor cloud infrastructure, SDI, CNF, VNF, and cloud-native environments.
- Perform proactive health checks and identify service degradation.
- Handle alarms, events, and incidents from monitoring systems
- Perform first-level troubleshooting and service restoration.
- Escalate complex issues to second-line support, vendors, and engineering teams.
- Manage incidents according to ITIL processes
- Perform fault isolation and root cause analysis
- Coordinate with customer operations teams during critical incidents
- Provide timely incident updates, communication, and reporting
- Monitor Kubernetes clusters and cloud-native work
- Validate pod, node, and container health
- Investigate container failures, resource exhaustion, and application alarms
- Support deployment validation and post-maintenance verification
- Support compute, storage, virtualisation, and networking platforms
- Monitor cluster health, redundancy, and resource utilisation
- Execute routine operational activities, backups, and housekeeping
- Support maintenance windows and software upgrades
- Maintain operational procedures, and troubleshooting guides
- Produce incident reports and operational summaries
- Update knowledge bases and support documentation
Required Technical Skills
Cloud & Infrastructure Platforms- Software Defined Infrastructure (SDI)
- Cloud Execution Environments (CEE)
- Cloud Native Infrastructure Solutions (CNIS)
- Cloud Container Distribution Platforms
- Cloud Native Network Functions (CNF)
- Virtual Network Functions (VNF)
- Red Hat Enterprise Linux (RHEL)
- Linux troubleshooting and performance analysis
- CPU, memory, filesystem, and process management
- Shell scripting fundamentals
- Kubernetes architecture and operations
- Docker and container technologies
- Pod, Deployment, StatefulSet, Service, and Ingress management
- kubectl troubleshooting
- Container log analysis
- TCP/IP
- VLANs
- Routing and Switching
- DNS, DHCP, and NTP
- Load balancing concepts
- Firewall and security group fundamentals
- VMware vSphere or equivalent platforms
- Compute, storage, and networking concepts
- High availability and disaster recovery
- Server and operating system troubleshooting
- Prometheus
- Grafana
- ELK Stack
- Operational monitoring tools
- Ansible (preferred)
- Git and CI/CD fundamentals
- 4G EPC Architecture
- 5G Core Architecture
- IMS Architecture
- CNF/VNF deployment concepts
- Telco Cloud operations
- Strong troubleshooting and analytical skills.
- Ability to work under pressure during critical incidents.
- Excellent communication and customer-facing skills.
- Knowledge of ITIL Incident, Problem, and Change Management processes.
- Ability to work in 24x7 shift rotations and on-call environments.
- Bachelor's degree in Computer Science, Telecommunications, Information Technology, or a related discipline.
- 2–5 years of experience in Cloud Infrastructure, SDI, Telecom Operations, or related environments.
- Experience supporting Kubernetes-based and cloud-native platforms is highly desirable.
- RHCSA (Red Hat Certified System Administrator)
- CKA (Certified Kubernetes Administrator)
- VMware VCP
- ITIL Foundation
- Cloud Infrastructure or Cloud Native Platform Certifications
UnikieAbu Dhabi
Generation) systems
• Implement prompt engineering, tool/function calling, and multi-agent orchestration
• Deploy AI solutions using Azure cloud services
• Ensure production readiness including CI/CD, monitoring, and security
• Collaborate with stakeholders...
dicetek llcAbu Dhabi
structured for investigations and audits.
Suitable candidates should have strong experience with cloud-based data platforms, data engineering, and working with high-volume, business critical data.
Certifications
Microsoft Certified: Azure Data Engineer...
bravanticAbu Dhabi
Administrator Engineer to lead and execute cloud migration initiatives and provide ongoing operational support across our Microsoft cloud estate. The role is hands-on and delivery-focused, with three core pillars: on-premises to Azure migrations, day-to-day...