Senior AI Infrastructure Engineer
Quantum Talent Group Abu Dhabi
Job Description
We are looking for a skilled senior AI infrastructure engineer to manage the provisioning, deployment, optimization and maintenance of Red Hat OpenShift Container Platform and Red Hat OpenShift AI platform to support AI/ML workflows. The ideal candidate is expected to deploy and maintain production Kubernetes environments, providing expertise in OpenShift to our clients and partners.Proficiency with virtualization, container orchestration, configuration management, and their capabilities are required, as well as familiarity with networking, databases, operating systems, and AI/ML concepts.
Responsibilities:
Administration of Red Hat OpenShift Solutions- Deploy, configure, and manage OpenShift Container Platform, OpenShift AI, and associated Kubernetes infrastructure for a variety of client environments.
- Engage in all aspects of OpenShift administration, including the management of users and policies, resources, networking configuration, creation and management of applications, and configuration of pod scheduling and cluster scaling.
- Perform routine upgrades, patching and maintenance to ensure infrastructure is secure and up to date.
- Implement and maintain automated solutions for provisioning and configuring the OpenShift environment and its associated infrastructure.
- Monitor and analyze performance metrics, working proactively to prevent issues before they impact operations.
- Troubleshoot and resolve infrastructure issues, ensuring minimal downtime and high level of performance.
- Provide best practice guidance on configuration for container orchestration platforms across multiple applications and projects.
Maintain thorough documentation for processes, platform architecture, system configurations, and troubleshooting steps.
Qualifications:
Required Skills & Experience:
- Bachelor's degree in Computer Science, Information Technology, or related field (or equivalent work experience)
- 5+ years experience provisioning and administering container orchestration platforms to support mission critical workloads.
- Proven experience as a Red Hat OpenShift administrator or Kubernetes administrator in a production environment.
- Proficiency with Red Hat Enterprise Linux (RHEL), Red Hat Core OS (RHCOS), or similar Red Hat-based Linux distributions.
- Experience with automation tools for infrastructure provisioning and configuration such as Ansible and Terraform.
- Familiarity with monitoring and logging tools such as Prometheus, Grafana, or similar.
- Understanding of networking principles and security best practices in a containerized environment.
- Excellent communication skills and ability to articulate information to both technical and non-technical stakeholders.
- Ability to work collaboratively in a cross-functional environment and adapt to evolving requirements and priorities.
- Strong analytical, problem-solving, and critical thinking skills with a keen attention to detail
Preferred:
- Certification(s): Red Hat Specialist in OpenShift Administration, Red Hat Certified Specialist in OpenShift AI and/or Certified Kubernetes Administrator (CKA)
- Familiarity with cloud platforms and integrating OpenShift within hybrid/multi-cloud environments
- Knowledge of AI/ML workload management on OpenShift AI is a plus, as well as familiarity with GPU management and training and inferencing applications
WSP in the Middle EastAbu Dhabi
degree preferred).
• Minimum of 20 years of experience in infrastructure and utilities supervision, with at least 10 years in a senior resident engineer or site supervision role within a construction consultancy in Abu Dhabi.
• In-depth knowledge of Abu...
Abu Dhabi
Job Title:
Mechanical Engineer
Department:
Operations
Reporting To:
Site Manager
Site:
ES-
Job Purpose:
The Data Center Mechanical Facility Engineer will be responsible for Data Center Mechanical Engineering Operations within a Data Center...
WSP in the Middle EastAbu Dhabi
degree preferred).
• Minimum of 20 years of experience in infrastructure and utilities supervision, with at least 10 years in a senior resident engineer or site supervision role within a construction consultancy in Abu Dhabi.
• In-depth knowledge of Abu...