AI Infrastructure & Platform Operations Engineer (remote in the EU)

Remote Full-time
Company Description

Mirantis is the Kubernetes-native AI infrastructure company, enabling organizations to build and operate scalable, secure, and sovereign infrastructure for modern AI, machine learning, and data-intensive applications. By combining open source innovation with deep expertise in Kubernetes orchestration, Mirantis empowers platform engineering teams to deliver composable, production-ready developer platforms across any environment—on-premises, in the cloud, at the edge, or in sovereign data centers. As enterprises navigate the growing complexity of AI-driven workloads, Mirantis delivers the automation, GPU orchestration, and policy-driven control needed to manage infrastructure with confidence and agility. Committed to open standards and freedom from lock-in, Mirantis ensures that customers retain full control of their infrastructure strategy.
Mirantis serves many of the world’s leading enterprises, including Adobe, DocuSign, Liberty Mutual, PayPal, Reliance Jio, Societe Generale, Splunk, and Volkswagen. Learn more at www.mirantis.com.

Job Description

We are building a European AI Infrastructure & Platform Operations team responsible for operating large-scale AI infrastructure environments powered by NVIDIA GPUs, high-performance networking, Kubernetes, and next-generation platform technologies.
The team is responsible for ensuring the availability, performance, and operational stability of critical AI infrastructure platforms deployed across multiple datacenters. Working at the intersection of infrastructure, networking, and platform operations, you will help support the environments that power modern AI workloads.
This is an opportunity to work with some of the latest technologies in AI infrastructure while contributing to the evolution of AI-powered operational services through platforms such as k0rdent AI.
Responsibilities:
Monitor, operate, and support production AI infrastructure platforms.
Investigate and resolve infrastructure, networking, hardware, and platform-related incidents.
Support NVIDIA GPU infrastructure and associated platform services.
Monitor and troubleshoot Kubernetes-based environments.
Investigate performance, availability, and reliability issues across infrastructure and platform components.
Collaborate with engineering teams, hardware vendors, datacenter personnel, and service delivery teams to resolve technical issues.
Participate in incident response, root cause analysis, and operational improvement activities.
Contribute to improvements in monitoring, observability, automation, and operational processes.
Maintain operational documentation, runbooks, and knowledge articles.

Qualifications


3+ years of experience in infrastructure operations, platform operations, network operations, site reliability engineering, cloud operations, datacenter operations, or related technical roles.
Strong Linux administration and troubleshooting skills.
Good understanding of networking concepts and experience diagnosing infrastructure-related issues.
Working knowledge of Kubernetes in production environments.
Experience supporting production infrastructure and services.
Strong analytical and problem-solving skills.
Experience working within structured operational and incident management processes.
Excellent communication and collaboration skills.
Ability to work within a shift-based operational environment.
Experience in one or more of the following areas is highly desirable:
NVIDIA GPU infrastructure and accelerated computing platforms.
InfiniBand networking and NVIDIA UFM.
Kubernetes platform operations.
AI infrastructure or HPC environments.
Site Reliability Engineering (SRE) or Platform Engineering.
Observability platforms such as Grafana, Prometheus, ELK, or OpenTelemetry.
Infrastructure automation technologies and Infrastructure-as-Code practices.
Large-scale distributed systems and production platforms.

Additional Information

What does Mirantis offer you?
Work with some of the most advanced AI infrastructure environments in production today.
Gain exposure to NVIDIA GPU technologies, Kubernetes platforms, and high-performance networking environments.
Help define how next-generation AI infrastructure is operated and supported.
Be part of a team shaping the future of AI-powered operations through k0rdent AI.
Join a growing organisation investing heavily in AI infrastructure and platform services.
It is understood that Mirantis, Inc. may use automated decision-making technology (ADMT) for specific employment-related decisions. Opting out of ADMT use is requested for decisions about evaluation and review connected with the specific employment decision for the position applied for. You also have the right to appeal any decisions made by ADMT by sending your request to [email protected]
By submitting your resume, you consent to the processing and storage of your personal data in accordance with applicable data protection laws, for the purposes of considering your application for current and future job opportunities.
#remote

We are a Leader for Container Management in G2 (#2 after AWS)!
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Experienced Part-Time Remote Data Entry Clerk for Accurate Information Management and Organization at blithequark

Remote

Customer Success Manager – Differentiated Support & High‑Volume Shipping Partner Experience Lead at arenaflex

Remote

**Experienced Data Entry Clerk – Remote Work Opportunity with blithequark**

Remote

Machinist I 3rd shift HTX01: Aerostructrues - San Marcos 2005 Technology Way, San Marcos, TX, 78666 USA

Remote

AWS Data Engineer

Remote

Machine Learning Engineer/SRE-100% Remote

Remote

**Experienced Remote Inbound Customer Service Representative – arenaflex Listening Center**

Remote

Urgently Need Production Technician I, Fill/Pouch- (7:00pm-7:00am) in Hickory, NC

Remote

Data Science Expert - AI Content Specialist

Remote

LTSS Service Coordinator - RN Clinician 7 Locations

Remote
← Back