[Remote] Principal Operations Engineer, Hardware β Data Center Operations
Note: The job is a remote job and is open to candidates in USA. Fluidstack is focused on delivering compute infrastructure for AI, aiming to enhance human freedom through technology. They are seeking a Principal Operations Engineer, Hardware to lead the operational hardware fleet across their AI data centers, ensuring reliability and continuous improvement of deployed systems.Responsibilities10+ years of hands-on experience operating mission-critical hardware infrastructure, with at least 5 years as the senior technical voice on a site, campus, or fleetData center operations experience strongly preferred; hyperscale, large HPC, cloud, or other mission-critical compute infrastructure experience consideredDeep working command of GPU systems, server platforms, storage infrastructure, firmware lifecycle management, and hardware diagnostics β earned in the field, not from a textbookDemonstrated ability to author, approve, and execute high-risk MOPs and change records in live production environmentsA track record of leading root cause analysis on significant hardware events and driving corrective actions to closureA track record of holding OEMs, ODMs, service vendors, and deployment partners accountable β you know how to enforce a standard without burning the relationshipStrong written communication: operational health assessments, RCAs, procedure reviews, and design review feedback are second natureComfort operating as the senior technical voice across operations, hardware engineering, network, facilities, supply chain, and customer-facing teamsWillingness to travel extensively across the fleet. 50-75%Skills10+ years of hands-on experience operating mission-critical hardware infrastructure, with at least 5 years as the senior technical voice on a site, campus, or fleetData center operations experience strongly preferred; hyperscale, large HPC, cloud, or other mission-critical compute infrastructure experience consideredDeep working command of GPU systems, server platforms, storage infrastructure, firmware lifecycle management, and hardware diagnostics β earned in the field, not from a textbookDemonstrated ability to author, approve, and execute high-risk MOPs and change records in live production environmentsA track record of leading root cause analysis on significant hardware events and driving corrective actions to closureA track record of holding OEMs, ODMs, service vendors, and deployment partners accountable β you know how to enforce a standard without burning the relationshipStrong written communication: operational health assessments, RCAs, procedure reviews, and design review feedback are second natureComfort operating as the senior technical voice across operations, hardware engineering, network, facilities, supply chain, and customer-facing teamsWillingness to travel extensively across the fleet. 50-75%Bachelor's degree in Computer Engineering, Electrical Engineering, Computer Science, or related fieldHyperscale or large-scale compute operational experience supporting thousands of servers and accelerator systemsDirect experience operating modern GPU platforms at production scaleStrong working knowledge of Linux administration, hardware management tooling, and production troubleshooting workflowsExperience supporting liquid-cooled compute infrastructure and the operational practices required to maintain itExperience operating across multiple sites or as part of a global fleet operations functionExperience standing up new sites from deployment handover through steady-stateExperience contributing operational requirements into hardware platform decisions, reference architectures, or productized data center buildsScripting and automation experience in support of fleet-scale hardware operationsBenefitsOffers EquityRetirement or pension plan, in line with local norms.Health, dental, and vision insurance.Generous PTO policy, in line with local norms.Company OverviewFluidstack accelerates the worldβs most ambitious AI projects by removing the bottlenecks to compute. It was founded in 2017, and is headquartered in London, England, GBR, with a workforce of 51-200 employees. Its website is http://www.flare-global.com.Company H1B SponsorshipFluidstack has a track record of offering H1B sponsorships, with 1 in 2026, 1 in 2025, 2 in 2024. Please note that this does not guarantee sponsorship for this specific role.