[Remote] Principal Data Center Facilities Engineer
Note: The job is a remote job and is open to candidates in USA. Oracle is a leading technology company that powers innovations across various industries. They are seeking a Principal Data Center Facilities Engineer to provide technical support and leadership in data center operations, focusing on reliability, compliance, and efficiency across global sites.ResponsibilitiesDefine and govern enterprise standards for preventive and predictive maintenance across electrical, mechanical,liquid to chip, and life-safety systems; ensure consistent execution and documentation across all sitesLead advanced diagnostics for systemic reliability issues; drive remediation programs that eliminate recurring failure modes and materially improve site availability and energy efficiencySet performance targets and reliability KPIs; establish dashboards and audits to validate adherence to SLAs and regulatory requirementsLead high-severity incident response across regions; coordinate rapid stabilization, stakeholder communications, and executive updates; ensure durable corrective and preventive actions are implemented at scaleInstitutionalize standardized root cause analysis (RCA) methodologies and CAPA governance; track cross-site action closure and effectivenessSpearheads collaboration with design, construction, facility engineering, and operations teams to validate designs, influence decision-making to integrate new systems with existing infrastructure and tooling alignment with operations objectivesLead technical oversight for site assessments, design reviews, uptime requirements, commissioning strategy, and integrated systems testing on high-priority builds and retrofitsApprove commissioning documentation and acceptance criteria; validate performance against design intent, reliability targets, and maintainability standards before handoverServe as principal SME for complex and novel scenarios in operations of mission-critical systems (power, cooling, BAS, fire/life safety)Acts as a senior engineering reviewer as needed and supports the Engineering Change Advisory Board (CAB) process, ensuring changes are assessed, approved, and implemented in alignment with operational risk and change management protocolsGovern operational procedures, MOP/SOP/EOP standards, and access controls; ensure safe work practices and audit readiness across all facilitiesAlign operations with capacity, performance, and sustainability objectives; oversee optimization initiatives that reduce PUE, improve utilization, and control operating costsProvides SME technical support for contract negotiations including generation and review of contracts, change orders and due diligence validationDefine and maintain best-in-class policies, technical standards, and playbooks; chair engineering review boards and the process for high-risk changesProvide senior engineering guidance for complex design challenges, lifecycle strategies, and modernization roadmaps (e.g., controls upgrades, UPS/generator strategies, cooling transformations)Prepare and review technical reports and business cases that guide investment decisions and long-horizon planningDefine competency standards and training curricula for facilities teams and partners; mentor senior engineers and lead cross-site knowledge-sharing forumsDevelop and deliver advanced training on complex systems, incident response, and change management; certify proficiency for high-risk proceduresPublish lessons learned and best practices from RCAs, commissioning, and optimization programs to uplift global operational maturityProvides SME technical support for contract negotiations including generation and review of contracts, change orders and due diligence validationSet organizational vendor strategy and performance expectations; oversee selection, onboarding, and governance for colocation partners and critical service providersEnsure contract compliance, safety adherence, and availability of critical spares, consumables, and fuel; direct the most complex escalations and joint remediation for systemic issuesUse performance analytics and SLAs to drive continuous improvement and cost optimization with vendors across the OrganizationInfluences cross-functional leaders and external stakeholders to gain alignment on strategic objectives. Fosters partnerships with key business leaders, stakeholders, and/or customers, identifying opportunities for expanding partnerships and promoting long-term organizational success. Champions transparency and inclusivity by actively seeking, listening to, and incorporating diverse perspectivesDrives implementation of ideas that increase the efficiency and effectiveness of processes, protocols, and workflows across the department. Seeks and integrates diverse feedback on approaches and methods for continued improvement to enhance efficiencies and ensure changes align with organizational goalsManages and provides direction on timelines and budgets for critical high-impact projects, ensuring timely completion and adherence to requirements. Anticipates and plans for shifts in resources or timelines based on changing business priorities, ensuring optimal outcomesLeads specialized, advanced problem-solving efforts, serving as an escalation point for highly complex issues. Guides others to leverage innovative data-driven techniques to address ambiguous or novel issues, identify root causes, and implement solutions that prevent future issuesLeverages deep industry knowledge and expertise within a specialty area to serve as a thought leader within the organization. Maintains and evolves expertise in relevant areas by proactively monitoring emerging trends, technologies, and industry standards, ensuring the organization remains current with best practices. Champions continuous learning, promoting professional development across teams. Applies new knowledge to drive advancement and mentors others to do the sameSkills10+ years of experience in data center design and critical infrastructure operationsStrong background in electrical, mechanical, and controls systemsExperience in administering and maintaining mission-critical environmentsAbility to read, write, and speak EnglishStrong experience in data center design and critical infrastructure operationsExceptional customer focus and effective collaboration with internal data center teams and business partnersAbility to travel to onsite locationsDefine and govern enterprise standards for preventive and predictive maintenance across electrical, mechanical, liquid to chip, and life-safety systemsLead advanced diagnostics for systemic reliability issuesSet performance targets and reliability KPIsLead high-severity incident response across regionsInstitutionalize standardized root cause analysis (RCA) methodologies and CAPA governanceSpearheads collaboration with design, construction, facility engineering, and operations teamsLead technical oversight for site assessments, design reviews, uptime requirements, commissioning strategy, and integrated systems testingServe as principal SME for complex and novel scenarios in operations of mission-critical systemsGovern operational procedures, MOP/SOP/EOP standards, and access controlsAlign operations with capacity, performance, and sustainability objectivesDefine and maintain best-in-class policies, technical standards, and playbooksProvide senior engineering guidance for complex design challengesDefine competency standards and training curricula for facilities teams and partnersDevelop and deliver advanced training on complex systems, incident response, and change managementSet organizational vendor strategy and performance expectationsInfluences cross-functional leaders and external stakeholders to gain alignment on strategic objectivesDrives implementation of ideas that increase the efficiency and effectiveness of processesManages and provides direction on timelines and budgets for critical high-impact projectsLeads specialized, advanced problem-solving effortsLeverages deep industry knowledge and expertise within a specialty areaBenefitsMedical, dental, and vision insurance, including expert medical opinionShort term disability and long term disabilityLife insurance and AD&DSupplemental life insurance (Employee/Spouse/Child)Health care and dependent care Flexible Spending AccountsPre-tax commuter and parking benefits401(k) Savings and Investment Plan with company matchPaid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.11 paid holidaysPaid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.Paid parental leaveAdoption assistanceEmployee Stock Purchase PlanFinancial planning and group legalVoluntary benefits including auto, homeowner and pet insuranceCompany OverviewOracle is an integrated cloud application and platform services that sells a range of enterprise information technology solutions. It was founded in 1977, and is headquartered in Austin, Texas, USA, with a workforce of 10001+ employees. Its website is https://www.oracle.com/.