[Remote] SRE Platform Engineer

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. GE Vernova is seeking a Platform System Reliability Engineer to manage their EKS Kubernetes environment, which supports global grid software SaaS products. This role involves ensuring the security, scalability, and resilience of the infrastructure while overseeing the full lifecycle of production clusters.ResponsibilitiesHelp design and deploy hardened EKS clusters across multiple AWS regions, ensuring consistent security baselinesBuild and maintain reusable Terraform and Ansible modules for automated provisioning of cloud infrastructure services including networking services, compute, storage, queue and cache, etcImplement "Policy as Code" guardrails and secure network perimeters (ESPs) in alignment with NERC CIP and IEC 62443 standardsStandardize run books, operating processes required to run critical infrastructure with highest reliabilityDefine and enforce Kubernetes resource quotas, limit ranges, and Pod Priority classes to ensure mission-critical services receive prioritized compute resourcesManage the ingress strategy and service mesh architecture to facilitate secure, performant connectivity between distributed micro servicesLead platform-level smoke, load testing and disaster recovery exercises to validate that the infrastructure can meet 99.99% uptime targetsPartner with application teams to right-size containerized workloads, optimizing for both performance and cloud cost (FinOps)Act as the highest technical escalation point for complex Kubernetes internals, troubleshooting issues such as failed pods, memory leaks, and network partitionsLead root cause analysis (RCA) for platform-level outages, implementing systemic fixes to prevent recurring failuresProactively identify and automate repetitive operational tasks—such as cluster upgrades and OS patching—to ensure the team spends at least 50% of their time on engineering improvementsInstitutionalize platform monitoring using Prometheus and Grafana, creating dashboards that surface the "Golden Signals" of cluster healthSkills5 years of experience operating production-grade Kubernetes clusters at scaleExpert-level knowledge of multi-cluster management, performance tuning and experience implementing observability tools such as Prometheus/Grafana, Dynatrace, Splunk, Datadog, etcDeep hands-on experience with AWS core services (EKS, EC2, ALB, S3, RDS, MSK)Proficiency in Terraform, Ansible, and Python or Go for infrastructure automation and deployment tools like ArgoCD or FluxStrong understanding and hands on experience of cloud networking concepts such as VPCs, routing, load balancing and security configurations such as encryption, certificate managementBachelor's Degree in Computer Science or 'STEM' Majors (Science, Technology, Engineering and Math) with advanced experience6–8 years in SRE or Platform Engineering roles supporting mission-critical, 24/7 cloud environmentsProven track record as a structured incident responder who can handle production down/break the glass scenarios in mission critical applicationsPractical knowledge of NERC CIP, SOC2, ISO 27001, or IEC 62443 compliance standards in a SaaS contextAWS Certified DevOps Engineer – Professional, CKA (Certified Kubernetes Administrator), or SRE Practitioner CertificationExperience supporting mission-critical systems in energy, utilities, or other high-stakes industrial sectorsBenefitsRelocation Assistance Provided: YesCompany OverviewGE Vernova provides energy consulting, gas power, and grid solutions. It was founded in 2024, and is headquartered in Boston, Massachusetts, USA, with a workforce of 10001+ employees. Its website is https://www.gevernova.com.

Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Experienced Data Entry and Form Filling Specialist – Remote Work Opportunity for Detail-Oriented Individuals with a Passion for Data Accuracy and Quality

Remote

Remote Success Coach & Training Facilitator - Flexible Schedule

Remote

Hiring Now: Require Speech Language Pathologist - Part Time in

Remote

[Remote-Position] (Remote Jobs No Experience) Disney Data Entry

Remote

Server / Waitress / Waiter – Amazon Store

Remote

FULL TIME Amazon Customer Support – Remote Work Hiring

Remote

Remote Travel Scheduling Coordinator

Remote

Clinical Research Associate II / Sr CRA - Full Service - ONC + Gen Med (Home-Based in Western US)

Remote

**Experienced Remote Amazon Customer Service Representative - $31/H - Work From Home Job Opportunity with Comprehensive Benefits and Growth Potential**

Remote

Virtual Care Assistant (Medical Assistant / CNA / LPN)

Remote
← Back