[Remote] Senior Site Reliability Engineer, GeForce NOW

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. NVIDIA is looking for a Senior Site Reliability Engineer (SRE) to join its GeForce Now (GFN) team. The SRE ensures that GPU cloud gaming services maintain reliability and uptime, while enabling developers to make changes to the system through careful planning. Responsibilities include improving service observability, automating tasks, and supporting production systems.ResponsibilitiesWorking on building tools to improve the SRE ObservabilityBe part of the Kubernetes migration journey with VMI setup and problem solvingRapidly debug and triage incidents and user-reported issuesTaking ownership of automating, scripting, and tooling of new/existing scripts to help the team achieve 100% automation of daily tasksSupport services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity management and launch reviewsBe part of an on call rotation to support production systemsSkillsMS or BS in Computer Science/Engineering or a related field or equivalent experience8+ year's Site reliability engineering experience working on large scale distributed micro services in a production environment with a real passion for automation and toolingVery strong Kubernetes background and ability to understand Kubernetes with complex and highly available VMI setup on K8'sLead significant production improvements including change management, post-mortem reviews, workflow processes, design and deliver software automation in various languagesConfirmed strengths in problem-solving and root causing issues, while continuously seeking ways to drive optimization, efficiency and the bottom linePrevious experience with Datadog, Prometheus, Alertmanager, or similar monitoring systemsExperience managing multi-region cloud deployments on hyperscalers like AWS, GCP, or AzureExperience designing and managing deployment pipelines using tools such as GitHub Actions, GitLab CI, or ArgoCDExcellent communication, presentation, social, and analytical skills; the ability to communicate complex interaction concepts clearly and persuasively across different audiences and varying levels of the organizationProduction-grade coding proficiency in languages like Go, Python, or robust Bash scriptingProduction on-call experience is a must. Should have served in a primary production on-call rotation, responding to and mitigating high-severity infrastructure alerts and service degradationsExperience working with automated anomaly detection, log clustering tools, or LLM-assisted debugging platformsComfortable using AI on a day-to-day basis as an SREPrior experience as an SRE or Service Engineer is a huge plusBenefitsEquityBenefitsCompany OverviewNVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI. It was founded in 1993, and is headquartered in Santa Clara, California, USA, with a workforce of 10001+ employees. Its website is https://www.nvidia.com.Company H1B SponsorshipNVIDIA has a track record of offering H1B sponsorships, with 448 in 2026, 1872 in 2025, 1354 in 2024, 976 in 2023, 835 in 2022, 601 in 2021, 529 in 2020. Please note that this does not guarantee sponsorship for this specific role.

Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

LPN GIG - Flex/Per Diem/PRN - Mercy Joplin

Remote

Senior Account Manager - Healthcare (Remote)

Remote

Remote Customer Service Representative – Pet‑Lovers’ Support Specialist for careerzynith’s Online Pet Retail Experience

Remote

Finance Associate - Partner Investments

Remote

ULINE- Customer Service Representative

Remote

Customer Service Representative(Night Shift)- Remote

Remote

Teleperformance Customer Service Representative - Work from Home

Remote

Strategic Account Manager - Northeast

Remote

Senior Consultant, DFIR (Wed-Sun)

Remote

Escrow Officer

Remote
← Back