[Remote] Staff Site Reliability Engineer

Remote Full-time

Note: The job is a remote job and is open to candidates in USA. Zscaler accelerates digital transformation to ensure customers are agile, efficient, resilient, and secure. They are seeking a Staff Site Reliability Engineer to be a key member of the Zero Trust Exchange team, responsible for the reliability of large-scale cloud services and ensuring system performance and availability.ResponsibilitiesOwn the reliability of a large-scale cloud service (Linux/BSD, bare metal, Kubernetes, custom load balancing, SD-WAN) by partnering with Engineering and Network teams to define requirements early, conduct operability reviews, and contribute code/design docs for platform resilienceDevelop and operate end-to-end observability (metrics/logs/traces, dashboards, alerting) and incident tooling to manage SLOs/error budgets, reduce noise, and improve system detection and diagnosisParticipate in an on-call rotation to lead full-cycle incident response; perform deep cross-stack troubleshooting (OS, networking, distributed systems, packet captures, core dumps) to drive permanent software fixes and codify learnings into runbooks and testsBuild and maintain everything-as-code for fleet and service lifecycle, driving provisioning, configuration, release automation, canary deployments, and complex rollout/rollback workflowsContinuously improve platform hygiene through consistent OS/app upgrades, dependency/vulnerability patching, capacity and performance tuning, and strict CI/CD validation prior to production rolloutsSkillsUS Citizenship is required (due to the nature of assigned customers)5+ years industry experience in software engineering, infrastructure software, and/or platform engineeringProficiency in at least one programming language (such as Python, Bash, or Go) with demonstrated ability to write production-quality code (testing, code reviews, CI, maintainable design, scripting for diagnostics)Strong Linux/Unix systems fundamentals (process/memory, filesystems, networking stack basics, debugging/perf troubleshooting) and solid understanding of networking protocols and components (e.g., HTTP, DNS, TCP/IP, ICMP, OSI model, subnetting, and load balancing/traffic concepts)Proven experience operating production services (including incident response, troubleshooting, reducing toil) and ability to participate in on-call rotations and support occasional after-hours or weekend deploymentsManaging BSD in production, with a focus on driving systemic fixes through platform engineeringProven expertise in operating Kubernetes at scaleDeep experience with the Prometheus/OpenTelemetry ecosystems, including instrumenting golden signals, defining SLOs, and performing alert tuning to ensure high-availability environmentsBenefitsVarious health plansTime off plans for vacation and sick timeParental leave optionsRetirement optionsEducation reimbursementIn-office perks, and more!Company OverviewZscaler is a global cloud-based information security company that enables secure digital transformation for mobile and cloud. It was founded in 2008, and is headquartered in San Jose, California, USA, with a workforce of 5001-10000 employees. Its website is https://www.zscaler.com.

Apply Now →

[Remote] Staff Site Reliability Engineer

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

USPS Office Helper

Product Delivery Analyst II, Healthcare Technology (Fully Remote)

Inside Sales Associate

Want English Teachers - Earn Money Online in Anaheim, CA

[Remote] Legal Writer (Extraordinary Ability Visas – EB-1A, O-1, EB-2 NIW)

Remote Management Consultant (Colorado Springs)

Bartender | Neighborhood Services

Video Producer, YouTube - PrayerSong

Join Today: Remote Work From Home Driver Recruiter - No Exp

Consultor Funcional SAP - RE

Experienced Healthcare Customer Service Representative - Remote Opportunity at careerzynith