[Remote] Staff Site Reliability Operations Engineer

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. Calix is a company focused on enabling Communication Service Providers to transform and future-proof their businesses through a cloud-first, AI-powered platform. They are seeking a Staff Site Reliability Engineer to lead their global platform reliability and observability strategy on Google Cloud Platform, leveraging advanced technologies to build intelligent infrastructure and provide technical leadership.ResponsibilitiesFull-Stack Network Architecture: Architect, optimize, and troubleshoot complex networking infrastructure spanning Layer 1 through Layer 7, ensuring low-latency data transport, secure edge routing, and seamless service mesh integrationGrafana Stack Architecture: Design, scale, and optimize our unified observability platform using the Grafana Labs suite (Grafana, Mimir, Loki, Tempo, and Beyla)AIOps & Intelligent Alerting: Deploy machine learning models and automated anomaly detection to cut through telemetry noise, reduce alert fatigue, and predict network or data pipeline bottlenecksGKE Platform Engineering: Drive the architecture, scaling, security, and networking of production Google Kubernetes Engine (GKE) clustersData & Event Streaming Reliability: Tune, and maintain high-throughput Apache Kafka clusters to guarantee low-latency event delivery and high availabilityLarge-Scale Database Management: Ensure the performance, scalability, and disaster recovery readiness of our transactional and analytical data tiers across PostgreSQL, AlloyDB, and BigQueryAutomated Incident Response: Integrate AIOps insights with Grafana workflows to automate triage, accelerate root-cause analysis, and trigger auto-remediation scriptsTechnical Leadership: Champion the long-term technical roadmap for distributed infrastructure engineering and GCP cloud-native observability standardsMentorship: Coach senior and junior engineers on advanced debugging techniques, distributed systems thinking, and intelligent operations across a distributed workforceSkillsProven track record of high autonomy and successful delivery in a 100% remote engineering environment8+ years in SRE, Production Engineering, or Distributed Systems infrastructure rolesDeep technical knowledge and debugging mastery across all OSI layers, including: L1-L3: Physical/fiber infrastructure awareness, switching, and advanced routing protocols (BGP, OSPF)Transport layer tuning (TCP congestion control algorithms, UDP, QUIC)Session management, TLS termination, DNS architecture, and advanced application protocols (HTTP/3, gRPC)Expert-level mastery of Google Kubernetes Engine (GKE) internals, custom controllers, multi-cluster networking, and GitOps workflowsProven track record managing high-throughput Apache Kafka pipelines and large-scale data environments across PostgreSQL, AlloyDB, and BigQueryDeep, hands-on experience deploying and managing Grafana Enterprise/Cloud, Prometheus/Mimir, Loki, and Tempo at scaleTrack record applying AI/ML techniques for time-series anomaly detection, log clustering, and correlation (e.g., Grafana Adaptive Metrics, BigPanda)Advanced, production-scale expertise utilizing HashiCorp Terraform exclusively to provision and manage multi-region GCP cloud architecturesHigh proficiency in Go and Python for building custom infrastructure tooling, Kubernetes operators, and data integration scriptsExceptional written and verbal communication skills, with an emphasis on creating clear documentation for asynchronous alignmentDeep knowledge of Google Cloud architectural best practices, Cloud SDN, Cloud Armor, Interconnect, Identity and Access Management (IAM), and cost optimizationDeep understanding of Linux internals, eBPF-based monitoring, kernel-level networking, and packet analysis tools (Wireshark, tcpdump)BenefitsAs a part of the total compensation package, this role may be eligible for a bonus.Company OverviewCalix provides the cloud, software, systems and services for service providers to simplify business, excite subscribers and grow value It was founded in 1999, and is headquartered in San Jose, California, USA, with a workforce of 1001-5000 employees. Its website is http://www.calix.com.Company H1B SponsorshipCalix has a track record of offering H1B sponsorships, with 11 in 2026, 36 in 2025, 22 in 2024, 24 in 2023, 31 in 2022, 19 in 2021, 7 in 2020. Please note that this does not guarantee sponsorship for this specific role.

Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Remote Medical Director, Appeals

Remote

Experienced Data Entry Associate – Remote Opportunity at careerzynith

Remote

bolthires Product Reviewer Jobs (Remote) $30/H – [Entry Level/No Experience]

Remote

Telehealth NP – Remote Cardiac Preventive Care (New Grads Welcome)

Remote

[Remote] Senior Supply Chain Analyst

Remote

Claims Examiner - Fast Track

Remote

**Experienced Customer Service Representative – Sports and Entertainment Industry**

Remote

**Experienced Live Chat Support Specialist – Work from Home Opportunity with arenaflex**

Remote

Senior People Data Scientist

Remote

Long Term Mission Opportunities

Remote
← Back