[Remote] Senior Site Reliability Engineer

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. Juul Labs is dedicated to transitioning adult smokers away from combustible cigarettes through innovation and quality. They are seeking a Senior Site Reliability Engineer to manage and ensure the operational stability of their hybrid cloud infrastructure, focusing on automation and reliability in critical incidents.ResponsibilitiesDesign, deploy, and maintain enterprise-scale Nutanix AHV clusters and Prism Central for multi-cluster managementExpert-level proficiency with Nutanix CLI (nCLI and acli) for advanced operations, troubleshooting, and automationDevelop automation scripts using Nutanix REST APIs, Python SDK, PowerShell, and Terraform for infrastructure-as-codeCreate and manage VM templates, golden images, and standardized deployment catalogs for consistent provisioningDesign disaster recovery solutions using Leap, Protection Domains, cross-cluster replication, and metro clusteringImplement network micro-segmentation using Nutanix Flow and configure RBAC, encryption, and security hardeningLead L3 troubleshooting using advanced diagnostics, log analysis (CVM, Genesis), NCC health checks, and cluster service resolutionConfigure high availability, VM affinity rules, QoS policies, and optimize performance for mission-critical workloadsManage AHV networking with OVS bridges, VLANs, bonds, LACP and implement resource reservations and workload balanceDesign, deploy, and maintain hybrid cloud infrastructure across Nutanix HCI, AWS, and GCP platformsArchitect and implement multi-cloud solutions ensuring high availability, scalability, and disaster recoveryArchitect and deploy enterprise-scale, highly available multi-cloud solutions across AWS and GCP with multi-region/multi-account strategiesExpert-level proficiency with AWS CLI, GCP CLI, SDK, boto3, and Python for advanced automation and infrastructure orchestrationDesign AWS Organizations and GCP Organization hierarchies with consolidated billing, IAM policies, and centralized governanceConfigure and manage AWS Systems Manager (SSM) including Session Manager, Run Command, State Manager, and Automation for centralized fleet operationsImplement centralized logging using CloudWatch/CloudTrail and GCP Cloud Logging with S3/Cloud Storage aggregationIntegrate AWS and GCP with Splunk using HEC, CloudWatch subscriptions, Pub/Sub, Dataflow, and cloud-specific add-ons for SIEM correlationDesign and deploy advanced load balancing solutions with AWS ALB/NLB/ELB and GCP Cloud Load Balancing including SSL termination and auto-scalingDevelop infrastructure-as-code using Terraform, CloudFormation, CDK for repeatable multi-cloud deployments and CI/CD pipelinesConfigure AWS SSO, cross-account IAM roles, GCP Workload Identity, and federated access for centralized identity managementDesign VPC architectures with AWS Transit Gateway/PrivateLink and GCP Shared VPC/VPC peering for hybrid connectivityManage containerized workloads using EKS, GKE, ECS, Cloud Run with service mesh, observability, and security best practicesImplement disaster recovery using AWS Backup, Cross-Region Replication, GCP snapshots, and multi-region failover strategiesLead L3 troubleshooting using CloudWatch Insights, GCP Cloud Trace, VPC Flow Logs, X-Ray, and vendor support escalationPerform cost optimization through Reserved Instances, Committed Use Discounts, rightsizing, and automated resource lifecycle managementAdminister and support Windows Server and Unix/Linux environments in production and non-production settingsPerform OS-level hardening, patch management, and security compliance across heterogeneous systemsAutomate routine administrative tasks using PowerShell, Bash, Python, or similar scripting languagesManage GitHub organization settings, user permissions, repository access controls, and monitor GitHub Actions workflows and repository health across multiple teamsConfigure Splunk forwarders, heavy forwarders and other integrations for data ingestion from cloud and on-premises sourcesSkills8-12+ years infrastructure experience with 8+ years in Nutanix HCI and enterprise cloud AWS/GCPExpert-level skills in Python, PowerShell, Bash scripting, infrastructure-as-code (Terraform/CloudFormation), and container orchestration (Kubernetes, EKS/GKE)Proven experience managing enterprise-scale environments, hybrid cloud migrations, disaster recovery, and L3 critical incident managementStrong networking knowledge (TCP/IP, VLANs, routing, VPN), security hardening, and compliance frameworks (ITIL)Strategic thinker with exceptional analytical and troubleshooting abilities for complex multi-layer infrastructure issuesExcellent communication skills to translate technical concepts to executives and non-technical stakeholdersCalm under pressure during critical outages with meticulous attention to security, compliance, and configuration managementSelf-motivated continuous learner committed to staying current with evolving cloud technologies and automation opportunitiesAvailable for on-call rotations with strong documentation skills and customer service orientationBachelor's or master's degree in computer science/ITCertifications (plus): Nutanix NCP/NCAP, AWS Solutions Architect Professional, AWS DevOps Professional, GCP Professional Cloud Architect, TerraformBenefitsPeople. Work with talented, committed and supportive teammatesEquity and performance bonuses. Every employee is a stakeholder in our successCell phone subsidy, commuter benefits and discounts on JUUL productsExcellent medical, dental and vision, disability, and life insurance, plus family support, wellness, legal, and employee assistance program benefits401(k) plan with company matchingPlus biannual discretionary performance bonusesCompany OverviewJuul Labs is a thriving team of scientists, engineers, designers and professionals who are committed to offering adult smokers alternatives to combustible cigarettes, while combating underage use of our products. It was founded in 2015, and is headquartered in San Francisco, California, USA, with a workforce of 1001-5000 employees. Its website is https://www.juul.com.

Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

[Work From Home] Executive Assistant Burbank, CA, USA

Remote

Sr. BI Engineer [Fixed - Term Contract]

Remote

Experienced Online Research Participant and Customer Service Representative – Flexible Remote Opportunity for Engaged Individuals

Remote

Delta Airlines Flight Attendant Immediate Start No Experience Training Provided

Remote

**Experienced Senior Manager of Customer Support – Global Operations & Team Leadership**

Remote

Remote Healthcare Claims Specialist

Remote

Hiring Now: Non-Phone Remote Roles | Flexible Opportunities

Remote

**Entry Level Chat Support Representative – Delivering Exceptional Customer Experiences at blithequark**

Remote

Security Engineer (L4) - Application Security

Remote

Associate Product Manager, Data – CRM & Personalization

Remote
← Back