Senior Site Reliability Engineer

Remote Full-time
The Role:
We are looking for a Senior Site Reliability Engineer to own the observability, reliability, and operational health of our platform. This person will serve as the primary steward of monitoring and alerting, driving automation to reduce toil and ensure our production environments run with minimal friction. You will partner closely with the Engineering team, acting as a thought leader in reliability engineering, cloud infrastructure, and operational excellence.
Our platform is built on Azure Cloud using C#, ASP.NET, React, MSSQL, and Redis.

Responsibilities:
Monitoring & Alerting Ownership: Own end-to-end monitoring and alerting for critical platform components using Datadog, Application Insights, and Grafana. Define and track SLOs/SLIs and drive proactive incident prevention.

Automation & Scripting: Triage and resolve day-to-day DevOps tickets through scripting and automation (PowerShell, Bash, Azure CLI) to reduce operational toil and accelerate engineering velocity.

Infrastructure as Code: Manage and evolve cloud infrastructure using Terraform. Build reusable modules and enforce infrastructure best practices across test and production environments.

On-Call & Incident Response: Cover early morning PST for on-call rotation. Lead incident response, conduct blameless post-mortems, and implement follow-up actions to prevent recurrence.

CI/CD & Release Reliability: Partner with engineering teams to improve CI/CD pipelines in Azure DevOps, reducing deployment risk and streamlining delivery.

Security & Compliance: Support SOC-2 compliant infrastructure design, assist with security hardening, vulnerability management, and regulatory requirements.

Continuous Improvement: Monitor industry trends β€” especially around AI-assisted operations β€” and evaluate new tools to continuously improve infrastructure reliability and performance.

Requirements:
5+ years of experience in a DevOps or SRE role, supporting business-critical, highly available systems.

Strong proficiency in Terraform (IaC) for provisioning and managing cloud infrastructure at scale.

Hands-on experience with Azure Cloud - including computing, networking, storage, and managed services.

Deep expertise in monitoring, alerting, and observability tooling using Datadog, Azure Insights, Azure Application Insights, Grafana, or equivalent.

Proficiency in scripting languages such as PowerShell, Python, or Bash for automation and incident remediation.

Experience with Azure DevOps for CI/CD automation, including Repos, Pipelines, and Releases.

Willingness to participate in early-morning on-call and respond to production incidents as they arise.

Strong communication and collaboration skills, with the ability to thrive in a small, fast-moving team environment.

Our Stack:
Azure Cloud,

Azure DevOps,

Terraform

Ansible

MSSQL

Redis

AKS / Helm

Datadog

Bonus Points:
BS/MS degree in Computer Science or a related field.

Experience with managed SOC platforms and/or endpoint protection tools (e.g., CrowdStrike, Microsoft Defender for Endpoint).

Experience with Kubernetes (AKS) for container orchestration and Helm chart management.

Background in Fintech, Mortgage, or other regulated industries.

Familiarity with MSSQL administration, including replication and single-tenant environments.

Experience with Azure Data Services (e.g., Azure Synapse, Delta Lake, Data Warehousing).

Apply To This Job
Apply Now β†’

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Experienced Data Entry Specialist – Detail-Oriented and Tech-Savvy Professional for Data Management and Analysis at arenaflex

Remote

**Experienced Part-Time Remote Data Entry Clerk – Flexible Hours and Competitive Pay**

Remote

Experienced Part-Time Remote Customer Support Specialist for Marketplace Operations – Delivering Exceptional Service and Driving Customer Satisfaction

Remote

[Hiring] Temporary Contract Analyst @Angi

Remote

Experienced Remote Customer Support Representative – Entry-Level Live Chat Assistant for Global Businesses with Immediate Start and Comprehensive Training

Remote

Firm Administrator

Remote

Dynamic Customer Service Associate – Multi‑Channel Support, Problem Solving & Growth Opportunities at careerzynith

Remote

Temporary Full Stack Data Entry Clerk – Remote Work Opportunity for Detail-Oriented Professionals with Excellent Organizational Skills

Remote

**Experienced Customer Service Representative – Sales Support and Customer Experience Expert**

Remote

Virtual Assistant at Clearpol San Jose, CA

Remote
← Back