[Remote] Site Reliability Engineer
Note: The job is a remote job and is open to candidates in USA. Talener is a fast-growing healthcare technology organization seeking a Site Reliability Engineer (SRE) to help scale and support a high-impact cloud platform focused on improving healthcare delivery nationwide. This role is critical for strengthening platform reliability, operational efficiency, observability, and automation across production environments.ResponsibilitiesEnsure the reliability, scalability, performance, and security of cloud-based infrastructure and applicationsMonitor, troubleshoot, and resolve production platform and application issues across distributed systemsLead incident response efforts, root cause analysis, and blameless post-mortemsBuild and maintain operational runbooks and automated remediation workflowsDevelop and enhance observability and telemetry solutions for proactive monitoring and alertingCollaborate closely with engineering, DevOps, QA, security, and operations teams to improve platform health and deployment processesSupport infrastructure automation and configuration management initiativesContribute to infrastructure-as-code (IaC) practices and CI/CD operational improvementsPromote best practices around reliability engineering, incident management, and operational excellenceParticipate in an on-call rotation supporting production systems, including occasional off-hours support for West Coast operationsSkills5+ years of experience in Site Reliability Engineering, DevOps, Cloud Infrastructure, or related disciplinesStrong experience troubleshooting and supporting production environmentsHands-on experience with observability and monitoring platforms such as Datadog, New Relic, or similar toolsExperience working within Azure-based cloud environments and modern containerized infrastructureKnowledge of Docker, Kubernetes, and cloud-native application hosting environmentsExperience with infrastructure-as-code tools such as Terraform, Terragrunt, or OpenTofuStrong scripting and automation experience using PowerShell, Python, JavaScript, or similar languagesExperience with source control and CI/CD tooling (Git, Azure DevOps, etc.)Understanding of cloud security principles, compliance frameworks, and operational best practicesStrong collaboration and communication skills within Agile engineering environmentsExperience improving operational visibility through telemetry, dashboards, reports, and alerting systemsExperience evolving incident response processes and operational toolingPassion for mentoring others and promoting operational excellence across teamsStrong problem-solving mindset with a focus on continuous improvement and automationBenefitsOpportunity to work on mission-driven technology with meaningful real-world impactCollaborative engineering culture focused on innovation, reliability, and continuous learningFlexible environment that supports work-life balance while maintaining operational excellenceCompany OverviewTalener is a staffing firm dedicated to finding great opportunities for technology professionals. It was founded in 2007, and is headquartered in New York, New York, USA, with a workforce of 11-50 employees. Its website is http://www.talener.com.