[Remote] Senior Site Reliability Engineer
Note: The job is a remote job and is open to candidates in USA. Lean Tech is a rapidly expanding organization in the technology services sector, seeking a highly experienced Senior Site Reliability Engineer. The role focuses on evolving the reliability, security, observability, and operational maturity of their cloud platform, leveraging AI tools and practices to enhance operational efficiency.ResponsibilitiesOwn and evolve the reliability, security, observability, and operational maturity of our cloud platformUse AI tools and agentic workflows to automate infrastructure and SRE tasksManage production infrastructure for SaaS platforms, including senior AWS ownershipLead production incidents and drive root-cause analysis, creating remediation plansEnsure compliance with security best practices and maintain compliance controlsSkillsExpert use of AI tools and agentic workflows to automate infrastructure and SRE tasksHands-on experience using AI for Terraform development, incident triage, log analysis, runbook creation, postmortems, operational automation, CI/CD pipeline generation, and reducing repetitive operational workStrong understanding of AI capabilities, limitations, and necessary validation processesAbility to clearly articulate AI workflows, tooling choices, operational safeguards, and production outcomes10+ years managing production infrastructure for SaaS platforms, including 5+ years of senior AWS ownershipDeep expertise with AWS services such as ECS, VPC, IAM, RDS, S3, CloudFront, Route53, Lambda, API Gateway, CloudWatch, Secrets Manager, and related security and governance servicesAdvanced Terraform experience managing multi-account environments, infrastructure state, drift remediation, and dependency managementAdvanced Terraform experience managing multi-account, multi-workspace infrastructureStrong understanding of: provider versioning, state management, drift detection and remediation, dependency management, infrastructure blast radius analysisProven experience resolving production infrastructure drift safelySignificant experience leading production incidents as the accountable ownerAbility to operate calmly and effectively during high-severity outagesProven experience authoring detailed postmortems and operational remediation plansStrong understanding of operational risk management and production recovery proceduresProven experience leading production incidents, driving root-cause analysis, and creating remediation plansStrong background in observability, monitoring, logging, distributed tracing, and alerting using tools such as GrafanaExperience owning CI/CD pipelines, deployment strategies, infrastructure automation, and operational workflowsStrong Linux administration, containerization (Docker), networking, and scripting skillsExperience with security best practices, identity management (SAML, OIDC, SCIM), and compliance frameworks such as SOC 2, ISO 27001, HIPAA, or PCIComfortable working directly with auditors and maintaining compliance controlsExperience supporting Spring Boot or JVM-based systems in productionExperience with runtime security or EDR tooling such as FalcoExperience automating joiner/mover/leaver identity workflows using SCIM and IdP toolingAWS certifications including: AWS Solutions Architect Professional, AWS DevOps Engineer Professional, AWS Security SpecialtyAbility to read and debug Kotlin or Java backend services from an SRE perspectiveReact/NodeJS/Backstage developer experienceMuleSoft API Management experienceBenefitsProfessional development opportunities with international customersCollaborative work environmentCareer path and mentorship programs that will lead to new levelsCompany OverviewGlobal Technology Services (GTS) is the technology solution of Lean Solutions Group, helping companies scale faster through AI-driven automation, software development, and tech-powered talent. It was founded in 2019, and is headquartered in Medellín, Antioquia, COL, with a workforce of 1001-5000 employees. Its website is https://www.lean-tech.io/.Company H1B SponsorshipLean Tech has a track record of offering H1B sponsorships, with 1 in 2023, 1 in 2022. Please note that this does not guarantee sponsorship for this specific role.