[Remote] Senior AIOps Engineer, Incident Response [Remote-US]
Note: The job is a remote job and is open to candidates in USA. Quanata is an insurance technology innovation company focused on creating context-based insurance solutions. They are seeking a Senior AIOps Engineer to lead production health, incident response, and operational reliability while collaborating with engineering and AI orchestration teams to enhance scalability and issue resolution.ResponsibilitiesOwn production health, reliability, and operational support processes across critical systems and servicesLead incident response efforts, stakeholder communication, root cause analysis, and post-incident reviewsIdentify patterns in production issues and drive improvements to reduce recurring incidents and operational overheadDesign and implement AI-driven agents and workflows that automate support and operational tasksPartner with engineering, product, and AI orchestration teams to improve system resilience and operational efficiencyBuild and maintain operational runbooks, documentation, and knowledge base content for both human and AI-assisted workflowsSupport observability, monitoring, and troubleshooting efforts across cloud-based production environmentsParticipate in on-call rotations and continuously improve operational readiness and response processesSkills6–8 years of experience in production operations, site reliability engineering, technical support engineering, or similar operational rolesStrong background in incident management, root cause analysis, and production system troubleshootingExperience working within modern SDLC, DevOps, and change management environmentsFamiliarity with operational tooling such as Jira, Confluence, and observability/monitoring platformsStrong analytical and problem-solving skills with the ability to identify trends and drive operational improvementsComfortable working cross-functionally with engineering, product, operations, and leadership teamsStrong communication skills and ability to operate effectively in fast-moving technical environmentsBachelor's degree in Computer Science, Engineering, or equivalent relevant experienceExperience building or working with AI/LLM-powered systems, intelligent agents, or workflow automation toolsFamiliarity with cloud platforms such as AWS and modern observability ecosystemsExperience with event-driven architectures, orchestration frameworks, or operational automation platformsBackground leading operational transformation or reliability improvement initiativesPassion for AI-native operations, automation, and improving developer/support experiencesBenefitsMedical, dental, vision, life insurance and supplemental income plans for you and your dependentsA Headspace app subscriptionMonthly wellness allowanceA 401(k) Plan with a company matchA one-time payment of $2K will be provided to cover the purchase of in-home office equipment and furniture at your discretionMacBook Pros, which we will deliver to you fully provisioned prior to your first dayAll employees accrue four weeks of PTO in their first year of employmentNew parents receive twelve weeks of fully paid parental leave which may be taken within one year after the birth and/or adoption of a childThe twelve weeks is applicable to both birthing and non-birthing parentAll employees receive up to $5000 each year for professional learning, continuing education and career developmentAll team members also receive LinkedIn Learning subscriptions and access to multiple different coaching opportunities through BetterUpCompany OverviewQuanata offers context-based insurance solutions with risk prediction and mitigation, backed by State Farm. It was founded in 2016, and is headquartered in San Francisco, California, USA, with a workforce of 201-500 employees. Its website is https://www.quanata.com.