[Remote] Sr. Site Reliability Engineer
Note: The job is a remote job and is open to candidates in USA. Tavily is building the infrastructure layer for agentic web interaction at scale, focusing on real-time reasoning in AI systems. They are seeking a Senior Site Reliability Engineer to manage Kubernetes clusters, own infrastructure as code, and maintain CI/CD pipelines while working closely with a small engineering team.ResponsibilitiesManaging Kubernetes clusters across multiple environments and regionsOwning infrastructure as code for all resourcesMaintaining and improving CI/CD pipelines and GitOps-based deploymentsMaintaining and optimize real-time data pipelines that process billions of events per day across distributed queues and stream processorsBuilding out monitoring, alerting, and observabilityDebugging production issues across servicesManaging cloud costs and capacity planningWorking closely with a small engineering team — you'd own infra, not a slice of itSkills5-8 years in a DevOps or SRE role, working in production environmentsProven experience designing and operating large-scale, distributed systems, with a solid understanding of API design, reliability, and performance at scaleStrong Kubernetes experience in a managed cloud environmentProficiency with infrastructure as code (Terraform or similar)Experience with GitOps-based deployment workflowsBuilt or maintained observability stacks (logging, metrics, alerting)Experience handling production incidents calmly and methodicallyMulti-region deploymentsSearch infrastructureData pipeline experience (streaming, warehousing)Proxy/networking infrastructure at scaleBenefitsHealth insurance: 100% company-paid medical, dental, and vision coverage for employees and families.401(k) plan: Up to 4% company match with immediate vesting.Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.Remote work reimbursement: Up to $85/month for mobile and internet.Disability & life insurance : Company-paid short-term, long-term and life insurance coverage.Company OverviewTavily is a search engine for LLMs and RAG that connects AI agents to the real-time web. It was founded in 2024, and is headquartered in New York, New York, USA, with a workforce of 51-200 employees. Its website is https://tavily.com.