Senior Reliability Engineer

Remote Full-time
Hive is a fast-growing SaaS company offering marketing solutions to live event promoters across North America. Our Engineering Team builds and maintains the systems that empower our customers to do powerful things simply and intuitively. We operate with agility—shipping minimum viable products, deploying multiple times daily, and rapidly iterating based on customer feedback.At Hive, we handle impressive technical scale: ingesting high-volume data in real-time from 20+ integrations (including Ticketmaster and Eventbrite), storing and querying billions of customer data points, and delivering over 200 million emails and SMS messages monthly to our clients' customers. Our technology stack includes Python, React, Redis, MongoDB, SQL, Elasticsearch, Clickhouse, and various AWS services.As we continue to scale, we're seeking a Senior Reliability Engineer to join our Reliability Team—the foundation that enables our product and engineering teams to deliver exceptional experiences while maintaining system performance, security, and cost efficiency.The RoleAs a Senior Reliability Engineer at Hive, you'll be part of a team responsible for the performance, reliability, and maintainability of our systems. This role bridges infrastructure, operations, and application engineering to ensure our services are scalable, performant, secure, and cost-effective as we tackle increasingly complex technical challenges.Tech StackAWS, Docker, Kubernetes, Karpenter, Terraform, Python, Django, Redis, MySQL, Clickhouse, MongoDB, Elasticsearch, DataDog, SentryWhat You'll DoChampion system observability improvements through implementation, maintenance, process refinement, and automation for business-critical servicesDrive SLO adoption and improvement to ensure excellent customer satisfaction across key value streamsEnhance application performance at every level, from infrastructure foundations to runtime environmentsTackle and resolve complex technical challenges across the entire stackPartner with development teams to design and implement scalable, reliable solutionsLead security and compliance initiatives as integral components of our engineering practiceCraft and refine developer tools that boost team productivity and efficiencyDevelop and implement strategies to optimize cloud infrastructure costsCollaborate with DevOps to maintain and enhance deployment pipelines in our cloud environmentsContribute to incident management by defining meaningful metrics, executing against targets, and improving response times and overall system stabilityWhat We're Looking For7+ years of software engineering experience, with at least 5 years focused on reliability, infrastructure, or platform engineering3+ years experience with AWS and proven ability to build effective monitoring, alerting, and observability solutionsTrack record of implementing, maintaining, and improving SLOs and uptime KPIs for critical servicesExpert knowledge of Linux, Docker, and distributed systems principles with their real-world applicationsSolid programming skills in both application and infrastructure languages (Python, Go, etc.)Strong grasp of security best practices and a data-driven approach to enhancing stability and availabilityExcellent communication skills with the ability to collaborate effectively across teams and explain complex technical concepts clearlyBonus points if you have...Proven experience scaling complex AWS environments and optimizing performance across the full technology stack during periods of significant growthExperience creating developer platforms and CI/CD pipelines that enhance team productivitySkillful approach to cloud cost optimization and resource managementExperience in establishing and improving incident management processesWhat We OfferMeaningful salary and equity. You're rewarded based on impactWork fully remote from the comfort of your home in CanadaOpportunity to shape reliability practices at a rapidly scaling companyCollaborative team of experienced engineers passionate about building reliable systemsFlexible work hours with minimal meetingsHealth & Dental coverageOpen vacation/PTO policy so you can be happy and healthy!Generous parental leave top-up with a flexible return-to-work planAbout HiveHive.co is a marketing platform for event marketers. We help brands personalize and automate their campaigns, using email and SMS, to empower them to sell out so they can focus on making their events unforgettable.By integrating with ticketing partners like Ticketmaster and e-commerce partners like Shopify, we enable brands to access and act on all their customer data, so they can easily segment their list in thousands of ways, and send more customized, timely email campaigns that land in inboxes.We started our company inside a University of Waterloo computer lab in early 2014, graduated from Y Combinator that summer (S14 batch) and have been growing ever since. Originally based in Kitchener, our team is now 100% remote and located all across Canada! We strive to provide an online work environment that allows team members to have a strong work life balance while still feeling connected to their team and Hive’s mission.To learn more about our team check out our About Us page on our website:https://www.hive.co/company

Apply Now
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Need Education Consultant in Raleigh, NC

Remote

**Experienced Remote Data Entry Specialist – Join arenaflex's Dynamic Team**

Remote

[Remote] Product Sales Manager

Remote

Senior Retail Store Development Manager

Remote

(USA) Director, UX Design - Experience Platforms - People Product & Design

Remote

Maintenance Technician - Day/1st $32/hr

Remote

Experienced Full Stack Data Entry Specialist – Hybrid Remote Opportunity at careerzynith in Costa Rica

Remote

Flutter Developer

Remote

[Remote-Position] Looking for Elizabethtown Managerial Accounting

Remote

Experienced Professional Site Civil Engineer

Remote
← Back