[Remote] Senior Platform Engineer
Note: The job is a remote job and is open to candidates in USA. Verve is a technology company focused on creating efficient and privacy-focused advertising solutions. They are seeking a Senior Platform Engineer to take ownership of their growing infrastructure, optimizing Kubernetes clusters and implementing CI/CD pipelines while mentoring engineers and ensuring platform reliability.ResponsibilitiesOwn and optimize Kubernetes clusters (GKE) at scale, including networking, autoscaling, and cost efficiencyLead the design and implementation of infrastructure provisioning and configuration using Infrastructure-as-Code (Terraform), establishing reusable patterns and enforcing standards across teamsArchitect, implement, and continuously improve CI/CD pipelines (GitHub Actions, ArgoCD) with a focus on reliability, security, and developer experienceDesign and maintain Helm charts for applications and services across multiple environmentsDrive platform reliability — owning scalability, availability, and disaster recovery strategy (load balancing, auto-scaling, multi-region resilience)Define and implement observability standards across metrics, logging, and tracing (Grafana, New Relic, OpenTelemetry)Serve as a technical lead during incidents on primary revenue-generating applications, owning resolution and contributing to post-mortemsMentor engineers across the team and act as a trusted advisor to development teams on infrastructure best practices, security standards, and platform adoptionContribute to the team’s on-call rotation with a proactive, systems-thinking approachSkillsOwn and optimize Kubernetes clusters (GKE) at scale, including networking, autoscaling, and cost efficiencyLead the design and implementation of infrastructure provisioning and configuration using Infrastructure-as-Code (Terraform), establishing reusable patterns and enforcing standards across teamsArchitect, implement, and continuously improve CI/CD pipelines (GitHub Actions, ArgoCD) with a focus on reliability, security, and developer experienceDesign and maintain Helm charts for applications and services across multiple environmentsDrive platform reliability — owning scalability, availability, and disaster recovery strategy (load balancing, auto-scaling, multi-region resilience)Define and implement observability standards across metrics, logging, and tracing (Grafana, New Relic, OpenTelemetry)Serve as a technical lead during incidents on primary revenue-generating applications, owning resolution and contributing to post-mortemsMentor engineers across the team and act as a trusted advisor to development teams on infrastructure best practices, security standards, and platform adoptionContribute to the team's on-call rotation with a proactive, systems-thinking approach7+ years of relevant platform, infrastructure, or DevOps engineering experienceDeep hands-on expertise with Kubernetes in production environments (GKE)Strong Terraform skills — module authoring and IaC architecture, not just usageExperience with GitOps workflows and CI/CD pipeline design at scaleSolid understanding of cloud networking, IAM, and security across GCPDemonstrated ability to work independently on complex, ambiguous problemsExperience mentoring engineers and influencing platform cultureBenefitsCompetitive salaryHealth, dental, and vision insurance, plus mental health resources401(k) match and generous PTOHybrid work environment (NYC office)Free lunch for onsite team members in NYCVolunteer OpportunitiesOpportunities for professional development in a high-growth ad tech companyCompany OverviewVerve’s omnichannel ad platform connects advertisers, agencies, brands, and publishers to people in real time. It is a sub-organization of Media and Games Invest. It was founded in 2005, and is headquartered in New York, New York, USA, with a workforce of 501-1000 employees. Its website is https://www.verve.com/.