[Remote] Platform Engineer
Note: The job is a remote job and is open to candidates in USA. Harrison Clarke is an early-stage company building advanced AI systems, and they are seeking a senior platform engineer to take ownership of their core platform. The role involves managing multi-region Kubernetes clusters, GPU orchestration, and ensuring infrastructure security while partnering closely with machine learning engineers.ResponsibilitiesDesign and manage multi-region Kubernetes clusters across cloud and GPU-focused providers using infrastructure-as-codeOwn the deployment lifecycle through GitOps practices (Helm, Kustomize, automated releases, continuous delivery)Manage GPU infrastructure, including scheduling efficiency, workload placement, and cold-start optimizationOversee networking systems such as ingress, gateways, load balancing, and cross-region connectivityBuild and maintain observability across metrics, logs, traces, and performance profilingEnsure infrastructure security across identity, secrets, and encryptionMaintain CI/CD workflows supporting a monorepo of services and deployment artifactsPartner closely with ML engineers to optimize model serving and GPU utilizationSkillsStrong experience operating Kubernetes in production environments, including troubleshooting, autoscaling, and upgradesProven background with infrastructure-as-code tools (e.g., Terraform, Pulumi)Hands-on experience running GPU workloads on Kubernetes and understanding resource optimizationFamiliarity with GitOps tooling such as ArgoCD or Flux, and Helm-based deploymentsExperience with in-memory data systems (e.g., Redis) and distributed architecturesSolid understanding of observability tooling and practicesStrong networking fundamentals, particularly in low-latency or distributed systemsExperience working in environments with broad ownership across infrastructureExposure to GPU cloud providers beyond major hyperscalersExperience with real-time or streaming infrastructureProficiency in Go or PythonFamiliarity with ML model deployment and optimizationExperience managing infrastructure cost, particularly for GPU-heavy workloadsCompany OverviewHarrison Clarke is the Leading Staffing & Recruiting Firm in XOps & Cybersecurity. It was founded in 2016, and is headquartered in New York, New York, USA, with a workforce of 11-50 employees. Its website is https://www.harrisonclarke.com/.