[Remote] Senior Platform Engineer
Note: The job is a remote job and is open to candidates in USA. MOXFIVE is building technologies that leverage AI to streamline response, recovery, and resilience from cyber attacks in enterprises. They are seeking a Senior Platform Engineer to enhance the reliability, security, and deployability of their platform, directly impacting their engineering team's efficiency. The role involves owning cloud infrastructure, improving CI/CD pipelines, and ensuring operational readiness while promoting security and developer velocity.ResponsibilitiesOwn and improve the platform foundation that helps a high-velocity engineering team ship safely across cloud infrastructure, Kubernetes, IaC, secrets, networking, access controls, CI/CD, observability, and production guardrailsBuild internal tooling for an AI-enabled engineering workflow, including automation, repo and CI feedback loops, agent-ready development environments, and safeguards that let engineers move quickly without weakening production disciplineStrengthen operational readiness through better logging, metrics, tracing, alerting, runbooks, and incident follow-upHarden production access with least-privilege IAM, secure secret management, auditability, and controlled break-glass pathsSet pragmatic platform standards that help a small team move quickly today while avoiding infrastructure, reliability, and security debt tomorrowSkills5+ years of experience in platform engineering, DevOps, SRE, infrastructure engineering, or backend-adjacent cloud operationsA track record of owning production systems where reliability, security, and developer velocity all matterHands-on experience with cloud infrastructure, Kubernetes, infrastructure-as-code, CI/CD, secrets management, access controls, and observabilityExperience building internal developer tooling, platform automation, or AI-assisted development workflowsComfort designing safe release processes with deployment gates, smoke tests, rollback paths, and clear ownershipPractical experience supporting relational databases and production data changesA security-minded approach to infrastructure, including least privilege, auditability, secret handling, and controlled production accessClear written communication for runbooks, deployment notes, incident follow-ups, and engineering decisionsFamiliarity with agent harness design, agent sandboxing, including tool access, environment setup, state management, permissions, and production guardrailsExperience managing production model inference across hosted providers such as Together AI or Fireworks.ai, GPU platforms such as RunPod or Lambda Cloud, Modal, or similar, or self-hosted serving stacks, including the tradeoffs between hosted APIs, dedicated deployments, serverless GPUs, and self-hosted inference stacksBenefitsOffers BonusCompany OverviewMOXFIVE specializes in the technical advisory service to minimize the business impact of cyberattacks. It was founded in 2019, and is headquartered in Mclean, Virginia, USA, with a workforce of 51-200 employees. Its website is https://www.moxfive.com.