[Remote] Senior Platform Engineer
Note: The job is a remote job and is open to candidates in USA. Profitmind is a retail analytics SaaS company that transforms competitive and customer data into actionable insights for retailers. They are seeking a Senior Platform Engineer to design and operate their multi-cloud infrastructure, implement automation and security tooling, and ensure the reliability of their platform as they expand into new regions.ResponsibilitiesOwn infrastructure as code. Design and operate our cloud infrastructure as Terraform across Azure, AWS, and GCP β landing zones, networking, and IAM β including bringing resources currently managed by partners and templates under our own automated, version-controlled ownershipRun GitOps delivery. Build and maintain CI/CD and GitOps pipelines (e.g., Argo, GitHub Actions / Azure DevOps) that deploy our services into non-prod and prod environments reliably and repeatablyOwn the release process. Define and run a disciplined SDLC and release process β versioning and tagging releases, promoting builds through environments, and tracking what is deployed where. Maintain traceability from commit to build to deployment (and to the Jira work behind it) so every change is auditable, including for SOC 2 and customer due diligenceOperate Kubernetes. Manage our Kubernetes / AKS environments β cluster setup, autoscaling, workload deployment, and the operators and platform services that run on themEnable EU and multi-region. Stand up EU-resident infrastructure and additional cloud regions, ensuring data residency, networking, and access controls are correct by designDrive SOC 2 and security. Implement and maintain org-wide security controls, vulnerability scanning, and access governance across our clouds, GitHub, and Kubernetes β and produce the evidence and discipline that SOC 2 and enterprise customer due diligence requireBuild platform tooling. Develop platform services and developer tooling (in Python or similar) that make the rest of engineering faster β self-service environments, internal automation, cost monitoring, and reusable building blocksSupport the data platform. Partner with data engineering on the infrastructure under our data pipelines and customer-cloud deployments β from competitive and customer data flows to customer ingestion paths (SFTP and direct warehouse reads), so the platform is reliable and portable across customer environmentsBuild secure integrations. Help evolve how we connect to customer and third-party systems β moving beyond todayβs SFTP file exchange toward more direct, flexible integrations (both push and pull) such as direct warehouse reads. Build the authentication, authorization, secrets, and key management that make those connections secure by defaultOwn observability and cost. Run our observability stack (metrics, logs, traces, alerting) and monitor cloud spend and egress across clouds, driving optimizationUnblock the team. Treat the engineering teams as your customers. Own access, permissions, and environment requests end to end and turn them around quickly β when someone needs cloud access or a resource to do their job, removing that blocker promptly is part of the role, not a side taskReduce single-points-of-knowledge. Document what you build and spread ownership. We are deliberately building a fungible, well-documented platform function, not concentrating critical knowledge in one personSkillsStrong software engineering. You write production-quality software β platform services, automation, and tooling in Python, Go or similar β and treat infrastructure as code you engineer, not just configureMulti-cloud infrastructure depth. Hands-on experience architecting and operating infrastructure as code with Terraform across more than one major cloud (Azure, AWS, and/or GCP), including per-provider IAM and account/environment segmentationGitOps and Kubernetes. Strong, production experience with Kubernetes and GitOps-based delivery (ArgoCD or equivalent), CI/CD pipelines, and deploying real services into real environmentsRelease and SDLC discipline. Experience defining release processes β versioning, tagging, environment promotion, and tracking what is deployed where β with traceability from change to deploymentSecurity and compliance instinct. Experience implementing security controls and supporting compliance efforts (SOC 2, ISO 27001, or similar), including the evidence and process discipline audits requireAuthentication and secure connectivity. Working knowledge of authentication and authorization (OAuth / OIDC, SSO, IdP federation), secrets and key management, and securing system-to-system integrationsObservability. Experience operating modern observability tooling (metrics, logs, traces, and alerting β e.g., OpenTelemetry, Prometheus, Grafana, Datadog, or similar) and using it to drive reliabilityOwnership and a service mindset. Self-directed, comfortable with ambiguity, and responsive to the people who depend on youExperience standing up EU data residency / GDPR-relevant infrastructureMLOps or model-serving infrastructure experience, supporting an ML or data science teamAI / LLM infrastructure experienceCloud cost optimization (FinOps) track recordService mesh or custom Kubernetes operator experienceCompany OverviewProfitmind enables retailers to auto-identify competitors across the internet and track their assortments, and optimize prices in real time. It was founded in 2022, and is headquartered in Pittsburgh, Pennsylvania, USA, with a workforce of 11-50 employees. Its website is https://www.profitmind.com/.