[Remote] Senior Cloud Platform Engineer
Note: The job is a remote job and is open to candidates in USA. Waites is a leading provider of wireless IoT sensors for industrial environments, focused on empowering plants to conduct predictive maintenance. The Senior Cloud Platform Engineer will be responsible for designing and maintaining scalable cloud infrastructure, ensuring the reliability and security of cloud systems, and collaborating with various engineering teams to enhance platform capabilities.ResponsibilitiesDesign, implement, and maintain scalable cloud infrastructure primarily in AWSArchitect and operate highly reliable APIs, web services, and data ingestion pipelinesLead infrastructure standardization using Infrastructure as Code (Terraform and related tooling)Design and optimize event-driven and queue-based systems for high-volume telemetry and distributed processing workloadsDevelop scalable and secure serverless and containerized architectures where appropriateEstablish infrastructure best practices for: reliability, observability, disaster recovery, security, cost optimization, deployment automationSupport compliance initiatives including SOC2 and GDPR requirementsDesign and maintain secure networking architectures, IAM strategies, and secrets management practicesCollaborate with firmware, backend, AI, and product engineering teams to support platform scalability and operational excellenceEvaluate and recommend appropriate database technologies based on workload requirements and tradeoffsImprove CI/CD pipelines and engineering workflowsParticipate in operational troubleshooting and incident response for production systemsCarry out additional responsibilities as needed to support business objectivesSkills10+ years of professional software and infrastructure engineering experienceDeep hands-on AWS experience operating production systems at scaleStrong experience designing and maintaining cloud-native infrastructureSignificant experience with Infrastructure as Code using TerraformStrong understanding of distributed systems, asynchronous processing, and event-driven architecturesExperience building and operating data ingestion pipelines using queues, streams, and serverless servicesStrong understanding of networking, security, IAM, DNS, TLS, and API architectureExperience supporting SOC2-compliant environments and familiarity with GDPR requirementsExperience operating and troubleshooting production APIs and web applicationsStrong understanding of database technologies and architectural tradeoffs including: relational databases, time-series databases, object storage systems, caching systemsStrong understanding of monitoring, logging, observability, and operational toolingExperience designing resilient and fault-tolerant systemsExcellent communication and technical leadership skillsExperience with Azure infrastructure and servicesExperience with high-scale IoT or telemetry platformsExperience with Kubernetes and container orchestrationExperience with streaming and messaging technologies such as SQS, Kafka, Kinesis, RabbitMQ, or MQTTExperience with PostgreSQL, MySQL, DynamoDB, Redis, TimescaleDB, or similar systemsFamiliarity with edge computing or hybrid cloud architecturesBenefitsAn attractive base salary with a strong history of profit sharingMany opportunities for professional advancement and developmentCompany OverviewWaites works in the field of condition monitoring. It was founded in 2006, and is headquartered in Covington, Kentucky, USA, with a workforce of 201-500 employees. Its website is https://waites.net.