[Remote] Data Engineer, AQNav
Note: The job is a remote job and is open to candidates in USA. SandboxAQ is a high-growth company delivering AI solutions that address some of the world's greatest challenges. The AQNav team is looking for a highly-accomplished Data Engineer to help build infrastructure that empowers the team with data and accelerates their models.ResponsibilitiesData Pipeline Development & Maintenance: Work across a mixed-maturity pipeline environmentData Modeling: Build and optimize data models that serve a diverse set of consumers. You'll make the data accessible and trustworthy, not just availableSimulation Data Integration: Work within the in-house simulation suite to add data-capturing capabilities and ensure simulation outputs feed cleanly into downstream pipelines alongside real-world field dataData Quality & Observability: Instrument pipelines with quality checks, anomaly detection, and alerting so issues surface earlyCross-Functional Data Support: Translate ambiguous asks into well-defined requirements, repeatable datasets and lightweight Dashboards that the team can use independently going forwardData Platform Infrastructure Contribution: Improve the features and reliability of our internal data platform over timeDocumentation: Own the technical documentation for pipelines, data models, and schemas you touch. In a team this cross-functional, good documentation is a force multiplierSkillsUS citizenship (required for working with CUI data)3+ years of industry experience as a Data Engineer in a startup or fast-moving environmentStrong proficiency in Python and SQL, with hands-on experience building production-grade data solutionsExperience designing and maintaining data pipelines and data models/warehouses that process large, structured scientific or engineering datasetsHands-on experience building on AWS (e.g., S3, ECS, Lambda, IAM) combined with CI/CD and containerization (e.g., GitHub Actions or CircleCI, Docker) to automate, deploy, and maintain data and ML workloads in the cloudPractical MLOps experience: setting up and operating MLOps frameworks (e.g., MLFlow, DVC)A Master's or Ph.D. in a specialized technical field like computer science, data science, mathematics, etcExperience working with sensor data (100-1KHz range)Ability to build interactive dashboards in Hex or similarExperience working with standard ML libraries like PyTorch, scikit-learn and basic supervised/ unsupervised learning techniquesBenefitsPerformance-based incentives or bonuses (where applicable)Equity participationComprehensive medical, dental, and vision coverage for employees and dependents with generous employer premium contributionsRetirement savings with company matchingPaid parental leaveInclusive family-building benefitsFlexible paid time offCompany-wide seasonal breaksSupport for flexible work arrangements that enable sustainable performanceOpportunities for continuous learning and growth through on-the-job development, cross-functional collaboration, and access to internal learning and development programsCompany OverviewSandboxAQ develops AI and quantum technology solutions that enhance biopharma, cybersecurity, and materials science. It was founded in 2016, and is headquartered in Palo Alto, California, USA, with a workforce of 51-200 employees. Its website is https://www.sandboxaq.com.