[Remote] Senior Machine Learning Engineer
Note: The job is a remote job and is open to candidates in USA. Anno.ai is a mission-focused defense technology startup dedicated to accelerating the safe and effective development of next-generation autonomous systems. As a Senior Machine Learning Engineer, you will design, develop, test, document, deploy, and maintain production machine learning software to automate processes and streamline mission operations.ResponsibilitiesOperationalize machine learning models by building and maintaining robust, scalable pipelines for training, evaluation, deployment, and lifecycle management across cloud, on-prem, and edge compute environmentsWork closely with autonomy researchers, software engineers, systems teams, and field operators to translate mission requirements into deployable ML capabilitiesImplement automated CI/CD workflows tailored to ML systems, ensuring repeatable experiments, reliable packaging, and continuous delivery of both up to date models and associated data pipelinesManage ML runtime infrastructure using containerization and orchestration frameworks (e.g., Docker, Kubernetes) and incorporating model serving platforms (e.g., Seldon, KServe, BentoML)Develop monitoring systems to track model health, performance, data drift, system utilization, and mission relevance using tools such as Prometheus, Grafana, and ELK/EFK stacksEnsure ML deployments meet defense, customer, and platform security requirements, with emphasis on data integrity, traceability, and operational reliabilityEvaluate and integrate emerging MLOps, distributed training, and edge inference technologies to enhance reproducibility, extensibility, scalability, and deployment speed of ML systemsSkillsBachelor's degree in Computer Science, Electrical Engineering, Data Science, or a related technical field (Master's preferred)5+ years of professional experience in software engineering, machine learning engineering, MLOps, or related rolesExperience operationalizing ML systems at production scale, including model training, versioning, packaging, deployment, and monitoringStrong proficiency in Python and familiarity with at least one deep learning framework (e.g., PyTorch, TensorFlow)Hands-on experience with MLOps frameworks and workflow tooling (e.g., MLflow, Kubeflow, Airflow, DVC, BentoML)Experience deploying containerized ML services using Docker and orchestrating workloads using Kubernetes (including air-gapped or constrained deployments)Understanding of CI/CD workflows and DevOps practices applied to ML systems (e.g., Git, Code Review, Metrics Evaluation)Familiarity with monitoring, observability, and logging platforms (e.g., Prometheus, Grafana, ELK/EFK)Ability to obtain and maintain U.S. Government security clearance (U.S. Citizenship required)Ability to travel up to 20%Experience with deploying models and associated runtimes to Edged DevicesExperience optimizing models for memory and CPU constrained systems (e.g., embedded systems, microcontrollers)Prior experience supporting U.S. Department of War programs, cUAS systems, or mission-critical autonomous platformsExperience working with diverse or atypical data sources (e.g., Audio/Acoustics, RF signals, EO/IR imagery)Experience deploying and optimizing ML inference on edge or resource-limited compute systemsExperience with Explainable/Auditable AI/ML tools and interpretable model designExperience with AI Software Development Tools (e.g., GitHub CoPilot, Claude)BenefitsCompetitive salaryEquityComprehensive benefits package401k with a 5% company matchPaid holidays and generous paid time off offeringPaid leave programsPatent bonus programEmployee referral bonus programLearning and development programOpportunity to work with a team of highly skilled, creative and motivated team membersCompany OverviewBringing online retail metrics, insights, and visibility you care about into your brick and mortar locations. It was founded in 2019, and is headquartered in Minnetonka, Minnesota, USA, with a workforce of 11-50 employees. Its website is https://www.onsights.io.