[Remote] Staff Machine Learning Engineer
Note: The job is a remote job and is open to candidates in USA. Monogram Health is a leading multispecialty provider of in-home, evidence-based care for complex patients with multiple chronic conditions. They are seeking a Staff Engineer in Machine Learning Operations to architect and scale machine learning infrastructure while mentoring teams and driving strategic decisions that impact patient outcomes.ResponsibilitiesArchitect and maintain enterprise-grade ML infrastructure, including model versioning, automated testing frameworks, containerization strategies, CI/CD pipelines, and comprehensive monitoring systems for model performance, data quality, and drift detectionDrive MLOps strategy and standards across the organization. Mentor data scientists and engineers on production best practices, system design, and scalable architecture patternsOwn the complete journey from model development through production deployment, including real-time and batch inference systems, A/B testing frameworks, and automated retraining pipelinesCollaborate with clinical leaders, product teams, and data scientists to translate complex healthcare requirements into robust, scalable ML solutions. Present technical strategies to executive stakeholdersBuild fault-tolerant, compliant systems that meet healthcare security and privacy standards. Establish SLAs, incident response protocols, and disaster recovery procedures for mission-critical ML servicesEvaluate and integrate cutting-edge MLOps tools and practices. Design systems that scale with Monogram's growth while reducing operational overhead and improving model iteration velocitySkillsBachelor's degree in computer science, engineering, or related field required; master's degree preferredMinimum of ten (10) years in software engineering with five (5) years focused on ML infrastructure, MLOps, or production ML systems and Python development with strong software engineering fundamentals and three (3) years architecting and deploying production ML systems on cloud platforms (Azure preferred)Proven track record building and scaling ML platforms from the ground upExpert-level proficiency with MLOps tooling (MLflow, Kubeflow, SageMaker, Azure ML, etc.)Deep experience with containerization (Docker, Kubernetes), orchestration tools (Airflow, Prefect), and infrastructure-as-code (Terraform, ARM templates)Advanced knowledge of CI/CD systems, automated testing strategies, and GitOps workflowsData engineering skills: SQL, Spark/PySpark, Databricks, data pipeline optimizationExpertise in model monitoring, observability, feature stores, and experiment tracking at scaleProduction experience with both batch and real-time inference architecturesDemonstrated ability to influence technical direction and mentor senior engineersProven communication skills with ability to distill complex technical concepts for diverse audiencesTrack record of driving consensus on architectural decisions across multiple stakeholdersSystems thinking skills with focus on reliability, scalability, and maintainabilityHealthcare or regulated industry experience strongly preferredUnderstanding of healthcare data standards (FHIR, HL7, claims data) is a plusUnderstanding of security, compliance, and privacy requirements in healthcare (HIPAA) preferredBias toward action with pragmatic approach to technical debt and iterative improvement preferredBenefitsMedical, dental, and vision insuranceEmployee assistance programEmployer-paid and voluntary life insuranceDisability insuranceHealth and flexible spending accountsCompetitive compensation401k with employer matchFinancial wellness resourcesPaid holidaysFlexible vacation time/PSSLPaid parental leaveWork life assistance resourcesPhysical wellness perksMental health supportEmployee referral programBenefitHub for employee discountsCompany OverviewMonogram Health is a specialty provider of evidence-based in-home care and benefit management services for polychronic patients. It was founded in 2019, and is headquartered in Brentwood, Tennessee, USA, with a workforce of 1001-5000 employees. Its website is https://www.monogramhealth.com.