[Remote] Senior Data Platform Engineer
Note: The job is a remote job and is open to candidates in USA. Ellipsis Health is creating cutting-edge AI/ML products that solve healthcare staffing issues and administrative burdens using conversation-based software and patented voice biomarker technology. They are seeking an experienced Senior Data Platform Engineer to lead the design and development of a scalable data platform that supports analytics and ML Ops while collaborating with various teams to implement end-to-end pipelines.ResponsibilitiesLead the design, development, and operation of a scalable and secure data platform to support analytics, ML Ops, and business intelligenceCollaborate closely with Data Science, Machine Learning, Application and DevOps teams to implement end-to-end ML Ops pipelinesArchitect and manage data warehousing solutions using Databricks, Dbt, and SparkDevelop and maintain ETL/data pipelines that handle structured and unstructured data across diverse sourcesOptimize data storage, access, and processing for cost-efficiency and performance in GCP and AWS Cloud environmentsBuild and maintain dashboards and analytics solutions using tools such as Sigma, Metabase, and other BI platformsEnsure compliance with data governance, security, and privacy best practices, including HIPAA, SOC-2, and other regulatory requirementsEvaluate and integrate third-party anonymization and security solutions to protect sensitive dataProvide strategic guidance on the evolution of the data platform to meet the company's growth and technical needsDesign and implement scalable infrastructure for Large Language Model (LLM) operations, including training, fine-tuning, and inference workflowsCollaborate with AI/ML teams to build and optimize LLM serving platforms for real-time and batch processingDevelop monitoring and observability solutions for LLMs, ensuring model performance, cost-efficiency, and compliance with ethical AI guidelinesEvaluate and integrate state-of-the-art LLM technologies into existing data platforms to enhance analytics and decision-makingSkillsBachelor's or Master's Degree in Computer Science or equivalent experience5+ years of industry experience in designing and building large-scale data platformsStrong expertise in SQL, Data Modeling, and Data Warehousing (Databricks, Snowflake, Redshift, BigQuery, etc.)Proficiency in writing Advanced SQLs and performance tuningStrong proficiency in Python for building, optimizing, automating and maintaining data pipelines and servicesDeep experience with Apache Spark and distributed data processing frameworksHands-on experience with modern ETL/Orchestration frameworks such as Airflow, dbt, and othersKnowledge of business intelligence tools such as Sigma, Metabase, Tableau, and LookerStrong familiarity with cloud-based infrastructure and managed data services in GCP and AWS CloudExperience with CI/CD pipelines to automate testing, deployment and release of data engineering and analytics workflows using GitLab, GitHub etcExperience with tools like Kubernetes, Terraform, Pubsub, DebeziumExposure building data quality frameworks and automationUnderstanding of data governance, privacy, and regulatory frameworks (HIPAA, SOC-2, HITRUST)Experience working with ML Ops platforms and supporting Data Science teamsExperience with ML Ops tools such as MLflow, Streamlit, and vector databasesFamiliarity with healthcare data standards (FHIR, HL7)Experience in real-time data processing and event-driven architecturesExpertise in implementing data access controls and anonymization techniquesBenefits401(k) matchingHealth, vision, and dental insuranceVery flexible paid time offCompany OverviewAI Nursing Care Manager It was founded in 2017, and is headquartered in San Francisco, California, USA, with a workforce of 11-50 employees. Its website is http://www.ellipsishealth.com.Company H1B SponsorshipEllipsis Health has a track record of offering H1B sponsorships, with 2 in 2026, 6 in 2025, 1 in 2024, 2 in 2023, 1 in 2021. Please note that this does not guarantee sponsorship for this specific role.