[Remote] Senior Data Engineer
Note: The job is a remote job and is open to candidates in USA. Healthy Together is a fast-growing GovTech & healthcare platform seeking an experienced Senior Data Engineer. The role involves designing, building, and maintaining data pipelines and integrations to support analytics, reporting, and operational workflows while ensuring compliance with data governance standards.ResponsibilitiesArchitect, develop, and operate scalable data pipelines in Python using frameworks such as Apache Airflow, AWS Glue, or similarIngest and transform data from internal sources, microservices, and third-party APIs (REST, streaming, webhooks)Design and maintain dimensional and normalized schemas in cloud data warehouses (AWS Redshift, Snowflake, or equivalent)Optimize table structures, partitioning, and indexing for performance and cost efficiencyBuild and manage robust, fault-tolerant integrations with external systems (payment gateways, identity providers, data vendors)Develop monitoring, retries, and alerting to ensure integration reliabilityImplement data validation, anomaly detection, and reconciliation processes to guarantee accuracyCollaborate with Security and Compliance teams to enforce data governance, encryption, and access controlsPartner with Analytics, ML, and Product teams to translate requirements into data solutionsProvide self-service data access (views, dashboards) and documentation for stakeholdersMonitor and tune pipeline and warehouse performance; identify opportunities to reduce AWS spendIntroduce caching, batching, and parallelism as appropriate for large-scale workloadsEvaluate and prototype emerging data technologies (Spark, Kafka, dbt, data mesh patterns)Leverage AI/ML tools to automate repetitive data tasks or anomaly detectionSkills7+ years in data engineering or analytics engineering rolesExpert-level Python for ETL scripting, API clients, and automationHands-on with AWS data services (S3, Glue, Redshift, EMR, Lambda) and infrastructure-as-code (Terraform or CloudFormation)Deep expertise designing relational schemas, writing complex SQL, and building data martsProven experience with Apache Airflow, AWS Glue, or equivalent orchestration toolsSolid background integrating and transforming data from third-party APIs, streaming platforms, and message queuesFamiliarity with data handling requirements in HIPAA, FedRAMP, and SOC-2 environmentsStrong communication skills; able to partner effectively with cross-functional teamsExperience with Apache Spark, Kafka, Kinesis, or similarProficiency with dbt, Delta Lake, or Iceberg for versioned tables and transformationsKnowledge of Docker and Kubernetes for data workloadsExposure to MLOps frameworks and feature storesPrior work on healthcare analytics or government data projectsEngagement with data-engineering or analytics OSS communitiesBenefitsCompetitive salary and equity packagesComprehensive health, dental, and vision benefitsMonthly Wellness StipendGenerous PTOCompany OverviewHealthy Together's SaaS platform enables the future of health by bringing together the objectives of government programs. It was founded in 2020, and is headquartered in Miami, Florida, USA, with a workforce of 11-50 employees. Its website is https://www.healthytogether.co.