[Remote] New Grad Data Engineer (for Health Tech Startup)🤓

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. 1Phi Health is a health tech startup focused on making healthcare more accessible. They are seeking a New Grad Data Engineer to build and maintain data pipelines, ensuring data quality and collaborating with data scientists and product engineers in the healthcare data domain.ResponsibilitiesBuild and maintain data pipelines that ingest, transform, and validate large-scale Medicare claims data using SQL, Python, and Databricks (Spark). You'll work with patient-level records across billions of claim linesWrite and optimize complex SQL — multi-step transformations, window functions, joins across large datasets, aggregations with suppression rules. SQL is the primary language of the workAutomate and operationalize recurring data workflows — building reliable, repeatable pipelines that process CMS data extracts, dimension tables, and derived provider metricsEnsure data quality by designing validation checks, reconciling source data against expected schemas, and investigating anomalies when numbers don't add upCollaborate with data scientists and product engineers to define output schemas, deliver clean datasets, and support downstream analytics and application featuresWork in cloud infrastructure — primarily Databricks on AWS, with exposure to S3, Unity Catalog, and related servicesLearn the healthcare data domain — you'll develop working knowledge of claims data structures, medical coding systems (ICD-10, HCPCS, DRG), and CMS data programsSkillsBuild and maintain data pipelines that ingest, transform, and validate large-scale Medicare claims data using SQL, Python, and Databricks (Spark). You'll work with patient-level records across billions of claim linesWrite and optimize complex SQL — multi-step transformations, window functions, joins across large datasets, aggregations with suppression rules. SQL is the primary language of the workAutomate and operationalize recurring data workflows — building reliable, repeatable pipelines that process CMS data extracts, dimension tables, and derived provider metricsEnsure data quality by designing validation checks, reconciling source data against expected schemas, and investigating anomalies when numbers don't add upCollaborate with data scientists and product engineers to define output schemas, deliver clean datasets, and support downstream analytics and application featuresWork in cloud infrastructure — primarily Databricks on AWS, with exposure to S3, Unity Catalog, and related servicesLearn the healthcare data domain — you'll develop working knowledge of claims data structures, medical coding systems (ICD-10, HCPCS, DRG), and CMS data programsYou have strong SQL skills. Coursework, internships, or projects where you wrote non-trivial queries — joins, CTEs, window functions, aggregations. You can reason about query performanceYou're comfortable with Python. You've used it for data manipulation (pandas, PySpark, or similar). You don't need to be a software engineer, but you can write clean, functional codeYou understand data pipeline concepts — ETL/ELT, idempotency, schema management, data validation. Exposure through coursework, capstone projects, or internships countsYou're detail-oriented and methodical. Healthcare data has strict rules around suppression, privacy, and accuracy. You care about getting the numbers rightYou're a fast learner who's comfortable ramping up on unfamiliar domains. You'll be learning Medicare claims data, CMS programs, and healthcare coding systems on the jobYou have a BS or MS in Computer Science, Data Science, Information Systems, Statistics, or a related fieldYou've worked with Spark, Databricks, or other distributed compute environments (even in a class or personal project)You have exposure to cloud platforms (AWS, GCP, or Azure) — S3, IAM, or managed database servicesYou've touched healthcare data in any capacity — claims, EHR, public health datasets, MIMIC, CMS public use filesYou're familiar with version control (Git) and collaborative development workflowsYou've built a data project end-to-end — ingestion through delivery — even if it was smallBenefitsHealth insurance within 3 months of startingGenerous vacation policy + company holidays401K + profit share contributionsQuarterly evals and performance bonus (~10% at start, ~20% after 4 years)Company Overview It was founded in undefined, and is headquartered in , with a workforce of 2-10 employees. Its website is https://1phi.com/.

Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Aetna Jobs Jacksonville $25/Hour – mysmartpros

Remote

**Experienced Remote Data Entry Research Panelist – Work From Home Opportunity at arenaflex**

Remote

Sr. Lead Solutions Architect - Azure Delivery (RapidScale)

Remote

Utilizaton Management Nurse Associate LPN/LVN

Remote

Refrigeration HVAC Technician

Remote

Remote Customer Service Representative

Remote

REMOTE Sales & Portrait Appointment Booker (25 hours per week)

Remote

DVM Veterinary Partner & Hospital Equity Owner

Remote

Board Certified - Texas Licensed Physician Reviewers- Pediatric Neurology

Remote

Southwest Airlines Customer Service Job $26/Hour - DPS

Remote
← Back