GCP Data Engineer (Health Care Background Must)

Remote Full-time
Position :- GCP Data Engineer (Health Care Background Must)

Location: Across USA any Location

The pay range for this role is $130k - $140k per annum

Summary: Strong experience architecting enterprise data platforms on Google Cloud (GCP). The architect will work as a strategic technical partner to design and build a GCP BigQuery-based Data Lake & Data Warehouse ecosystem.

The role requires deep hands-on expertise in data ingestion, transformation, modeling, enrichment, and governance, combined with a strong understanding of clinical healthcare data standards, interoperability, and cloud architecture best practices.

Key Responsibilities:

1. Data Lake & Data Platform Architecture (GCP)
• Architect and design an enterprise-grade GCP-based data lakehouse leveraging BigQuery, GCS, Dataproc, Dataflow, Pub/Sub, Cloud Composer, and BigQuery Omni.
• Define data ingestion, hydration, curation, processing and enrichment strategies for large-scale structured, semi-structured, and unstructured datasets.
• Create data domain models, canonical models, and consumption-ready datasets for analytics, AI/ML, and operational data products.
• Design federated data layers and self-service data products for downstream consumers.

2. Data Ingestion & Pipelines
• Architect batch, near-real-time, and streaming ingestion pipelines using GCP Cloud Dataflow, Pub/Sub, and Dataproc.
• Set up data ingestion for clinical (EHR/EMR, LIS, RIS/PACS) datasets including HL7, FHIR, CCD, DICOM formats.
• Build ingestion pipelines for non-clinical systems (ERP, HR, payroll, supply chain, finance).
• Architect ingestion from medical devices, IoT, remote patient monitoring, and wearables leveraging IoMT patterns.
• Manage on-prem → cloud migration pipelines, hybrid cloud data movement, VPN/Interconnect connectivity, and data transfer strategies.

3. Data Transformation, Hydration & Enrichment
• Build transformation frameworks using BigQuery SQL, Dataflow, Dataproc, or dbt.
• Define curation patterns including bronze/silver/gold layers, canonical healthcare entities, and data marts.
• Implement data enrichment using external social determinants, device signals, clinical event logs, or operational datasets.
• Enable metadata-driven pipelines for scalable transformations.

4. Data Governance & Quality
• Establish and operationalize a data governance framework encompassing data stewardship, ownership, classification, and lifecycle policies.
• Implement data lineage, data cataloging, and metadata management using tools such as Dataplex, Data Catalog, Collibra, or Informatica.
• Set up data quality frameworks for validation, profiling, anomaly detection, and SLA monitoring.
• Ensure HIPAA compliance, PHI protection, IAM/RBAC, VPC SC, DLP, encryption, retention, and auditing.

5. Cloud Infrastructure & Networking
• Work with cloud infrastructure teams to architect VPC networks, subnetting, ingress/egress, firewall policies, VPN/IPSec, Interconnect, and hybrid connectivity.
• Define storage layers, partitioning/clustering design, cost optimization, performance tuning, and capacity planning for BigQuery.
• Understand containerized processing (Cloud Run, GKE) for data services.

6. Stakeholder Collaboration
• Work closely with clinical, operational, research, and IT stakeholders to define data use cases, schema, and consumption models.
• Partner with enterprise architects, security teams, and platform engineering teams on cross-functional initiatives.
• Guide data engineers and provide architectural oversight on pipeline implementation.

7. Hands-on Leadership
• Be actively hands-on in building pipelines, writing transformations, building POCs, and validating architectural patterns.
• Mentor data engineers on best practices, coding standards, and cloud-native development.

Required Skills & Qualifications

Technical Skills (Must-Have)
• 10+ years in data architecture, engineering, or data platform roles.
• Strong expertise in GCP data stack (BigQuery, Dataflow, Composer, GCS, Pub/Sub, Dataproc, Dataplex).
• Hands-on experience with data ingestion, pipeline orchestration, and transformations.
• Deep understanding of clinical data standards:
• HL7 v2.x, FHIR, CCD/C-CDA
• DICOM (for scans and imaging)
• LIS/RIS/PACS data structures
• Experience with device and IoT data ingestion (wearables, remote patient monitoring, clinical devices).
• Experience with ERP datasets (Workday, Oracle, Lawson, PeopleSoft).
• Strong SQL and data modeling skills (3NF, star/snowflake, canonical and logical models).
• Experience with metadata management, lineage, and governance frameworks.
• Solid understanding of HIPAA, PHI/PII handling, DLP, IAM, VPC security.

Cloud & Infrastructure
• Solid understanding of cloud networking, hybrid connectivity, VPC design, firewalling, DNS, service accounts, IAM, and security models.
• Cloud Native Data movement services
• Experience with on-prem to cloud migrations.

Apply tot his job

Apply To this Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Experienced Remote Data Entry and Typing Professional - Flexible Work from Home Opportunity for Career Growth and Development

Remote

Remote Compliance Risk Intern

Remote

**Experienced Part-Time Data Entry and Customer Service Representative – Flexible Remote Opportunity with blithequark**

Remote

**Experienced Data Entry Specialist – Remote Work Opportunity at arenaflex**

Remote

Software Engineer Consultant II- CoPilot

Remote

**Experienced Customer Service Representative – Remote Market Research and Data Entry Opportunities**

Remote

Experienced Data Entry Clerk for Remote Positions at blithequark - No Experience Necessary

Remote

Senior Partner, Advertising Sales, Walmart Connect - Pets - Mid Market (Chicago)

Remote

Staff System Architect, Benchmarking and Calibration

Remote

Experienced Customer Service Representative – Medicaid & Medicare Member Support at Blithequark, Virginia

Remote
← Back