[Remote] Senior Data Engineer
Note: The job is a remote job and is open to candidates in USA. CitiusTech is a healthcare technology company that aims to solve the industry's greatest challenges through innovation and collaboration. They are seeking a Senior Data Engineer who will be responsible for designing, building, and operating ETL/ELT pipelines on cloud platforms for healthcare data, ensuring data governance and quality practices are upheld.ResponsibilitiesDesign, build, and operate scalable ETL/ELT pipelines across GCP, Azure, BigQuery, and SQL ServerModel data for analytical workloads including dimensional modeling, SCDs, normalization, and schema designOrchestrate pipelines using Airflow, Cloud Composer, Azure Data Factory, or similar frameworksEnsure secure handling of PHI in alignment with HIPAA—covering data movement, de-identification, access controls, and audit readinessImplement and enforce data governance practices, including metadata management, data lineage, cataloging, and stewardship workflowsIntegrate with enterprise data governance platforms such as Microsoft Purview, Collibra, or Alation for:Data cataloging and classificationLineage tracking (end-to-end pipeline visibility)Glossary and business metadata managementDefine and implement data quality frameworks including validation rules, anomaly detection, and monitoringEnable data discoverability and trust through proper tagging, classification, and governance standardsDeploy pipelines using Git-based workflows and CI/CD; monitor, troubleshoot, and optimize production pipelinesCollaborate with stakeholders (business, analytics, governance teams) to translate requirements into scalable technical solutionsCommunicate technical tradeoffs, risks, and governance implications early in the design lifecycleSkills7+ years of experience in data engineering with strong exposure to senior-level ownership of production systemsStrong proficiency in SQL and either Python or Java for pipeline developmentHands-on experience across cloud platforms such as GCP and Azure, including BigQuery and SQL ServerDeep experience designing scalable and reliable ETL/ELT pipelines with performance optimizationHands-on experience with orchestration tools such as Airflow, Cloud Composer, ADF, Dagster, or PrefectStrong data modeling skills — dimensional modeling, normalization, and slowly changing dimensionsData Governance Expertise: Experience working with tools like Microsoft Purview, Collibra, Alation, or Informatica EDCUnderstanding of data cataloging, lineage, metadata management, and business glossariesExposure to data classification, data stewardship workflows, and governance frameworksExperience implementing data quality frameworks (DQ rules, profiling, validation pipelines)Working knowledge of HIPAA and PHI compliance requirementsExperience operating within enterprise security, governance, and compliance frameworksProficiency with Git, CI/CD pipelines, and production deployment practicesExperience integrating governance tools with cloud-native ecosystems (e.g., Purview with Azure data services, Collibra with multi-cloud pipelines)Exposure to Master Data Management (MDM) and reference data systemsFamiliarity with semantic layers, data mesh, or data fabric architecturesExperience with LLM-assisted development, data observability tools, or modern ELT frameworks (dbt, Dataform)Knowledge of healthcare data standards (HL7, FHIR, OMOP, etc.)BenefitsMedical, dental, and vision insurancePaid time offParental leaveComprehensive benefits packageCompany OverviewMajor provider of technology services and solutions to healthcare technology companies, providers, payers and life sciences organizations It was founded in 2005, and is headquartered in Princeton, New Jersey, USA, with a workforce of 5001-10000 employees. Its website is http://citiustech.com.