Data Platform - Migration Engineer

Remote Full-time
ROLE SUMMARY We are looking for a Senior Data Platform / Migration Engineer to lead the modernization of an enterprise data ecosystem, including migration from Cloudera DataIQ DSS to MapR. This role requires deep expertise in large-scale distributed data systems, migration strategy, and performance optimization, with a strong focus on zero data loss, minimal downtime, and production stability. KEY RESPONSIBILITIES • Lead end-to-end migration of enterprise data lake from Cloudera (DataIQ, DSS, CDP) to MapR • Define and execute migration strategy ensuring data integrity, minimal downtime, and rollback readiness • Design and build scalable, production-grade data pipelines post-migration • Optimize cluster performance including compute, storage, and resource utilization • Partner with BI/reporting teams to ensure schema consistency and data availability • Implement data validation frameworks to ensure accuracy and completeness post-migration • Document architecture, runbooks, lineage, and operational procedures • Collaborate with governance teams on data quality, lineage, and compliance requirements REQUIRED SKILLS AND EXPERIENCE • 8+ years in Data Engineering / Data Platform Engineering • Strong hands-on experience with Cloudera (CDP, DSS, DataIQ) and/or MapR • Strong hands-on experience with Apache Spark, Hive, Hadoop, HDFS • Proven experience executing large-scale data lake migrations • Strong programming skills in Python, Scala, or SQL • Deep understanding of distributed data processing and storage systems • Experience with ETL/ELT frameworks (Informatica, Talend, dbt, or similar) PREFERRED QUALIFICATIONS • Prior MapR implementation or certification • Experience with streaming platforms (Kafka, Pulsar) • Exposure to cloud-native data platforms (AWS S3, Azure Data Lake, Google Cloud Platform) • Familiarity with data governance, lineage, and catalog tools • Experience working in high-scale enterprise environments (multi-terabyte/petabyte) CORE TECHNOLOGY STACK Cloudera DSS / DataIQ / CDP, MapR, Apache Spark, Hive, Hadoop, HDFS, Kafka, Python, SQL, dbt, Informatica / Talend
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Experienced Customer Service Representative - Fully Remote Opportunity at blithequark

Remote

[Remote] Offline Marketing Manager-Podcast

Remote

Experience Designer - Learning & People Development

Remote

Instructional Design Program Manager

Remote

[Hiring] RN - Registered Nurse Navigator Triage - Population Health @Geisinger

Remote

**Experienced Full Stack Data Entry Specialist – Content Management and Quality Assurance**

Remote

[Remote] Software Test Analyst (Data Audit & Analytics)

Remote

**Experienced Remote Data Entry Specialist – Flexible Work Schedule at blithequark**

Remote

**Experienced Full Stack Software Engineer – Web & Cloud Application Development at arenaflex**

Remote

Demand Planner

Remote
← Back