Lead Data Engineer (AWS Cloud)

Remote Full-time
Position: - Lead Data Engineer (AWS Cloud) Location: - Remote Type: - Contract to Hire Job Description • Design, develop, and maintain ETL/ELT pipelines using PySpark on Databricks. • Build and optimize batch and streaming data pipelines. • Implement Delta Lake solutions (Delta tables, time travel, ACID transactions). • Collaborate with data scientists, analysts, and architects to deliver analytics-ready datasets. • Optimize Spark jobs for performance, scalability, and cost. • Integrate data from multiple sources (RDBMS, APIs, files, cloud storage). • Implement data quality checks, validation, and monitoring. • Manage Databricks notebooks, jobs, clusters, and workflows. • Follow data governance, security, and compliance standards. • Participate in code reviews and contribute to best practices. Qualifications • Hands-on experience with Data Frames, RDDs, joins, transformations, and actions within PySpark. • Proven experience leading teams and mentoring engineers. • Job optimization, cluster configuration, repartitioning, and Shuffle mechanics in Databricks. • S3 buckets, IAM, CloudWatch, and integration with Databricks and AWS. • Strong query skills for analytics and ETL with SQL. • Performance tuning: Partitioning, caching, broadcast joins, and skew handling. • Delta Lake, Medallion Architecture, Spark Streaming, Spark ML, and CI/CD pipelines. • ETL/ELT design patterns. - Handling large-scale structured and semi-structured data. • Performance tuning (partitioning, caching, broadcast joins). • Understanding of data warehousing concepts. • Excellent communication and stakeholder management skills. • Ability to work in Agile delivery environments. • Ownership mindset and delivery-focused approach. • Strong technical decision-making and problem-solving skills. Apply tot his job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Experienced Data Engineering and Information Management Specialist – Remote Data Ingestion, ETL, and Data Warehousing Expert for arenaflex

Remote

Experienced Remote Customer Service Representative – Delivering Exceptional Travel Experiences from Home with arenaflex

Remote

Experienced Social Media Customer Support Specialist – Delivering Exceptional Customer Experiences through Innovative Technology and Sustainability at blithequark

Remote

Research Safety Compliance Analyst - Office of Research Protections

Remote

-Customer Service – Fully Remote | (No Experience Needed)

Remote

Associate Director – Corporate Affairs

Remote

Experienced Remote Call Center Customer Service Representative – Medicaid Member Support and Enrollment Services for the State of Iowa at blithequark

Remote

Experienced Data Entry and Information Specialist - Remote Opportunity for Career Growth and Development at blithequark

Remote

Transmission Siting Specialist; Environmental Permitting Specialist (Texas/Oklahoma)

Remote

Finance Manager / Controller (Remote)

Remote
← Back