Data Engineer (Python & PySpark)

Remote Full-time
Key Responsibilities
Pipeline Development: Design, develop, and maintain end-to-end ETL/ELT pipelines using Python and PySpark.
Big Data Processing: Build large-scale data processing frameworks to handle structured and unstructured data, ensuring high performance and reliability.
Cloud Infrastructure: Architect and manage data solutions within the GCP ecosystem, focusing on cost-efficiency and security.
Data Modeling: Design and implement robust data warehouse models (Star/Snowflake schemas) and data lake architectures.
Optimization: Identify, design, and implement internal process improvements, such as automating manual processes and optimizing data delivery for greater scalability.
Collaboration: Work closely with stakeholders to understand data requirements and translate them into technical specifications.
Technical Qualifications
Core Programming: Strong proficiency in Python, including experience with libraries like Pandas, NumPy, and logging frameworks.
Big Data: 3+ years of hands-on experience with Apache Spark (PySpark) for distributed data processing.
GCP Ecosystem: Practical experience with Google Cloud services, specifically:
BigQuery (Optimization, Partitioning, Clustering).
Cloud DataProc or Dataflow.
Cloud Storage (GCS) and Cloud Functions.
Cloud Composer (Apache Airflow) for orchestration.
Data Warehousing: Solid understanding of relational databases and SQL (PostgreSQL, MySQL) as well as NoSQL environments.
DevOps & Tools: Experience with Git, Docker, and CI/CD pipelines. Familiarity with Terraform or other IaC tools is a significant plus.

Apply To This Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Specialist Seller - Enterprise

Remote

Experienced Remote Data Entry Specialist – Home-Based Opportunity for Detail-Oriented Individuals with Strong Organizational Skills

Remote

Programmer/Analyst

Remote

**Experienced Remote Data Entry Clerk and Research Participant – Flexible Work Arrangements at arenaflex**

Remote

Experienced Inbound Customer Service Representative (Remote) - careerzynith

Remote

**Experienced Data Entry Specialist – Database Management and Customer Service**

Remote

Fashion Technology Architect

Remote

Analyst, Revenue Forecasting

Remote

(Part Time) Southwest Airlines At Home Remote Jobs

Remote

Acquisition Manager

Remote
← Back