Data Engineer: Scalable Pipelines for ML Workflows

Remote Full-time
Roles and Responsibility - • Design, build, and maintain scalable and reliable data pipelines for dataset creation, transformation, and benchmarking • Own and optimize Airflow pipelines on AWS for data processing, orchestration, and evaluation workflows • Write efficient, production-grade SQL and Python code for large-scale data processing and analysis • Partner closely with ML engineers to enable model training, evaluation, and benchmarking pipelines • Improve pipeline performance, reliability, and observability, ensuring high data quality in production • Build and maintain systems to support model performance tracking and data drift monitoring • Troubleshoot and resolve data issues across pipelines, ensuring minimal impact on ML workflows • Contribute to data architecture decisions and best practices across the platform • Collaborate cross-functionally with ML, platform, and data teams to support scalable ML infrastructure What Were Looking For • 35 years of experience in Data Engineering, Data Platforms, or related roles • Strong proficiency in Python and SQL with experience in production systems • Hands-on experience with AWS services (S3, EC2, SageMaker or similar) • Solid experience building and managing Airflow (or similar orchestration tools) • Strong understanding of data engineering fundamentals (ETL/ELT, data modeling, pipeline design) • Experience working with large-scale datasets and distributed data systems • Experience supporting ML workflows, datasets, or evaluation pipelines • Strong problem-solving skills and ability to work independently in a fast-paced environment Nice to Have • Experience with ML infrastructure, MLOps, or model evaluation workflows • Exposure to biometric systems or computer vision datasets • Familiarity with data quality frameworks, monitoring, and observability tools • Experience working in SaaS or high-scale production environments
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Business Analytics Consultant – Remote Eligible in Albuquerque, NM

Remote

Influencer Manager

Remote

**Experienced Luxury Beauty Chat Consultant – Remote Customer Service Representative for Amazon.com Services LLC**

Remote

Data Warehouse Developer Senior or Lead

Remote

**Experienced Part-Time Data Entry Specialist – Remote Opportunity with arenaflex**

Remote

Senior Accountant job at Reliable Robotics in Mountain View, CA

Remote

Security Researcher, Data Ops (Remote)

Remote

Experienced Remote Data Entry Operator – Data Management and Entry Specialist for arenaflex

Remote

**Experienced Entry-Level Customer Relations Chat Agent – Remote Opportunity at blithequark**

Remote

VP Analyst, Cybersecurity Executive Product Management (Remote US)

Remote
← Back