AI Data Engineer; ML Data Pipelines

Remote Full-time
Position: AI Data Engineer (ML Data Pipelines)
• Work Experience Python, SQL, Spark, Databricks, Airflow, Feature Engineering, Data Pipelines, Data Quality, Great Expectations, AWS, Azure, GCP, Kafka
• Required Skills
• Airflow
• AWS
• +20
• Remote Job
Job Description

This is a remote position.

We are seeking an AI Data Engineer to design and build production-grade data pipelines that power machine learning systems. This role focuses on creating scalable ingestion, transformation, and feature engineering workflows that support model training, evaluation, and real‑time inference.

You will work closely with Data Scientists, Machine Learning Engineers, and Platform teams to ensure high‑quality, reliable, and efficient data flows across cloud environments. The ideal candidate understands both traditional data engineering and the unique data needs of ML systems.
Key Responsibilities
• Design and build scalable data pipelines for ML workflows
• Develop feature engineering and data preparation processes
• Implement batch and real‑time data ingestion systems
• Ensure data quality, validation, and monitoring
• Collaborate with ML engineers to support model training and deployment
• Integrate pipelines with orchestration tools (Airflow or similar)
• Optimize pipeline performance and cloud cost efficiency
• Maintain documentation and version control of data workflows
Requirements
• 4+ years of experience in Data Engineering
• Strong Python and SQL skills
• Experience building data pipelines for ML or analytics systems
• Hands‑on experience with Spark, Databricks, or similar distributed processing frameworks
• Experience with orchestration tools (Airflow or similar)
• Experience in AWS, Azure, or GCP environments
• Familiarity with data quality validation and monitoring frameworks
• Understanding of feature engineering and model data lifecycle
Preferred Qualifications
• Experience with streaming systems (Kafka, Kinesis, Pub/Sub)
• Experience supporting model deployment and MLOps workflows
• Experience with feature stores or vector databases
• Familiarity with ML frameworks (Tensor Flow, PyTorch)
#J-18808-Ljbffr

Apply tot his job

Apply To this Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Staff Development Specialist I

Remote

Salesforce CRM Technical & Solution Architect(Sales Focus)

Remote

Core Master of Science in Nursing (MSN) Adjunct Faculty- Remote

Remote

Hybrid Dental Hygienist - $10,000 Signing Bonus

Remote

Overnight Customer Care and Technical Support Advisor – Remote Position Supporting Education Technology Solutions

Remote

Regional Sales Manager

Remote

Senior People Experience Partner - Remote Work

Remote

Remote Junior Data Analyst

Remote

Tech Lead, Web Core Product & Chrome Extension - Hsinchu, Taiwan

Remote

(Senior) Fullstack Engineer - New Platform (m/f/x) (onsite / remote in Germany)

Remote
← Back