Director of ML Engineering & Infrastructure

Remote Full-time
Requirements • 10+ years of industry experience spanning machine learning engineering and distributed systems, • 3+ years of leadership and management experience, with a proven ability to build and lead strong technical teams, • MSc or Ph.D. in Computer Science, Machine Learning, or related field, or equivalent practical experience, • Proven expertise in building and deploying end-to-end ML systems at scale, including recommendation and personalization systems, • Strong background in distributed systems architecture, including low-latency services, streaming platforms, and large-scale serving, • Hands-on experience with deep learning frameworks (e.g., TensorFlow, PyTorch) and ML infrastructure technologies, • Track record of delivering high-quality, scalable, and fault-tolerant systems, • Excellent communication skills and ability to influence product and technical strategy, • Proven experience deploying large-scale serving systems on AWS and demonstrated expertise in leveraging Databricks for large-scale data processing and ML workflows What the job involves • We are seeking a Director of Machine Learning Engineering and Infrastructure to lead a hybrid team bridging advanced ML engineering with world-class infrastructure design, • In this role, you will own the strategic direction and execution for scaling our machine learning capabilities while ensuring our distributed systems and infrastructure can support innovation at massive scale, • You will combine technical depth with leadership excellence to guide teams that deliver both foundational ML systems and high-performance distributed services, • Lead and manage high-performing teams across ML engineering and ML infrastructure, fostering a culture of innovation, collaboration, and growth, • Define and execute the strategic roadmap for ML systems, including recommendation, personalization, and ads optimization, • Oversee the design, development, and deployment of scalable ML pipelines: data ingestion, feature engineering, model training, evaluation, and serving, • Architect distributed systems to support ML workloads at scale, ensuring reliability, observability, and operational excellence, • Partner closely with Product, Engineering, and Content teams to align on business goals and deliver impactful ML-driven experiences, • Support best practices in experimentation, evaluation, and ML system monitoring, • Ensure cost efficiency, scalability, and performance in ML infrastructure investments Apply tot his job

Apply tot his job

Apply To this Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Medical Billing Associate - Patient Services

Remote

Call Center Representative - 3rd Shift

Remote

Experienced Remote Customer Service Chat Agent – Automotive Industry Expertise Required for Dynamic Engagement and Support

Remote

Doctor Referral Institute- Exclusive Territory Director Ownership

Remote

Director, Organizational Design & Executive Development

Remote

[Remote] TikTok & TYB Community Manager

Remote

Clinical Informatics Specialist / Healthcare (ON-SITE) – San Diego, CA

Remote

[Remote] Sales Development Representative

Remote

**Experienced Remote Customer Service Representative – Scheduling Operations Specialist at blithequark**

Remote

Dedicated and Results-Driven Customer Support Executive for a Fast-Growing B2B SaaS Company – blithequark

Remote
← Back