Machine Learning Researcher (Speech/Audio)

Remote Full-time
Brahma is a pioneering enterprise AI company developing Astras, AI-native products built to help enterprises and creators innovate at scale. Brahma enables teams to break creative bottlenecks, accelerate storytelling, and deliver standout content with speed and efficiency. Part of the DNEG Group, Brahma brings together Hollywood’s leading creative technologists, innovators in AI and Generative AI, and thought leaders in the ethical creation of AI content.

We are looking for a Machine Learning Researcher for Audio to join our team and help develop next-generation voice synthesis models. You'll research and build deep learning systems that can generate expressive, natural-sounding speech from text or audio prompts, and collaborate with cross-functional teams to integrate your work into production-ready pipelines.

Key Responsibilities
Research and develop state-of-the-art voice synthesis models (e.g., TTS, voice cloning, speech-to-speech).
Build and fine-tune models using frameworks like PyTorch and HuggingFace.
Design training pipelines and datasets for scalable voice model training.
Explore techniques for emotional expressiveness, multilingual synthesis, and speaker adaptation.
Work closely with product and creative teams to ensure models meet quality and production constraints.
Stay on top of academic and industrial trends in speech synthesis and related fields.

Must Haves
Strong background in machine learning and deep learning, with focus on speech/audio.
Hands-on experience with TTS, voice cloning, or related voice synthesis tasks.
Proficiency with Python and PyTorch; experience with libraries like torchaudio, ESPnet, or similar.
Experience training models at scale and working with large audio datasets.
Familiarity with vocoders and transformer-based architectures.
Strong problem-solving skills, ability to work autonomously in a remote-first environment.

Nice to Have
PhD degree in Computer Science/ Machine Learning and publications in top venues.
Contributions to open-source speech research or participation in relevant benchmarks.
Familiarity with adjacent areas like lip-syncing, audio-driven animation, or expressive speech control.
Experience with voice datasets or proprietary pipelines.

Apply To This Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Virtual Special Education Teacher - K-12

Remote

**Experienced Virtual Assistant – Data Entry Specialist for Remote Operations at blithequark**

Remote

Human Resources Generalist - Remote/Hybrid - Los Angeles, CA

Remote

Marketing Agency Account Director (Health Tech, Biotech)

Remote

Senior Product Manager

Remote

Overnight Media Journalist Remote / Telecommute Jobs

Remote

Inside Sales and Customer Service Representative

Remote

Staff Engineer – Mobile (Android) (REMOTE)

Remote

Fractional Chief Information Officer; in-office & remote

Remote

HIM Coder​/Part Time Remote

Remote
← Back