AI Researcher

Remote Full-time

About Toptal
Toptal is a global network of top talent in business, design, and technology that enables companies to scale their teams, on-demand. With $200+ million in annual revenue and team members based around the globe, Toptal is the world’s largest fully remote workforce.
We take the best elements of virtual teams and combine them with a support structure that encourages innovation, social interaction, and fun. We see no borders, move at a fast pace, and are never afraid to break the mold.
Job Summary
Toptal is building a dedicated AI Research team focused on advancing the frontier of agentic AI systems powered by proprietary real-world interaction data.
We are seeking AI Researchers who are excited to explore how large-scale, real-world signals can be transformed into better reasoning, improved generalization, and more capable multimodal agents.
In this role, you will work at the intersection of model development, multimodal representation learning, and reinforcement learning, designing new approaches that enable agents to learn from complex behavioral data, workflows, and multimodal inputs such as audio, logs, and structured interaction traces. You will focus on building and improving learning systems for agents, including methods for RAG, fine-tuning, reinforcement learning (RLHF, DPO, GRPO), and joint embedding spaces, as well as speech and audio intelligence capabilities such as STT, ASR, and audio signal modeling.
You will collaborate closely with engineering and product teams to ensure research breakthroughs are translated into scalable systems, and that feedback from production continuously improves model behavior.
This is a remote position. All communication and resumes must be in English.
Responsibilities:
The following information is intended to describe the general nature and level of work being performed. It is not intended to be an exhaustive list of all duties, responsibilities, or required skills.

Advance research on agentic AI systems trained on real-world interaction signals and multimodal data.
Design and experiment with learning paradigms for large-scale models, including RAG, supervised fine-tuning, RLHF, DPO, and GRPO-style methods.
Develop multimodal representation learning approaches, including joint embedding spaces across text, audio, logs, and structured interaction traces.
Improve speech and audio intelligence capabilities, including STT, ASR, and audio-driven learning signals.
Research methods for enhancing agent reasoning, planning, tool use, and adaptation in real-world environments.
Define how complex behavioral and interaction signals can be translated into effective training objectives for large-scale models.
Build and refine evaluation methodologies for agent performance in real-world, domain-specific scenarios.
Collaborate with engineering and product teams to bring research ideas into production systems.
Identify patterns in real-world workflows and convert them into generalizable modeling and representation strategies.
Contribute to the long-term research direction of Toptal’s agentic AI systems and multimodal capabilities.
Stay current with academic and industry research and integrate relevant advancements into internal systems.

In the first week, expect to:

Join the AI team and orient yourself with Toptal’s mission and strategy.
Access our existing datasets, agent stacks, and internal evaluation tools.
Map the landscape of raw data sources currently feeding our agentic systems.

In the first month, expect to:

Develop a deep understanding of our current architectures and evaluation methodologies.
Identify high-leverage gaps where data improvements can measurably increase agent capability.
Initiate concrete improvements to pipelines converting raw inputs into model-ready assets.
Shape feedback loops that utilize live performance as a training signal.

In the first three months, expect to:

Own a production data pipeline from ingestion through delivery into RL or fine-tuning workflows.
Define reusable schemas that abstract repeated workflows into queryable formats.
Drive measurable advancements in agent accuracy within a specific vertical, backed by metrics.
Integrate AI features into user-facing surfaces like browsers or enterprise tools.

In the first six months, expect to:

Lead the design of multimodal pipelines that unify text and real-time logs for agents.
Establish tooling for encoding institutional knowledge into scalable schemas for the team.
Define the team’s strategy for fine-tuning and capturing human feedback for RLHF.
Mentor teammates on data-centric approaches and influence the team’s technical direction.

In the first year, expect to:

Serve as a key technical leader in turning proprietary data into a durable competitive advantage.
Operate as a recognized expert across the team on knowledge representation and improvement loops.
Drive a step-change in agent capability across multiple verticals through clear performance metrics.
Shape the next generation of products by evolving data, agents, and applications together.

Qualifications and Job Requirements:

PhD in Computer Science, Machine Learning, AI, Electrical Engineering, or a related field.
5+ years of experience in applied AI research or ML systems with production impact.
Strong background in large-scale machine learning, LLMs, or multimodal AI systems.
Hands-on experience with:
RAG systems.
Fine-tuning large language models.
Reinforcement learning methods (RLHF, DPO, or GRPO-style approaches).
Experience with VLM.
Strong understanding of representation learning, embeddings, and joint embedding spaces.
Experience with speech and audio modeling, including STT, ASR, or audio signal processing.
Proficiency in Python and modern ML frameworks (PyTorch, Hugging Face ecosystem).
Experience designing or improving evaluation methodologies for LLMs or agentic systems.
Experience with agentic AI systems, including reasoning, planning, or tool-use architectures.
Background in multimodal AI systems (text, audio, vision, or structured logs).
Experience embedding AI into real-world products (browsers, IDEs, enterprise tools).
Experience with real-time or streaming AI systems.
Open-source contributions or publications in top-tier ML/AI conferences.
Strong ability to define research hypotheses from ambiguous, real-world problems.
Outstanding written and verbal communication skills in English.
You must be a world-class individual contributor to thrive at Toptal. You will not be here just to tell other people what to do.

Apply Now →

Experienced Data Analyst and Insights Specialist for Business Growth and Development – Full Time Opportunity in Louisiana

Remote

AI Researcher

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

USPS Office Helper

Mail Innovations Operations Supervisor

Member Service Representative (Call Center), Hybrid

Registered Nurse (RN) Home Health – Amazon Store

Airline Technical Representative - IAH Airport - Hybrid/Remote

Scheduling Specialist, Associate

Senior Manager, Marketing Analytics

Experienced Data Analyst and Insights Specialist for Business Growth and Development – Full Time Opportunity in Louisiana

Payer Contracting Analyst

Analytics Engineer Denver, CO

Experienced Customer Service Specialist for E-commerce and Digital Support – arenaflex Chat Support Team in UAE