[Remote] Software Engineer, Inference - Multi Modal

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. They are looking for a software engineer to help serve OpenAI’s multimodal models at scale, focusing on building reliable infrastructure for real-time audio and image processing. Responsibilities • Design and implement inference infrastructure for large-scale multimodal models • Optimize systems for high-throughput, low-latency delivery of image and audio inputs and outputs • Enable experimental research workflows to transition into reliable production services • Collaborate closely with researchers, infra teams, and product engineers to deploy state-of-the-art capabilities • Contribute to system-level improvements including GPU utilization, tensor parallelism, and hardware abstraction layers Skills • Experience building and scaling inference systems for LLMs or multimodal models • Worked with GPU-based ML workloads and understand the performance dynamics of large models, especially with complex data like images or audio • Enjoy experimental, fast-evolving work and collaborating closely with research • Comfortable dealing with systems that span networking, distributed compute, and high-throughput data handling • Familiarity with inference tooling like vLLM, TensorRT-LLM, or custom model parallel systems • Own problems end-to-end and are excited to operate in ambiguous, fast-moving spaces • Design and implement inference infrastructure for large-scale multimodal models • Optimize systems for high-throughput, low-latency delivery of image and audio inputs and outputs • Enable experimental research workflows to transition into reliable production services • Collaborate closely with researchers, infra teams, and product engineers to deploy state-of-the-art capabilities • Contribute to system-level improvements including GPU utilization, tensor parallelism, and hardware abstraction layers • Experience working with image generation or audio synthesis models in production • Exposure to distributed ML training or system-efficient model design Company Overview • OpenAI is an AI research and deployment company that develops advanced AI models, including ChatGPT. It is a sub-organization of OpenAI Foundation. It was founded in 2015, and is headquartered in San Francisco, California, USA, with a workforce of 201-500 employees. Its website is Company H1B Sponsorship • OpenAI has a track record of offering H1B sponsorships, with 1 in 2025, 1 in 2024, 1 in 2023, 18 in 2022, 10 in 2021, 6 in 2020. Please note that this does not guarantee sponsorship for this specific role. Apply tot his job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Experienced Remote Data Entry Specialist – Part-Time Entry-Level Opportunity for Detail-Oriented Individuals with Strong Typing Skills

Remote

COMPLIANCE COUNSEL

Remote

(Remote) Claims Examiner - Workers Compensation | California Experience

Remote

Experienced Customer Interaction Specialist – Live Chat (Remote) for Delivering Exceptional Client Experiences

Remote

**Experienced Customer Service Associate - Full-time Opportunity at arenaflex**

Remote

Senior Compliance Analyst, Duals Audit job at Centene Corporation in FL, MO

Remote

Bilingual Customer Support Specialist – Evening Shift (12:30 PM‑9 PM), Full‑Time, Pasadena, CA – Grow Your Career with arenaflex

Remote

Clinical and Statistical Programming Intern

Remote

Experienced Pediatric Clinical Nursing Instructor – Educating the Next Generation of Nursing Professionals in a Dynamic and Supportive Environment

Remote

[Hiring] Clinical Trial Manager @4D Molecular Therapeutics

Remote
← Back