[Remote] Software Co-Design AI HPC Systems

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. Microsoft is a leading technology company dedicated to empowering individuals and organizations. The Software Co-Design AI HPC Systems role focuses on architecting and optimizing next-generation AI systems, collaborating across hardware and software to enhance performance and efficiency.ResponsibilitiesLead the co-design of AI systems across hardware and software boundaries, spanning accelerators, interconnects, memory systems, storage, runtimes, and distributed training/inference frameworksDrive architectural decisions by analyzing real workloads, identifying bottlenecks across compute, communication, and data movement, and translating findings into actionable system and hardware requirementsCo-design and optimize parallelism strategies, execution models, and distributed algorithms to improve scalability, utilization, reliability, and cost efficiency of large-scale AI systemsDevelop and evaluate what-if performance models to project system behavior under future workloads, model architectures, and hardware generations, providing early guidance to hardware and platform roadmapsPartner with compiler, kernel, and runtime teams to unlock the full performance of current and next-generation accelerators, including custom kernels, scheduling strategies, and memory optimizationsInfluence and guide AI hardware design at system and silicon levels, including accelerator microarchitecture, interconnect topology, memory hierarchy, and system integration trade-offsLead cross-functional efforts to prototype, validate, and productionize high-impact co-design ideas, working across infrastructure, hardware, and product teamsMentor senior engineers and researchers, set technical direction, and raise the overall bar for systems rigor, performance engineering, and co-design thinking across the organizationSkillsBachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experienceMaster's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experienceStrong background in one or more of the following areas: AI accelerator or GPU architectures, Distributed systems and large-scale AI training/inference, High-performance computing (HPC) and collective communications, ML systems, runtimes, or compilers, Performance modeling, benchmarking, and systems analysis, Hardware–software co-design for AI workloadsProficiency in systems-level programming (e.g., C/C++, CUDA, Python) and performance-critical software developmentProven ability to work across organizational boundaries and influence technical decisions involving multiple stakeholdersExperience designing or operating large-scale AI clusters for training or inferenceDeep familiarity with LLMs, multimodal models, or recommendation systems, and their systems-level implicationsExperience with accelerator interconnects and communication stacks (e.g., NCCL, MPI, RDMA, high-speed Ethernet or InfiniBand)Background in performance modeling and capacity planning for future hardware generationsPrior experience contributing to or leading hardware roadmaps, silicon bring-up, or platform architecture reviewsPublications, patents, or open-source contributions in systems, architecture, or ML systems are a plusBenefitsCertain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-payCompany OverviewMicrosoft is a software corporation that develops, manufactures, licenses, supports, and sells a range of software products and services. It was founded in 1975, and is headquartered in Redmond, Washington, USA, with a workforce of 10001+ employees. Its website is https://www.microsoft.com.Company H1B SponsorshipMicrosoft has a track record of offering H1B sponsorships, with 1317 in 2026, 9192 in 2025, 9343 in 2024, 7677 in 2023, 11403 in 2022, 7210 in 2021, 7852 in 2020. Please note that this does not guarantee sponsorship for this specific role.

Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

In-Store Shopper - Seasonal Part Time – Amazon Store

Remote

Remote Live Chat Specialist - Delivering Exceptional Customer Experience from Anywhere with blithequark

Remote

Redmond Costco – 25/hr to start +Commission in Redmond, WA

Remote

Experienced Full Stack Live Chat Support Specialist – Conversational AI Development

Remote

**Experienced Intern, Girl Up USA - Global Leadership Development Initiative**

Remote

Manager, Marketplaces and International Ecommerce

Remote

**Experienced Entry-Level Remote Data Entry Specialist – Customer Service Representative**

Remote

Require (USA) Coach/Ops Mgr Trainee in Erie, PA

Remote

Solutions Architect IV- Data and Analytics

Remote

Registered Nurse (RN) Visiting Home Health - Rancho Cucamonga (CA, Montrose)

Remote
← Back