[Remote] Data Research Engineer
Note: The job is a remote job and is open to candidates in USA. Microsoft is seeking Data Research Engineers to join their Multimodal team, focusing on building next-generation foundation models across various domains. The role involves designing and curating high-quality datasets to enhance AI models, collaborating with diverse teams to ensure data quality and ethical standards.ResponsibilitiesCreate high-quality datasets for training and evaluation; run experiments on new datasets (data ablations) to assess their impact and determine the most effective dataDevelop and maintain scalable data pipelines for multimodal ingestion, preprocessing, filtering, and annotationAnalyze real-world multimodal datasets to assess quality, diversity, relevance, and identify areas for improvementBuild lightweight tools and workflows for dataset auditing, visualization, and versioningCollaborate with Safety, Ethics, and Governance teams to ensure datasets meet standards for quality, privacy, and responsible AI practicesEmbody our culture and valuesSkillsBachelor's Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.) + OR equivalent experienceMaster's Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND 8+ years technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)OR Bachelor's Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND 12+ years technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)OR equivalent experience2+ years of experience in data analysis or data engineering, including work with large-scale datasets that are unstructured or semi-structuredProficiency in statistics and exploratory data analysis methodsFamiliarity with data processing frameworks such as Spark, Ray, or Apache BeamAbility to communicate technical findings clearly to research and product teamsBenefitsCertain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-payMicrosoft is an equal opportunity employer.If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.Company OverviewMicrosoft is a software corporation that develops, manufactures, licenses, supports, and sells a range of software products and services. It was founded in 1975, and is headquartered in Redmond, Washington, USA, with a workforce of 10001+ employees. Its website is https://www.microsoft.com.Company H1B SponsorshipMicrosoft has a track record of offering H1B sponsorships, with 1317 in 2026, 9192 in 2025, 9343 in 2024, 7677 in 2023, 11403 in 2022, 7210 in 2021, 7852 in 2020. Please note that this does not guarantee sponsorship for this specific role.