[Remote] Senior AI / Machine Learning Engineer
Note: The job is a remote job and is open to candidates in USA. Absentia Labs is building intelligent systems at the intersection of AI, biology, chemistry, and large-scale engineering. They are seeking a Senior AI/Machine Learning Engineer to lead the design, training, and deployment of large-scale machine learning models, with significant ownership over technical direction and collaboration with data engineers.ResponsibilitiesDesign, train, and evaluate large-scale models, including Large Language Models (LLMs), diffusion models, and Graph Neural Networks (GNNs)Own end-to-end training pipelines, from dataset interfaces and batching strategies to distributed training and checkpointingMake principled decisions about model architecture, objective functions, optimization strategies, and scaling lawsBuild and optimize distributed training systems (data parallelism, model parallelism, sharding, mixed precision)Collaborate closely with data engineers to define ML-ready datasets and streaming interfacesTranslate ambiguous scientific or product requirements into robust ML solutionsDrive model evaluation, ablation, and iteration with a focus on generalization, stability, and reproducibilityContribute to architectural decisions around model serving, inference efficiency, and lifecycle managementProvide technical leadership through design reviews, mentorship, and cross-team collaborationSkills5+ years of industry experience in machine learning or applied AI rolesDemonstrated experience training large-scale models in production settings, not just prototypesHands-on expertise with LLMs, diffusion models, and/or GNNsStrong proficiency in PyTorch (or equivalent deep learning frameworks)Deep understanding of distributed training, including parallelism strategies and performance optimizationExperience working with large datasets and high-throughput data pipelinesStrong software engineering fundamentals: clean code, testing, reproducibility, and debugging at scaleAbility to clearly communicate technical trade-offs to both technical and non-technical stakeholdersExperience with reinforcement learning, fine-tuning, or preference-based optimization (e.g., RLHF)Familiarity with model compression, distillation, or inference optimizationExperience deploying models in production inference systemsExposure to multimodal learning or foundation modelsPrior work in startups or fast-moving R&D environmentsContributions to open-source ML frameworks or research codebasesBenefitsOffers EquityOffers BonusCompetitive compensation, including meaningful equity participation, allows you to share directly in the long-term success and growth of the company.Flexible remote or hybrid work arrangements.Company OverviewAbsentia Labs is building the foundation models that will power the next generation of medicine. It was founded in 2024, and is headquartered in Boston, Massachusetts, US, with a workforce of 2-10 employees. Its website is https://www.absentia.bio/.