[Remote] Senior ML Engineer
Note: The job is a remote job and is open to candidates in USA. Invoca is an AI-powered revenue execution platform that focuses on turning customer interactions into measurable growth. They are seeking a Senior ML Engineer to lead the productionization of their ML stack, including model serving and inference optimization, while collaborating with various teams to deliver impactful ML solutions.ResponsibilitiesLead End-to-End MLOps and Productionization: Architect, implement, and maintain CI/CD pipelines for ML artifacts β including model evaluation, versioning, and automated deployment. Serve as the primary SME for operational excellence across the Invoca ML stackDesign and Optimize SLM/LLM Deployment: Own the full inference infrastructure: model serving on Triton Inference Server, Baseten, and Kubernetes-based GPU infrastructure. Profile and tune for low latency and high throughput, and build robust, scalable APIs for internal and external model accessFine-Tune Language Models: Apply parameter-efficient fine-tuning methods (LoRA, QLoRA, PEFT) to adapt transformer-based SLMs and LLMs for high-impact NLP applications in conversation intelligenceEvolve ML Infrastructure: Contribute to model training infrastructure, data pipelines, and data lake foundations to keep the systems powering our models reliable and scalableCollaborate Across Teams: Partner closely with Data Scientists, Data Engineers, and Applied AI Engineers to build the foundational ML systems behind Invoca's agentic AI productsDeliver Customer Value: Work with product and engineering to understand customer needs and ship ML solutions that make a measurable differenceSkills5+ years of ML Engineering experience with a strong production focusAdvanced Python and deep learning proficiency (PyTorch, HuggingFace Transformers, spaCy)Demonstrated track record deploying and maintaining transformer-based NLP models in productionHands-on experience fine-tuning SLMs/LLMs (LoRA, QLoRA, PEFT) and optimizing models via quantization, batching, and throughput tuningProficiency with inference infrastructure: Triton, Baseten, vLLM, TGI, SageMaker, Vertex AI, or similarExperience building production-grade APIs that expose ML models to downstream consumersFamiliarity with MLOps tooling, model monitoring, and eval platforms (Braintrust, MLflow, or equivalent)B.S. in Computer Science, Engineering, Statistics, or equivalent; advanced degree a plusFamiliarity with RLHF or preference training is a bonusBenefitsFlexible Time Off β We encourage a healthy work-life balance. Our flexible paid time off policy allows you to recharge and take time away as needed.Paid Holidays β Invoca provides 16 U.S. paid holidays, including a winter break, giving you ample opportunity to refresh and spend time with friends and family.Health Benefits β Our healthcare program includes medical, dental, and vision coverage, with multiple plan options so you can choose what works best for you and your family. Fertility assistance is also included.Retirement β Invoca offers a 401(k) plan through Fidelity with a company match of up to 4%.Stock Options β All employees are invited to share in Invocaβs success through stock options.Mental Health Programβ Well-being support on a broad range of issues is available through our SpringHealth program.Paid Family Leave β Up to 6 weeks of 100% paid leave is provided for baby bonding, adoption, and caring for family members.Paid Medical Leave β Up to 12 weeks of 100% paid leave is provided for childbirth and medical needs.InVacation β As a thank-you to our long-term team members, we offer a bonus after 7 years of service.Wellness Subsidy β We provide a subsidy that can be applied toward gym memberships, fitness classes, and more.Company OverviewInvoca, the recognized leader in conversation intelligence AI for marketing, e-commerce, and contact center teams It was founded in 2008, and is headquartered in Santa Barbara, California, USA, with a workforce of 201-500 employees. Its website is http://www.invoca.com.