[Remote] Data Scientist
Note: The job is a remote job and is open to candidates in USA. Terzo is an innovative company that builds an AI-native enterprise data platform designed for modern companies. As a Data Scientist on the Applied Research team, you will develop intelligent systems for data extraction and classification, manage model pipelines, and collaborate with engineering and product teams to ensure high-quality outcomes.ResponsibilitiesBuild the intelligent systems that create the data our customers depend onDesign extraction and classification models that process enterprise-scale document corporaBuild and evolve the entity resolution and signal detection layers powering the Commercial Graph and Financial GraphDefine how AI capabilities surface as recommendations, agents, and search across the platformOwn the models, pipelines, and graph structures that are the productWork directly with engineering, product, and customers on problems where a single clause can represent tens of millions of dollars of exposure and where model accuracy has a contractual SLASkills5+ years of experience in data science, applied ML, or AI research with production-shipped systems, not just notebooks and prototypesStrong statistical foundations and the ability to define and evaluate success metrics for AI systems including precision, recall, coverage, latency, not just accuracyDeep experience building NLP, NLU, or document understanding models that operate on messy, real-world unstructured data at scaleStrong intuition for entity resolution, knowledge graph construction, or graph-based modeling and you've thought seriously about how to connect fragmented data into structured, queryable representationsHands-on proficiency in Python and modern AI frameworks, with experience deploying models into production pipelinesComfort with information extraction, classification, and retrieval-augmented generation patterns applied to real enterprise workloadsA track record of working cross-functionally with engineering and product to shape what gets built, not just executing on handed-down specsClear, structured communication where you can explain a model decision to a PM, defend an architectural choice to a staff engineer, and present results to leadership without hiding behind jargonHigh ownership mentality where you treat model quality, pipeline reliability, and customer outcomes as your responsibilityExperience building or evolving knowledge graphs, commercial ontologies, or financial data models in enterprise contextsPrior work on document AI, OCR pipelines, or hybrid extraction systems combining rule-based and learned approachesExposure to AI agent architectures, tool-use patterns, or autonomous reasoning systems in productionBackground in procurement, contract management, spend analytics, or financial operations domainsExperience with evaluation frameworks for AI systems (RAGAS, custom eval harnesses, human-in-the-loop QA pipelines)Familiarity with distributed data platforms, event-driven architectures, or streaming systems (Ray, Kafka, Azure Service Bus)Prior work at a high-growth startup or enterprise AI companyAn MS or PhD in a quantitative fieldBenefitsCompetitive salaryAnnual performance bonusEmployee stock option plan100% paid medical, dental, and vision coverage401(k) with employer contributionGenerous vacation and sick leaveFlexible work arrangementsHigh-quality equipment for home and officeStrong culture of collaboration, mentorship, and continuous improvementCompany OverviewTerzo is an AI-powered Contract and Spend Intelligence platform to help Finance Procurement teams optimize supplier expenses It was founded in 2020, and is headquartered in Los Angeles, California, USA, with a workforce of 51-200 employees. Its website is http://www.terzo.ai.