[Remote] Founding Data Engineer, AI Platform
Note: The job is a remote job and is open to candidates in USA. GC AI is the fastest-growing legal AI platform for in-house legal teams, seeking a Founding Data Engineer to own the entire data stack. This role involves consolidating data from multiple sources into a single warehouse, building internal data tools, and defining data engineering practices as the first dedicated data hire.ResponsibilitiesTake ownership of the data warehouse in BigQuery: modeling, pipeline development, data quality, and performanceBuild pipelines that consolidate product usage data, CRM data, billing, customer contract data, and user analytics into a single source of truthDesign and build internal data tools using applied AI, including natural-language query interfaces and automated reporting, so the rest of the company can self-serve without waiting on an analystSet up the warehouse so business teams can run their own queries and pull their own numbers without filing a ticketBuild toward a data lake architecture that supports personalization and model fine-tuning for the GC AI productKeep the stack lean. Use what's available in BigQuery and the broader GCP ecosystem and make smart decisions to reduce complexity and cost without introducing tool sprawlDefine data engineering practices, tooling, and standards as the first hire on what will become a teamSkills5+ years of experience in data engineering, with hands-on experience building and maintaining data warehouses and pipelinesStrong SQL skills and deep experience with BigQuery or comparable analytical databasesProficiency in Python for pipeline development, scripting, and toolingExperience building ETL/ELT pipelines that consolidate data from multiple source systems (SaaS APIs, event streams, databases)Experience working within GCP or a comparable cloud ecosystemAbility to design data models that are clean, performant, and usable by non-engineersExperience building internal data tools or agents using LLMs (text-to-SQL, natural language interfaces, automated reporting). This is a strong differentiatorExperience as the first or early data hire at a startup, where you owned the full stackFamiliarity with legaltech, legal operations, or SaaS product analyticsExperience setting up self-serve analytics layers (semantic layers, BI tool configuration, data documentation)Experience with data infrastructure that supports ML workflows (feature stores, training data pipelines, data lakes)Experience with infrastructure as code, especially Terraform, for managing GCP data infrastructureCompany OverviewGC AI is the legal AI platform built for in-house teams that solves the workflows in-house lawyers and legal professionals face every day. It was founded in 2023, and is headquartered in San Francisco, California, USA, with a workforce of 51-200 employees. Its website is https://gc.ai.