[Remote] Data Engineer - Seattle, WA
Note: The job is a remote job and is open to candidates in USA. Nscale is the GPU cloud engineered for AI, providing high-performance infrastructure for AI-focused companies. The Data Engineer will design, build, and operate data foundations to support Nscale’s platform and internal operations, working closely with various teams to create reliable, scalable data products.ResponsibilitiesDesign and build scalable, reliable data pipelines that ingest data from infrastructure, platform services, and business systemsDefine data models and schemas that support operational workflows and use cases across the business, monitoring, and analyticsClean, transform and structure the data to create a digital twin of NscaleImplement permissioning and manage access and security of the Foundry implementationCreate trusted datasets and metrics that power workflows and processes, internal tools, and customer-facing insightsEnable self-serve analytics by establishing clear data contracts, documentation, and semantic layersBuild use cases including but not limited to capacity planning, cost optimisation, reliability analysis, and customer reporting to drive our business forwardCollaborate with Product and Commercial teams to translate real-world questions into robust data solutionsImplement data quality checks, monitoring, and alerting to ensure data correctness and availabilityCodify data lineage, freshness, and consistency across systemsEstablish best practices around data versioning, access control, and governance appropriate for a fast-scaling companyContinuously improve system resilience and observabilityTake end-to-end ownership of projects, from design through to production and iterationHelp define standards, tooling, and ways of working for data at NscaleContribute to technical decision-making as the company scales its platform and customer baseAct as a thought partner to engineers and operators, not just a service functionSkillsDeep, hands-on experience building in Palantir Foundry, including ontology modelling, pipeline development, API integration, and large-scale data platform designStrong proficiency in Python, with experience applying data engineering libraries and frameworks (e.g. Spark, PySpark, Dask, pandas) to work with large, complex datasetsFamiliarity with API-driven data integration, including REST, GraphQL, and Foundry Action APIsPractical experience working in Git-based development workflows, including code reviews, version control, and CI/CD pipelinesComfort working in ambiguous, early-stage environments where requirements evolve quicklyStrong communication skills — able to explain data concepts clearly to technical and non-technical stakeholdersA bias toward ownership, pragmatism, and shipping useful solutionsExperience with cloud platforms (AWS, GCP, Azure) and infrastructure telemetryFamiliarity with distributed systems, monitoring data, or usage-based billing dataExperience supporting customer-facing data products or platformsBenefitsHighly competitive package (base + equity) with reviews every 12 monthsDynamic progression plan tailored to your ambitionsHuman-first flexibilityFlexible paid time offParental leaveRetirement plan participationMedicalDentalVisionBonusEquityCommission programsThriving remote-first teamCompany OverviewNscale builds AI data centers and provides GPU cloud infrastructure that companies use to train, run, and scale large AI models. It was founded in 2024, and is headquartered in London, England, GBR, with a workforce of 201-500 employees. Its website is https://www.nscale.com.