[Remote] Data Engineer
Note: The job is a remote job and is open to candidates in USA. Nscale is a GPU cloud company engineered for AI, providing infrastructure for AI start-ups and enterprises. They are seeking a Data Engineer to design and build data foundations that support their platform and operations, ensuring reliable and scalable data products.ResponsibilitiesDesign and build scalable, reliable data pipelines that ingest data from infrastructure, platform services, and business systemsDefine data models and schemas that support operational workflows and use cases across the business, monitoring, and analyticsClean, transform and structure the data to create a digital twin of NscaleImplement permissioning and manage access and security of the Foundry implementationCreate trusted datasets and metrics that power workflows and processes, internal tools, and customer-facing insightsEnable self-serve analytics by establishing clear data contracts, documentation, and semantic layersBuild use cases including but not limited to capacity planning, cost optimisation, reliability analysis, and customer reporting to drive our business forwardCollaborate with Product and Commercial teams to translate real-world questions into robust data solutionsImplement data quality checks, monitoring, and alerting to ensure data correctness and availabilityCodify data lineage, freshness, and consistency across systemsEstablish best practices around data versioning, access control, and governance appropriate for a fast-scaling companyContinuously improve system resilience and observabilityTake end-to-end ownership of projects, from design through to production and iterationHelp define standards, tooling, and ways of working for data at NscaleContribute to technical decision-making as the company scales its platform and customer baseAct as a thought partner to engineers and operators, not just a service functionSkillsDeep, hands-on experience building in Palantir Foundry, including ontology modelling, pipeline development, API integration, and large-scale data platform designStrong proficiency in Python, with experience applying data engineering libraries and frameworks (e.g. Spark, PySpark, Dask, pandas) to work with large, complex datasetsFamiliarity with API-driven data integration, including REST, GraphQL, and Foundry Action APIsPractical experience working in Git-based development workflows, including code reviews, version control, and CI/CD pipelinesComfort working in ambiguous, early-stage environments where requirements evolve quicklyStrong communication skills — able to explain data concepts clearly to technical and non-technical stakeholdersA bias toward ownership, pragmatism, and shipping useful solutionsExperience with cloud platforms (AWS, GCP, Azure) and infrastructure telemetryFamiliarity with distributed systems, monitoring data, or usage-based billing dataExperience supporting customer-facing data products or platformsBenefitsHighly competitive package (base + equity) with reviews every 12 months.Dynamic progression plan tailored to your ambitions.Human-First Flexibility: We treat you as humans first. Our flexible workplace trusts Nscalers to deliver, giving you the autonomy to shape your day around life's moments.Join our thriving remote-first team. Geography is no barrier to impact or connection. We build seamless virtual collaboration, empowering you, wherever you work.Company OverviewNscale builds AI data centers and provides GPU cloud infrastructure that companies use to train, run, and scale large AI models. It was founded in 2024, and is headquartered in London, England, GBR, with a workforce of 201-500 employees. Its website is https://www.nscale.com.