[Remote] Senior Data Engineer
Note: The job is a remote job and is open to candidates in USA. Keyrock is a leading change-maker in the digital asset space, known for its innovative approach and diverse team. They are seeking a Senior Data Engineer to help build the Keyrock Data Platform, enabling various teams to access and utilize data efficiently for trading and asset management purposes.ResponsibilitiesBuild streaming and batch pipelines that ingest, normalise, and distribute market, trading, and portfolio data, resilient to feed and exchange failuresBuild the self-serve tooling (SDKs, patterns, templates, AI agents) so other teams publish, consume, and build on data products without waiting on usOwn data contracts and schema evolution. Keep schema changes from turning into multi-team coordination eventsDesign the lakehouse and time-series layer around consumer query patternsBuild and evolve the Data Governance and Data Quality Framework: stale-feed detection, schema validation, range checks, idempotent writes, lineage, ownership, self-healingBuild the derived analytics the business runs on: cross-exchange spreads, VWAP at depth, order book microstructure for the desks; portfolio views, exposure, performance for wealth and asset managementMake observability, cost, and performance first-class from day oneTreat infrastructure as code (Docker, Terraform, CI/CD) alongside our Central Infrastructure TeamWork in the open: write things down, partner closely with Architecture, Infrastructure, Platform, and the rest of the teamsSkills8+ years of building production data systems that other people rely onStrong proficiency in Python and SQL: not just being able to write a query, but being able to reason about what the engine is doing with itCode that's easy for someone else to read, test, and delete laterStrong understanding of data modelling for both streaming and analytical workloadsEfficiency, quality, idempotency, and observability are taken seriously by defaultYou've designed and operated streaming systems on Kafka, Redpanda, MSK, or Kinesis, and you have opinions about partitioning, consumer groups, offsets, and schema registriesYou've used a time-series store in production (ClickHouse ideally; TimescaleDB, QuestDB, or similar are fine too) and can talk about table design as a function of query patternsYou've worked with a lakehouse architecture and reason about table layout, partitioning, and compaction as design choices that shape query performance and storage costYou build for self-healing and idempotency. Reprocessing is safe, retries don't double-write, and the system recovers without a human in the loopDocker, Terraform, and CI/CD are how you work, not a separate 'DevOps' thingYou think about cost and performance earlyYou instrument as you build: logs, metrics, and traces are part of the system from day oneYou design for data quality and governance up front covering contracts, validation, lineage, and ownershipYou reason from first principles when a problem is new, stay pragmatic when it isn't, and update your view when you learn moreYou treat the trading desks, wealth and asset management, product, risk, finance, compliance, and research as customers of what you build, and communicate with them that wayYou optimise for outcomes over output. A smaller, simpler thing that ships and works beats a bigger thing that doesn'tYou take ownership end-to-end: design, ship, operate, improveYou say what you think including when it's an unpopular take. You change your mind when the argument is betterYou make the people around you better. Reviews are real, juniors grow from working with you, and peers want to work with you againYou're curious about how markets work. Data engineering on its own won't keep you interested hereYou're honest about what you know and what you don't, and quick to close the gapYou understand financial market data: order books, trades, reference data, portfolios, exposures. Crypto, TradFi, or both are a strong plusLakehouse experience with Apache Iceberg or Delta LakeFamiliarity with DataHub or similar metadata/lineage platformsRust. Some of our performance-critical services are written in it. Interest is welcome; fluency isn't requiredBenefitsA competitive salary package, with various benefits.Flexible hours, remote-first, business-hours on-call shared across the team.Regular online get-togethers and a yearly onsite where everyone's in the same room.Company OverviewKeyrock develops scalable, transparent proprietary algorithmic technologies to increase the liquidity of financial assets. It was founded in 2017, and is headquartered in Woluwé-saint-pierre, Brussels Hoofdstedelijk Gewest, BEL, with a workforce of 51-200 employees. Its website is https://keyrock.com.