[Remote] Staff Software Engineer, Data Platform - US (Remote)
Note: The job is a remote job and is open to candidates in USA. Luxury Presence is the leading digital platform revolutionizing the real estate industry for agents, teams, and brokerages. They are seeking a Staff Software Engineer to strengthen their real estate MLS data platform squad by building robust data pipelines and backend services that power high-quality MLS and property data across various feeds.ResponsibilitiesOwn the end-to-end architecture for MLS and property data: streaming and batch pipelines, microservices, storage layers, and APIsDesign and evolve event-driven, Kafka-based data flows that power listing ingestion, enrichment, recommendations, and AI use casesDrive technical design reviews, set engineering best practices, and make high-quality tradeoffs around reliability, performance, and costDesign, build, and operate backend services (Python or Java) that expose listing, property, and recommendation data via robust APIs and microservicesImplement scalable data processing with Spark or Flink on EMR (or similar), orchestrated via Airflow and running on Kubernetes where applicableChampion observability (metrics, tracing, logging) and operational excellence (alerting, runbooks, SLOs, on-call participation) for data and backend servicesBuild and maintain high-volume, schema-evolving streaming and batch pipelines that ingest and normalize MLS and third-party dataEnsure data quality, lineage, and governance are built into the platform from the start—supporting analytics, AI/ML, and customer-facing featuresPartner with analytics engineering and data science to make data discoverable and usable (e.g., semantic layers, documentation, self-service tooling)Collaborate with ML/AI engineers to design and scale AI agents that automate MLS feed onboarding, listing discrepancy triage, and other operational workflowsWork with frameworks such as PydanticAI, LangChain, or similar to integrate LLM-based agents into our data and service architectureHelp define and implement evaluation, logging, and feedback loops so these agents and data-driven products continuously improveCollaborate closely with Product, Engineering, and Operations to shape the roadmap for our data platform, MLS capabilities, and AI-powered experiencesTranslate ambiguous business and customer problems into clear technical strategies and phased delivery plansMentor and unblock other engineers; elevate the overall level of technical decision-making on the team via pairing, reviews, and design guidanceSkills10+ years of professional software engineering experience, including owning production systems end-to-endSignificant experience working with data-intensive or distributed systems at scale (high volume, high availability)Prior experience in a senior or staff/lead role where you influenced architecture, standards, and technical directionStrong programming skills in Python or Java, with experience building microservices and APIs (REST/GraphQL)Hands-on experience with Apache Kafka or similar event/messaging platforms (Kinesis, Pub/Sub, etc.)Deep experience with Spark or Flink for large-scale data processing, across streaming and batch pipelines (on EMR or similar big-data compute)Airflow (or equivalent orchestration tools)Kubernetes for running data/compute workloadsStrong SQL and data modeling skills; solid understanding of ETL/ELT patterns, data warehousing concepts, and performance tuningExperience building on AWS (preferred) or another major cloud provider, with a good grasp of cost, reliability, and security tradeoffsExperience building or integrating AI agents into production workflows (e.g., internal tools, support automation, operational triage, or data workflows)Familiarity with frameworks such as PydanticAI, LangGraph, Claude Code or similar, and how they interact with backend services, vector stores, and LLM APIsComfort working with logs, telemetry, and evaluation metrics to monitor, debug, and iteratively improve AI-driven systemsDemonstrated ability to lead technical initiatives across teams, from idea to production (alignment, design, implementation, rollout)Track record of mentoring other engineers and raising the bar on code quality, testing, and designStrong communication skills; able to clearly explain complex technical decisions to both engineers and non-technical stakeholdersCustomer and product mindset: you care about how the data and services you build improve the end-user and client experience, not just the internalsExperience with any of: Iceberg, Hive, or other table formats/data lake technologiesSnowflake, Athena, Redshift, or other cloud data warehousesDbt or similar transformation frameworksData quality / observability tools (e.g., Great Expectations, Monte Carlo, Datafold)Vector databases / retrieval (e.g., LanceDB, Pinecone, Elasticsearch/OpenSearch)Background in real estate, marketplaces, or other domains where data quality and freshness are highly visible to customersPrior experience in a startup or high-growth environment where you've built or significantly evolved a data platformCompany OverviewLuxury Presence is a website and marketing system used by the world’s leading real estate agents and brokers. It was founded in 2016, and is headquartered in Austin, Texas, USA, with a workforce of 501-1000 employees. Its website is https://www.luxurypresence.com/.