[Remote] Senior Software Engineer - Orchestration & Job Execution
Note: The job is a remote job and is open to candidates in USA. Alteryx is a leading company in data analytics, automation, and AI, seeking a Senior Software Engineer to join their cloud platform engineering team. The role involves designing and building backend services for orchestration and job execution, ensuring reliability and scalability in cloud-native systems.ResponsibilitiesDesign and deliver backend services, APIs, workers, and shared libraries that power cloud platform capabilitiesBuild and improve systems for orchestrating work across services, including job submission, execution tracking, status propagation, retries, cancellation, results, and operational visibilityDevelop reliable asynchronous and event-driven systems using queues, messaging, background workers, and durable stateWork on distributed execution flows across platform services, including service-to-service communication, routing, acknowledgements, and failure recoveryBuild and maintain cloud-native runtime infrastructure using containers, Kubernetes, deployment automation, and related platform toolingIntegrate platform services with persistence layers, event streams, REST APIs, and internal service contractsImprove production reliability through metrics, tracing, structured logging, health checks, dashboards, alerting, runbooks, and incident follow-upLead technical design for ambiguous or cross-service work, review code with a systems mindset, and mentor engineers on distributed-system and production-engineering practicesCollaborate with partner teams to turn product requirements into incremental, testable, and operable platform capabilitiesUse AI and modern development tools to improve engineering productivity, code quality, and delivery speedSkills5+ years preferred (4+ years minimum) of professional software development experience, with meaningful ownership of production backend services, cloud platform capabilities, or distributed systemsStrong experience building backend services using TypeScript/Node.js, or core systems languages like Go, Java, or Rust (with a willingness to code primarily in Node.js)Experience designing and operating asynchronous, queue-driven, or event-driven systems, including patterns such as retries, cancellation, idempotency, concurrency, ordering, timeouts, and failure handlingExperience working with durable persistence, service contracts, RESTful APIs, and integrations across multiple services or platform componentsExperience with production systems including containers, Kubernetes or similar orchestration platforms, service health, scaling behavior, and operational debuggingStrong production engineering ownership, including testing, observability, structured logging, metrics, tracing, incident response, and continuous reliability improvementAbility to lead design discussions, communicate technical tradeoffs clearly, mentor other engineers, and drive cross-team work through ambiguity with an ownership-oriented mindset3+ years of Python and C++ design, development, and debugging experience preferably leveraging open source and open standardsDesign, implement, and maintain embedded Python runtime integration in a predominantly C++ engine/host environmentOwn and evolve the Alteryx Python Tool including C++ plugin engines and process lifecycle (server startup, persistence, shutdown)Develop and troubleshoot SDK engine plugin components (e.g., gRPC-based engine plugins, streaming pipelines) in C++ with Python-facing APIsDebug complex native/runtime issues involving DLL/.pyd conflicts, OpenSSL and other native libraries across multiple Python versionsLead modernization work around virtualenv/venv management and installer/packaging plumbing, including WiX (installer) and native DLL exportsMaintain and extend native/compiled Python extensions, ensuring compatibility with modern NumPy/CPython ABIsCollaborate with security and platform teams to remediate native library vulnerabilities (e.g., c-ares, libxml2, SQLite, OpenSSL) and keep the SBOM healthyImprove and support developer SDKs (v1/v2), including debugging C++/Python streaming and serialization issues for 1P and 3P tool authorsDrive reliability and performance improvements in engine ↔ Python bridges, focusing on deadlocks, crashes, and high-throughput streaming scenariosContribute to and maintain CI/CD pipelines and code-quality tooling (e.g., C++ docs jobs, coverage, static analysis) affecting C++/Python hybrid reposAuthor and maintain architecture and operational runbooks for C++/Python integration points, including upgrade playbooks for new Python/OpenSSL versionsMentor other engineers in best practices for native–Python interop, debugging cross-language issues, and designing robust extension pointsExperience with (REST) API and/or SDK developmentMS/BS degree in Computer Science or equivalent experienceExperience with object oriented and functional design patternsExperience using Git and Git-based pipelines or equivalentExperience mentoring and developing othersStrong skills in critical thinking, decision making, problem solving, and attention to detailGrowth focused and curious about new challenges and experiencesExperience with workflow engines, orchestration systems, job runners, task execution platforms, or runtime infrastructureExperience with Kubernetes deployment patterns, GitOps, multi-cluster environments, cloud platform operations, or infrastructure automationExperience with messaging systems, event contracts, schema management, asynchronous notification pipelines, or distributed service coordinationExperience with real-time communication, long-lived connections, service-to-service messaging, or coordination across unreliable networksExperience with secure service-to-service authentication, token handling, secrets management, authorization boundaries, or customer-scoped execution contextsExperience with log collection, log streaming, distributed workload observability, operational dashboards, or production support toolingBachelor's degree in Computer Science, Engineering, or equivalent practical experienceNode.jsFamiliarity with cloud computing / managed services (GCP/Azure/AWS)Experience or familiarity with AI-driven development in a modern IDEFront end experience in React or a similar framework including Javascript and JSONExperience with optimizing protocols and building efficient RPC systemsNetworking & concurrency experienceKnowledge and experience with distributed computing, big data and stream processing systemsContainer experience: Docker, KubernetesRust and/or Golang familiarityExperience with a data prep and blend and predictive analytics workflow platform such as AlteryxBenefitsEmployees may also be eligible for a wide range of other benefits, such as a bonus or commission, medical, retirement, financial, wellness, time off, employee discounts, and others.Alteryx has amazing benefits for all Associates which can be viewed here.Company OverviewAlteryx is a leading provider of an end to end data science & analytics platform for the enterprise It was founded in 2011, and is headquartered in Irvine, California, USA, with a workforce of 1001-5000 employees. Its website is https://alteryx.com.