[Remote] Senior Data Engineer - AI
Note: The job is a remote job and is open to candidates in USA. Anaplan is a leader in AI-infused scenario planning and analysis platforms, helping global companies optimize their business decision-making. They are seeking a Senior Data Engineer who will set the technical direction for data ingestion, transformation, storage, and governance, building robust data pipelines for business users and supporting advanced analytics initiatives.ResponsibilitiesLead the data architecture, design, and deployment of scalable, high-throughput Big Data systems into production environmentsArchitect, deploy, and manage the foundational data systems that underlie modern AI infrastructure, including vector, NoSQL, and document databasesDevelop end-to-end data engineering solutions, including robust ETL/ELT pipelines, API services, and data ingestion frameworksDesign and build the storage and processing layers powering our analytics workloads: data lakes, data warehouses, distributed file systems, and real-time streaming architecturesEngineer feature-rich context pipelines that process large-scale enterprise data, balancing batch and streaming patterns seamlesslyOptimize and scale large distributed queries and data transformations to ensure high performance and low latency for end usersImplement data quality frameworks to measure and ensure data integrity, reliability, and governance across all data assetsCollaborate with analytics, product, and platform teams to build data models that capture the semantics of customer metrics, hierarchies, and relationshipsStay current with the modern data stack and big data landscape, evaluating new tools, distributed computing frameworks, and database technologies for potential adoptionSkills7+ years of dedicated data engineering experience, demonstrating a strong track record of hands-on execution and delivery in complex data environmentsDeep practical understanding of the database ecosystems that power AI and machine learning infrastructure (e.g., Vector databases, NoSQL, and Document stores)Hands-on experience building, scaling, and shipping large-scale data platforms in productionDeep practical experience with distributed data processing frameworks (e.g., Apache Spark, Flink, Hadoop)Strong expertise in message brokers and event streaming platforms (e.g., Apache Kafka, Kinesis)End-to-end exposure to data pipeline lifecycle development, including extensive experience with workflow orchestration tools (e.g., Apache Airflow, Dagster)Hands-on expertise with cloud data warehouses (e.g., Snowflake, BigQuery, Redshift) and data lake architectures (e.g., Databricks, Delta Lake, Apache Iceberg)Advanced SQL skills and proficiency in Python, Scala, or JavaStrong background in modern software development practices (testing, code review, CI/CD, Infrastructure as Code)Extensive, progressive experience leading technical projects and mentoring engineering teamsHands-on experience with cloud-native infrastructure (AWS, GCP, or Azure)Deep understanding of dimensional data modeling and warehouse optimization techniquesExperience implementing data observability, monitoring, and alerting frameworks at scaleBackground in enterprise software, planning, or financial analytics applicationsFamiliarity with Anaplan or similar enterprise planning platformsCompany OverviewAnaplan is a business planning software company that develops a cloud platform for decision-making with connected planning modeling. It was founded in 2006, and is headquartered in San Francisco, California, USA, with a workforce of 1001-5000 employees. Its website is https://www.anaplan.com.Company H1B SponsorshipAnaplan has a track record of offering H1B sponsorships, with 3 in 2026, 25 in 2025, 21 in 2024, 54 in 2023, 72 in 2022, 104 in 2021, 52 in 2020. Please note that this does not guarantee sponsorship for this specific role.