Principal Data Engineer

Remote Full-time
Octus Octus is a leading global provider of credit intelligence, data, and analytics. Since 2013, tens of thousands of professionals across hedge fund, investment banking, management consulting, and law firm verticals have come to rely on Octus to make better, faster, and more confident decisions in pace with the fast-moving credit markets. For more information, visit: Working at Octus Octus hires growth-minded innovators and trailblazers across the globe to drive our business and culture. Our core values – Action Oriented, Customer First Mindset, Effective Team Players, and Driven to Excel – define an organizational ethos that’s as high-performing as it is human. Among other perks, Octus employees enjoy competitive health benefits, matched 401k and pension plans, PTO, generous parental leave, gym subsidies, educational reimbursements for career development, recognition programs, pet-friendly offices (US only), and much more. Role Octus is seeking a Principal Data Engineer to roll up their sleeves and build scalable, production-grade data pipelines and infrastructure. You'll be a hands-on technical leader β€” writing code daily, solving hard engineering problems, and helping elevate the team around you through doing. Spanning Snowflake, Databricks, and AWS, you'll be deeply involved in the day-to-day development of the data platform that powers Octus's products, data, and automation initiatives. The ideal candidate is an expert in Python and SQL who thrives in an execution-focused environment and has deep experience building modern data pipelines and lakehouse solutions. Responsibilities Build and maintain end-to-end data pipelines β€” from raw ingestion through transformation and delivery β€” across diverse data sources (APIs, web data, internal feeds, etc.). Hands-on development of scalable, production-grade pipelines within Databricks, including Delta Lake table management, Workflows, and cluster optimization. Build and maintain data models, schemas, and transformation logic in Snowflake, optimizing for performance and reliability. Develop and manage Databricks environments including Unity Catalog, Delta Live Tables, and integration patterns that support both internal data consumers and external sharing use cases. Build and manage orchestration workflows using AWS services (MWAA/Airflow, Lambda, ECS, SQS, MSK) and Databricks-native orchestration where appropriate. Implement and maintain infrastructure as code (IaC) using Terraform, ensuring reproducibility and compliance with cloud standards. Establish and enforce best practices in data modeling, schema design, and ETL/ELT processes for high-volume structured and semi-structured data across Snowflake and Databricks. Ensure data quality, lineage, and observability through automated testing, monitoring, and alerting across all pipeline layers. Collaborate closely with technology leadership to align data platform development with business strategy and product goals. Stay at the forefront of industry trends in data engineering, lakehouse architecture, and cloud-native data platforms. Requirements Strong foundation in software engineering principles, including SOLID design, modularity, and scalability. Expert proficiency in Databricks , including Delta Lake, Unity Catalog, Delta Live Tables, MLflow, and Databricks Workflows. Deep experience with Snowflake , including data modeling, performance optimization, and integration with upstream/downstream pipeline tooling. Expert proficiency in Python for data pipeline and automation development. Advanced SQL skills with experience optimizing complex queries and data models at scale. Proven experience designing and maintaining cloud-native data pipelines on AWS (e.g., MWAA/Airflow, Lambda, ECS, SQS, Glue, S3, Redshift, etc.). Experience implementing and managing Terraform or similar IaC frameworks. Strong understanding of lakehouse architecture patterns, data ingestion, transformation, and orchestration, including familiarity with ML/AI pipeline integration patterns. Familiarity with CI/CD pipelines, automated testing, and modern DevOps practices. 8+ years of experience in data engineering or backend development, with a focus on scalable data solutions. Demonstrated experience leading data infrastructure projects end-to-end and mentoring senior engineers. Familiarity with containerization (Docker) and workflow orchestration best practices. Excellent communication, collaboration, and problem-solving skills. Nice to Have Experience with streaming data technologies (Kafka, Kinesis, Flink). Exposure to ML/AI pipeline patterns (feature stores, experiment tracking, model serving) and MLOps tooling, particularly in a cross-functional team environment. Experience integrating data quality and observability tools. Experience with Databricks as a data sharing and collaboration platform (Delta Sharing, Marketplace). Familiarity with Claude Code or similar AI-powered developer tools for accelerating pipeline development and code workflows. At Octus, we consider a range of factors in connection with compensation decisions, including experience, skills, location, and our business needs and limitations. As a result, compensation may vary within and across similar roles and positions. Please note that the salary range information below is a good faith estimate for this position and actual compensation for any individual may fall outside this range if warranted by the circumstances applicable to that individual. If we identify a role that would be suitable for a broader range of skills and experience such that we would consider hiring at multiple levels then the range listed below may reflect that breadth. The salary range estimate for this position is $175,000 - $220,000 The actual compensation will be at Octus' sole discretion and will be determined by the aforementioned and other relevant factors. Equal Employment Opportunity Octus is committed to providing equal employment opportunities to all employees and applicants for employment without regard to race, colour, religion, sex, sexual orientation, gender identity, national origin, age, disability, genetic information, marital status, pregnancy, veteran status, or any other legally protected status. We strive to create an inclusive and diverse work environment where all individuals are valued, respected, and treated fairly. We believe that diversity enriches our workplace and enhances our ability to innovate and succeed.
Apply Now β†’

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

**Experienced Part-Time Remote Data Entry Specialist – Web & Cloud Application Development**

Remote

Senior Supply Chain Engineering Manager – Remote Opportunity at PepsiCo: Transforming the Future of Food and Beverage Supply Chain Excellence

Remote

Kubernetes Engineer Remote

Remote

**Experienced Data Entry Clerk – High-Speed Data Management for blithequark**

Remote

[Remote] Senior Consultant, Government Assurance

Remote

HIPAA Compliance Assessor/Consultant (Remote)

Remote

**Experienced Remote Chat Support Specialist – Public Relations and Digital Strategy**

Remote

Administrative Professional

Remote

Centralized Monitor Tech | 24 hours per week | Telemetry

Remote

Blue Team Cybersecurity Consultant (EDR/IR)

Remote
← Back