[Remote] Senior Autonomy Data Engineer

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. Torc Robotics is a leader in autonomous driving technology, focusing on developing software for automated trucks. They are seeking a Senior Autonomy Data Engineer to design and operate the data infrastructure that supports their autonomy program, ensuring reliable data pipelines and effective collaboration with cross-functional teams.ResponsibilitiesOwn the design and organization of the program’s data lake, including schema definitions, partitioning strategy and metadata indexingDesign and maintain end-to-end pipelines that ingest high-bandwidth sensor logs from vehicles into cloud storage with high reliability and tolerant of ad-hoc and intermittent connectivity mechanismsDevelop data validation and integrity checks that can detect corrupted information, missing sensors, and inconsistent calibration prior to the data being processed by downstream systemsImplement retention, tiering and lifecycle policies for data to balance storage costs with development valueBuild tooling to query raw logs to produce curated training and evaluation datasetsBuild automation to run cost-effective pseudo-labeling workflows at the scale of data ingestImplement data quality and model performance metrics that are used to direct labeling effort toward the highest-value examplesDeploy and maintain data visualization tooling to support log review, annotation QA, and autonomy debugging workflowsBuild integrations between the visualization tooling and the data lake so engineers can navigate from a dataset entry or model failure directly to the origin log dataWork with autonomy engineers to define and surface custom visualization panels and implement metrics for analyzing unstructured operating environmentsBuild dashboards that provide the autonomy engineers visibility into data coverage by terrain type, operating environment and geographic regionEstablish and document data contracts between the data services and model training consumersPartner with perception, planning and embedded engineers across the data lifecyle: from shaping the logging schemas and collection triggers to defining the dataset interfaces that supply model training and evaluationDefine data engineering standards, best practices, and tooling choices for an innovative and fast-paced teamContribute to the data roadmap and provide input to technical leadership on investment prioritiesMentor junior engineers and raise the team’s capabilities in data infrastructure scalability and operational hygieneSkillsBachelor's degree in Computer Science, Computer Engineering, Software Engineering, Electrical Engineering or a related field with 6+ years of data engineering experience or a Master's with 4+ yearsStrong proficiency in Python and SQL, with demonstrated ability to build production-quality data pipelinesDeep experience with cloud data infrastructure (AWS preferred: S3, Glue Athena, redshift, or equivalent) and infrastructure-as-code tools (Terraform, Cloud Formation)Solid understanding of data partitioning strategies and columnar storage formats (Parquet, Orc, etc.)Experience building and operating data pipelines that process time-series and binary dataProven ability to evaluate and integrate open-source tooling when appropriate versus building from scratchStrong instincts for delivering data quality through first-class implementations of monitoring, validation and lineage trackingExperience with autonomous vehicles, robotics, or other sensor-driven autonomous systemsDeep experience with Foxglove or Rerun beyond basic playback, e.g. building custom extensions or integrating them into a structured log review or annotation QA workflowFamiliarity with the MCAP CLI and/or python library and experience converting MCAP data to columnar data formats for further querying and processingExperience with data curation for ML training, e.g. diversity sampling, pseudo-labeling, and dataset versioningBenefitsA competitive compensation package that includes a bonus component and stock options100% paid medical, dental, and vision premiums for full-time employees401K plan with a 6% employer matchFlexibility in schedule and generous paid vacation (available
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Easy Office Role - Training Provided

Remote

[Hiring] Healthcare Compliance Director @Equip Health

Remote

Senior Software Engineer - Data Platform

Remote

Senior Clinical Research Associate

Remote

**Experienced Customer Success Business Partner – German Market Focus**

Remote

**Experienced Part-time Online Data Entry Specialist – Remote Opportunity at blithequark**

Remote

Freelance Writing Jobs Near Me | $25-$35/hr | Start Immediately...

Remote

Entry-Level Data Entry Associate – Part-Time Administrative Support Role in Retail Operations at arenaflex

Remote

Principal Product Manager – Sales and Customer Success Platforms

Remote

Remote Life Insurance Sales | No Experience Needed | Uncapped

Remote
← Back