Data Engineer, PCCTC

Remote Full-time
About Us:


The people of Memorial Sloan Kettering Cancer Center (MSK) are united by a singular mission: ending cancer for life. Our specialized care teams provide personalized, compassionate, expert care to patients of all ages. Informed by basic research done at our Sloan Kettering Institute, scientists across MSK collaborate to conduct innovative translational and clinical research that is driving a revolution in our understanding of cancer as a disease and improving the ability to prevent, diagnose, and treat it. MSK is dedicated to training the next generation of scientists and clinicians, who go on to pursue our mission at MSK and around the globe.

Exciting opportunity at MSK: Join the Prostate Cancer Clinical Trials Consortium as a Data Engineer! The Prostate Cancer Clinical Trials Consortium (PCCTC), incubated within Memorial Sloan Kettering (MSK), is seeking a Data Engineer to join a collaborative team of clinical investigators and PCCTC staff working together on a single mission: to design, implement, and complete clinical trials and observational studies in prostate cancer, translating scientific discoveries to improve standards of care. We support this work through biostatistics and data science, data management, reporting, site management, and clinical operations across the clinical study lifecycle.

You will implement and maintain our data storage and access infrastructure. You will build the systems that make clinical trial data reliably available to our Data Science Team: standing up relational database structures in AWS S3, developing ETL pipelines from EDC systems, and creating programmatic access layers that integrate with our R-based analytic workflows. We are building a forward-thinking data warehouse designed around structured, relational data principles, including CDISC standards (SDTM/ADaM) for our clinical trials and well-organized schemas for observational and non-CDISC data sources. You will help make that vision operational.

Role Overview:
Implement and maintain relational database structures for clinical trial data storage in AWS S3, using tools such as DuckDB and/or DuckLake.
Build and maintain ETL pipelines that ingest data from clinical trial data systems (e.g. EDCs), transforming raw clinical data into organized, versioned, analysis-ready datasets.
Develop access layers (database connectors, internal R packages or utilities) that enable our R-focused Data Science Team to query and retrieve data efficiently.
Implement and maintain access management and permissioning structures across data systems, including SharePoint and Airtable, ensuring consistent and scalable controls as the team and trial portfolio grow.
Maintain data governance standards, including naming conventions, versioning, and documentation, across our active trial portfolio.
Collaborate with Clinical Operations and Data Management teams to understand data flows from sites and ensure upstream processes align with downstream analytic needs.
Use GitHub Enterprise for version control and contribute to CI/CD workflows for pipeline automation where infrastructure allows.

Additionally, we wanted to share a few other tools and concepts that we work with as a team. We are excited to collaborate with you on these areas and also support your continuous development!
CDISC data standards (SDTM, ADaM) and how clinical trial data is structured
DuckDB, DuckLake, or similar analytical database technologiescomplex & relational data sets
R (you won't need to be an R programmer, but understanding how R users consume data will make you more effective)
Airtable structure and maintenance (we have several operational data systems here)
SharePoint administration and file system permissioning at scale
CI/CD for orchestrating automated data pipelines
Clinical trial data lifecycle, from EDC capture through analysis-ready datasets

Key Qualifications:
An undergraduate degree, preferably in computer science, data engineering, information systems, or a related field.
2–4 years of experience building or maintaining data pipelines, ETL processes, or database systems.
Working knowledge of SQL and relational database concepts.
Familiarity with cloud storage (AWS S3 preferred) and infrastructure-as-code principles.
Experience with access management, permissioning, or user administration across collaborative platforms.
Exposure to version control (git/GitHub).
Passion for data and creating reliable systems that empower cancer care and clinical research.

Core Skills:
Strong problem-solving and analytical thinking skills with the ability to troubleshoot complex data and system issues.
Excellent collaboration and communication skills, with the ability to work effectively across technical and non-technical teams.
Highly organized with strong attention to detail and a commitment to data accuracy, quality, and documentation.
Ability to manage multiple priorities in a fast-paced environment while meeting deadlines.
Proactive, adaptable, and eager to learn new technologies and contribute to continuous process improvement.
Self-motivated and able to work independently in a fully remote environment while remaining an engaged team member.

Additional Information:
Location: Remote
Reporting to the Director, Data Science

Helpful Links:
Compensation Philosophy
Benefits

Learn more about PCCTC: The Prostate Cancer Clinical Trials Consortium (PCCTC) was initiated in 2005 by the Prostate Cancer Foundation (PCF) and the U.S. Department of Defense (DOD) Prostate Cancer Research Program (PCRP) in response to critically unmet needs in prostate cancer clinical research identified by physician investigators and patient advocates. To fulfill our mission, we developed a unique infrastructure which has fostered a culture of transparent project co-development between investigators, research sites and industry partners. Established as an independent entity in 2014, the PCCTC, LLC is now the nation’s premier multicenter clinical research organization specializing in cutting-edge prostate cancer research.


Pay Range: $92,700.00 - $148,400.00
FSLA Status: Exempt
Closing:

At MSK, we believe in fair, competitive pay that reflects your job, experience, and skills.

MSK is an equal opportunity and affirmative action employer committed to diversity and inclusion in all aspects of recruiting and employment. All qualified individuals are encouraged to apply and will receive consideration without regard to race, color, gender, gender identity or expression, sexual orientation, national origin, age, religion, creed, disability, veteran status or any other factor which cannot lawfully be used as a basis for an employment decision.

Federal law requires employers to provide reasonable accommodation to qualified individuals with disabilities. Please tell us if you require a reasonable accommodation to apply for a job or to perform your job. Examples of reasonable accommodation include making a change to the application process or work procedures, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment.
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Sr. Payroll Services Manager

Remote

M&T Bank is hiring: Senior Cybersecurity Engineer - Vulnerability in Buffalo

Remote

Google Remote Jobs Entry Level, Online Jobs No Experience, @ Get Informed!!

Remote

Watch (Tagger): 12-46

Remote

Travel Nurse RN - OR - $2,482 per week in Burlington, VT

Remote

VP-Digital Transformation, Project Management & QC.MGN Pak -Digital Transformation&Project.Operations Group-PAKCOE

Remote

Job Title: Remote Home-Based Customer Service Representative - Connecting Travelers with Exceptional Support Experiences

Remote

Customer Success Retention Strategist

Remote

LCSW (Virtual)

Remote

Experienced Virtual Social Media Chat Assistant – Remote Customer Support and Engagement Specialist for Blithequark

Remote
← Back