Web Scraper & Data Automation for SAT Question Library (Short-Term Project, Long-Term Potential)

Remote Full-time
I'm looking for a developer or data engineer who can scrape SAT-style practice questions from publicly available sources (College Board & Khan Academy) and organize them into a clean, structured spreadsheet — with potential for long-term work building AI-powered workflows for tutors and students.

This project is the first phase of a bigger vision: helping students and tutors auto-generate homework, analyze test results, and create custom practice based on standards and difficulty levels.

Initial Scope (Week 1 Target):

Scrape question data from the following sites:

https://satsuitequestionbank.collegeboard.org/

https://www.khanacademy.org/test-prep/v2-sat-math

https://www.khanacademy.org/test-prep/sat-reading-and-writing

Data should include:

Question text

Answer choices

Correct answer

Difficulty level (if available)

Subject/category/standard (as listed on site)

Direct link to the source (where available)

Output should be a clean spreadsheet or database format, such as Google Sheets or CSV

(I’m most familiar with Google Sheets)

Future/Optional Scope (Not required for this task, but helpful context):

Tagging scraped questions to our assessment standards and scoring system

Connecting questions to external practice links (e.g., IXL/Khan Academy by standard)

Auto-generating practice sets for students based on assessment performance

Triggering scraping or homework generation via Google Sheets buttons or scripts

Using this question library as the foundation to train AI models (for custom question generation, next-step recommendations, and tutor workflows)

✅ What I'm Looking For:

Proven experience with web scraping (Python preferred, but open to other tools)

Familiarity with Google Sheets automation or Airtable integrations a plus

Clean, readable data formatting — I plan to build systems on top of this!

Ability to document the scraping script in case I want to re-run it later (e.g., next year)

Bonus: Understanding of K-12 or standardized test prep platforms

Timeline:

Ideal turnaround: 3–7 days

One-time scrape to start, but may need occasional re-runs or expansion

Budget:

Fixed

Please include a quote for:

Initial scrape and data cleanup

Optional: script/handoff for future use

To Apply:

Please include:

A short note on how you’d approach this scrape (mention any tools or languages)

Links to past scraping or automation projects (especially education-related if possible)

Whether you’d be interested in follow-up work on tutoring workflows, AI training data, or homework automation

Looking forward to building something great together.



Apply Now

Apply Now
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Mental Health Administrative Virtual Assistant - In Office and/or Remote

Remote

HR Generalist

Remote

Program Development and Operations Engineer – TE3

Remote

ASIC New Product Manager, Kuiper Silicon Development Engineering

Remote

Engineer I

Remote

RN Auditor, Clinical Services remote based in WA

Remote

Remote with Flexible Hours

Remote

GIS Analyst Spring 2025 Internship (paid/remote)

Remote

Experienced Remote Customer Experience Representative – Delivering Exceptional Support and Solutions from the Comfort of Your Own Home at blithequark

Remote

Sales - Nationwide Fully Remote Work From Home

Remote
← Back