[Remote] Senior Python Data Scraping Engineer (Freelance)
Note: The job is a remote job and is open to candidates in USA. Mindrift is looking for highly skilled Senior Python Data Scraping Engineers to join the Tendem project and drive specialized data scraping workflows within our hybrid AI + human system. In this freelance role, you'll handle data scraping tasks requiring technical precision for web extraction and processing, collaborating with Tendem Agents to ensure accurate and actionable results.ResponsibilitiesOwn end-to-end data extraction workflows across complex websites, ensuring complete coverage, accuracy, and reliable delivery of structured datasetsLeverage internal tools (Apify, OpenRouter) alongside custom workflows to accelerate data collection, validation, and task execution while meeting defined requirementsEnsure reliable extraction from dynamic and interactive web sources, adapting approaches as needed to handle JavaScript-rendered content and changing site behaviorEnforce data quality standards through validation checks, cross-source consistency controls, adherence to formatting specifications, and systematic verification prior to deliveryScale scraping operations for large datasets using efficient batching or parallelization, monitor failures, and maintain stability against minor site structure changesSkillsAt least 5+ years of relevant experience in data engineering, web scraping, automation, or software developmentEnglish proficiency: Upper-intermediate (B2) or aboveBachelor's or Master's Degree in Engineering, Applied Mathematics, Computer Science, or related technical fields is a plusStrong experience in Python web scraping (BeautifulSoup, Selenium or similar), including dynamic content (JS, AJAX, infinite scroll) and APIs via proxiesProven ability to extract data from complex structures (hierarchies, archived pages, inconsistent HTML)Solid background in data cleaning, normalization, and validation, delivering structured datasets (CSV, JSON, Google Sheets)Demonstrated experience handling anti-bot mechanisms and dynamic site structures at scaleExperience with cloud infrastructure (AWS or equivalent) and containerization (Docker) as part of real workflowsHands-on experience with LLM frameworks (LangChain, OpenRouter, or similar) applied to automation tasksStrong attention to detail and commitment to data accuracySelf-directed work ethic with ability to troubleshoot independentlyA link to GitHub is a plusCompany OverviewMindrift is a company and a global leader in data services for AI. It was founded in undefined, and is headquartered in , with a workforce of 501-1000 employees. Its website is https://mindrift.ai.