Toloka logo
Toloka1 week ago

Senior Python Data Scraping Engineer (Freelance)

Remote · Pune, Maharashtra, India or Gurugram, Haryana, India

Type
Contract
Level
Senior Level
Education
Bachelors Degree
Company size
Unknown

Job Summary

Senior Python Data Scraping Engineer (Freelance) — Mindrift seeks an experienced, self-driven expert to own end-to-end data extraction workflows across complex websites for a Tendem project. You will leverage Apify/OpenRouter alongside custom workflows to accelerate data collection, validation, and delivery, handling JavaScript-rendered content and site changes while enforcing data quality and scalable batching. The role requires 5+ years in data engineering or web scraping, strong Python skills (BeautifulSoup/Selenium), experience with AWS/Docker, LLM frameworks (LangChain/OpenRouter), and the ability to work 10–20 hours/week remotely as a contractor. Knowledge of extracting data from complex structures, data cleaning/normalization, anti-bot handling, and delivering structured datasets (CSV/JSON) is essential. A Bachelor’s or Master’s in a technical field is preferred; GitHub presence is a plus; English proficiency B2+ is required. Project hours are estimated at 10–20 hours per week during active phases, with compensation up to $30/hour depending on level and pace.

Required Qualifications

  • At least 5+ years of relevant experience in data engineering, web scraping, automation, or software development (required)
  • Bachelor’s or Master’s Degree in Engineering, Applied Mathematics, Computer Science, or related technical fields (a plus)
  • Strong experience in Python web scraping (BeautifulSoup, Selenium or similar), including dynamic content (JS, AJAX, infinite scroll) and APIs via proxies
  • Proven ability to extract data from complex structures (hierarchies, archived pages, inconsistent HTML)
  • Solid background in data cleaning, normalization, and validation, delivering structured datasets (CSV, JSON)
  • Demonstrated experience handling anti-bot mechanisms and dynamic site structures at scale
  • Experience with cloud infrastructure (AWS or equivalent) and containerization (Docker)
  • Hands-on experience with LLM frameworks (LangChain, OpenRouter, or similar) applied to automation tasks
  • Self-directed work ethic with ability to troubleshoot independently
  • A link to GitHub is a plus
  • English proficiency: Upper-intermediate (B2) or above (required)
  • Project time expectations: around 10–20 hours per week during active phases
  • Compensation: up to $30 per hour equivalent
Sorce

Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.

Hiring someone like this?

Get your role in front of qualified candidates on Sorce.

Get started

Toloka

Senior Python Data Scraping Engineer (Freelance)

Apply on Sorce