Senior Python Data Scraping Engineer (Freelance)
Remote · Pune, Maharashtra, India or Gurugram, Haryana, India
Job Summary
Senior Python Data Scraping Engineer (Freelance) — Mindrift seeks an experienced, self-driven expert to own end-to-end data extraction workflows across complex websites for a Tendem project. You will leverage Apify/OpenRouter alongside custom workflows to accelerate data collection, validation, and delivery, handling JavaScript-rendered content and site changes while enforcing data quality and scalable batching. The role requires 5+ years in data engineering or web scraping, strong Python skills (BeautifulSoup/Selenium), experience with AWS/Docker, LLM frameworks (LangChain/OpenRouter), and the ability to work 10–20 hours/week remotely as a contractor. Knowledge of extracting data from complex structures, data cleaning/normalization, anti-bot handling, and delivering structured datasets (CSV/JSON) is essential. A Bachelor’s or Master’s in a technical field is preferred; GitHub presence is a plus; English proficiency B2+ is required. Project hours are estimated at 10–20 hours per week during active phases, with compensation up to $30/hour depending on level and pace.
Required Qualifications
- At least 5+ years of relevant experience in data engineering, web scraping, automation, or software development (required)
- Bachelor’s or Master’s Degree in Engineering, Applied Mathematics, Computer Science, or related technical fields (a plus)
- Strong experience in Python web scraping (BeautifulSoup, Selenium or similar), including dynamic content (JS, AJAX, infinite scroll) and APIs via proxies
- Proven ability to extract data from complex structures (hierarchies, archived pages, inconsistent HTML)
- Solid background in data cleaning, normalization, and validation, delivering structured datasets (CSV, JSON)
- Demonstrated experience handling anti-bot mechanisms and dynamic site structures at scale
- Experience with cloud infrastructure (AWS or equivalent) and containerization (Docker)
- Hands-on experience with LLM frameworks (LangChain, OpenRouter, or similar) applied to automation tasks
- Self-directed work ethic with ability to troubleshoot independently
- A link to GitHub is a plus
- English proficiency: Upper-intermediate (B2) or above (required)
- Project time expectations: around 10–20 hours per week during active phases
- Compensation: up to $30 per hour equivalent
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.