Skip to content
OpenTrain AIFor AI Companies

Vibecode Specialist — Web Scraping & Data Extraction

Join a remote, part-time contractor role extracting structured data from complex, JS-heavy websites using Python, Apify/OpenRouter, and your own scripts. $20/hr, 20+ hours/week; B2+ English and 1+ year relevant experience required.

OpenTrain AI

Coding & Software

100% Remote Hourly · $20/hr

$20/hr

Compensation

Worldwide

Eligibility

Intermediate

Experience

Feb 26, 2026

Posted

Open worldwide

Interested in this role?

Create a free OpenTrain account and apply in minutes.

About OpenTrain

OpenTrain is the #1 platform for people starting and building careers in AI training and data labeling. We connect contributors with projects that shape how modern AI systems behave, offering flexible, remote work across a wide range of domains.

This role is posted through OpenTrain and gives you direct access to impactful, hands-on work that feeds real model training pipelines.

Why AI Training Work Matters

AI training (data labeling/annotation) is the human foundation behind modern machine learning: people prepare, clean, and validate the examples models learn from. Contributors often work remotely, set flexible hours, and gain exposure to cutting-edge tools and workflows.

This project pairs automated agents with human experts: AI handles repetitive steps while you apply critical thinking, troubleshooting, and quality control to produce reliable structured datasets.

  • 100% remote — work from anywhere with an internet connection
  • Flexible, part-time work that fits around other commitments
  • Contribute to real datasets used for fine-tuning and evaluation

The Role

We’re hiring a Vibecode Specialist to own end-to-end web scraping and data extraction workflows for complex sites. You’ll extract, validate, normalize, and deliver structured text datasets (CSV/JSON/Sheets) while collaborating in a hybrid AI + human setup.

This is a contractor, part-time role: expect 20+ hours per week, paid hourly at $20 USD.

  • Contractor, part-time (20+ hours/week)
  • Pay: $20 USD per hour (PAY_PER_HOUR)
  • Location: Global — any country; work remotely
  • Label types for this project: Fine-tuning, Evaluation/Rating, Computer Programming/Coding
  • (project-specific tools plus your scripts)

What You’ll Do

You’ll design and execute resilient scraping strategies for dynamic, JavaScript-rendered websites, extract multi-level site data, and deliver clean, validated datasets. The work involves troubleshooting failures, building fallbacks and retries, and scaling jobs via batching or parallelization.

  • Scrape JS-heavy pages (infinite scroll, AJAX, dynamic content) using Python tools and browser automation
  • Extract across hierarchical structures (category → entity → details) and reconcile nested data
  • Clean, normalize, and validate outputs to meet formatting specs (CSV/JSON/Google Sheets)
  • Use internal tools (Apify, OpenRouter) alongside your own scripts and workflows
  • Leverage LLMs/AI tools to accelerate tasks such as extraction assistance and prompt-based automation
  • Document edge cases, failures, and remediation steps clearly

Requirements

You must meet the experience and language requirements listed below. We preserve all qualification details from the project brief.

  • 1+ year experience in at least one: web scraping, data engineering, software development, automation, or data analysis
  • Strong Python web scraping skills (e.g., BeautifulSoup + Selenium/Playwright or equivalents)
  • Proven experience scraping dynamic/JS-heavy sites (infinite scroll, AJAX, JS-rendered content)
  • Experience extracting from multi-level/hierarchical site structures
  • Ability to implement resilient scraping strategies (selectors, fallbacks, retries)
  • Skill cleaning, normalizing, and validating data; deliverables in CSV, JSON, or Google Sheets
  • Experience batching/parallelization or other scaling approaches for large scraping jobs
  • Familiarity using LLMs/AI tools for prompting, automation, or extraction assistance
  • English level B2+ (upper-intermediate or fluent) with ability to follow detailed specs and document edge cases

Who Should Apply

This role suits someone comfortable writing and maintaining Python scraping code, troubleshooting fragile pipelines, and producing production-ready datasets. You should enjoy a mix of engineering, data-cleaning, and process documentation within a hybrid AI-assisted workflow.

Applicants with domain experience in data engineering or software engineering who want flexible, remote, part-time work are encouraged to apply.

  • Intermediate-level professionals with practical scraping experience
  • People who can communicate technical issues and edge cases clearly in English
  • Contributors who prefer project variety and hands-on ownership of deliverables

How It Works

Apply through OpenTrain and attach examples of prior scraping projects or a brief technical summary of relevant work. Successful applicants will receive project specs, access to project tools, and onboarding instructions.

You will collaborate with AI agents and other contributors: follow detailed specs, report issues, and submit validated datasets on schedule.

  • Onboarding includes project-specific guidelines, data schemas, and QA checkpoints
  • Deliverables accepted as CSV/JSON/Google Sheets per spec
  • This is a contractor engagement — ensure you can commit to 20+ hours/week