Skip to content
OpenTrain AIFor AI Companies

Data Annotator for RL Environments

Join OpenTrain AI as a remote, entry-level data annotator performing actions and locating information inside apps to generate question-answer labels. Contract, part-time role at $11/hr with a 20+ hours/week commitment using internal proprietary tooling.

OpenTrain AI

Generative AI & RLHF

100% Remote Hourly · $11/hr

$11/hr

Compensation

Worldwide

Eligibility

Entry

Experience

Jul 22, 2025

Posted

Open worldwide

Interested in this role?

Create a free OpenTrain account and apply in minutes.

About OpenTrain

OpenTrain is the #1 platform for finding and building careers in AI training and data labeling. We connect people to projects where they can start and grow careers teaching AI—working on real tasks that shape how modern AI systems behave.

Why AI training work matters

AI training (also called data labeling or human feedback work) is the human side of building artificial intelligence. Contributors annotate data, create and verify examples, and provide feedback that helps models learn. Many projects are fully remote, flexible, and accessible to people without prior experience.

  • Flexible, remote work that fits around other commitments
  • Entry-level opportunities where attention to detail matters more than prior experience
  • Direct impact on how state-of-the-art AI systems behave

The role — what you'll do

You will perform short, discrete tasks inside a web or mobile application and record the results in our internal annotation tooling. Tasks fall into two types: perform an action (for example, create, update, or delete an invoice) or find information (for example, locate an invoice number for a specific customer).

  • Execute specified actions within the app (create/update/delete records) and document outcomes
  • Search the app for requested information and record findings as answers
  • Enter answers and task metadata into internal proprietary tooling using question-answering label formats

How you'll record labels

This project uses question-answering style labels and a text data type. For each task you will capture the requested answer or the result of the action in our internal proprietary tooling so it can be used to train and evaluate reinforcement learning environments and agent behavior.

  • Provide clear, accurate text responses to posed questions
  • Log action outcomes and any relevant details as instructed
  • Follow task instructions precisely to ensure consistent, high-quality labels

Requirements

This is an entry-level contract role designed for people who can follow instructions closely and maintain high attention to detail. Candidates must be available for at least 20 hours per week and able to work as a contractor in a part-time capacity.

  • High attention to detail (explicit requirement)
  • Entry-level candidates welcome
  • Available 20+ hours per week
  • Work type: contractor, part-time

Who should apply

Apply if you want flexible, remote work contributing directly to AI systems, and if you enjoy careful, task-oriented work. This role suits people new to AI training who can consistently follow step-by-step instructions and produce accurate text-based answers.

  • People looking for part-time, remote work
  • Those who enjoy structured tasks and attention-focused work
  • Contributors interested in building experience in AI training and RL environments

Pay, schedule, and tooling

Compensation is $11 USD per hour on a pay-per-hour basis. This is a contractor, part-time position with a minimum commitment of 20 hours per week. All labeling is done in internal proprietary tooling provided by the project.

  • Rate: $11 USD per hour (pay-per-hour)
  • Minimum: 20+ hours/week
  • Employment type: contractor, part-time
  • Labeling software: internal proprietary tooling
  • Data type: text; label type: question_answering