Data Annotator for RL Environments

Join OpenTrain AI as a remote, entry-level data annotator performing actions and locating information inside apps to generate question-answer labels. Contract, part-time role at $11/hr with a 20+ hours/week commitment using internal proprietary tooling.

Generative AI & RLHF

100% Remote Hourly · $11/hr

$11/hr

Compensation

Worldwide

Eligibility

Entry

Experience

Jul 22, 2025

Posted

Open worldwide

Interested in this role?

Create a free OpenTrain account and apply in minutes.

Apply now

About OpenTrain

OpenTrain is the #1 platform for finding and building careers in AI training and data labeling. We connect people to projects where they can start and grow careers teaching AI—working on real tasks that shape how modern AI systems behave.

Why AI training work matters

AI training (also called data labeling or human feedback work) is the human side of building artificial intelligence. Contributors annotate data, create and verify examples, and provide feedback that helps models learn. Many projects are fully remote, flexible, and accessible to people without prior experience.

Flexible, remote work that fits around other commitments
Entry-level opportunities where attention to detail matters more than prior experience
Direct impact on how state-of-the-art AI systems behave

The role — what you'll do

You will perform short, discrete tasks inside a web or mobile application and record the results in our internal annotation tooling. Tasks fall into two types: perform an action (for example, create, update, or delete an invoice) or find information (for example, locate an invoice number for a specific customer).

Execute specified actions within the app (create/update/delete records) and document outcomes
Search the app for requested information and record findings as answers
Enter answers and task metadata into internal proprietary tooling using question-answering label formats

How you'll record labels

This project uses question-answering style labels and a text data type. For each task you will capture the requested answer or the result of the action in our internal proprietary tooling so it can be used to train and evaluate reinforcement learning environments and agent behavior.

Provide clear, accurate text responses to posed questions
Log action outcomes and any relevant details as instructed
Follow task instructions precisely to ensure consistent, high-quality labels

Requirements

This is an entry-level contract role designed for people who can follow instructions closely and maintain high attention to detail. Candidates must be available for at least 20 hours per week and able to work as a contractor in a part-time capacity.

High attention to detail (explicit requirement)
Entry-level candidates welcome
Available 20+ hours per week
Work type: contractor, part-time

Who should apply

Apply if you want flexible, remote work contributing directly to AI systems, and if you enjoy careful, task-oriented work. This role suits people new to AI training who can consistently follow step-by-step instructions and produce accurate text-based answers.

People looking for part-time, remote work
Those who enjoy structured tasks and attention-focused work
Contributors interested in building experience in AI training and RL environments

Pay, schedule, and tooling

Compensation is $11 USD per hour on a pay-per-hour basis. This is a contractor, part-time position with a minimum commitment of 20 hours per week. All labeling is done in internal proprietary tooling provided by the project.

Rate: $11 USD per hour (pay-per-hour)
Minimum: 20+ hours/week
Employment type: contractor, part-time
Labeling software: internal proprietary tooling
Data type: text; label type: question_answering