Do I need a formal machine learning degree to do this work?

Not always. Many projects prioritize practical experience and the ability to apply ML reasoning to labeling and evaluation tasks. Demonstrable experience—such as work on datasets, model deployments, code review, or research—can be as persuasive as formal credentials. Highlight concrete examples on your OpenTrain profile to show your expertise.

Are these machine learning roles remote and flexible?

Yes. AI-training and labeling work on OpenTrain is typically remote and can often be done on a flexible schedule. Projects vary in their timing and responsiveness requirements—some allow asynchronous, part-time contributions while others require more consistent availability for quality control or collaboration.

How is work structured and how is pay typically set?

Work is usually project- or task-based and managed by the client. Compensation models vary and are set by each project; common structures include per-task, hourly, or milestone-based payments. Projects often use qualification tasks to evaluate contributors before assigning paid batches. OpenTrain lists role requirements and next steps for each opportunity.

How do I apply and stand out on OpenTrain?

Create a detailed profile that emphasizes your ML-relevant work: datasets you’ve curated, models you’ve evaluated, guideline authorship, or relevant code examples. When applying, tailor your responses to the listing, complete any qualification tasks carefully, and provide clear reasoning for your decisions. Reliable delivery and helpful feedback in trials lead to more invitations.

What tools or outputs should I expect to work with?

Projects use a range of annotation interfaces, spreadsheets, code snippets, and review dashboards. You may produce labeled examples, written justifications for labels, revised guidelines, or evaluation reports. Familiarity with common data formats (CSV, JSON) and basic tooling for viewing text, images, or audio is helpful.

Remote machine learning jobs

Machine Learning subject-matter roles apply technical knowledge to the human side of building AI. Work may include reviewing annotations, designing labeling instructions, evaluating model outputs, and giving domain-aware corrections that help models learn more accurately. OpenTrain connects ML specialists with short-term and ongoing AI-training projects. Create a free profile, highlight your expertise, and apply to projects that need ML-level judgment and domain context.

69 open positions

Integration Developer (API Specialist)

Join OpenTrain to train and evaluate AI systems focused on API integrations and interoperability, working remotely 20+ hrs/week as a contractor. Design prompts, assess AI-generated integration plans and payloads, and troubleshoot REST API and webhook workflows for $15–$45/hr.

Posted Mar 29, 2026

AI Workflow Engineer (LLM Integration & Prompt Engineering)

Join a remote, contract role (20+ hrs/week) building LLM-powered automation pipelines: design and refine prompts, integrate LLM APIs, and perform rubric-based evaluation and QA. Pays $15–$45/hr; intermediate-level work focused on evaluation, RLHF, and function-calling.

Posted Mar 29, 2026

Fine-Tune Qwen3-Coder on JUCE/C++ Audio DSP — Dataset Curation & QLoRA Training

Build a 3,000–5,000 example supervised fine-tuning dataset and run QLoRA on Qwen3‑Coder focused on JUCE/C++ audio DSP. Expert C++/JUCE and DSP background required; 20+ hrs/week, fixed-price contract of $1,000.

Posted Feb 27, 2026

Evaluation Scenario Writer - AI Agent Testing Specialist

Design structured evaluation scenarios and gold-standard behavior for LLM-based agents in a remote, part-time contractor role (20+ hrs/week) paying $18–$24/hr. Requires QA/test-case experience and basic Python and JavaScript skills.

Posted Jan 13, 2026

Senior Data Science AI Task Designer (Python & SQL, 5+ yrs)

Design realistic, end-to-end, computationally intensive data science problems to train and evaluate advanced AI systems; requires Master’s/PhD, 5+ years’ experience, expert Python and strong SQL. Remote contract, part-time (<20 hrs/week) at $50/hr.

Posted Dec 3, 2025

AI Red Team Engineer for LLMs (Security Certification Required)

Part-time remote red team role evaluating LLMs and AI agents for safety and security, requiring an advanced English level, a technical degree, and at least one verifiable cybersecurity/red-team certification. Flexible hours, contractor work, compensation varies by location up to $55/hr.

Posted Nov 17, 2025

AI Data Labeling for an Innovative Startup backed by 10M in Funding

Join a deeptech AI research startup as a part-time contractor testing an internal AI coding tool and earn $100/hour for up to 20 hours/week. You should have solid Python experience and familiarity with AI coding assistants like Cursor, Windsurf, or Claude Code.

Posted Oct 17, 2025

AI Red Team Engineer — LLM Security & Pentesting (C1 English)

Part-time contract role applying offensive security and LLM red-teaming skills to evaluate models, agents, and RAG pipelines; $40/hr, <20 hrs/week. Must have hands-on pentesting experience, Python/Bash/PowerShell skills, C1 English, and be able to take a HackerRank + platform test immediately.

Posted Oct 6, 2025

Freelance Software Developer – AI Trainer / QA (Python, JavaScript, TypeScript, Rust, Ruby) [AS-L]

Apply specifying which primary language you’re applying for (Python, JavaScript/TypeScript, Rust, or Ruby). Contract role: review model-generated code, annotate results, and write evaluation prompts — 20+ hrs/week at $15/hr.

Posted Aug 1, 2025

Python Infrastructure Engineer — LLM Training & Agent Tooling [US‑CA]

Build and own Python infrastructure for LLM training and agent evaluation as a remote part-time contractor (US & Canada). Requires 5+ years Python, Docker, CI/CD, FastAPI/Flask and a test-driven, security-aware mindset; pay tiers Junior $34, Mid $37, Senior $42/hr.

Posted Jul 25, 2025

Python Infrastructure Engineer — LLM Training & Agent Tooling [AS‑L]

Join an OpenTrain project building the infrastructure that powers LLM training and agent evaluation: design sandboxes, CI/CD, Dockerized services, and developer tooling. Remote contract work for Asia‑Low candidates, 20+ hrs/week, tiered hourly pay $9–$16.

Posted Jul 25, 2025

Snr Code Reviewer - TypeScript (React)

Audit AI-generated TypeScript + React code by installing dependencies, compiling with tsc, running snippets in a sandbox, and correcting mis-ratings with clear feedback; remote contractor role, 20+ hrs/week at $25/hr.

Posted Jul 8, 2025

Senior Python Code Reviewer (DOCKER PROFICIENCY REQUIRED)

Join a leading AI-training platform to audit and validate AI-generated Python code, running containerized proof-of-work checks, catching rating errors, and writing concise feedback. Part-time contractor role (under 20 hrs/week), remote, $18/hr—requires 7+ years Python experience and Docker proficien

Posted Jul 7, 2025

YOLOv7 Expert Needed to Review AI-Generated Object Detection Code

Experienced YOLOv7 developer needed to assess AI-generated prompts, code snippets, and recommendations for technical accuracy, efficiency, and deployment feasibility. Contract, remote role at $30/hr for under 20 hours/week focused on structured interviewing and code review.

Posted Mar 10, 2025

YOLO (OpenCV) Expert for Reviewing AI-Generated Code & Responses

Experienced OpenCV/YOLO developer needed to evaluate and improve AI-generated code, explanations, and recommendations for real-time object detection workflows. Remote, contract, part-time work at $30/hr reviewing correctness, performance, and best practices.

Posted Mar 10, 2025

PyTorch YOLO Developer Needed for AI Model Evaluation & Code Review

Remote, part-time contract reviewing AI-generated PyTorch/YOLO code and explanations; provide technical feedback, code reviews, and interview-style assessments. $25/hour, <20 hrs/week, worldwide.

Posted Mar 10, 2025

LangChain v2 Developers Needed for AI Code Review & Evaluation

Review and evaluate AI-generated LangChain v2 code, prompts, and workflows; provide structured feedback and run technical interviews. Part-time remote contractor role (under 20 hrs/week) at $20/hr — requires hands-on LangChain v2 experience and strong English.

Posted Mar 10, 2025

OpenAI (Cookbook) Developer Needed for AI Code Review & Evaluation

Analyze and label AI-generated code and explanations derived from OpenAI Cookbook patterns, provide structured technical feedback, and run focused technical interviews; $20/hr, remote, part-time (under 20 hrs/week). Ideal for developers with hands-on OpenAI API experience and strong English.

Posted Mar 10, 2025

OpenAI (Azure) Developer Needed for AI Code Review & Evaluation

Experienced Azure OpenAI developer needed to review AI-generated code and run technical interviews to label and improve model outputs. Part-time, remote contract work helping train AI to give accurate, Azure-specific guidance.

Posted Mar 10, 2025

Transformers (Hugging Face) Developer Needed for AI Code Evaluation

Experienced Transformers developer needed to evaluate AI-generated Transformers code, provide structured feedback, and run technical interviews to vet candidates. Part-time contractor role, remote, under 20 hrs/week at $27/hr focused on labeling and code-quality assessment.

Posted Mar 10, 2025

ChromaDB Developer Needed for AI Code Review & Evaluation

Review, label, and improve AI-generated ChromaDB code and responses to optimize vector search and retrieval; $25/hr, under 20 hrs/week, fully remote. Must have hands-on ChromaDB/vector DB experience and strong English communication for structured feedback.

Posted Mar 10, 2025

LlamaIndex Developers Needed for AI Code Review & Evaluation

Experienced LlamaIndex developer needed to review AI-generated LlamaIndex code, label outputs, and run structured technical interviews; $25/hr, under 20 hrs/week, fully remote. Use your RAG, indexing, and vector DB expertise to give clear, actionable feedback and screen candidates.

Posted Mar 10, 2025

LLM model trainer for a medical scoring system

Collect urine output and serum creatinine data from the MIMIC‑4 database to build a training dataset for an LLM-based event prediction model; fixed-price $1,000, contractor role, intermediate level, remote worldwide.

Posted Feb 19, 2025

Long-term Visual Data Labeling - Images/Video - USA

Remote contract for experienced computer-vision annotators in the US/Canada: long-term, part-time work labeling images and video (bounding boxes, polygons, keypoints, cuboids) with flexible hours and pay at $14/hr for ~15–20 hrs/week.

View job

Image Video Annotation

Posted Sep 3, 2024

Software Developer & AI Trainer (JSON, Tables, Lists - Native Languages)

Join a remote AI-training project creating and editing JSON, tables, and lists for model training; $25/hr, contractor role. You must be a native speaker of one specified language with strong software development skills (Python, JavaScript, SQL, HTML/CSS) and be available to start Aug 19.

Posted Aug 14, 2024

Typescript Coders - Ongoing LLM/AI Training

Join a long-term, remote TypeScript project training LLMs: write and debug TypeScript snippets, create accurate training datasets, and optimize AI-generated code. Part-time contractor role (20–40 hrs/week) paying $7.25/hr; applicants need 2+ years TypeScript experience.

Posted Jun 25, 2024

Scala Developers - AI/LLM Training (Long-term Project)

Remote contract position writing and debugging Scala code to build training datasets for AI models; $9.28/hr, long-term (6+ months), 20–40 hrs/week with flexibility. Requires 2+ years professional Scala experience and a CS/engineering degree.

Posted Jun 22, 2024

Advanced Swift/iOS Developers - Quality Assurance/Code Review

Remote contractor role reviewing Swift/iOS code used to train AI models. $14/hr, ongoing 3–6 month project; expect 20+ hours/week (many contributors work 30–40 hrs); must have 5+ years Swift experience and B2+ English.

Posted Jun 18, 2024

Python Coding - Data Labeling/AI Training Data

Join an ongoing contractor role creating and debugging Python code to train AI models; commit 20–40 hours/week for 3–6 months at $8/hr USD. Ideal for Python developers with 2+ years' experience, strong English, and LLM training experience preferred.

Posted Jun 18, 2024

Swift/iOS Programmer - LLM Coding

Join a remote, part-time contract role creating Swift coding prompts and responses to train LLMs — 20+ hrs/week at $12/hr. Must be highly proficient in Swift with 2+ years of iOS experience and strong English; LLM training experience is a plus.

Posted May 17, 2024

What this work involves

In AI-training projects that need machine learning expertise, your role is to bring model-aware judgment to labeling and evaluation tasks. Typical assignments include reviewing and correcting labeled data, writing or refining annotation guidelines, assessing model predictions for subtle errors, labeling complex examples that require ML context, and participating in pilot tasks that shape a project's workflow.

These tasks are focused on quality and nuance rather than building models from scratch. You may work with text, code, images, audio, or multimodal outputs and will often be asked to explain why a label is correct or to produce examples that teach a model a concept more reliably.

Review and correct annotations to improve dataset quality and consistency.
Design or refine labeling guidelines so annotators apply criteria uniformly.
Evaluate model outputs for edge cases, failure modes, and bias.
Create and label challenging examples that require ML domain knowledge.

Skills and knowledge that help

Successful contributors combine practical ML understanding with careful attention to detail. Knowledge of model behavior, common failure modes, and evaluation metrics helps you spot mistakes that non‑specialists might miss. Familiarity with data formats, basic statistics, and versioned datasets is useful when assessing dataset quality.

Communication skills are important: many projects require written feedback, clear justification for labels, and collaboration with project leads to improve guidelines. Experience teaching, tutoring, code review, or dataset curation transfers well.

Understanding of model outputs, common errors, and evaluation concepts.
Experience writing clear, testable labeling instructions or documentation.
Comfort with datasets, simple tooling, and quality-control workflows.
Ability to explain decisions and document ambiguous cases for reviewers.

Who tends to do well

People who excel in ML-focused training roles include practitioners who have used models in production, researchers who know typical pitfalls, and domain experts who can interpret difficult examples through a model-centric lens. You do not always need a formal ML degree; relevant experience, demonstrable expertise, and clear problem-solving judgment are often what projects look for.

These roles suit individuals who enjoy iterative, detail-oriented work and want to influence how models behave without taking on full-time engineering or research roles. They can be a good fit for academics, ML engineers, data scientists, and experienced annotators aiming to move into more technical oversight.

ML engineers and data scientists seeking flexible, part-time training work.
Researchers and graduate students with hands-on model experience.
Domain experts (finance, healthcare, law) who add subject-matter context.
Experienced annotators ready to lead guideline development or QA.

How hiring works on OpenTrain

OpenTrain is a central place to discover projects that need ML subject-matter expertise. Create a free profile that highlights your ML experience and relevant examples—this makes it easier for project leads to find and evaluate you. Listings will describe required skills, task types, and how the work is managed.

Most assignments are project- or task-based and run remotely. After you apply, project teams often use short qualification tasks or trial batches to verify fit and clarify instructions. Strong written feedback, reliable throughput, and consistent quality increase your chances of being invited to ongoing work or higher-responsibility tasks.

Build a clear profile with examples of ML work, datasets, or tool familiarity.
Expect qualification tasks or small pilots before being assigned large batches.
Communicate clearly in trial tasks and follow guideline revisions closely.
Consistency and thoughtful feedback lead to more and higher-level opportunities.

Frequently asked questions

Do I need a formal machine learning degree to do this work?: Not always. Many projects prioritize practical experience and the ability to apply ML reasoning to labeling and evaluation tasks. Demonstrable experience—such as work on datasets, model deployments, code review, or research—can be as persuasive as formal credentials. Highlight concrete examples on your OpenTrain profile to show your expertise.
Are these machine learning roles remote and flexible?: Yes. AI-training and labeling work on OpenTrain is typically remote and can often be done on a flexible schedule. Projects vary in their timing and responsiveness requirements—some allow asynchronous, part-time contributions while others require more consistent availability for quality control or collaboration.
How is work structured and how is pay typically set?: Work is usually project- or task-based and managed by the client. Compensation models vary and are set by each project; common structures include per-task, hourly, or milestone-based payments. Projects often use qualification tasks to evaluate contributors before assigning paid batches. OpenTrain lists role requirements and next steps for each opportunity.
How do I apply and stand out on OpenTrain?: Create a detailed profile that emphasizes your ML-relevant work: datasets you’ve curated, models you’ve evaluated, guideline authorship, or relevant code examples. When applying, tailor your responses to the listing, complete any qualification tasks carefully, and provide clear reasoning for your decisions. Reliable delivery and helpful feedback in trials lead to more invitations.
What tools or outputs should I expect to work with?: Projects use a range of annotation interfaces, spreadsheets, code snippets, and review dashboards. You may produce labeled examples, written justifications for labels, revised guidelines, or evaluation reports. Familiarity with common data formats (CSV, JSON) and basic tooling for viewing text, images, or audio is helpful.

Explore the Machine Learning career path →

Integration Developer (API Specialist)

AI Workflow Engineer (LLM Integration & Prompt Engineering)

Fine-Tune Qwen3-Coder on JUCE/C++ Audio DSP — Dataset Curation & QLoRA Training

Evaluation Scenario Writer - AI Agent Testing Specialist

Senior Data Science AI Task Designer (Python & SQL, 5+ yrs)

AI Red Team Engineer for LLMs (Security Certification Required)

AI Data Labeling for an Innovative Startup backed by 10M in Funding

AI Red Team Engineer — LLM Security & Pentesting (C1 English)

Freelance Software Developer – AI Trainer / QA (Python, JavaScript, TypeScript, Rust, Ruby) [AS-L]

Python Infrastructure Engineer — LLM Training & Agent Tooling [US‑CA]

Python Infrastructure Engineer — LLM Training & Agent Tooling [AS‑L]

Snr Code Reviewer - TypeScript (React)

Senior Python Code Reviewer (DOCKER PROFICIENCY REQUIRED)

YOLOv7 Expert Needed to Review AI-Generated Object Detection Code

YOLO (OpenCV) Expert for Reviewing AI-Generated Code & Responses

PyTorch YOLO Developer Needed for AI Model Evaluation & Code Review

LangChain v2 Developers Needed for AI Code Review & Evaluation

OpenAI (Cookbook) Developer Needed for AI Code Review & Evaluation

OpenAI (Azure) Developer Needed for AI Code Review & Evaluation

Transformers (Hugging Face) Developer Needed for AI Code Evaluation

ChromaDB Developer Needed for AI Code Review & Evaluation

LlamaIndex Developers Needed for AI Code Review & Evaluation

LLM model trainer for a medical scoring system

Long-term Visual Data Labeling - Images/Video - USA

Software Developer & AI Trainer (JSON, Tables, Lists - Native Languages)

Typescript Coders - Ongoing LLM/AI Training

Scala Developers - AI/LLM Training (Long-term Project)

Advanced Swift/iOS Developers - Quality Assurance/Code Review

Python Coding - Data Labeling/AI Training Data

Swift/iOS Programmer - LLM Coding

What this work involves

Skills and knowledge that help

Who tends to do well

How hiring works on OpenTrain

Frequently asked questions

Python Infrastructure Engineer — LLM Training & Agent Tooling [US‑CA]

Python Infrastructure Engineer — LLM Training & Agent Tooling [AS‑L]