Software Engineer — AI Model Evaluation
Join OpenTrain as a remote contractor to evaluate and improve AI systems by reviewing, debugging, and rating production code; this role pays $20–$75/hr and requires a 20+ hour/week commitment. You'll work on backend, full-stack, and infrastructure tasks that directly shape model behavior.
Coding Software
$20–$75/hr
Compensation
Worldwide
Eligibility
Intermediate
Experience
Jun 27, 2026
Posted
Open worldwide
About OpenTrain
OpenTrain is the #1 platform for finding and building careers in AI training and data labeling. We help people discover AI training projects, build a unified portfolio of their work, and grow durable freelance careers in a fast-growing industry. Creating an OpenTrain account is free.
This opening is posted through OpenTrain and runs with Micro1 as the hiring organization. You will join a distributed, contractor-based workflow that connects your software expertise to real AI model evaluation work.
Why AI training matters
AI training (data labeling / human feedback) is the human side of how modern AI systems learn. Contributors prepare and review examples—code, text, audio, and images—that guide model behavior. This is accessible, cutting-edge work that often offers flexible, remote hours and the chance to influence production systems.
- Work remotely from anywhere with an internet connection.
- Flexible, part-time contractor work that fits around other commitments.
- Contribute directly to how AI systems perform and make decisions.
The role
Micro1 and OpenTrain are recruiting a Software Engineer for AI Model Evaluation. This is a contractor, part-time role; you will evaluate and improve next-generation AI systems by working on real-world software engineering tasks.
Compensation is $20–$75 per hour (rate depends on experience and qualifications), and the role requires a 20+ hours per week commitment. Work is fully remote and contractor-based.
- Employment type: Contractor, Part-time.
- Hours: 20+ hours/week (remote).
- Pay: $20–$75 per hour, based on experience.
What you'll do
You will apply your software engineering skills to evaluate code, propose improvements, and explain technical tradeoffs. Your assessments will be used to train and evaluate AI models that reason about software and engineering decisions.
- Review, debug, improve, and explain code across backend, full-stack, infrastructure, and systems contexts.
- Design or evaluate practical solutions involving APIs, databases, services, integrations, testing, and deployment workflows.
- Identify tradeoffs around scalability, maintainability, performance, reliability, security, and developer experience.
- Communicate technical reasoning clearly in writing and provide evaluation ratings or assessments.
- Collaborate with project teams on technical reviews, implementation decisions, and problem-solving exercises.
- Adapt quickly to new codebases, frameworks, and technical requirements.
Requirements
You must meet the core experience and technical expectations below. We preserve all required qualifications from the role description.
- 3+ years of hands-on software engineering experience.
- Strong experience in at least one backend or full-stack environment: Python, JavaScript/TypeScript, Node.js, Java, C++, Go, or Ruby.
- Experience building, maintaining, or reviewing production-level applications, APIs, services, databases, or integrations.
- Strong understanding of software engineering fundamentals: debugging, testing, code quality, architecture, and technical tradeoffs.
- Ability to explain complex engineering decisions clearly and objectively in writing.
- Comfortable reading and reasoning through unfamiliar code or technical requirements.
Helpful background (not required)
The following skills and experiences make you especially competitive for this work but are not strict requirements.
- Experience with cloud platforms such as AWS, GCP, or Azure.
- Familiarity with CI/CD, DevOps workflows, containers, monitoring, or production operations.
- Experience with frontend frameworks like React, Next.js, Angular, Vue, or React Native.
- Open-source contributions, public GitHub work, technical writing, or strong examples of past engineering projects.
- Experience mentoring engineers, reviewing code, or making architecture decisions.
- Exposure to cybersecurity or SecOps practices.
How this work is evaluated
You will produce written evaluations and ratings of code and technical solutions; label types include evaluation ratings and programming/coding assessments. Clear, objective technical reasoning is central: explain why a solution works, what alternatives were considered, and tradeoffs involved.
Work is carried out in a contractor workflow through OpenTrain; assignments may vary in scope and technical stack and will require adapting to different repositories and environments.
How to apply
Create an OpenTrain account (free) if you don't already have one, complete your profile, and submit your application to this Micro1 posting. Include examples of past engineering work—links to GitHub, public projects, technical writing, or a brief summary of relevant experience.
Applications should highlight your primary languages, relevant production experience, and examples that demonstrate your ability to reason about unfamiliar codebases and explain engineering decisions clearly.