Software Developer & AI Trainer — JSON & Structured Data
Join a remote AI-training project creating and editing JSON, tables, and lists for model training; $25/hr, contractor role. You must be a native speaker of one specified language with strong software development skills (Python, JavaScript, SQL, HTML/CSS) and be available to start Aug 19.
Coding & Software
$25/hr
Compensation
Worldwide
Eligibility
Intermediate
Experience
Aug 14, 2024
Posted
Open worldwide
About OpenTrain
OpenTrain is the #1 platform for finding and building careers in AI training and data labeling. We connect people with projects where they teach and shape AI by preparing, labeling, and curating examples that modern models learn from.
This role is offered through OpenTrain’s network of AI-training projects, which provide flexible, remote work across a range of domains and skill levels.
About AI Training Work
AI training (data labeling/data annotation) is the human side of building machine learning systems: people create and verify the examples models learn from. Work in this industry is often remote, flexible, and accessible — contributors directly influence how state-of-the-art AI behaves.
This project focuses on structured text data generation and engineering-aligned prompts, a high-impact area of AI training where precise, well-formed JSON and table content helps models understand software and programming contexts.
The Role
We’re hiring Software Developers & AI Trainers to design, generate, and edit structured data formats (JSON, tables, lists) in specified native languages. You will produce prompts and example content that reflect real software development and engineering scenarios so models can be trained and evaluated accurately.
This is a remote contractor position, part-time, paid hourly at USD 25/hr. The project expects contributors to be available to start the week of August 19 and commit at least 15 hours per week (20+ hours/week preferred).
- Pay: $25 per hour (USD) — contractor, part-time.
- Start date: must be able to start the week of August 19.
- Minimum commitment: 15 hours per week; 20+ hours/week preferred.
What You’ll Do
Create and edit structured text artifacts — JSON files, tables, and lists — that capture programming tasks, code snippets, and developer-focused prompts in your native language.
Design prompts and example inputs/outputs for language models, generate high-quality textual content, and validate the structure, accuracy, and relevance of the created data against engineering contexts and project guidelines.
- Write and validate JSON prompts and schema-conformant data.
- Produce tables and lists that represent code examples, API outputs, or debugging scenarios.
- Ensure linguistic clarity and technical correctness in your native language.
Requirements
This role requires both software development experience and native-level fluency in one of the specified languages. You must meet every listed requirement to be considered.
- Native speaker of ONE of: Korean (South Korea), Japanese (Japan), German (Germany), Italian (Italy), Spanish (Spain or Mexico only), Chinese (Simplified, Hong Kong, or Taiwan), Portuguese (Brazil), or French (France).
- Proficiency in English at minimum B2 level.
- 2+ years of software development experience OR 4+ years of formal software development education.
- Proficiency with structured data formats (JSON, tables, lists) and ability to create/manipulate JSON prompts for model training.
- Proficiency in programming languages: Python, JavaScript, SQL, and HTML/CSS.
- Attention to detail and ability to produce precise, well-structured data.
- Ability to start week of August 19 and commit the required hours each week.
Preferred Qualifications
The following are not required but will strengthen your application and qualification score for this project.
- Experience writing LLM training or data-labeling content (writing tasks) on platforms such as OpenTrain, OpenTrain, Appen, etc.
- 2+ years of creative or technical writing experience in your native language.
- Prior experience designing prompts and evaluation examples for language models.
How to Apply / Next Steps
Apply with a brief summary of your software development background, which native language you speak natively (specify country/dialect where required for Spanish or Chinese), and your earliest start date. Include examples or descriptions of past work that show your ability to create JSON or structured text for technical contexts.
Qualified applicants will be asked to complete short screening tasks that assess your ability to produce accurate JSON prompts, write technical examples in your native language, and follow detailed instructions. We score candidates on language quality, technical accuracy, and attention to detail.
- Worldwide applicants accepted, but you must be a native speaker of one listed language as specified.
- Employment type: contractor, part-time. Work is remote.
- /custom platform tools will be used.