Hebrew–English Bilingual Language Expert
Join OpenTrain to evaluate and improve AI-generated Hebrew and English content using rubrics, error taxonomies, and high-quality revisions; part-time contract, remote, under 20 hrs/week at $32/hr. Ideal for experienced translators, editors, and localization specialists with native Hebrew and C1+ Eng
Generative AI & RLHF
$32/hr
Compensation
Worldwide
Eligibility
Entry
Experience
Dec 26, 2025
Posted
Open worldwide
About OpenTrain
OpenTrain is the #1 platform for building careers in AI training and data labeling. We connect language professionals with projects that shape how modern AI systems behave and provide flexible, remote opportunities to work on cutting-edge model evaluation and improvement.
About AI training work
AI training (also called data labeling or human feedback work) is the human side of building artificial intelligence. Tasks include evaluating model outputs, applying annotation rubrics, writing explanations and revisions, and verifying factual claims—work that directly affects model quality.
This role focuses on bilingual evaluation and localization inside generative-AI workflows: you’ll assess accuracy, reasoning, clarity, and adherence to prompts, and produce model solutions and explanatory feedback.
The role
We’re hiring bilingual Hebrew–English language experts to evaluate and improve AI-generated text. You will review model responses for accuracy, clarity, and reasoning; identify methodological or factual errors; and write high-quality explanations and corrected model outputs.
This is a contract, part-time position with remote, asynchronous work and expected commitment of less than 20 hours per week.
- Work type: Contractor, part-time, remote, asynchronous
- Time commitment: Less than 20 hours/week
- Data type: Text; label types: Evaluation/Rating, RLHF, Translation/Localization
- (project-specific platform)
What you’ll do day-to-day
You will apply structured rubrics and an error taxonomy to rate and rank multiple AI responses, flagging language errors, factual inaccuracies, and reasoning gaps. When a response is incorrect or incomplete, you will write clear model solutions or revisions and explain the issues using concise, actionable feedback.
Occasional fact-checking against reputable sources will be required. Consistent application of severity and category labels within the rubric is essential.
- Rate and rank AI outputs using rubric-based evaluation and error taxonomy
- Identify reasoning gaps, methodological errors, and unclear explanations
- Produce high-quality model solutions, revisions, and written explanations
- Fact-check claims and validate content with reliable sources when needed
- Maintain consistent labeling and documentation for quality and reproducibility
Requirements
You must meet all core requirements below. Preferred qualifications are listed separately.
- Native or near-native Hebrew proficiency with strong writing/editing across formal and informal registers
- English proficiency at C1+ (reading/writing) for bilingual evaluation and guideline adherence
- Bachelor’s degree or higher in Linguistics, Translation, Hebrew Language, Communications, Journalism, or related field
- 3+ years professional experience in translation/localization, editorial QA, content QA, or linguistic review (or equivalent)
- Experience applying linguistic QA rubrics and error taxonomies with severity/category consistency
- Strong fact-checking habits and comfort validating claims with reputable sources
- Ability to identify reasoning gaps and methodological errors beyond surface language issues
- Comfortable working independently in a remote, asynchronous workflow with reliable availability/communication
Preferred experience
These are not required but will increase your competitiveness for the role.
- Familiarity with Israeli cultural context and terminology norms across common domains
- Prior experience with AI data training, annotation, or evaluation workflows
Compensation, schedule, and how to apply
Pay is $32 USD per hour. The role is contract, part-time, and remote. Projects are typically asynchronous—choose shifts or tasks that fit your availability while meeting deadlines and quality standards.
To apply, prepare examples of translation, localization, or QA work and be ready to demonstrate consistent rubric application. Successful applicants will complete a short qualification task to confirm rubric understanding and bilingual evaluation skills.
- Hourly rate: $32 USD per hour
- Workload: Less than 20 hours/week; schedule flexible within remote project constraints
- Application may include a qualification task to evaluate rubric use and bilingual skills