Skip to content
OpenTrain AIFor AI Companies

Hebrew–English Bilingual Language Expert

Join OpenTrain to evaluate and improve AI-generated Hebrew and English content using rubrics, error taxonomies, and high-quality revisions; part-time contract, remote, under 20 hrs/week at $32/hr. Ideal for experienced translators, editors, and localization specialists with native Hebrew and C1+ Eng

OpenTrain AI

Generative AI & RLHF

100% Remote Hourly · $32/hr

$32/hr

Compensation

Worldwide

Eligibility

Entry

Experience

Dec 26, 2025

Posted

Open worldwide

Interested in this role?

Create a free OpenTrain account and apply in minutes.

About OpenTrain

OpenTrain is the #1 platform for building careers in AI training and data labeling. We connect language professionals with projects that shape how modern AI systems behave and provide flexible, remote opportunities to work on cutting-edge model evaluation and improvement.

About AI training work

AI training (also called data labeling or human feedback work) is the human side of building artificial intelligence. Tasks include evaluating model outputs, applying annotation rubrics, writing explanations and revisions, and verifying factual claims—work that directly affects model quality.

This role focuses on bilingual evaluation and localization inside generative-AI workflows: you’ll assess accuracy, reasoning, clarity, and adherence to prompts, and produce model solutions and explanatory feedback.

The role

We’re hiring bilingual Hebrew–English language experts to evaluate and improve AI-generated text. You will review model responses for accuracy, clarity, and reasoning; identify methodological or factual errors; and write high-quality explanations and corrected model outputs.

This is a contract, part-time position with remote, asynchronous work and expected commitment of less than 20 hours per week.

  • Work type: Contractor, part-time, remote, asynchronous
  • Time commitment: Less than 20 hours/week
  • Data type: Text; label types: Evaluation/Rating, RLHF, Translation/Localization
  • (project-specific platform)

What you’ll do day-to-day

You will apply structured rubrics and an error taxonomy to rate and rank multiple AI responses, flagging language errors, factual inaccuracies, and reasoning gaps. When a response is incorrect or incomplete, you will write clear model solutions or revisions and explain the issues using concise, actionable feedback.

Occasional fact-checking against reputable sources will be required. Consistent application of severity and category labels within the rubric is essential.

  • Rate and rank AI outputs using rubric-based evaluation and error taxonomy
  • Identify reasoning gaps, methodological errors, and unclear explanations
  • Produce high-quality model solutions, revisions, and written explanations
  • Fact-check claims and validate content with reliable sources when needed
  • Maintain consistent labeling and documentation for quality and reproducibility

Requirements

You must meet all core requirements below. Preferred qualifications are listed separately.

  • Native or near-native Hebrew proficiency with strong writing/editing across formal and informal registers
  • English proficiency at C1+ (reading/writing) for bilingual evaluation and guideline adherence
  • Bachelor’s degree or higher in Linguistics, Translation, Hebrew Language, Communications, Journalism, or related field
  • 3+ years professional experience in translation/localization, editorial QA, content QA, or linguistic review (or equivalent)
  • Experience applying linguistic QA rubrics and error taxonomies with severity/category consistency
  • Strong fact-checking habits and comfort validating claims with reputable sources
  • Ability to identify reasoning gaps and methodological errors beyond surface language issues
  • Comfortable working independently in a remote, asynchronous workflow with reliable availability/communication

Preferred experience

These are not required but will increase your competitiveness for the role.

  • Familiarity with Israeli cultural context and terminology norms across common domains
  • Prior experience with AI data training, annotation, or evaluation workflows

Compensation, schedule, and how to apply

Pay is $32 USD per hour. The role is contract, part-time, and remote. Projects are typically asynchronous—choose shifts or tasks that fit your availability while meeting deadlines and quality standards.

To apply, prepare examples of translation, localization, or QA work and be ready to demonstrate consistent rubric application. Successful applicants will complete a short qualification task to confirm rubric understanding and bilingual evaluation skills.

  • Hourly rate: $32 USD per hour
  • Workload: Less than 20 hours/week; schedule flexible within remote project constraints
  • Application may include a qualification task to evaluate rubric use and bilingual skills