For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
M
Mandeep K.

Mandeep K.

AI/LLM Trainer (RLHF)

India flagJalandhar, India

Key Skills

Software

Label StudioLabel Studio

Top Subject Matter

No subject matter listed

Top Data Types

DocumentDocument
ImageImage
TextText

Top Task Types

Action RecognitionAction Recognition
Data CollectionData Collection
RLHFRLHF
Text SummarizationText Summarization
Translation/LocalizationTranslation/Localization

Freelancer Overview

I specialize in training and refining large language models (LLMs) using Reinforcement Learning from Human Feedback (RLHF) methodologies to enhance model performance and alignment with user intent. I can also evaluate AI-generated responses based on user prompts, API calls, and structured rubrics to assess accuracy, relevance, and overall quality. Additonaly, I assign scale ratings (Likert scale) to responses and provide detailed, logical justifications for quality assessments to ensure transparency and continuous improvement. I am also proficient in conducting multi-dimensional evaluations of AI outputs, focusing on factors such as coherence, factual accuracy, and alignment with user needs. I contribute to the iterative refinement of LLMs by providing actionable feedback and insights derived from systematic evaluation processes.

Education

L

Lovely Professional University

MBA, Business Administration- International Business

MBA
2017 - 2019

Work History

S

San Global Research

Assistant Manager - Market Research

Pune
2023 - 2024
K

Kia Biz

Senior Research Analyst

Pune
2020 - 2024