For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
A
Akshat Jain

Akshat Jain

Skilled in AI training across various software languages, databases, cloud

India flagBareilly, India
$25.00/hrEntry LevelRemotasksScale AIOther

Key Skills

Software

RemotasksRemotasks
Scale AIScale AI
Other

Top Subject Matter

Computer Programming/Coding
Databases and SQL
Evaluation

Top Data Types

Computer Code ProgrammingComputer Code Programming
DocumentDocument
ImageImage

Top Task Types

Computer Programming/CodingComputer Programming/Coding
Evaluation/RatingEvaluation/Rating
Function CallingFunction Calling
Prompt + Response Writing (SFT)Prompt + Response Writing (SFT)
RLHFRLHF

Freelancer Overview

In data labeling and AI training data, my expertise is anchored in crafting intricate prompts that challenge AI models, ensuring their resilience and accuracy across diverse domains. I specialize in categorizing and annotating complex data science tasks like data visualization, manipulation, cleaning, feature engineering, statistical analysis, and SQL query optimization. My approach involves adversarial prompt generation to systematically expose model vulnerabilities, refining prompts iteratively until they pinpoint failure points. Additionally, I have led SQL projects that integrate advanced query design and database optimization techniques, contributing to comprehensive data handling and analysis solutions. Through multi-model evaluations, I assess and enhance model performance, leveraging dynamic prompt generation algorithms to adapt strategies based on real-time feedback. These experiences underscore my ability to deliver high-quality AI training data and drive continuous improvement in model training and deployment strategies.

Entry LevelEnglish

Labeling Experience

Scale AI

Challenging AI Models through Complex Data Science Prompts: Adversarial Generation and Multi-Model Evaluation

Scale AIComputer Code ProgrammingPrompt Response Writing SFT
Scope: The project involves generating complex data science prompts to challenge AI models until they fail. Categories include data visualization, manipulation, cleaning, feature engineering, matrix operations, statistical analysis, and model fine-tuning. Prompts are refined until they are specific and difficult enough to cause model errors. Data Labelling: Annotations cover data visualization, manipulation, cleaning, feature engineering, matrix operations, statistical analysis, and model fine-tuning, ensuring precise and accurate prompts. Size and Complexity: Large-scale project with numerous complex prompts across various data science categories, requiring advanced understanding and detailed annotations. Quality Measures: Ensures specificity and relevance of prompts, validates their complexity, continuous refinement based on model performance, peer reviews, automated checks, and regular feedback loops to maintain high standards.

Scope: The project involves generating complex data science prompts to challenge AI models until they fail. Categories include data visualization, manipulation, cleaning, feature engineering, matrix operations, statistical analysis, and model fine-tuning. Prompts are refined until they are specific and difficult enough to cause model errors. Data Labelling: Annotations cover data visualization, manipulation, cleaning, feature engineering, matrix operations, statistical analysis, and model fine-tuning, ensuring precise and accurate prompts. Size and Complexity: Large-scale project with numerous complex prompts across various data science categories, requiring advanced understanding and detailed annotations. Quality Measures: Ensures specificity and relevance of prompts, validates their complexity, continuous refinement based on model performance, peer reviews, automated checks, and regular feedback loops to maintain high standards.

2024 - 2024
Scale AI

Multi-Language Query Generation

Scale AIComputer Code ProgrammingPrompt Response Writing SFT
Scope: The project involves generating prompts to fetch data from Google Cloud Platform based on given database details, query types, and sub-query types. Responses are written in BigQuery, SQL, or PostgreSQL, followed by converting the response to another language from the set. Data Labelling: Annotated data includes a variety of query and sub-query types for Google Cloud Platform databases, with detailed labelling for implementations in BigQuery, SQL, and PostgreSQL. Size and Complexity: Extensive project involving large-scale data and complex query types, with comprehensive annotation and multi-language code conversion. Quality Measures: Ensures prompt accuracy, query consistency, response correctness, and efficient code conversion. Quality is maintained through peer reviews, automated validations, and continuous feedback.

Scope: The project involves generating prompts to fetch data from Google Cloud Platform based on given database details, query types, and sub-query types. Responses are written in BigQuery, SQL, or PostgreSQL, followed by converting the response to another language from the set. Data Labelling: Annotated data includes a variety of query and sub-query types for Google Cloud Platform databases, with detailed labelling for implementations in BigQuery, SQL, and PostgreSQL. Size and Complexity: Extensive project involving large-scale data and complex query types, with comprehensive annotation and multi-language code conversion. Quality Measures: Ensures prompt accuracy, query consistency, response correctness, and efficient code conversion. Quality is maintained through peer reviews, automated validations, and continuous feedback.

2024 - 2024
Scale AI

Iterative Prompt Generation and Evaluation

Scale AIComputer Code ProgrammingEvaluation Rating
Scope: The project entails an advanced system for generating prompts and conducting comparative evaluations of two responses in real-time. Following the selection of the optimal response, the system iteratively refines the prompt to elicit deeper and more nuanced responses Data Labelling: Annotated data includes Python libraries crucial to data science (pandas, numpy, scikit-learn, matplotlib, seaborn) for precise response generation. Size and Complexity: Moderate scale with emphasis on Python libraries, demanding advanced algorithms and model architectures. Quality Measures: Ensures prompt clarity, response relevance, adherence to criteria, depth of information, and contextual appropriateness for high-quality outputs.

Scope: The project entails an advanced system for generating prompts and conducting comparative evaluations of two responses in real-time. Following the selection of the optimal response, the system iteratively refines the prompt to elicit deeper and more nuanced responses Data Labelling: Annotated data includes Python libraries crucial to data science (pandas, numpy, scikit-learn, matplotlib, seaborn) for precise response generation. Size and Complexity: Moderate scale with emphasis on Python libraries, demanding advanced algorithms and model architectures. Quality Measures: Ensures prompt clarity, response relevance, adherence to criteria, depth of information, and contextual appropriateness for high-quality outputs.

2024 - 2024

Education

D

Dr. A.P.J. Abdul Kalam Technical University,

Bachelor's of Technologu, Computer Science

Bachelor's of Technologu
2017 - 2021

Work History

F

Freelance

Software Engineer

Remote
2024 - Present
M

Mark Intel

Founder

India
2023 - Present