Pricing Find Work Managed Service For Large Projects

Platform Overview

Hire, manage, and pay top AI Trainers & Data Labelers in one place while working in the tools you already use

How It Works

Learn how we make hiring and managing AI Trainers simple.

Data Labeling Tool Integrations

Hire experts for any labeling tool, including your custom platform.

Pricing

Get transparent pricing and start hiring with scalable costs.

Solutions

Find specialists for any LLM and labeling workflow you can imagine.

Find Data Labeling Vendors

Browse vetted agencies and BPOs for large-scale projects.

List your data labeling company

Create a free company profile, receive matched RFPs, and submit proposals with your pricing, capacity, and timeline.

JOIN AS Freelancer

The #1 Platform for Finding AI Training Jobs

We bring AI training and data labeling jobs from 20+ platforms into one place.

Work With Us

Hire Freelancers

Post a Job to the #1 Network for AI Training Talent Now

Post your job and find pre-vetted AI Trainers & Data Labelers across any domain, language, or tool.

FOR LARGE PROJECTS / MANAGED SERVICE

Done-for-You AI Data Teams

For large or complex projects. We recruit, train, manage, and QA your team inside your tools.

JOIN AS Freelancer

The #1 Platform for Finding AI Training Jobs

We bring AI training and data labeling jobs from 20+ platforms into one place.

LLM & Agents

LLM Evaluation

Red Teaming

Hallucination Audits

RLHF & Preference Data

Supervised Fine-Tuning

Code Generation Review

Function Calling

View All LLM & Agent Solutions

Structured Data Labeling

Speech and Audio Labeling

Time Series Annotation

View Data Labeling Solutions

Work With Us

Hire Freelancers

Post a Job to the #1 Network for AI Training Talent Now

Post your job and find pre-vetted AI Trainers & Data Labelers across any domain, language, or tool.

FOR LARGE PROJECTS / MANAGED SERVICE

Done-for-You AI Data Teams

For large or complex projects. We recruit, train, manage, and QA your team inside your tools.

Work With Us

Hire Freelancers

Post a Job to the #1 Network for AI Training Talent Now

Post your job and find pre-vetted AI Trainers & Data Labelers across any domain, language, or tool.

FOR LARGE PROJECTS / MANAGED SERVICE

Done-for-You AI Data Teams

For large or complex projects. We recruit, train, manage, and QA your team inside your tools.

Multimodal

Vision

Text

Bring Your Own Platform

We're the talent layer, not the tool. Hire AI Trainers and Data Labelers into any platform - commercial, open-source, or your own internal tooling.

Researcher Tools

Paper Explorer (HFEPX)

Browse high-signal papers for RLHF, human feedback datasets, and LLM/agent evaluation workflows.

Paper2Code Finder

Find the best implementation and artifacts for any paper by arXiv ID, DOI, URL, or title.

AI & ML Glossary

Browse 500+ AI and machine learning terms with definitions, examples, and explanations.

Platform Overview

Hire, manage, and pay top AI Trainers & Data Labelers in one place while working in the tools you already use

How It Works

Learn how we make hiring and managing AI Trainers simple.

Data Labeling Tool Integrations

Hire experts for any labeling tool, including your custom platform.

Pricing

Get transparent pricing and start hiring with scalable costs.

Solutions

Find specialists for any LLM and labeling workflow you can imagine.

Find Data Labeling Vendors

Browse vetted agencies and BPOs for large-scale projects.

List your data labeling company

Create a free company profile, receive matched RFPs, and submit proposals with your pricing, capacity, and timeline.

JOIN AS Freelancer

The #1 Platform for Finding AI Training Jobs

We bring AI training and data labeling jobs from 20+ platforms into one place.

Work With Us

Hire Freelancers

Post a Job to the #1 Network for AI Training Talent Now

Post your job and find pre-vetted AI Trainers & Data Labelers across any domain, language, or tool.

FOR LARGE PROJECTS / MANAGED SERVICE

Done-for-You AI Data Teams

For large or complex projects. We recruit, train, manage, and QA your team inside your tools.

JOIN AS Freelancer

The #1 Platform for Finding AI Training Jobs

We bring AI training and data labeling jobs from 20+ platforms into one place.

LLM & Agents

LLM Evaluation

Red Teaming

Hallucination Audits

RLHF & Preference Data

Supervised Fine-Tuning

Code Generation Review

Function Calling

View All LLM & Agent Solutions

Structured Data Labeling

Speech and Audio Labeling

Time Series Annotation

View Data Labeling Solutions

Work With Us

Hire Freelancers

Post a Job to the #1 Network for AI Training Talent Now

Post your job and find pre-vetted AI Trainers & Data Labelers across any domain, language, or tool.

FOR LARGE PROJECTS / MANAGED SERVICE

Done-for-You AI Data Teams

For large or complex projects. We recruit, train, manage, and QA your team inside your tools.

Work With Us

Hire Freelancers

Post a Job to the #1 Network for AI Training Talent Now

Post your job and find pre-vetted AI Trainers & Data Labelers across any domain, language, or tool.

FOR LARGE PROJECTS / MANAGED SERVICE

Done-for-You AI Data Teams

For large or complex projects. We recruit, train, manage, and QA your team inside your tools.

Multimodal

Vision

Text

Bring Your Own Platform

We're the talent layer, not the tool. Hire AI Trainers and Data Labelers into any platform - commercial, open-source, or your own internal tooling.

Researcher Tools

Paper Explorer (HFEPX)

Browse high-signal papers for RLHF, human feedback datasets, and LLM/agent evaluation workflows.

Paper2Code Finder

Find the best implementation and artifacts for any paper by arXiv ID, DOI, URL, or title.

AI & ML Glossary

Browse 500+ AI and machine learning terms with definitions, examples, and explanations.

Pricing Find Work Managed Service For Large Projects

Refusal Steering: Fine-grained Control over LLM Refusal Behaviour for Sensitive Topics

Iker García-Ferrero, David Montero, Roman Orus

2025-12-18T14:43:04Z

arXiv

Abstract

We introduce Refusal Steering, an inference-time method to exercise fine-grained control over Large Language Models refusal behaviour on politically sensitive topics without retraining. We replace fragile pattern-based refusal detection with an LLM-as-a-judge that assigns refusal confidence scores and we propose a ridge-regularized variant to compute steering vectors that better isolate the refusal--compliance direction. On Qwen3-Next-80B-A3B-Thinking, our method removes the refusal behaviour of the model around politically sensitive topics while maintaining safety on JailbreakBench and near-baseline performance on general benchmarks. The approach generalizes across 4B and 80B models and can also induce targeted refusals when desired. We analize the steering vectors and show that refusal signals concentrate in deeper layers of the transformer and are distributed across many dimensions. Together, these results demonstrate that activation steering can remove political refusal behaviour while retaining safety alignment for harmful content, offering a practical path to controllable, transparent moderation at inference time.

Full analysis loading… Code implementations, benchmark data, and reproduction guides are being assembled. Please check back shortly.

Browse all papers

Need human evaluators for your AI research? Scale annotation with expert AI Trainers.

Post a Job Get a Quote

The #1 platform for sourcing AI Trainers and Data Labelers. 100,000+ pre-vetted domain experts.

Platform

How It Works
Pricing
Managed Service
Solutions
Integrations

Company

Contact
contact@opentrain.ai
Get a Quote

Get Started

Create Account Log In