Are there pretrained models available for "SynBullying: A Multi LLM Synthetic Conversational Dataset for Cyberbullying Detection"?

Yes, 1 Hugging Face model found. The top result is tokyotech-llm/GPT-OSS-Swallow-120B-RL-v0.1 with 11,778 downloads.

SynBullying: A Multi LLM Synthetic Conversational Dataset for Cyberbullying Detection

Q: How reproducible is "SynBullying: A Multi LLM Synthetic Conversational Dataset for Cyberbullying Detection"?

Estimated time to first reproduction: a few days. Risk flags: No repository-level reproducibility signals are currently available, Estimate is based on paper-only reproduction flow. No direct maintained implementation was found. Use the paper PDF and citation graph to design a baseline reproduction.

Arefeh Kazemi, Hamza Qadeer, Joachim Wagner, Hossein Hosseini, Sri Balaaji Natarajan Kalaivendan, Brian Davis

Published: Oct 30, 2025

No direct paper-linked artifacts found; showing strongest related artifacts

Evidence: Curated Related

Domain fit: AI-core

Verified repos: 0

Core AI workload signals detected from paper context and implementation/artifact evidence.

Time to first repro: a few days

2 risk flags

arXiv PDF

We introduce SynBullying, a synthetic multi-LLM conversational dataset for studying and detecting cyberbullying (CB). SynBullying provides a scalable and ethically safe alternative to human data collection by leveraging large language models (LLMs) to simulate realistic bullying interactions. The dataset offers (i) conversational structure, capturing multi-turn exchanges rather than isolated posts; (ii) context-aware ...

Read full abstract

annotations, where harmfulness is assessed within the conversational flow considering context, intent, and discourse dynamics; and (iii) fine-grained labeling, covering various CB categories for detailed linguistic and behavioral analysis. We evaluate SynBullying across five dimensions, including conversational structure, lexical patterns, sentiment/toxicity, role dynamics, harm intensity, and CB-type distribution. We further examine its utility by testing its performance as standalone training data and as an augmentation source for CB classification.

Technical details

Canonical key: arxiv-2511.11599

Cache status: Fresh

Generated at: Jun 18, 2026, 5:37 AM

Artifact coverage: curated_related

HF provider: ok (token)

PWC source used: No

LLM status: blocked (invalid_model_output:researcher_extraction:OpenRouter request failed)

LLM model: openai/gpt-5.1

LLM generated: Jun 15, 2026, 6:04 AM

LLM content type: sparse_repro_blueprint

HF policy: hf-relevance-v27

LLM evidence refs: paper.title, summary.hasReliableImplementation, guidance.mode

context only

Benchmarks: missing

Time to repro: a few days

2 risk flags

Results & Benchmarks

Freshness tier: hot

Direct + Inferred Evidence

No concrete benchmark grounding is available yet. Treat the page as context or an implementation starting point only.

We introduce SynBullying, a synthetic multi-LLM conversational dataset for studying and detecting cyberbullying (CB).

Implementation Evidence Summary

Confidence: low

Recommendation evidence is currently too limited for a maintained-repo choice. Use Implementation Status and Reproduction Path for a practical baseline plan.

Reproduction Risks

Estimate is based on paper-only reproduction flow

Hardware Notes

Expect multi-day setup/compute for meaningful reproduction based on current guidance.

Evidence disclosure

LLM evidence refs: paper.title, summary.hasReliableImplementation, guidance.mode

Evidence graph: 3 refs, 2 links.

Utility signals: depth 60/100, grounding 68/100, status medium.

Implementation Status

No verified maintained repo

There is no verified maintained implementation yet. Use this baseline plan to decide whether to prototype now or defer.

No direct maintained implementation was found. Use the paper PDF and citation graph to design a baseline reproduction.
Track assumptions and missing details in an experiment log before coding.

Time to first repro: a few days

Best available artifact: tokyotech-llm/GPT-OSS-Swallow-120B-RL-v0.1

Reproduction readiness

No Repo

Time to first repro: days

Last checked: Jun 18, 2026

Hardware requirements

Expect multi-day setup/compute for meaningful reproduction based on current guidance.

No verified implementation available

· No maintained repository has been identified for this paper. Check adjacent implementations or HF artifacts below.

No benchmark numbers could be verified. You will not be able to validate reproduction correctness against published numbers.

Hugging Face artifacts

No direct paper-linked artifacts were found. Showing strongest curated related artifacts for faster exploration.

Models

tokyotech-llm/GPT-OSS-Swallow-120B-RL-v0.1

Curated Related

Downloads: 11,778

Likes: 16

Broaden model search

Transformer Natural Language Processing Transformer synbullying multi synthetic conversational

Datasets

No trustworthy dataset matches right now.

Search datasets on Hugging Face

Spaces

No trustworthy demo spaces right now.

Search spaces on Hugging Face

Explore on Hugging Face

Search models Search datasets Search spaces

Research context

Tasks

None detected

Methods

Transformer

Domains

Natural Language Processing, Large Language Models

Evaluation & Human Feedback Data

Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.

Open in HFEPX

Explore Similar Papers

Jump to Paper2Code search queries derived from this paper's research context.

Transformer Natural Language Processing Large Language Models

Need human evaluators for your AI research? Scale annotation with expert AI Trainers.

Post a Job Get a Quote