Matched via arXiv identifier search · Community adoption signal (59 stars)
- Stars
- 59
- Last push
- Jun 8, 2026 (10d ago)
Risk flags
- Dependency manifest missing
- Low confidence match
Advik Raj Basani, Pin-Yu Chen
Paper appears method- or tooling-adjacent to AI workflows with partial ecosystem coverage.
Detecting AI-generated text is an increasing necessity to combat misuse of LLMs in education, business compliance, journalism, and social media, where synthetic fluency can mask misinformation or deception. While prior detectors often rely on token-level likelihoods or opaque black-box classifiers, these approaches struggle against high-quality generations and offer little interpretability. In this work, we propose D ...
ivEye, a novel detection framework that captures how unpredictability fluctuates across a text using surprisal-based features. Motivated by the observation that human-authored text exhibits richer variability in lexical and structural unpredictability than LLM outputs, DivEye captures this signal through a set of interpretable statistical features. Our method outperforms existing zero-shot detectors by up to 33.2% and achieves competitive performance with fine-tuned baselines across multiple benchmarks. DivEye is robust to paraphrasing and adversarial attacks, generalizes well across domains and models, and improves the performance of existing detectors by up to 18.7% when used as an auxiliary signal. Beyond detection, DivEye provides interpretable insights into why a text is flagged, pointing to rhythmic unpredictability as a powerful and underexplored signal for LLM detection.
Audit each benchmark finding before selecting an implementation path. Evidence refs map to the disclosure section below.
| Task | Dataset | Metric | Value | Source | Evidence refs |
|---|---|---|---|---|---|
| Generation | Falcon-7B | AI Accuracy . | 98.2 | paper-derived | No explicit refs |
| Generation | Llama-3.1-8B | AI Accuracy . | 96.1 | paper-derived | No explicit refs |
| Generation | GPT-2-XL | AI Accuracy . | 98.8 | paper-derived | No explicit refs |
| Detection | Llama-3.1-8B | DivEye Accuracy | 90.8 | paper-derived | No explicit refs |
| Detection | GPT-4o | DivEye Accuracy | 92.4 | paper-derived | No explicit refs |
Detecting AI-generated text is an increasing necessity to combat misuse of LLMs in education, business compliance, journalism, and social media, where synthetic fluency can mask misinformation or deception.
ICTMCG/Awesome-Machine-Generated-Text is the closest maintained adjacent implementation (Title overlap with paper keywords (40%)). It is not paper-verified; validate algorithm and evaluation setup against the paper before trusting reported metrics. Community adoption signal: 233 GitHub stars.
Hardware Notes
Expect multi-day setup/compute for meaningful reproduction based on current guidance.
Evidence graph: 3 refs, 3 links.
Utility signals: depth 100/100, grounding 85/100, status high.
Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.
Matched via arXiv identifier search · Community adoption signal (59 stars)
Risk flags
There is no verified maintained implementation yet. Use this baseline plan to decide whether to prototype now or defer.
Hardware requirements
No verified implementation available
These are not paper-verified. Use them as reference points when no direct implementation is available.
Title overlap with paper keywords (40%)
No additional verified repositories beyond the primary recommendation.
These repositories had low-confidence matching signals and are hidden by default.
No trustworthy direct or curated related Hugging Face artifacts were found yet.
Continue with targeted Hugging Face searches derived from the paper title and method context:
Datasets
Tip: start with models, then check datasets/spaces if you need evaluation data or demos.
Direct artifact matches are currently sparse. Use targeted Hugging Face searches to quickly locate candidate models, datasets, and demos.
Tasks
Generation, Detection
Methods
Transformer
Domains
Large Language Models
Evaluation & Human Feedback Data
Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.
Open in HFEPXExplore Similar Papers
Jump to Paper2Code search queries derived from this paper's research context.
Need human evaluators for your AI research? Scale annotation with expert AI Trainers.