Featured Papers
Popular high-signal papers with direct links to full protocol pages.
- Don't Lose Focus: Activation Steering via Key-Orthogonal Projections
May 7, 2026 · Citations: 0
Across multiple steering benchmarks, we show that SKOP achieves the best joint steering-utility trade-off, reducing utility degradation by 5-7x while retaining over 95% of vanilla steering efficacy.
- MANTRA: Synthesizing SMT-Validated Compliance Benchmarks for Tool-Using LLM Agents
May 7, 2026 · Citations: 0
To overcome these limitations, we present MANTRA, a framework for automatically synthesizing machine-checkable compliance benchmarks from natural-language manuals and tool schemas.
- Measuring Evaluation-Context Divergence in Open-Weight LLMs: A Paired-Prompt Protocol with Pilot Evidence of Alignment-Pipeline-Specific Heterogeneity
May 7, 2026 · Citations: 0
Safety benchmarks are routinely treated as evidence about how a language model will behave once deployed, but this inference is fragile if behavior depends on whether a prompt looks like an evaluation.
- Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning
May 7, 2026 · Citations: 0
Paradoxically, we observe that tool-enabled evaluation can degrade reasoning performance even when the strong thinking models make almost no actual tool calls.
- Improving the Efficiency of Language Agent Teams with Adaptive Task Graphs
May 7, 2026 · Citations: 0
In contrast, fully unstructured teams enable adaptability and exploration but suffer from inefficiencies such as error propagation, inter-agent conflicts, and wasted resources (measured in time, tokens, or file operations).
- Who and What? Using Linguistic Features and Annotator Characteristics to Analyze Annotation Variation
May 7, 2026 · Citations: 0
Human label variation has been established as a central phenomenon in NLP: the perspectives different annotators have on the same item need to be embraced.
- MultiLinguahah : A New Unsupervised Multilingual Acoustic Laughter Segmentation Method
May 7, 2026 · Citations: 0
Laughter is a social non-vocalization that is universal across cultures and languages, and is crucial for human communication, including social bonding and communication signaling.
- Log-Likelihood, Simpson's Paradox, and the Detection of Machine-Generated Text
May 7, 2026 · Citations: 0
The ability to reliably distinguish human-written text from that generated by large language models is of profound societal importance.
- LatentRAG: Latent Reasoning and Retrieval for Efficient Agentic RAG
May 7, 2026 · Citations: 0
Agentic RAG extends this paradigm by replacing single-step retrieval with a multi-step process, in which the large language model (LLM) acts as a search agent that generates intermediate thoughts and subqueries to iteratively interact with…
- Quantifying the Statistical Effect of Rubric Modifications on Human-Autorater Agreement
May 7, 2026 · Citations: 0
Autoraters, also referred to as LLM-as-judges, are increasingly used for evaluation and automated content moderation.
- Linear Semantic Segmentation for Low-Resource Spoken Dialects
May 7, 2026 · Citations: 0
In this paper, we introduce a new multi-genre benchmark (more than 1000 samples) for semantic segmentation in conversational Arabic, focusing on dialectal discourse.
- Rethinking RL for LLM Reasoning: It's Sparse Policy Selection, Not Capability Learning
May 7, 2026 · Citations: 0
Across three model families, six scales, and six math reasoning benchmarks, ReasonMaxxer matches or exceeds full RL performance while requiring only tens of problems and minutes of single-GPU training, a reduction in training cost of…