Shashank Aggarwal, Ram Vikas Mishra, Amit Awekar · Feb 19, 2026
- In multi-agent IR pipelines for tasks such as search and ranking, LLM-based agents exchange intermediate reasoning in terms of Chain-of-Thought (CoT) with each other.
- Current CoT evaluation narrowly focuses on target task accuracy.