Uncertainty-Aware LLMs Fail to Flag Misleading Contexts
Tianyi Zhou, Johanne Medina, Sanjay Chawla
NeurIPS 2025 – Reliable ML Workshop (2025)
Can LLMs Detect Their Confabulations? Estimating Reliability in Uncertainty-Aware Language Models
Tianyi Zhou, Johanne Medina, Sanjay Chawla
Proceedings of the AAAI Conference on Artificial Intelligence, 40(44), 38164-38172.