Beyond Confidence Scores: How LLMs Are Learning to Know What They Don’t Know
A new wave of research is transforming uncertainty quantification from a diagnostic tool into a powerful control signal for large language models, improving reasoning and enabling more robust AI agents.

![Sectoral distinctions in company risk profiles emerge organically from the data, as demonstrated by the substantial divergence in similarity distributions-companies sharing two-digit Standard Industrial Classification codes exhibit markedly higher risk profile similarity [latex] (5,263 pairs) [/latex] than those in different industries [latex] (101,228 pairs) [/latex], despite the taxonomy mapping process being explicitly devoid of industry-specific information.](https://arxiv.org/html/2601.15247v1/x1.png)


![UAV swarm architectures are explored through three distinct deployments-standalone, edge-enabled, and edge/cloud-enabled-each offering varying levels of computational resource allocation to individual agents and collectively enabling a range of applications dependent on optimized agent behavior as defined by [latex] agent_{i} [/latex].](https://arxiv.org/html/2601.14437v1/x1.png)