Retour aux articles
IAOpenAI News

Teaching models to express their uncertainty in words

OpenAI May 28, 2022 Publication Teaching models to express their uncertainty in words Read paper (opens in a new window) Loading… Share Abstract We show that a GPT‑3 model can learn to express uncertainty...

Le flux RSS ne fournissait qu'un extrait. FlowMarket a récupéré le contenu public disponible depuis la page originale, sans contourner les contenus réservés.

May 28, 2022

Teaching models to express their uncertainty in words

Teaching Models To Express Their Uncertainty In Words

Abstract

We show that a GPT‑3 model can learn to express uncertainty about its own answers in natural language—without use of model logits. When given a question, the model generates both an answer and a level of confidence (e.g. "90% confidence" or "high confidence"). These levels map to probabilities that are well calibrated. The model also remains moderately calibrated under distribution shift, and is sensitive to uncertainty in its own answers, rather than imitating human examples. To our knowledge, this is the first time a model has been shown to express calibrated uncertainty about its own answers in natural language. For testing calibration, we introduce the CalibratedMath suite of tasks. We compare the calibration of uncertainty expressed in words ("verbalized probability") to uncertainty extracted from model logits. Both kinds of uncertainty are capable of generalizing calibration under distribution shift. We also provide evidence that GPT‑3's ability to generalize calibration depends on pre-trained latent representations that correlate with epistemic uncertainty over its answers.

  • GPT
  • Language
  • Learning Paradigms

Authors

Related articles

Three farmers using a mobile app outside

Jan 12, 2024

Wix cover image

May 29, 2025

WHOOP Coach HIIT

Jan 4, 2024

Besoin d'un workflow n8n ou d'aide pour l'installer ?

Après la veille, passez à l'action : trouvez un template n8n ou un créateur capable de l'adapter à vos outils.

Source

OpenAI News - openai.com

Voir la publication originale