A User-Focused Approach to Evaluating Probabilistic and Categorical Forecasts

Nicholas Loveday ^aBureau of Meteorology, Melbourne, Victoria, Australia

Search for other papers by Nicholas Loveday in
Current site
Google Scholar
PubMed

Robert Taggart

Robert Taggart ^bBureau of Meteorology, Sydney, New South Wales, Australia

Search for other papers by Robert Taggart in
Current site
Google Scholar
PubMed

, and

Mohammadreza Khanarmuei

Mohammadreza Khanarmuei ^cBureau of Meteorology, Brisbane, Queensland, Australia

Search for other papers by Mohammadreza Khanarmuei in
Current site
Google Scholar
PubMed

Online Publication:: 13 May 2024

DOI:: https://doi.org/10.1175/WAF-D-23-0201.1

Received:: 20 Nov 2023

Final Form:: 21 Mar 2024

Accepted:: 08 May 2024

Published Online:: 13 May 2024

Displayed acceptance dates for articles published prior to 2023 are approximate to within a week. If needed, exact acceptance dates can be obtained by emailing [email protected].

Download PDF

Full access

Abstract

A user-focused verification approach for evaluating probability forecasts of binary outcomes (also known as probabilistic classifiers) is demonstrated that is (i) based on proper scoring rules, (ii) focuses on user decision thresholds, and (iii) provides actionable insights. It is argued that when categorical performance diagrams and the critical success index are used to evaluate overall predictive performance, rather than the discrimination ability of probabilistic forecasts, they may produce misleading results. Instead, Murphy diagrams are shown to provide better understanding of overall predictive performance as a function of user probabilistic decision threshold. It is illustrated how to select a proper scoring rule, based on the relative importance of different user decision thresholds, and how this choice impacts scores of overall predictive performance and supporting measures of discrimination and calibration. These approaches and ideas are demonstrated using several probabilistic thunderstorm forecast systems as well as synthetic forecast data. Furthermore, a fair method for comparing the performance of probabilistic and categorical forecasts is illustrated using the FIxed Risk Multicategorical (FIRM) score, which is a proper scoring rule directly connected to values on the Murphy diagram. While the methods are illustrated using thunderstorm forecasts, they are applicable for evaluating probabilistic forecasts for any situation with binary outcomes.

© 2024 American Meteorological Society. This is an Author Accepted Manuscript distributed under the terms of the default AMS reuse license. For information regarding reuse and general copyright information, consult the AMS Copyright Policy (www.ametsoc.org/PUBSReuseLicenses).

Corresponding author: Nicholas Loveday, [email protected]

Abstract

Corresponding author: Nicholas Loveday, [email protected]

Share Link

Copy this link, or click below to email it to a friend

Email this content

or copy the link directly:

https://journals.ametsoc.org/view/journals/wefo/aop/WAF-D-23-0201.1/WAF-D-23-0201.1.xml

Link copied successfully

Weather and Forecasting

	All Time	Past Year	Past 30 Days
Abstract Views	117	117	41
Full Text Views	75	75	34
PDF Downloads	87	87	35

The Impacts of Interannual Climate Variability on the Declining Trend in Terrestrial Water Storage over the Tigris–Euphrates River Basin

Authors:

Li-Ling Chang

and

Guo-Yue Niu

Evaluation of Snowfall Retrieval Performance of GPM Constellation Radiometers Relative to Spaceborne Radars

Authors:

Yalei You

George Huffman

Veljko Petkovic

Lisa Milani

John X. Yang

Ardeshir Ebtehaj

Sajad Vahedizade

, and

Guojun Gu

A 440-Year Reconstruction of Heavy Precipitation in California from Blue Oak Tree Rings

Authors:

Ian M. Howard

David W. Stahle

Michael D. Dettinger

Cody Poulsen

F. Martin Ralph

Max C. A. Torbenson

, and

Alexander Gershunov

Quantifying the Role of Internal Climate Variability and Its Translation from Climate Variables to Hydropower Production at Basin Scale in India

Authors:

Divya Upadhyay

Sudhanshu Dixit

, and

Udit Bhatia

Extreme Convective Rainfall and Flooding from Winter Season Extratropical Cyclones in the Mid-Atlantic Region of the United States

Authors:

Yibing Su

James A. Smith

, and

Gabriele Villarini

A User-Focused Approach to Evaluating Probabilistic and Categorical Forecasts

Abstract

Abstract

Share Link

AMS Publications

Get Involved with AMS

Affiliate Sites

Follow Us

Contact Us

Metrics

Related Content

A User-Focused Approach to Evaluating Probabilistic and Categorical Forecasts

Abstract

Abstract

Share Link

Metrics

Related Content

AMS Publications

Get Involved with AMS

Affiliate Sites

Follow Us

Contact Us