Do your evaluations have enough power?

sbfnk · 13 January 2026 15:54

This is a great question - my sense from all the hub etc. work is that it’s likely underpowered but it would be great to think about this a bit more, and even more so to develop some guidance on how to address this when reporting forecast scores.

One thing that I think we discussed in the past was the idea of Model Confidence Sets, i.e. sets of models that are indistinguishable in their forecast ability - there seems to be some active work on this, with applications to COVID forecasts in Sequential model confidence sets and to forecasts during particular phases in Conditional model confidence sets.

Topic		Replies	Views
Community Seminar 2024-08-07 - Kaitlyn Johnson - Wastewater modeling to forecast hospital admissions in the US: Challenges and opportunities Meetings	19	177	14 August 2024
Baseball Stats, Model Cards, and Forecasting Performance Project Proposals	17	267	11 March 2026
How can collaborative infectious disease forecasting/nowcasting projects be improved?	6	515	5 June 2023
A basket of baselines Project Proposals	15	137	27 January 2026
Streamlining of epi modeling tools	12	104	14 August 2024

Do your evaluations have enough power?

Related topics