Baseball Stats, Model Cards, and Forecasting Performance

samabbott · 10 September 2025 12:12

@kath-sherratt and I met today to talk about A basket of baselines - #12 by samabbott as an off shoot of that conversation we thought that it would be really great to write a nice tightly scoped piece on the idea in here to evaluate models by value above replacement.

This would be a remix extension of some of the ideas from @nickreich and cos model importance work (with the extension here being replacement) and showing when it might be needed by looking at different ensemble sizes (under the assumption that as performance asymptotes with the addition of more models so should the difference betwee just looking at leave one out and performance above replacement).

We also thought there was an interesting discussion to be had in terms of what to replace with i.e all other models as a permutation, a baseline (hence the link to the basket of baselines), the ensemble of all other models (we thought this might end up being the same as the all permutation idea). There is a clear extension here to weighted replacements i.e more similar models or the extreme category version of this i.e. models from the same “family” for now we thought we would leave that to the discussion.

The plan is that we will draw up a short analysis plan over the next week or so and then reach out to interested parties from here to set up a meeting. Keen to hear peoples thoughts.

Topic		Replies	Views
Community Seminar 2024-08-07 - Kaitlyn Johnson - Wastewater modeling to forecast hospital admissions in the US: Challenges and opportunities Meetings	19	280	14 August 2024
Do your evaluations have enough power?	27	632	17 February 2026
How can collaborative infectious disease forecasting/nowcasting projects be improved?	6	563	5 June 2023
A basket of baselines Project Proposals	15	235	27 January 2026
Include a simple reference model Project Proposals model-extension , package-extension	21	1077	30 June 2025

Baseball Stats, Model Cards, and Forecasting Performance

Related topics