Transform and aggregate scoring rules

(This all relates to ideas in Do your evaluations have enough power? - #17 by samabbott about having a common standard of best practice with some clear mechanism for iterating on it). Currently, that is in some sense via software but of course not completely (i.e how you aggregate etc etc).

1 Like