How to Use Independent Validation in Python

Thede von Oertzen; Hannes Diemerling; Timo von Oertzen

doi:10.5964/meth.18873

How to Use Independent Validation in Python

Thede von Oertzen
Thomas Bayes Institute, Berlin, Germany
Hannes Diemerling
Thomas Bayes Institute, Berlin, Germany; Humboldt University, Berlin, Germany
Timo von Oertzen
Thomas Bayes Institute, Berlin, Germany

Abstract

To statistically test whether two groups or models differ, classifier accuracy is compared. However, common accuracy estimates like cross-validation have unknown distributions, making them unsuitable for statistical inference. Alternatives like permutation tests or train-test splits are computationally expensive and limited to frequentist tests against chance. Independent Validation (IV) is a more flexible alternative providing a known estimate distribution. This enables both conventional hypothesis testing and Bayesian analysis of classifier performance. Although Python is most widely used for machine learning, a Python implementation of IV has been lacking so far. This article introduces such an implementation; beyond the core IV algorithm, the package allows to: (1) plot accuracy against training set size, (2) estimate the posterior distribution of the asymptotic accuracy, and (3) query the posterior for statistics and credible intervals. This makes it easy to apply IV when comparing accuracy posteriors across classes, datasets, or classifiers on the same data.

PDF HTML XML

Published at

30. June 2026
https://doi.org/10.5964/meth.18873
Issue:

Vol. 22 No. 2 (2026)
Section:

Original Article
Keywords:

machine learning classification Python implementation independent validation cross validation
Share:

von Oertzen, T., Diemerling, H., & von Oertzen, T. (2026). How to Use Independent Validation in Python. Methodology, 22(2), Article e18873. https://doi.org/10.5964/meth.18873

Download Citation

This work is licensed under a Creative Commons Attribution (CC BY) 4.0 International License.

PlumX

Dimensions

Views:

Total	Abstract	PDF	HTML	XML
209	115

Authors

Abstract