Data science and machine learning algorithms can help us form probabilistic forecasts of things like sporting events.
Are two sets of data genuinely different, or is it because of randomness? This question, known as the two-sample testing problem, becomes notoriously difficult in modern datasets, because they are ...