This is a little project to re-examine statistical analyses in some public papers and sites (selected according to interest rather than any other method) using modern techniques and Python tools, but with much of the data selection/visualisation and interaction done from Microsoft Excel.

Why? – The modern tools

Many of the techniques used here are not that new. The reason for doing this is that modern tools now make this very easy and quick. What would have taken significant computing and coding time 15 years ago can now been done in only a few lines of code.

How? – Python machine learning tools, sligthly repurposed

The analysis will rely mostly on the Python machine-learning ecosystem repurposed to do pretty traditional hypothesis-based data analysis. The speed, efficiency and programability of these tools is utilised to make simple things simple to do.





