
One of the core statistical values to be promoted as part of the forthcoming World Statistics Day is integrity. The recent rise of user generated content and DIY internet research (all elements of what is known as “Web 2.0”) means that awareness of this particular value is especially important.
Traditionally, it has generally been the case that organisations such as governments and universities are the outlets which drop statistical bombshells. Now quite literally anyone has access to the tools to do this.
In recent years it has become possible for anyone with basic IT knowledge (and often limited statistical ability) to either host an internet survey or place a poll on a website. Users of such research tools (most of the time) do not show the same integrity statistics professionals a) emit themselves or b) would like to see within their profession. Gone are robust sampling methods, as anyone can partake in “Web 2.0” research exercises, killing notions of representativeness and non-probability sampling. Additionally, there are no mechanisms to stop people omitting their opinion on numerous occasions, causing data sets to be filled with duplicated information. However, and most importantly, those utilising such research tools do not treat the information they gather in a manner which is forthright. Often the data generated by such research will be presented with no regard for statistical reporting protocols. Furthermore, and most worryingly, despite its (often) unrobust nature, the information gathered by such methods may feed into the decision making process.
So what does this all mean? It means that when we promote the integrity of statistics, we must also seek to make “Web 2.0” DIY researchers aware that increasing their statistical integrity will not only improve their work, but also the reputation of all statistical investigations as well. Additionally, it also means we must raise awareness of these methods amongst the untrained readers of statistics, so as to avoid true, integrity driven statistical research being tarnished with any brush which may criticise “statistics 2.0”.
Teaching Old Dogs Old Tricks






