Data Journalism : Do we trust the numbers ?

Data journalism can be rephrased as data-aided journalism. The ability to extract information from various sources is at the heart of the big data process called Data Mining. The current trend in journalism is to rely more on the data mining techniques to gather data that allows for meaningful exploitation of facts and insights that would be otherwise difficult to analyze manually. A lot of news stations have started investing in big data over the past few years but this has led to another problem.
         How can we trust the analysis performed by the new channels ? Not only does the technique of big data create a myth that data analysis serves as conclusive evidence, but also disengages the audience from conducting its own inquiry. I’m not implying here the curation of viewers minds, rather the false trust generated by statistical analysis in general. Statistical analysis which is at the heart of big data is only used to indicate there presence of trends in data. The problem arises when we claim certain properties hold in a system in general only based on trends. In some cases, this is a valid approach.
     Take the example of historical meteorological data. It’s not out of the world to assume that if it rained yesterday and that if the humidity is sufficiently high that it will rain again today. But this kind of predictive analysis cannot be used out of the box to predict the number of crimes that will happen and certainly not to decide whether a person living in a particular area is more likely to commit a crime than other. While there is some truth to it, this decision cannot be made without sufficient analysis of the situation and evidence.
Journalism is "the profession of the skeptic” and should stay that way. It is one of those few usages of big data where too many cooks spoil the broth.
I hope you liked reading about this topic. This writeup is entirely mine and is inspired from the following two web sources

Comments

Popular posts from this blog

Why computers aren't amazingly smart

Making Machines : Introduction

Data Mining and R/Rattle : First Experiment