Reddit 10 year growth analysis and visualization

In this article we will analyze the 10 year growth history of Reddit using simple visualizations. The data for these visualizations is available from this source [thanks to Justin @ Reddit] Let us compare the traffic stats The above data is available as monthly aggregates since 2008. So, the next question is how much of … Read more

Marriage and Divorce Rates Analysis Dashboard – CDC Data set

In this analysis we gather data from CDC for Marriage and Divorce rates across all the states of USA for past several years. As usual the bulk of the time is consumed in the data gathering, munging and preparation step for the analysis. All the data for each year is manually copy/pasted from the website … Read more

InfoCaptor does Big Data with Cloudera Impala and Hive

InfoCaptor now officially works and certified with Cloudera’s Hadoop distribution and specifically with Hive and Impala. Earlier InfoCaptor supported only JDBC protocol but now along with CDH integration, it has introduced ODBC protocol to take advantage of efficient drivers from Cloudera (Cloudera – Simba drivers) and provide more platform connectivity options. The integration has been … Read more

American footbal players who died during their career – Data Visualization

This is a data analysis and visualization for players of American football who died while still on a team roster or a free agent. Included are players in the NFL, arena football, and college who have died as a result of team bus and plane accidents such as the plane crashes by Marshall and Wichita … Read more

Venture Capital Investment Analytics on 20 years of investment data

In this article we will take an analytical approach and perform straight analytics on a huge dataset available from https://www.pwcmoneytree.com/. PWC Money tree provides a data dump of all the Venture capital investments from 1995 onwards. Having data that goes far into the history should give us enough to extract the necessary analytical juice out … Read more

Impact of sports in movies : Simple Data visualization Analysis – Basketball, Football, Hockey, Lacrosse, Miscellaneous sports

This http://www.the-numbers.com/keywords/ website contains lot of statistics related to movies and one important piece of juicy data is that they have categorized the movie listings by keywords. Using this list we try to analyze the popularity of sports in movies. 1. How has the featuring of sports as backdrop or prominent theme increased over the … Read more

Data Visualization : Who made money for the Government on the bailout money?

Here is a quick data analytics for the bailout money disbursement. Lot of the entities have already paid the amount back to the government with interest and the government has made profits! Government has also lost money on lot of other organizations/companies that failed to repay back.   [Click here to see large] The top … Read more

Thanksgiving Sales – shopping survey

Data source : https://nrf.com/ PDF used for the visualizations Analysing the survey results How Likely Are You Going To Shop On Thanksgiving Which Of The Following Days Do You Plan To Shop Thanksgiving Weekend? Did you shop on Thanksgiving Day last year?   Do You Plan To Shop Specifically For “Small Business Saturday” On Saturday, … Read more

Don’t Expect A Large Salary Increase If You Didn’t Go To College

What is the importance of education and getting formal degree? As per US Census data, following chart illustrates that if you have less than “college degree” then the jumps in your salary is very less as you progress in your age Notice the big jump in median salary with better than college degrees. This is … Read more