Using a lot of data wrangling and NLP, I created a movie and tv show recommender: Basically, the user selects from the drop-down list their movie/tv show of interest, and the algorithm will recommend a combination of movies and tv shows that the user is most likely to be interested in. I tried to include […]
Tag: Data Analytics

By: Dr. Ikjyot Singh Kohli I wrote an extensive script in R that takes the most recent data available for the number of new/confirmed COVID-19 cases per day by location and computes the probability using statistical learning that a selected location will observe a new COVID-19 case (probability of observing a non-zero daily percentage return). […]

By: Dr. Ikjyot Singh Kohli The conventional wisdom by the political pundits/analysts who are seeking to explain Joe Biden’s massive win in the 2020 South Carolina primary is that Jim Clyburn’s endorsement was the sole reason why Biden won. (Here is just one article describing this.) I wanted to analyze the data behind this and […]
By: Dr. Ikjyot Singh Kohli Election season is upon us again, and a number of people from political analysts to campaign advisors are making a huge deal about winning the Iowa caucuses. This seems to be the standard “wisdom”. I decided to run some analysis on the data to see if it was true. I […]
Trump has once again put The U.S. on the world stage this time at the expense of innocent children whose families are seeking asylum. The Trump administration’s justification is that: “They want to have illegal immigrants pouring into our country, bringing with them crime, tremendous amounts of crime.” I decided to try to […]
By: Dr. Ikjyot Singh Kohli It seems that one cannot turn on ESPN or any YouTube channel nowadays without the ongoing debate of whether Michael Jordan is better than Lebron, what would happen if Michael Jordan played in today’s NBA, etc… However, I have not seen a single scientific approach to this question. Albeit, it […]
By: Dr. Ikjyot Singh Kohli So, I came across this article today by Matt Moore on CBSSports, who basically once again has taken to the web to bash the Triangle Offense. Of course, much of what he claims (like much of the Knicks media) is flat-out wrong based on very primitive and simplistic analysis, and […]
Where is Lebron James most effective on the court? Based on 2015-2016 data, we obtained from NBA.com the following data which tracks Lebron’s FG% based on defender distance: From Basketball-Reference.com, we then obtained data of Lebron’s FG% based on his shot distance from the basket: Based on this data, we generated tens of thousands of […]
An interesting machine learning problem: Can one figure out the relationship between the popular vote margin, voter turnout, and the percentage of electoral college votes a candidate wins? Going back to the election of John Quincy Adams, the raw data looks like this: Electoral College Party Popular vote Margin (%) Turnout Percentage of EC John […]
In this article, I will use Data Science / Machine Learning methodologies to break down the real factors separating the playoff from non-playoff teams. In particular, I used the data from Basketball-Reference.com to associate 44 predictor variables which each team: “FG” “FGA” “FG.” “X3P” “X3PA” “X3P.” “X2P” “X2PA” “X2P.” “FT” “FTA” “FT.” “ORB” “DRB” “TRB” […]