# Category: Statistics

Using a lot of data wrangling and NLP, I created a movie and tv show recommender: Basically, the user selects from the drop-down list their movie/tv show of interest, and the algorithm will recommend a combination of movies and tv shows that the user is most likely to be interested in. I tried to include […]

## Investigating Prime Numbers

In this post, I attempt to investigate the distribution of prime numbers and the famous Riemann hypothesis.

I wanted to test out R on Apple’s new M1/ARM architecture. Here’s how it went:

Many textbooks when introducing ordinary least squares as a method to do regression (usually via introducing linear regression) don’t explain / derive the coefficients. Here, I explain how to do it using tensor/index notation.

Using mathematics to analyze who is a “better” player By: Dr. Ikjyot Singh Kohli It seems that nowadays one cannot escape the never-ending debate of “Who is better? Michael Jordan or Lebron James?” The situation is made worse by organizations like ESPN that have endless debates on their various shows combined with releasing lists of […]

By: Dr. Ikjyot Singh Kohli I wrote an extensive script in R that takes the most recent data available for the number of new/confirmed COVID-19 cases per day by location and computes the probability using statistical learning that a selected location will observe a new COVID-19 case (probability of observing a non-zero daily percentage return). […]

I wrote an extensive application using NLP and TensorFlow/Keras in Python that looks at all of the current and upcoming Hollywood releases for 2020 and tracks the online Twitter sentiment for each of them. The model output was then displayed in a PowerBI dashboard. In essence, we are predicting the classification probability . You can […]

By: Dr. Ikjyot Singh Kohli The conventional wisdom by the political pundits/analysts who are seeking to explain Joe Biden’s massive win in the 2020 South Carolina primary is that Jim Clyburn’s endorsement was the sole reason why Biden won. (Here is just one article describing this.) I wanted to analyze the data behind this and […]

Abstract: It is shown that the standard/common definition of team offensive rating/offensive efficiency implies that a team’s offensive rating increases as its opponent’s offensive rebounds increase, which, in principle, should not be the case. Over the past number of years, the advanced metric known as Offensive Rating has become the standard way of measuring a basketball […]