Categories
Data Science

2025 MLB Win Projections

We have designed and implemented a new machine learning model that projects the total number of wins for each team in a given season. We will update this each day. As of: Wednesday, August 13, 2025 Team Projected Wins (+/-4 Wins) Philadelphia Phillies 95 Milwaukee Brewers 92 Boston Red Sox 90 Los Angeles Dodgers 89 […]

Categories
Basketball Data Science Mathematics NBA Sports Statistics

NBA 2024-2025 Win Predictions

Below, are the predicted NBA team wins for the 2024-2025 seasons (within a margin of error of +/- 5 wins) based on our novel ML algorithm to predict team wins. We collected over 20 years of NBA team data and applied advanced nonparametric, nonlinear models to uncover the relationship between various performance indicators and a […]

Categories
Data Analytics Data Science Mathematics Physics Statistics

Relating the Ridge and Lasso to Physics

Categories
Data Analytics Data Science Hollywood Mathematics Movies Statistics

Movie and TV Show Recommender

Using a lot of data wrangling and NLP, I created a movie and tv show recommender: Basically, the user selects from the drop-down list their movie/tv show of interest, and the algorithm will recommend a combination of movies and tv shows that the user is most likely to be interested in. I tried to include […]

Categories
Data Analytics Data Science education Mathematics Statistics

Deriving Least Squares Coefficients Using Tensor/Index Notation

Many textbooks when introducing ordinary least squares as a method to do regression (usually via introducing linear regression) don’t explain / derive the coefficients. Here, I explain how to do it using tensor/index notation.

Categories
Biology Data Analytics Data Science Mathematics Statistics

Coronavirus Predictions

By: Dr. Ikjyot Singh Kohli I wrote an extensive script in R that takes the most recent data available for the number of new/confirmed COVID-19 cases per day by location and computes the probability using statistical learning that a selected location will observe a new COVID-19 case (probability of observing a non-zero daily percentage return). […]

Categories
Basketball Mathematics NBA Sports Statistics

A Problem With Offensive Rating

Abstract: It is shown that the standard/common definition of team offensive rating/offensive efficiency implies that a team’s offensive rating increases as its opponent’s offensive rebounds increase, which, in principle, should not be the case. Over the past number of years, theย advanced metric known as Offensive Rating has become the standard way of measuring a basketball […]

Categories
Basketball Data Analytics Data Science Mathematics NBA NBA Playoffs Sports Statistics

The Risk of The 3-Point Shot

As more and more teams are increasing the number of threes they attempt based on some misplaced logical fallacy that this somehow leads to an efficient offense, we show below that it is in fact in a team’s opponent’s interest for a team to attempt as many three point shots as possible. Looking at this […]

Categories
Basketball Data Analytics Data Science Mathematics Science Sports Statistics

An Equation to Predict NBA Playoff Probabilities

Based on a previous paper I wrote that used machine learning to determine the most relevant factors for teams making the NBA playoffs, I did some further analysis in an attempt to come up with an equation that outputs the probability of an NBA team making the playoffs in a given season. From the aforementioned […]

Categories
Basketball Mathematics NBA Statistics

When is it optimal to shoot a 3-Point Shot

A very interesting result: computing payoffs of players, the following is a diagram that shows when it is optimal for a player to shoot a 2 point or a 3-point shot. One sees that it is hardly ever optimal for a player to shoot a 3-point shot, since the region corresponding to 3-point optimality is […]