Posts by Category

Data Analysis Fundamentals

Neural Network Regression

17 minute read

The following is a subsection of a tutorial I co-authored that I wanted to reproduce here to work out how to get the figures to render properly. Plus, it nev...

Random Forests

25 minute read

As with support vector machines, and as the name implies, classification and regression trees (CART) can be used for either classification or regression task...

Support Vector Machines

9 minute read

Support Vector Regression (SVR) attempts to include as many data points as possible in the area between two lines. The following figure demonstrates this usi...

Generalized Additive Models

6 minute read

Recall that if there is a non-linear relationship between predictor and response, we can attempt to transform the predictor using a known function (log, reci...

Non-Parametric Analysis Of Variance

6 minute read

As we’ve seen in the last few posts, linear models can be successfully applied to many data sets. However, there may be times when even after transforming va...

Advanced Experimental Designs

25 minute read

So far, we have considered designs with just two levels per factor, which is fine for inherently binary factors. However, if our study requires us to include...

Logistic Regression

7 minute read

So far, we’ve seen regression techniques for continuous and categorical response variables. There is a different form of regression called logistic regressio...

Transforming Non-Linear Variables

8 minute read

If we have data that includes factors with more than two levels, we have the ability to evaluate non-linear relationships between predictor and response vari...

Model Selection

20 minute read

This post presents methods for finding a balance between under fitting and over fitting a model. Under fitting is when the model is a poor predictor of the r...

Multiple Linear Regression

16 minute read

In the previous section we considered just one predictor and one response. The linear model can be expanded to include multiple predictors by simply adding t...

Simple Linear Regression

7 minute read

The purpose of regression is to describe a relationship that explains one variable (the response, or the “y” variable) based on one or more other variables (...

Back to Top ↑

Generative Art

Maze Generator

7 minute read

This generative art seems less artsy to me, but the idea of generating a maze was interesting. I found the mazegenerator GitHub repo that described the algor...

KNN Art

4 minute read

The generative art journey continues with images produced from machine learning models. In this post, I’ll use k-nearest neighbors to produce two types of ar...

Perlin Noise Flow Fields

8 minute read

In this post, I’ll generate art based on the idea of a material flowing through an area. Again this was inspired by the the aRtsy package, which has a canvas...

Space Colonization

8 minute read

Continuing the series of generative art posts, I liked the look of the canvas_petri() images from the aRtsy package. That led me to this article that describ...

The Recaman Sequence

3 minute read

The next stop on my generative art journey took me to the Recaman Sequence. As opposed to many other algorithms, there’s no randomness in this sequence - the...

The Collatz Conjecture

6 minute read

The inspiration for a series of posts on generative art was a post I read announcing the aRtsy package for producing generative art. A number of the example ...

Back to Top ↑

Sports Analytics

College Football Predictions

44 minute read

Continuing with the streak of posts about college football posts, here I finally get into developing some models to predict the winning team and margin of vi...

Simple Rating System

10 minute read

This, to me, is the coolest part. I had no idea this is how some rating systems work, and it’s pretty slick. It’s just one big system of equations that you s...

Oregon Football

20 minute read

I’m not a huge football fan. I can’t even remember the last time I watched an NFL game. However, I do sort of follow college football. And being from Oregon,...

Back to Top ↑

General Analysis

TriPeaks Solver

11 minute read

During COVID-19 brain fog, I had just enough mental capacity to play a lot of Microsoft Solitaire games. I found that TriPeaks on master and grandmaster diff...

Best Wordle Starting Word

10 minute read

I was scrolling through the RStudio blog page, and a post by Arthur Holtz caught my attention. He wrote some code to identify what the optimal starting word ...

Back to Top ↑

GitHub Personal Pages

Plotly & Markdown

4 minute read

In a previous post, I created a scatterplot using ggplot2 that displayed college football team logos as the markers using geom_image() from the ggimage packa...

Back to Top ↑