inf-428-data-analytics-online

Videos

2021

2021 is the second year INF428/528 has been taught online. The course continues to evolve. We will re-use some of the videos from last year as needed, and I will produce new videos based on questions I get. So please, if you have any questions post right away to the “ask a question” forum. If the question can’t be answered concisely it will become a video.

Module 1 - Introduction

Some tips on how to find videos, github desktop and Jupyter notebook

Loading Sports Stats from the ‘reference.com’ sites

Module 2 - Statistics

Some tips for Module 2 Assignment, and an example about a hypothetical Covid Vaccine

Opening up KNIME, connect to KNIME Server, get example workflow, modify it to use Jayson Tatum’s stats

KNIME T-test (From last year but still helpful for assignment 2)

Some helpful tips on Github and Confidence Intervals, relevant for Assignment 2

Module 3 - Data Wrangling

Wrangling airline data

Python Data Wrangling

KNIME BAR Plot

Conditional Indexing with Covid data

Group Obesity Stats

Trouble shooting for assignment 3

Write a csv from Python, read it from KNIME (useful for assignment 3)

Module 4 - Data Visualization

Heatmaps and converting categorical data to numeric (Advanced)

Heatmaps and barcharts in KNIME (Intermediate)

Visualize Time Series with KNIME

Visualizing data using KNIME javascript nodes

Visualizing data using Python, Matplotlib, and Pandas

Module 5 Machine Learning

Machine Learning Terminology

Machine Learning with KNIME

Machine Learning Metrics

Machine Learning with Python

Bayes Rule and Naive Baye’s with KNIME

Self Prediction

Hints on converting string data to numeric for assignment 5

2020

2020 is the first year INF428/528 has been taught online, and it has been a learning experience for myself as to the best way to teach material. I have been adding videos as needed based on questions I get. I will continue to do so (if you are struggling with something ask, and I will make a video) and summarize them here.

Oh yeah… to top it all off, all of a sudden we also got hit by a pandemic and everybody’s life got disrupted a bit (this class was online anyway so I guess we’ve been disrupted less than others). Stay safe.

Videos for assignment 2

KNIME T-test

Videos for assignment 3

Python Data Wrangling

KNIME BAR Plot

Python help + Python barplots for groups

Videos for assignment 4

Visualize Time Series with KNIME

Cleaning numerical data (Adirondack Mountain data)

Videos for Assignment 5

Machine Learning Terminology
Machine Learning with KNIME
Machine Learning Metrics
Machine Learning with Python
Bayes Rule and Naive Baye’s with KNIME
Self Prediction