This lesson is in the early stages of development (Alpha version)

Introduction to Machine Learning with Scikit Learn: Setup

Software Packages Required

You will need to have an installation of Python 3 with the matplotlib, pandas, numpy and opencv packages. If you can’t successfully install opencv, you may use Goolge Colab on day 2 of the workshop.

The Anaconda Distribution includes all of these except opencv by default.

Installing OpenCV with Anaconda

Installing from the Anaconda command line

From the Anaconda terminal run the command conda install -c conda-forge opencv

Download the code and data

  1. Please create a directory called IntroML located on your desktop. In this directory, create a subfolder called data (i.e., /Users/username/Desktop/IntroML/data)

  2. Download the following python scripts and place them in the IntroML directory located on your Desktop (e.g., /Users/username/Desktop/IntroML/regression_helper_functions.py)
  3. Download the the following files and place them in the data subfolder you just created (e.g., /Users/username/Desktop/IntroML/data/gapminder-life-expectancy.csv):

If you are using a Mac or Linux system the following commands will download this:

mkdir data
cd data
wget https://scw-aberystwyth.github.io/machine-learning-novice/data/worldbank-gdp.csv
wget https://scw-aberystwyth.github.io/machine-learning-novice/data/worldbank-gdp-outliers.csv
wget https://scw-aberystwyth.github.io/machine-learning-novice/data/gapminder-life-expectancy.csv