It recently ocurred to me that itâs not obvious how to run code for the projects I post here on your own computer. If you donât already know, all of the associated code for projects I post to this website are completely open to the public- in fact, the source code for this website is, too. This website was built with GitHub Pages, so the all of the assets and code are in this GitHub repo (I had originally built the site with Python/Flask but I was tired of paying for AWS hosting fees).
Running python code starting from zero can be daunting. This guide is to help you through the process.
There are six main steps to running code from a python project:
Now, if you have used python before, you probably have already done steps one and two. I am going to cover those anyway, so you can skip to step three.
Python is available through many different distributions, but the most common is Anaconda. Anaconda is a free distribution of python that is available for Windows, Mac, and Linux. The Anaconda distribution is primarily geared for scientific computing, but it can be used for many other purposes. I highly recommend using Anaconda over other distributions.
Anaconda has Commercial, Team, and Enterprise versions, but all you need is the free individual edition which you can get here. Just download the installer and run it. Chances are you can use all of the default install options.
Installing Anaconda will actually install a few more things than just python, but we will get to that later.
Before we install a package manager, let me explain how to why we need one in the first place.
One of the most frustrating parts about getting started with coding (especially with python) is dealing with package management. I donât think there is nearly enough easily digestable documentation on how to install packages and maintain usuable virtual environments, which makes it even more difficult to get started.
import pandas
, import matplotlib.pylot as plt
For each of my projects, I will set up a seperate virtual environment to avoid package conflicts and insure that the packages and versions used for that project are saved at that snapshot in time.
Python is known to have a dedicated and active developer community, which can be a double edged sword sometimes. Constant new releases of packages can be a headache if you are not careful about maintaining seperate virtual environments.
Technically, a package manager/installer and virtual environment manager are two different things. The package manager is used to install and manage packages, while the virtual environment manager is used to manage the virtual environments. For example, pip is a package manager, while virtualenv is a virtual environment manager. However, we are not going to use those tools here.
My preferred package manager is conda, which is a also environment manager (Two for one!). Even better, installing Anaconda will install conda as well, automaticaly. So, if you did step 1, then you can skip to step 3.
Just to reiterate: Anaconda is a distribution of python, while conda is the package & virtual environment manager that comes with Anaconda.
Luckily there already exists great documentation about Git, how it works, and how to use it- so I wonât explain too much here.
The key takeaways are:
If you have not installed nor used Git before, I recommend that you simply install Github Desktop which will also install Git for you. Github Desktop has a nice interface that makes it easy start using, as opposed to using Git in the command line interface which can be daunting. I also recomend that you take the time to read the Github documentation to learn more about Git and hosting services like Github.
First, you are going to want to navigate to the projects repository. This is where you will be cloning the code from. For any python projects on my website, you can navigate to itâs main repo with the github link at the top of the project page.
If you are using Github Desktop, then follow the instructions to clone a repo here
This will copy the code from the repository to your computer. You get to choose where you want to save the code. By default, Github Desktop will save the code in Users/<username>/Documents/GitHub/<repo_name>
. Changing the code in this directory will not affect the code in the remote repository, unless you commit the changes and push them to the repository from my side. So, feel free to play around with the code all you want.
Okay, so now that you have the code, you need to install the dependencies before you can get it to run. This is where the package manager comes in.
Each of my python projects comes with an environment specification (or âenv specâ for short) file, which is instruction that conda uses to recreate the environment I used to develop the project.
For example, here is the env spec for the PySquiggleDraw project:
The env specâŚ
Now that we have the instructions, lets recreate the environment.
First, open the application Anaconda Prompt
which should have been installed with Anaconda. (If you are using macOS, you can just use Terminal
)
Navigate to the directory where the repo was cloned. If you are using Github Desktop with the default location, this will look like:
Windows (using Anaconda Prompt
)
(base) C:\Users\jackson>cd C:\Users\jackson\Documents\GitHub\PySquiggleDraw
(base) C:\Users\jackson\Documents\GitHub\PySquiggleDraw>
macOS (using Terminal
, zsh)
(base) Jackson@Jacksons-Computer ~ % cd /Users/jackson/Documents/GitHub/PySquiggleDraw
(base) Jackson@Jacksons-Computer PySquiggleDraw %
Obviously, you would need to replace the username with your own username. Here we are using the PySquiggleDraw as an example, but you can use any project.
Then, run conda env create -f environment.yml
. This will create a new environment based on the instructions in the env spec, which is located in the folder you just navigated too. The name of the environment will be the one specified in the env spec. If you want to name it something else, you can run conda env create -f environment.yml --name <your-env-name-here>
instead.
Before conda
can create the environment, it first has to solve the environment (figure out non-conflicting dependancies) based on the instructions in the env spec. This can take a while, so be patient.
Once the environment is created, run conda activate <env-name>
. This will activate the environment and allow you to run the project code.
Now that we have dependencies installed and the environment activated we can run the code.
Depending on the project, this might mean running python scripts from the command line, but most of the time I like to use Jupyter Notebooks. Jupyter Notebooks allow you to run code in an interactive cell-based interface.
If Jupyter is part of the env spec, then you can launch jupyter lab by running jupyter lab
(or jupyter notebook
) in the command line once the environment is activated.
If you have not used jupyter before, then I recomend you download and install VS Code. VS Code is an all around great IDE that has a lot of features and extensions, and it can also run Jupyter notebook files (.ipynb). VS Code will also help by installing jupyter dependencies for you if they are not there already, letting you choose which environment to use, and will take care of othe technical aspects that might not be clear as a beginner.
Also, VS Code has fantastic resources to help you get started with Jupyter.
The basic steps to run the code from my project is:
Thatâs it! Thanks for reading!