Digital Education Resources - Vanderbilt Libraries Digital Lab
These lessons will introduce the key features of pandas with an emphasis on reusable code examples and typical ways to create visualizations from DataFrames. This is an intermediate level series that assumes you have a basic understanding of how Python operates and of basic Python objects like lists and dictionaries.
Working session date | Lesson | Topic | Web page (times are total video length) |
---|---|---|---|
Nov 2 | 1 | Introduction to pandas Series | optional lesson on NumPy (53 min) / lesson videos (34 min) |
Nov 9 | 2 | Introduction to pandas DataFrames: loading data and easy operations | lesson videos (56 min) |
Nov 16 | 3 | Extracting and changing data in DataFrames | lesson videos (25 min) / optional lesson on summarizing and rearanging DataFrames (18 min) |
Nov 30 | 4 | Introduction to Matplotlib | lesson videos (22 min, 42 min with optional videos) |
Date | Session | Topic | Notes |
---|---|---|---|
Mar 10 | 1 | Getting started | intro web page, presentation, and video / Anaconda / install Thonny / install Python / install editor / code examples / colab notebook / example Jupyter notebook |
Mar 17 | 2 | Python scripting basics | lesson webpage |
Mar 24 | 3 | Object-oriented Python | lesson webpage / video |
Mar 31 | 4 | Lists and loops | lesson webpage / video |
Apr 7 | 5 | Dictionaries and JSON | lesson webpage / video |
Apr 14 | 6 | Input and output from files | lesson webpage / video |
Apr 21 | 7 | Interacting with the Internet | lesson webpage / video |
These lessons were taught by Dr. Sanjay Mishra, Staff Scientist in the Vanderbilt Ingram Cancer Center and former Data Science instructor at the Nashville Software School. They focused on analyzing and visualizing large data sets with Python and used Pandas and Matplotlib. The lessons assume that you have basic Python skills. The examples use Microsoft Azure hosted Jupyter notebooks, which you can access them with any Microsoft compatible login credentials, including Vanderbilt / VUMC logins. If you have a functioning Anaconda installation on your local computer you can download the Jupyter notebooks and run them locally.
Dr. Mishra recommends the book Python for Data Analysis - Data Wrangling with Pandas, NumPy, and IPython by William (Wes) McKinney as a reference for this lesson. Vanderbilt users can access the eBook through the Heard Libraries subscription to O’Reilly For Higher Education media (VUNet ID and password required) at this link. Non-Vanderbilt users can access a free PDF of the first edition of the similar pandas: powerful Python data analysis toolkit at the Pandas home page. To access the code examples, go to Wes McKinney’s GitHub site and click on the appropriate IPython Notebook for the chapter. Note: IPython is an older name for Jupyter notebooks.
Date | Session | Topic | Notes |
---|---|---|---|
Mar 19 | 1 | Importing data (Pandas, slicing) | lesson video / Jupyter notebook / Azure project link After clicking, clone the project to your account in order to run it in the cloud. Click on the Download Project link if you want to download the project and open it in a local Jupyter notebook. The notebook for this lesson is intermediate_python_1.ipynb / Review of modules and packages |
Mar 26 | 2 | Creating figures with Matplotlib | lesson video / Jupyter notebook / Azure project link The notebook for this lesson is intermediate_python_2.ipynb. See the first part of the lesson video for more information about how to access the data in the notebook on a local Jupyter notebook or in the cloud through Azure / See notes above to access the O’Reilly book mentioned in the lesson |
Apr 2 | 3 | Curve fitting | lesson video / Jupyter notebook / Azure project link After clicking, clone the project to your account in order to run it in the cloud. Click on the Download Project link if you want to download the project and open it in a local Jupyter notebook. The notebook for this lesson is Intermediate_Python_3.ipynb / starting page for LaTex information |
Apr 9 | 4 | Basic image processing | lesson video / Jupyter notebook / Azure project link After clicking, clone the project to your account in order to run it in the cloud. Click on the Download Project link if you want to download the project and open it in a local Jupyter notebook. The notebook for this lesson is Intermediate_python_4.ipynb |
Apr 16 | 5 | Reproducible results: notebooks, LaTex, and presentations | lesson video / Jupyter notebook / Azure project link After clicking, clone the project to your account in order to run it in the cloud. Click on the Download Project link if you want to download the project and open it in a local Jupyter notebook. The notebook for this lesson is Intermediate_python_5.ipynb |
Jupyter notebook links for lesson 5
Extensions for Jupyter notebooks
Code folding for Jupyter notebooks
nbconvert for supressing input code in final rendered display
RISE system for presentations from notebooks
Date | Session | Topic | Notes |
---|---|---|---|
Nov 1 | 1 | HTTP and APIs | Jupyter notebook, Presentation |
Nov 8 | 2 | API search and authentication | Jupyter notebook |
Nov 15 | 3 | HTML and web page structure | Jupyter notebook, Presentation |
Nov 22 | 4 | Scraping with Beautiful Soup | Jupyter notebook |
Dec 6 | 5 | Project | Jupyter notebook for Twitter location search project |
University of Alabama Libraries Scholarly API Cookbook
Working session date | Lesson | Topic | Web pages |
---|---|---|---|
Oct 13 | 1 | Data from files | lesson videos (estimated 55 min) |
Oct 20 | 2 | Complex data structures and functions | lesson videos (30 min) |
Oct 27 | 3 | Reading and writing CSV files | lesson videos (34 min) |
Nov 3 | 4 | Pandas Series and DataFrames: vectorized programming | lesson videos (61 min but not all videos apply to every user) |
Nov 10 | 5 | Basic data wranging with Pandas DataFrames (first half) | lesson videos (40 min) |
Nov 17 | 6 | Basic data wranging with Pandas DataFrames (second half) | lesson videos (35 min) |
Questions? contact Steve Baskauf
Revised 2023-08-03
Questions? Contact us
License: CC BY 4.0.
Credit: "Vanderbilt Libraries Digital Lab - www.library.vanderbilt.edu"