This is a private internal preview of DISCOVERY 2.0. Do not share externally.

GPA Dataset (All Courses since Spring 2010)


The "GPA Dataset" contains the GPAs for courses at The University of Illinois over a ten year period from Spring 2010 through Spring 2020.

  • Dataset Format: Well-formatted CSV with column headers as the first row
  • Dataset Size: 53,933 rows ร— 21 columns
  • CSV File Location: https://waf.cs.illinois.edu/discovery/gpa.csv
  • Dataset Variables:
    • Year : number โžœ The year the course section was offered
    • Term : string โžœ The term the course section was offered (Fall, Spring, Summer, or Winter)
    • YearTerm : string โžœ A formatted combination of the year and term
    • Subject : string โžœ 2-4 letter subject code for the course; ex: HIST for History, CS for Computer Science, etc.
    • Number : number โžœ 1-3 digit course number
    • Course Title : string โžœ A short title describing the course
    • A+ : number โžœ The number of A+ grades given in the course section
    • A : number โžœ The number of A grades given in the course section
    • A- : number โžœ The number of A- grades given in the course section
    • B+, B, B-, C+, C, C-, D+, D, D-: number โžœ The number of grades given in the course section
    • F : number โžœ The number of F grades given in the course section
    • Primary Instructor : string โžœ The primary instructor of record in the course section
    • Students: The number of students per course section

Using the GPA Dataset in Python

The dataset can be loaded using the pandas library in Python:

import pandas as pd
df = pd.read_csv("https://waf.cs.illinois.edu/discovery/gpa.csv")
df
YearTermYearTermSubjectNumberCourse TitleA+AA-B+BB-C+CC-D+DD-FWPrimary Instructor
02020Spring2020-spAAS100Intro Asian American Studies1212102101000010Boonsripaisal, Simon
12020Spring2020-spAAS100Intro Asian American Studies512414120000010Boonsripaisal, Simon
22020Spring2020-spAAS100Intro Asian American Studies08736320200000Uhall, Michael B
32020Spring2020-spAAS100Intro Asian American Studies86204112100000Uhall, Michael B
42020Spring2020-spAAS100Intro Asian American Studies54314112110110Kang, Yoonjung
..................................................................
539282010Summer2010-suSTAT410Statistics and Probability II510225101300021Stepanov, Alexei G
539292010Summer2010-suSTAT440Statistical Data Management412813000000000Unger, David
539302010Summer2010-suTAM212Introductory Dynamics01325751102010Morgan, William T
539312010Summer2010-suTAM251Introductory Solid Mechanics12215033200110Ott-Monsivais, Stephanie
539322010Summer2010-suTAM335Introductory Fluid Mechanics58945104001020Keane, Richard D

The full GPA dataset stored in a DataFrame (53,933 rows).

Pages Using the GPA Dataset

  1. Video Walk-Through & Worksheet: Learn Page: GPA Dataset and Operations on Groups
  2. Video Walk-Through & Worksheet: Learn Page: Aggregation Functions for groupby Groups
  3. Video Walk-Through & Worksheet: Learn Page: Examples Aggregating groupby Groups
  4. Video Walk-Through & Worksheet: Learn Page: Reading Histograms (Part 1)
  5. Video Walk-Through & Worksheet: Learn Page: Histograms, Bar Charts, and Box Plots
  6. Learn Page: Python for Data Science: Introduction to DataFrames