Big Data Analytics

This page contains links to html renders of Databricks notebooks that I developed in Summer 2020 for the following two courses offered at Maryville University:

  • DSCI 417 - Big Data Analytics (Undergraduate)
  • DSCI 617 - Big Data Analytics (Graduate)

These courses cover the use of Apache Spark for performing distributed data analysis. The courses emphasize the use of PySpark to develop applications within a Databricks environment.