Skip to content

ilovemanu/ds504_big_data_analytics

Repository files navigation

ds504_big_data_analytics

In this individual project, I applied both classification and regression approaches to predict the processing time of BOS:311 service request. Analysis were first written in Python with pandas in Jupyter Notebook, then rewritten in PySpark in Databricks notebook as practice.

The complete story is described in the pdf file. Code implementation can be found in the three ipynb notebooks. If the notebooks won't load, open each link with https://nbviewer.jupyter.org/.

Data source:

About

DS504 Big Data Analytics Individual Final Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published