Skip to content

kherath6/Recommender-Systems-Projects

Repository files navigation

Recomender Systems using Python

  • Collaborative Filtering
  • Content Based Filteirng

This project was created for demo purposes for MIS 753 - Independant study class that I took in Fall 2021 on Recommender Systems.

The porpose of this project is to provide an introduction to recommender systems using the python programming language. The specific emphasis is given to collaborative filtering and content based filtering. Within the repo you can find 2 jupyter notebooks (one for each topic).

Each notebook goes through data cleaning, text processing, text vectorization, modeling and recommendation generation.

Data

  • The main data source for this project is: yelp open dataset - https://www.yelp.com/dataset
  • This dataset contains inforamtion about 160,000 businessess and 0ver 8 mission customer reviews for those businessess.
  • This dataset contains 5 json files (business, tip, reviews, user, checkin).
  • I downloaded all of these files and queried these files using mongoDb and Studio3T.
  • I then created subsets of these datasets via sql queries on Studio3T and saved subsets of the data using Stuidio3T as csv files.
  • business.csv file contains information about restaurants and reviews.
  • business250k.csv file contains information about 250k reataurents and their reviews.
  • All data files can be found in the Data directory of the repo.

Jupyter Notebooks

  • The repo contans 2 jupyter notebooks,
  • One for collaborative filtering - dataset used: business.csv
  • The second for content based filtering - dataset used: business250k.csv

Sources:

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published