Created a product review summarizer which clustered thousands of product reviews and summarized them into a maximum of 500 characters

Version 1.0.0

A quick guide on installation of important libraries and running the code.

The project has three .ipynb files – Data Scraper.ipynb, cosine-similarity-wo-tf-idf.ipynb, and cosine-similarity-w-tf-idf.ipynb.


Data Scraper

For the Data Scraper python script, we need to import the following three libraries – requests, BeautifulSoup, and pandas. The installation process can be viewed by clicking on the respective library names.

Splash

In this project, instead of using the default web browser to scrape data, we have created a splash container using docker. Splash is a light-weight javascript rendering service with an HTTP API. For easy installation, you can watch this amazing video by John Watson Rooney on YouTube.

 

 

 

To finish reading, please visit source site