Skip to content

Terminal application using natural language processing and web-scraping via Python/PyCharm

License

Notifications You must be signed in to change notification settings

smhussain5/Politico-NLP-Python

Repository files navigation

POLITICO NLP PYTHON

Politico NLP Python GIF Demonstration

Problem 🤔

Utilizing multiple libraries to develop innovative solutions is key to being a competent software engineer. In today's fast-paced landscape, staying informed is important but difficult to do!

Solution 💡

This terminal application utilizes Beautiful Soup to scrape the Politico website for the day's top stories. After collecting news links, the application utilizes natural language processing (NLP) libraries to summarize article texts and calculate a polairty score to further inform the reader. The resulting information is displayed to the user for convenient reading.

Quick Start ⚡

If you have Docker installed, you can run this application on your own machine with just 2 steps!

Pull the image from Docker Hub

docker pull smhussain5/politico-python

Then run the image as an interactive Docker container

docker run --rm -it smhussain5/politico-python

Technologies Used ⚙

  • Beautiful Soup
  • Newspaper3k
  • NLTK
  • PyCharm
  • Python
  • TextBlob

Challenges 💢

This was a straightforward application, but required proper organization for clean code. Furthermore, the Newspaper3k library was unable to collect every article and the NLP, in its current state, provides adequate summaries.

Insights 💭

In < 100 lines of code, I was able to scrape Politico and use NLP techniques to summarize the scraped articles. This is a great feat and demonstrates the power of these Python libraries. Potential refactoring may include utilizing more accurate NLP libraries and web-frameworks like Django for better presentation.

Contact 📲

Static Badge
Static Badge
Static Badge
Static Badge

About

Terminal application using natural language processing and web-scraping via Python/PyCharm

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published