Async Web Crawler for Website Title and Favicon

This project is a fully asynchronous web server built with FastAPI and HTTPX. Its primary functionality is to retrieve the title and favicon of a given website URL.

Features

Async Architecture: Fully asynchronous web server using FastAPI.
HTTP Requests: Handled using HTTPX.
Caching: Implemented using Redis to cache the results for a limited time, avoiding duplicate requests.
Favicon: The favicon is fetched, saved, and returned as a file URL.
Design Pattern: Follows the MVC (Model-View-Controller) design pattern.
Database: PostgreSQL used as the database.
Docker: Docker used to manage the application, Redis, and PostgreSQL.

How It Works

Request: The server accepts a website URL as input.
Title: It retrieves and returns the title of the website.
Favicon: The server fetches and saves the website's favicon, returning a file URL.
Caching: The URL and its corresponding data are cached in Redis for a limited time to prevent redundant requests.
Docker: The server runs in a Docker container along with Redis and PostgreSQL.

Getting Started

Prerequisites

Installation

Clone the repository:

git clone https://github.com/yourusername/yourproject.git
cd yourproject

docker-compose up -d

Technologies Used

Python: Core programming language
FastAPI: Web framework
Pydantic: Json Serializer
SQLAlchemy: ORM
HTTPX: Asynchronous HTTP requests
Redis: Caching layer
PostgreSQL: Database
Docker: Containerization

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
server		server
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Async Web Crawler for Website Title and Favicon

Features

How It Works

Getting Started

Prerequisites

Installation

Technologies Used

About

Releases

Packages

Languages

AmirEspahbodi/url_crawler

Folders and files

Latest commit

History

Repository files navigation

Async Web Crawler for Website Title and Favicon

Features

How It Works

Getting Started

Prerequisites

Installation

Technologies Used

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages