Handover Optimization in 5G Networks Using Q-Learning 🛜

This project addresses the challenge of optimizing handover management in 5G networks using Q-Learning, a reinforcement learning technique. In a 5G network, efficient handover management is crucial for ensuring low latency, high throughput, and seamless connectivity. This project models a dynamic grid-based system where an agent learns to navigate and minimize penalties (handover failures) while maximizing the quality of service (QoS) metrics.

Additionally, the project includes a separate Q-Learning implementation (Q_learning.py) to benchmark and compare the handover optimization results with classical reinforcement learning tasks, demonstrating the model's versatility and effectiveness.

Associated Video Link: Click here

Report PDF File: Click here

Brief Project Description: Click here

🧾 Course: EC431 - 5G Communication and Network

👤 Faculty: Dr. Bhupendra Kumar

✨ Overview

In a 5G network, handovers are a critical operation where user equipment (UE) switches from one cell tower (antenna) to another to maintain connectivity. Improper handovers can result in dropped connections and degraded QoS. This project models this scenario using reinforcement learning:

5G Antenna Cells: Represented as grid cells with overlapping signal ranges.
Rewards and Penalties: Based on the agent’s decisions regarding handovers.
Reinforcement Learning: The Q-Learning algorithm trains an agent to optimize the handover process dynamically.

The project also includes a benchmarking module (Q_learning.py) to compare the results of the handover optimization system with a simpler, well-known reinforcement learning task, such as MountainCar.

🔖 Project Structure

.
├── agent.py                    # Implements the Q-Learning algorithm for handover decisions.
├── gridworld.py                # Models the 5G grid environment with antennas.
├── main.py                     # Main script to train and evaluate the handover optimization agent.
├── plotter.py                  # Generates plots for performance analysis.
├── Q_learning.py               # A standalone implementation of Q-Learning on a classic task for comparison.
├── requirements.txt            # Lists the dependencies required for the project.
├── metrics_plots/              # Stores the training metrics for the handover optimization system.
│   ├── rewards_per_episode.png # Tracks rewards earned per episode.
│   ├── handovers_per_episode.png # Tracks the number of handovers.
│   ├── cumulative_success.png  # Tracks the cumulative success rate.
│   ├── steps_taken_per_episode.png # Tracks the steps taken per episode.
│   ├── cumulative_rewards.png  # Tracks the cumulative rewards over episodes.
│   ├── histogram_of_handovers.png # Displays the distribution of handovers.
│   └── epsilon_decay.png       # Shows the epsilon decay over episodes.
├── qlearning_plots/            # Stores plots for Q-Learning experiments.
│   └── plot.png                # Summarizes Q-Learning performance for standalone tasks.

🛠️ Setup and Installation

Prerequisites

Python 3.8+
Required libraries: numpy, matplotlib, gym, pyglet

Installation Steps

Clone the repository:

git clone <repository_url>
cd <repository_name>

Install dependencies:
```
pip install -r requirements.txt
```
Run the training script:
```
python main.py
```
Run the benchmarking script (optional):
```
python Q_learning.py
```

❓ How It Works

Environment Setup

5G Grid Model:
- A grid where each cell represents a 5G antenna range.
- Antennas have overlapping coverage zones.
Agent Movement:
- The agent navigates through the grid while switching between antenna cells.
- Penalties are applied for unnecessary handovers to incentivize efficient behavior.
Dynamic States:
- Signal strength varies dynamically.
- The agent must adapt to changing conditions to maintain optimal connectivity.

Training with Q-Learning

Q-Values: Represent the expected rewards for each action in a given state.
Rewards: Encourages successful handovers and connectivity.
Penalties: Applied for failed or unnecessary handovers.
Epsilon Decay: Balances exploration and exploitation during training.

Benchmarking with `Q_learning.py`

Implements Q-Learning for the MountainCar environment to evaluate the algorithm's adaptability and performance in a simpler, static setup.
Generates comparative performance insights, highlighting the unique challenges of dynamic 5G environments.

🛡️ Modules

1. `agent.py`

Defines the Q-Learning agent responsible for:

Selecting actions based on epsilon-greedy policy.
Updating Q-values dynamically.
Handling antenna selection and handover decisions.

2. `gridworld.py`

Models the 5G network environment, including:

Antenna configurations and signal ranges.
State transitions and movement rules.
Reward and penalty mechanisms.

3. `main.py`

Initializes the environment and agent.
Runs the training process.
Logs metrics for analysis.

4. `plotter.py`

Generates plots for:

Episode rewards.
Handover efficiency.
Steps taken per episode.
Epsilon decay and other metrics.

5. `Q_learning.py`

A standalone Q-Learning implementation for benchmarking on tasks like MountainCar.

💻 Usage

Run the Training Script:
```
python main.py
```
Run the Benchmarking Script (optional):
```
python Q_learning.py
```
View Plots:
Analyze the generated plots in the metrics_plots/ and qlearning_plots/ folders for performance insights.

🖼️ Visualization

Generated plots include:

Rewards per Episode: Tracks agent performance improvements.
Handovers per Episode: Measures unnecessary handovers.
Steps Taken per Episode: Monitors navigation efficiency.
Cumulative Success: Tracks overall success rate across episodes.
Epsilon Decay: Observes the exploration-exploitation strategy.

From Q_learning.py:

A single plot showing the reward progression for a classical RL task, aiding in comparing results with the 5G optimization system.

⚡ Key Features

Dynamic Handover Management
- Models realistic 5G network scenarios with overlapping antenna coverage.
Optimized Performance
- Uses Q-Learning to minimize unnecessary handovers and optimize connectivity.
Benchmarking Flexibility
- Includes a classical RL task (MountainCar) for performance comparison.
Detailed Visualization
- Comprehensive plots for evaluating the agent’s performance.
Reinforcement Learning Adaptability
- Explores RL's application for both dynamic 5G problems and static environments.

⏭️ Future Work

Multi-agent extension for collaborative tasks.
Integration of DQN for deep reinforcement learning.
Obstacle dynamics to simulate real-world grid environments.

🙋‍♂️ Contributors

Sanidhya Kumar: GitHub Profile
Samanway Maji: GitHub Profile
Riya: GitHub Profile
Shivang Bhargava: GitHub Profile
Anamika Sadh: GitHub Profile

For queries, reach out via email.

Feel free to reach out for further queries or discussions. Happy coding! ❤️

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
__pycache__		__pycache__
metrics_plots		metrics_plots
qlearning_plots		qlearning_plots
Gr29EC431_Brief_Project_Description_Recording.mp3		Gr29EC431_Brief_Project_Description_Recording.mp3
Gr29EC431_Handover_Optimization_in_5G_Networks_Using_Q_Learning.pdf		Gr29EC431_Handover_Optimization_in_5G_Networks_Using_Q_Learning.pdf
Gr29EC431_LATEX_Handover_Optimization_in_5G_Networks_Using_Q-Learning.zip		Gr29EC431_LATEX_Handover_Optimization_in_5G_Networks_Using_Q-Learning.zip
LICENSE		LICENSE
README.md		README.md
agent.py		agent.py
gridworld.py		gridworld.py
main.py		main.py
plotter.py		plotter.py
qlearning.py		qlearning.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Handover Optimization in 5G Networks Using Q-Learning 🛜

📚 Table of Contents

✨ Overview

🔖 Project Structure

🛠️ Setup and Installation

Prerequisites

Installation Steps

❓ How It Works

Environment Setup

Training with Q-Learning

Benchmarking with `Q_learning.py`

🛡️ Modules

1. `agent.py`

2. `gridworld.py`

3. `main.py`

4. `plotter.py`

5. `Q_learning.py`

💻 Usage

🖼️ Visualization

⚡ Key Features

⏭️ Future Work

🙋‍♂️ Contributors

About

Releases

Packages

Languages

License

IIITV-5G-and-Edge-Computing-Activity/Handover-Optimization-using-QLearning

Folders and files

Latest commit

History

Repository files navigation

Handover Optimization in 5G Networks Using Q-Learning 🛜

📚 Table of Contents

✨ Overview

🔖 Project Structure

🛠️ Setup and Installation

Prerequisites

Installation Steps

❓ How It Works

Environment Setup

Training with Q-Learning

Benchmarking with Q_learning.py

🛡️ Modules

1. agent.py

2. gridworld.py

3. main.py

4. plotter.py

5. Q_learning.py

💻 Usage

🖼️ Visualization

⚡ Key Features

⏭️ Future Work

🙋‍♂️ Contributors

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Benchmarking with `Q_learning.py`

1. `agent.py`

2. `gridworld.py`

3. `main.py`

4. `plotter.py`

5. `Q_learning.py`

Packages