GitHub - almog-gueta/VQA: VQA project- Deep Learning course assignment @Technion

This project is an assignment for Deep Learning course. The topic of the project is multimodel problems- specifically, visual question answering (VQA).

In the repo you can find the instructions for the assignment, and a report of what we have done in the project.

Our proposed model is an ensemble of 3 models:

no pretrained model with 8 CNN layers
pretrained autoEncoder with 4 CNN layers
pretrained autoEncoder with 8 CNN layers

main.py is reproducing the train of all 3 models.

evaluate_hw2.py initializes all 3 models, loads the trained model_dicts, creates the dataset and calculates the soft accuracy of the ensemble.

Note 1: main.py and evaluate_hw2.py are running the entire preprocess on creating and preprocessing the images and texts- it takes some time.. Note 2: for convenience, the saved models are inside the folder 'saved models'. evaluate_hw2.py loads the model from this folder

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
autoencoder_saved_models		autoencoder_saved_models
models		models
nets		nets
saved_models		saved_models
utils		utils
HW2_VQA_instructions.pdf		HW2_VQA_instructions.pdf
README.md		README.md
cfg.yaml		cfg.yaml
convolution_autoencoder.py		convolution_autoencoder.py
dataset.py		dataset.py
evaluate_hw2.py		evaluate_hw2.py
main.py		main.py
report.pdf		report.pdf
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Contributors 2

Languages

almog-gueta/VQA

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages