Group Equivariant DNN for Ambisonic Signal Processing

This repository is the demonstration of the group equivariant Ambisonic signal processing DNNs [1], implemented by the authors.

License

This repository (except submodules) is released under the specific license. Read License file in this repository before you download and use this software.

The submodule seld-dcase2019 is under its own license.

The script fCGModule.py is from zlin7/CGNet, which is originally released under the MIT License.

.
├── LICENSE
├── README.md
├── adversarial_attack.py
├── article_figure
│   └── taslp
├── boot_tensorboard.sh
├── checkpoints
├── dcase19_dataset.py
├── docker
│   ├── Dockerfile
│   └── build.sh
├── evaluation.py
├── fCGModule.py
├── feature_extraction.py
├── login_torch_sh.sh
├── main.py
├── math_util.py
├── models.py
├── modules.py
├── parameter.py
├── render_taslp_fig3.py
├── render_taslp_fig4.py
├── result
├── ret_adv
├── ret_eval
├── run_adversarial_attack.sh
├── run_experiment.sh
└── seld-dcase2019

Usage

We assume the environment that docker/Dockerfile appropriately works.

Clone this repository.

git clone --recursive https://github.com/nttrd-mdlab/group-equiv-seld
cd group-equiv-seld

Build the Docker environment.

$ cd docker
$ ./build.sh
> ...
> Successfully built 31cc484c9976
> Successfully tagged cgdcase:0.2
$ cd ../

Download the dataset files from the link on this website. You need foa_dev.z**, metadata_dev.zip, foa_eval.zip, metadata_eval.zip. Then, generate the normalized dataset using feature_extraction.py (do not forget to rewrite the path to the downloaded files in feature_extraction.py).

./login_torch_sh.sh
python3  feature_extraction.py
exit

Start model training.

./run_experiment.sh 0  # Specify the GPU number (0-origin) by argument

Trained model is saved to ./checkpoints, and the log is saved to ./result.

Change experiment conditions by rewriting parameter.py and re-run ./run_experiment.sh:
- Toggle model=['Conventional', 'Proposed'][1] to [0] to test baseline model.
- Toggle scale_equivariance=True to False to disable scale equivariance of proposed method.
- Switch train_rotation_bias=['virtual_rot', 'azi_random', None][0] to [1] to enable rotational data augmentation.
- Rewrite feature_phase_different_bin=0 to None to disable time translation invariance of proposed method.
Check and compare performance.

Evaluate the trained model

$ ./login_torch_sh.sh 0
$ python3 evaluation.py --resume ./checkpoints/(name of checkpoint file).checkpoint
$ exit

Compare the progress of (being) trained models

$ ./boot_tensorboard.sh

Then, view http://localhost:6006 with your browser.

Render the figures on the paper:

$ ./login_torch_sh.sh 0
$ python3 render_taslp_fig3.py
$ python3 render_taslp_fig4.py
$ exit

Run experiment for adversarial attack.

$ ./run_adversarial_attack.sh 0 ./checkpoints/(name of checkpoint file).checkpoint (output file name)

References

[1] R. Sato, K. Niwa, K. Kobayashi, "Ambisonic Signal Processing DNNs Guaranteeing Rotation, Scale and Time Translation Equivariance," IEEE/ACM Trans. ASLP, (to be published), 2021.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Group Equivariant DNN for Ambisonic Signal Processing

License

Contents

Usage

References

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
article_figure/taslp		article_figure/taslp
checkpoints		checkpoints
docker		docker
result		result
ret_adv		ret_adv
ret_eval		ret_eval
seld-dcase2019 @ e88c62b		seld-dcase2019 @ e88c62b
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
adversarial_attack.py		adversarial_attack.py
boot_tensorboard.sh		boot_tensorboard.sh
dcase19_dataset.py		dcase19_dataset.py
evaluation.py		evaluation.py
fCGModule.py		fCGModule.py
feature_extraction.py		feature_extraction.py
login_torch_sh.sh		login_torch_sh.sh
main.py		main.py
math_util.py		math_util.py
models.py		models.py
modules.py		modules.py
parameter.py		parameter.py
render_taslp_fig3.py		render_taslp_fig3.py
render_taslp_fig4.py		render_taslp_fig4.py
run_adversarial_attack.sh		run_adversarial_attack.sh
run_experiment.sh		run_experiment.sh

License

nttrd-mdlab/group-equiv-seld

Folders and files

Latest commit

History

Repository files navigation

Group Equivariant DNN for Ambisonic Signal Processing

License

Contents

Usage

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages