Analyzing coral restoration with machine learning

This work was published in Aquatic Conservation: Marine and Freshwater Ecosystems (link) and can be cited the following way:

Morand, G., Dixon, S. & Le Berre, T. (2022). Identifying key factors for coral survival in reef restoration projects using deep learning. Aquatic Conservation: Marine and Freshwater Ecosystems, 32( 11), 1758– 1773. https://doi.org/10.1002/aqc.3878


A typical coral frame, Photo credit: Simon Dixon

Create annotations with the Sloth tool.
Open the Annotations formatting jupyter notebook, check the filepaths and run all cells. (Except the last JSON files handling part).
Copy and paste all TFRecords files from the TFRecords folder to your training data folder in data/frags.
Run the Start training bash file.
When the loss stabilizes, terminate training. Check in the models/fgvc the number of the last checkpoint. Update the Fragment detection/Export model file with this number.
Run the Export model bash file
Update the model number in Frame analysis/main.py

Training the frame structure model

All necessary files are in the Frame segmentation folder.

Annotate pictures on labelbox.com, and then export and download the json file containing links to the masks.
Open jupyter notebook “Train Unet” and after creating the necessary folders, run the first cell. It might need to be run several times if HTTP errors arise.
Check that all masks are there and run second cell.
Check that we have all pictures and masks in “unet_padded”
In the “Train” cell, manually update the “next step” (0 or last_step + 1). The last_step should be visible at the end of the output, or can be obtained in tensorboard. Then run the cell.
Follow the training by running tensorboard.
When the loss stabilizes, terminate training and export model by copying the three model.ckpt.* files to models_folder/unet-XXXX. Update name with last step number.
Update the model number in Frame analysis/main.py

Frame analysis

Data storage

All the data is stored in the database, accessible on SEANOE. Seven different tables contain respectively:

FrameParams: Camera parameters which define the position of the frame on each monitoring picture.
Annotations: All the coral fragments detected on every monitoring picture. This is specific to a frame, a date, and a view.
Observations: This is the annotations, consolidated for the whole frame. It is specific to a frame and a date but not to the view. Every observation is linked to one or two annotations.
Fragments: This is the observations, consolidated for all dates. It is specific to a frame, and that’s it. Every fragment is linked to one or more observations.
Status : This is the status of each fragment at each monitoring date (starting from first detection)
FSFrames: This is the list of coral frames.
FSMonitoring: This is the list of monitoring sessions, with associated picture filenames.

Working principle

Fragment detection : We run the fgvc-101 CNN on all pictures to detect coral colonies. They will all be classified as Pocillopora, Acropora, or Dead coral. It outputs sets of annotations that are inserted into the database.
Frame detection : We run the Unet model on all pictures to detect the frames. This will output binary masks, that are temporarily stored in unet_masks.npy.
Frame position : We try random camera parameters to find the best match with the masks. The best fit parameters are inserted into the database.
Observation creation : We match the annotations with specific bars of the frame. We start with the bars in front of each view: H00: H00, H03: H02 & H04, H06: H06, H09: H08 & H10. We then add the potential missing annotations by analyzing secondary viewpoints: H00: H10 & H02, H06: H04 & H08. The resulting observations are inserted into the database.
Fragment creation : We analyze all observations from a frame to match them and link successive observations of the same fragment. The resulting fragments are inserted into the database.

Running the algorithms on new frames

All the necessary files are in the Frame Analysis folder.

Python functions are divided into 8 files :

sqlfunctions.py : handles all operations with the database
inference.py : handles all operations with the deep learning models
detectframe.py : runs the algorithm to calculate the frame position
datanalysis.py : all functions necessary to analyze the contents of the frames
main.py : contains the function to run everything at once
tools.py : miscellaneous functions for files handling etc.
parallel_ops.py : main file that should be called from command line to run operations
analyze1by1.py : file that should be called from command line to automatically run the algorithm on a large number of frames.

Notes

In order to run everything except the Results notebook for statistics, there is significant adaptation work to be done, mostly to match the folder paths to your folder structure.

Image editing

In this process, we have to edit the pictures, both before training and before running detection models. We have to be mindful to edit the each picture only once, otherwise the image quality degrades quickly.

As we cannot know which pictures are already edited in the database, we assume they are not. So some pictures will have been edited twice in the end.

To prevent overediting, all the pictures are stored unedited, except in two places:

Final pictures used to visualize results may be edited
For fragment detection, the folders used to create the TFRecords must contain edited files (handled by the annotation formatting functions).

Frame encoding

The bars are named with successive characters according to the following rules:

V for vertical or H for horizontal
Face from 00 to 11. Face 00 is the one with the tag. The next face on the left is 02, and so on. Vertical bars are the odd number between the two faces they are adjacent to.
Letter: A is the lowest bar, B the next one up, and so on.

Some examples : H02B, V07A, H10C

Python requirements

We are using Python 3.6 with packages installed with Pip, with a few exceptions.

Tensorflow 1.15 was compiled from source to have the best possible performance on this computer.
Line 208 was changed in object_detection/utils/visualization_util.py to adapt fontsize to image size.
tf_unet downloaded from Github

Statistics and Results

All the results mentioned in the paper can be calculated from the database using the Results jupyter notebook. To reproduce our results, we recommend using the database we published on SEANOE rather than generating it again, as it takes several weeks. To do this, you will need to setup a MySQL server and import our database, and update the credentials in the Statistics/sql.py file.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Fragment detection		Fragment detection
Frame analysis		Frame analysis
Frame segmentation		Frame segmentation
Images		Images
Statistics		Statistics
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Analyzing coral restoration with machine learning

Table of Contents

Introduction

Training the fragment detection model

Training the frame structure model

Frame analysis

Data storage

Working principle

Running the algorithms on new frames

Notes

Image editing

Frame encoding

Python requirements

Statistics and Results

About

Releases 1

Packages

Languages

License

morand-g/maldives-corals

Folders and files

Latest commit

History

Repository files navigation

Analyzing coral restoration with machine learning

Table of Contents

Introduction

Training the fragment detection model

Training the frame structure model

Frame analysis

Data storage

Working principle

Running the algorithms on new frames

Notes

Image editing

Frame encoding

Python requirements

Statistics and Results

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages