Semi-Supervised Learning for Glacier Front Delineation

Type: MA thesis

Status: running

Date: June 1, 2024 - December 1, 2024

Supervisors: Nora Gourmelon, Vincent Christlein, Andreas Maier

The use of synthetic aperture radar (SAR) imagery allows for all year round monitoring of glacier
movements, regardless of weather influences. This results in huge amounts of data, making human
evaluation of every image infeasible. Advances in the field of Deep Learning create new ways for
automatic image segmentation using a variety of models. Using the CaFFe benchmark dataset [3]
allows for a proper comparison of different model architectures. The problem of unlabeled data
still persist, with the CaFFe training dataset only consisting of only five different glaciers with
some of them being underrepresented.
In order to take advantage of unlabeled data, this thesis will apply semi-supervised learning techniques.
Semi-supervised learning is, as the name suggests, a hybrid learning scheme of supervised
and unsupervised models, using both labeled and unlabeled SAR images. After training the model,
the probability for each pixel in the image to belong to one of four classes (glacier, ocean, rock
outcrop and areas with no information available) is calculated, resulting in a zone prediction. The
front of the glacier can then be calculated afterwards during the post-processing. Two of the most
famous self-supervised learning schemes are iBot [6] and DinoV2 [5], which will both be analyzed
and evaluated. Both frameworks will have to be modified according to the guidelines laid out by
Gourmelon et al. [2], in order to properly compare them with each other and with other models
trained on the CaFFe dataset.
The backbone used in both cases is the HookFormer, a model based on the Swin Transformer [4]
that has shown better performance when used for glacier front delineation [1]. It employs two
Transformer models with a cross-resolution interaction between them, using images of different
resolution from the same area.

