Index

H2OArmor: A Dynamic Data-driven Leak Detection Framework for Varied Digital Maturity Levels in Water Utilities

In response to the pressing need for advanced leak detection in water distribution networks, this research endeavors to develop a sophisticated machine-learning pipeline named H2OArmor. The pipeline is designed to leverage various methods for detecting leakages by utilizing diverse data sources. Crucially, the ensembled opinions of these methods will be intelligently integrated to generate a confidence score for precise event detection.

H2OArmor’s development will be anchored on a robust framework. This framework not only streamlines the implementation of machine learning algorithms but also offers flexibility in onboarding different water utilities. The methodology of the thesis should include multiple machine learning models contributing towards a final informed decision on identifying leak events at DMA level. Furthermore, the thesis scope includes implementation of an end-to-end automated ML Pipeline, which can be used at scale to deploy with minimal manual intervention.

The thesis encompasses several key work packages:

  1. Framework Implementation: Utilization of a robust ML framework to build the Machine Learning pipeline, ensuring efficiency and compatibility. Either there would be a need to develop such a framework from scratch or there would be utilization of components of a pre-built framework.
  2. Development of ML-based Methods: Creation of machine learning methods ensuring accuracy and adaptability.
  3. Automated Onboarding Process: Designing an automated onboarding process for new methods, enhancing the scalability and versatility of H2OArmor as additional techniques are incorporated.
  4. Scoring Mechanism Development: Creation of a scoring mechanism that synthesizes the ensemble opinions of the various methods, providing a unified confidence score for leak detection events.

H2OArmor aims to revolutionize leak detection in water distribution networks by tailoring its approach to the digital maturity levels of water utilities, ensuring optimal performance and reliability across a spectrum of operational contexts.

[1]Fan, X., Zhang, X. & Yu, X.(.B. Machine learning model and strategy for fast and accurate detection of leaks in water supply network. J Infrastruct Preserv Resil 2, 10 (2021). https://doi.org/10.1186/s43065-021-00021-6

Multimodal fusion of pose and visual information for gesture recognition in historical artworks

Multimodal fusion of pose and visual information for gesture recognition in historical artworks

 

Gestures in historical artwork can communicate the underlying human experiences, offering a broad outlook on the past sensory worlds. To explore this domain, we use the SensoryArt [1] – a dataset of multisensory gestures in historical artworks that comes with person pose estimation key points and gesture labels. The goal of the thesis is to perform gesture classification of the persons’ actions depicted on the paintings. We aim to investigate how additional information on the body posture, such as annotated skeleton information, can affect the performance of the models.

 

Mandatory Goals:

  • Train a model for a multi-label gesture classification on the cropped images with fused ground truth heatmaps of the SensoryArt dataset + evaluate on validation split.
  • Selection and training of a well-performing keypoint estimation model.
  • Evaluate the performance of the end-to-end pipeline on the cropped images consisting of predicting the heatmaps first and then classifying.
  • Train another model for a multiperson gesture classification problem on the image level with fused ground truth heatmaps of the uncropped images + evaluate on validation split.
  •  Perform an inference test of the model on original images with machine-generated heatmaps.

 

Optional Goals:

  • Test incorporating additional information on body position, not as heatmaps but as skeleton key point coordinates/angles.
  •  Conduct additional ablations such as cropping the humans out of the images in square size.
  • Integrate a multi-label approach into the detection pipeline.
  • Test human pose estimation on artwork using the additionally provided gesture labels.

[1] Zinnen, M., Christlein, V., Maier, A., & Hussian, A. (2024). SensoryArt (1.0.0) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.10889613

A Comparative Study of Deep Learning Models for Brain Metastases Autosegmentation

CT Field-of-View Extension Dataset Simulation

Create a simulated dataset for CT FOV extension task using PYRONN

Improving manual annotation of 3D medical segmentation dataset using SAM2

In many medical scenarios, physicians need to annotate pixelwise objects in CT images, whole slide images (WSI), or cellular images. This annotation process often requires a significant amount of time and effort, especially when dealing with large datasets. To address this challenge, a web-based tool capable of automatically segmenting 3D and 2D medical images are widely expected.
EXACT is an existing web-based annotation platform and has already certain user base. Exact supports interdisciplinary collaboration and allows for both online and offline annotation and analysis of images across various domains. Physicians can annotate images directly through the platform’s web interface, which is intuitive and efficient. [1]
To enhance the functionality of Exact, an automatic segmentation plugin is explored and implemented in this thesis and integrate it with Exact. This plugin will enable physicians and researchers to automatically generate high-quality segmentation masks while annotating and save these masks for future use. This approach can significantly improve the efficiency of medical image annotation, reduce manual effort, and optimize medical imaging workflows.
A critical aspect of this project is selecting a segmentation model that is both efficient and accurate. I plan to adopt Segment Anything Model 2 (SAM2), as it has demonstrated robust performance in handling diverse medical imaging tasks (including CT, WSI, and cellular images) while ensuring segmentation precision and reliability. [2]

[1] Christian Marzahl, Marc Aubreville, Christof A. Bertram, Jennifer Maier, Christian Bergler, Christine Kröger, Jörn Voigt, Katharina Breininger, Robert Klopfleisch, and Andreas Maier. Exact: a collaboration toolset for algorithm-aided annotation of images with annotation version control. Scientific Reports, 11(1):4343, Feb 2021.

[2] Nikhila Ravi, Valentin Gabeur, Yuan-Ting Hu, Ronghang Hu, Chaitanya Ryali, Tengyu Ma, Haitham Khedr, Roman Rädle, Chloe Rolland, Laura Gustafson, Eric Mintun, Junting Pan, Kalyan Vasudev Alwala, Nicolas Carion, Chao-Yuan Wu, Ross Girshick, Piotr Doll´ar, and Christoph Feichtenhofer. Sam 2: Segment anything in images and videos. arXiv preprint arXiv:2408.00714, 2024.

Searching for evidence of world models in reinforcement learning agents

Advanced Machine Learning-Based High Demand Forecasting of Household Energy Consumption for Enhancing Grid Operations

This thesis explores forecasting techniques for household energy consumption to help Distribution System Operators (DSOs) manage high demand loads and ensure grid stability. It focuses on predicting when demand exceeds critical thresholds and for how long, enabling proactive energy management. The study analyzes how different data aggregation levels affect forecast accuracy and investigates methods to restore altered load signals for better predictions. By comparing forecasting models and evaluating their performance, the research aims to improve energy management, support automation in grid operations, and enhance data-driven decision-making for a more stable and efficient power distribution system.

Longitudinal Analysis of Parkinson’s Disease Patients Using Natural Language Processing Methods

Removing age bias in the context of pathological speech

Anomaly Detection of Industrial Products using Large Vision Language Models