Index

Improving AFFGANwriting by Exploring Deep Learning Models for Style Encoders and Image Generation of Sentence-Level Handwriting

Handwriting generation is a fundamental task in computer vision and natural language processing, with applications in personalized content generation and so on. The AFFGANwriting model presents a generative framework for synthesizing word-level handwritten images by fusing multistyle features using a GAN-based approach with a VGG-style encoder. However, its scope is limited in two ways:
• It only generates individual word images
• It used a fixed VGG backbone which may not capture style semantics as effectively as more modern alternatives such as CNN and transformer models (e.g. EfficientNet, ResNet, DINO).

With an increasing demand for personalized handwriting synthesis across longer text spans, there’s a clear motivation to explore if advanced backbone models can improve the feature extraction of the style. In addition, there is need to extend the generative capacity from words to full sentences and to interact ideally in a user-friendly interactive system.

Research questions

• Can more recent feature extractors like CNN and transformers (EfficientNet, ResNet, DINO) outperform VGG in capturing style-relevant features for handwriting generation?
• What are the architectural or training modifications required to extend AFFGANwriting from word-level to sentence-level image synthesis?
• How can the model be integrated into an intuitive web application that allows users to select a writing style and input arbitrary text for sentence-level generation?

Goal

To enhance AFFGANwriting’s quality and flexibility in handwriting image generation by:
• Upgrading the style encoder
• Enabling sentence-level synthesis
• Deploying the system as a web app for user interaction

Plug-and-Play Diffusion Models for Magnetic Resonance Imaging

Development of an AI-Based Algorithm for the Correction of Moiré Artifacts in Digital Radiography

Synthetic Non-Contrast CT Angiography Image Generation using Deep Learning Methods

RPA-Bots zur Prozessautomatisierung im Workflow Management der DATEV eG

Advanced Machine Learning Models for Leakage Detection and Localization in Water Distribution Networks Using Real-System Data

 

Reinforcement Learning for Centralized Fault Coordination in Power Systems

Latent Space Modeling for Event Detection in Power Grid Data

This project explores how latent representations learned from raw grid waveforms can reveal underlying structure and enable early detection of abnormal events. By modeling high-frequency voltage and current signals, we aim to distinguish critical disturbances from normal behavior with minimal delay.

Report Generation in pathology using WSIs

This project focuses on developing methods for processing large-scale digital pathology datasets and extracting meaningful features from whole slide images to support automated report generation. Emphasis is placed on efficient handling of gigapixel image data and preparing it for use in vision-language models for clinical applications.

Few-Shot Adaptation of Generalist Vision Models for Gastrointestinal Medical Image Analysis