Index
Improving AFFGANwriting by Exploring Deep Learning Models for Style Encoders and Image Generation of Sentence-Level Handwriting
Handwriting generation is a fundamental task in computer vision and natural language processing, with applications in personalized content generation and so on. The AFFGANwriting model presents a generative framework for synthesizing word-level handwritten images by fusing multistyle features using a GAN-based approach with a VGG-style encoder. However, its scope is limited in two ways:
• It only generates individual word images
• It used a fixed VGG backbone which may not capture style semantics as effectively as more modern alternatives such as CNN and transformer models (e.g. EfficientNet, ResNet, DINO).
With an increasing demand for personalized handwriting synthesis across longer text spans, there’s a clear motivation to explore if advanced backbone models can improve the feature extraction of the style. In addition, there is need to extend the generative capacity from words to full sentences and to interact ideally in a user-friendly interactive system.
Research questions
• Can more recent feature extractors like CNN and transformers (EfficientNet, ResNet, DINO) outperform VGG in capturing style-relevant features for handwriting generation?
• What are the architectural or training modifications required to extend AFFGANwriting from word-level to sentence-level image synthesis?
• How can the model be integrated into an intuitive web application that allows users to select a writing style and input arbitrary text for sentence-level generation?
Goal
To enhance AFFGANwriting’s quality and flexibility in handwriting image generation by:
• Upgrading the style encoder
• Enabling sentence-level synthesis
• Deploying the system as a web app for user interaction
Plug-and-Play Diffusion Models for Magnetic Resonance Imaging
Development of an AI-Based Algorithm for the Correction of Moiré Artifacts in Digital Radiography
Synthetic Non-Contrast CT Angiography Image Generation using Deep Learning Methods
RPA-Bots zur Prozessautomatisierung im Workflow Management der DATEV eG
Advanced Machine Learning Models for Leakage Detection and Localization in Water Distribution Networks Using Real-System Data
Reinforcement Learning for Centralized Fault Coordination in Power Systems
Latent Space Modeling for Event Detection in Power Grid Data
This project explores how latent representations learned from raw grid waveforms can reveal underlying structure and enable early detection of abnormal events. By modeling high-frequency voltage and current signals, we aim to distinguish critical disturbances from normal behavior with minimal delay.
Report Generation in pathology using WSIs
This project focuses on developing methods for processing large-scale digital pathology datasets and extracting meaningful features from whole slide images to support automated report generation. Emphasis is placed on efficient handling of gigapixel image data and preparing it for use in vision-language models for clinical applications.