This project focuses on developing methods for processing large-scale digital pathology datasets and extracting meaningful features from whole slide images to support automated report generation. Emphasis is placed on efficient handling of gigapixel image data and preparing it for use in vision-language models for clinical applications.