Feature Extraction and Dimensionality Reduction Techniques for Assessing Similarity in Large-Scale 3D CAD Datasets

Type: BA thesis

Status: running

Supervisors: Linda-Sophie Schneider

Work description
The research presented in this thesis explores the application of feature extraction and dimensionality reduction techniques to assess model similarity within large-scale 3D CAD datasets. It investigates how different geometric and topological descriptors can be quantified and utilized to measure the similarity between complex 3D models. Therefore, the study employs advanced machine learning algorithms to analyze and cluster 3D data, facilitating a better understanding of model characteristics and relationships.

During the thesis, the following questions should be considered:

  • What metrics can effectively quantify the variance in a training dataset?
  • How does the variance within a training set impact the neural network’s ability to generalize to new, unseen data?
  • What is the optimal balance of diversity and specificity in a training dataset to maximize NN performance?
  • How can training datasets be curated to include a beneficial level of variance without compromising the quality of the neural network’s output?
  • What methodologies can be implemented to systematically adjust the variance in training data and evaluate its impact on NN generalization?

Prerequisites
Applicants should have a solid background in machine learning and deep learning, with strong technical skills in Python and experience with PyTorch. Candidates should also possess the capability to work independently and have a keen interest in exploring the theoretical aspects of neural network training.

For your application, please send your transcript of record.