C01

C01

Uncertainty quantification in classification with applications in forensic genetics

Project summary

The task of classifying a human DNA trace from a crime spot into (sub-continental) regions is usually referred to as the analysis of biogeographical ancestry. This analysis is complicated by admixture, i.e., the possibility that every genetic marker may originate from a different region. The corresponding amount of uncertainty can be expected to vary depending on the trace, and on the quality of the reference database. Most importantly, test data may have a different distribution than the reference database. To address this small data challenge, we aim for a general theoretical framework for local uncertainty quantification in classification in situations with unequal though suitably similar distributions of training and test data. To allow for this, we will formalize the notion of local uncertainty of decision boundaries, for subsequently developing approaches for obtaining uncertainty estimates and analyzing their properties.

Our methods

  • Data-driven modeling
  • Local perspective

Principal investigator 1

Doctoral researcher position 1

(supervised by PI Lutz-Bonengel)

Tasks

Work related to admixed samples, and all data applications of the developed methods.

Requirements

  • Master’s degree or equivalent in biology or molecular medicine
  • Ideally, experience with NGS/MPS and analysis of data

Principal investigator 2

Doctoral researcher position 2

(supervised by PI Pfaffelhuber)

Tasks

Developing mathematical theory for guiding methods development, in a filtering framework, which will, e.g., provide error bounds and goodness-of-fit approaches for model selection.

Requirements

  • Master’s degree or equivalent in mathematics or statistics
  • Programming skills in, e.g., R or Python
  • Ideally, experience with research in applied probability and/or population genetics
  • Interest in applications of forensic genetics

Principal investigator 3

Doctoral researcher position 3

(supervised by PI Rohde)

Tasks

Fundamental statistical questions, such as transfer learning and nonparametric classifiers.

Requirements

  • Master’s degree or equivalent in mathematics with profound knowledge in mathematical statistics

Administrative Manager

Marc Schumacher

Institute of Medical Biometry and Statistics,
Faculty of Medicine and Medical Center –
University of Freiburg