Calendar

Here you can find a list of SmallData events. All events are open to the public unless otherwise specified. If hybrid options are available, they will be listed in the individual event. All events are in Central European Time (CET) or Central European Summer Time (CEST).

Follow these instructions to join our event mailing list!

Internal Event Only
Any events marked with internal event only are unfortunately only for our SmallData members.

Directions to the IMBI Lecture Hall
When you arrive at Stefan-Meier-Straße 26, enter the building and go up the stairs to the first floor. At the first floor, you will see a glass door. If the door is locked, press the small button located to your left for entry. After entering through the glass door, turn left and walk down the corridor. You will arrive at one of the SmallData Offices. The IMBI Lecture Hall is located to the left of this SmallData Office.

Date
Tags
Search
 

2025

Wednesday, June 25
13:00 – 
14:00

SmallData Seminar Series: Research Presentations from Jana Naue & Martin Wolkewitz

Location: IMBI Lecture Hall, 1st Floor, Stefan-Meier-Straße 26, 79104 Freiburg

In this seminar, our SmallData Associated Researchers will give a 30 minute presentation of their work on small data and explore how it connects with other projects across the Collaborative Research Center. Each seminar will feature an open Q&A session.

Forensic applications using DNA methylation analysis and mathematical prediction models
Jana Naue (SmallData Associated Researcher)
Institute of Forensic Medicine, Medical Center – University of Freiburg

Analyzing trace material is one of the main tasks in a laboratory of Forensic Molecular Biology. The standard method is to determine the individual DNA profile (‘genetic fingerprint’) to connect a trace to a person. However, there is additional information in the DNA that can reveal more information about the circumstances of the crime or help to narrow down the pool of possible contributors if a trace cannot be connected to a reference profile.

In this seminar, I will first shortly introduce general methods and requirements to perform casework in a forensic laboratory.

In the main part, I will present my primary research area, ‘Forensic Epigenetics’, with a closer look at two applications: chronological age prediction and body fluid identification. I will explain the biological background and the current state of the art of analysis and data interpretation for application in a forensic case. Next to the wet lab measurements of DNA methylation levels, dry lab work, including bioinformatics pipelines and mathematical models/ machine learning (e.g., Random Forest regression and classification), is an important step of inferring chronological age and body fluid from trace material.

I will also discuss the performed research on limitations and challenges due to the analysis of forensic material. Especially, technical limitations and stochastic effects due to low quantity and quality of sample materials have to be considered. Furthermore, analyzing only a few markers, having unknown background information, and having missing or challenging reference data can also influence the reliability of data and model interpretation.

Methodological aspects of the emulated target trial approach to optimize treatment strategies for a rare pediatric disease
Martin Wolkewitz (SmallData Associated Researcher) & Derek Hazard
Institute of Medical Biometry and Statistics, Division Methods in Clinical Epidemiology, Faculty of Medicine and Medical Center – University of Freiburg

A significant challenge in the treatment of rare diseases is the need to understand the effects of medical interventions and to identify the subgroups that stand to benefit most from these treatments. The additional complexities introduced by time-varying treatment initiation further compound the challenges inherent to this field.  Randomized controlled trials represent the gold standard for causal inference; however, they are often infeasible due to ethical, logistical, or financial constraints, in addition to a lack of available patients with a rare disease. In view of this challenge, we demonstrate how emulated target trials methods provide an alternative for analyzing such scenarios using data from a study into children with a rare combined immunodeficiency.

The example study is predicated on the assumption that the treatment is beneficial for all patients, provided that they survive the adverse effects of the procedure. This introduces an additional layer of complexity, as some patients may be too ill to derive benefit from the procedure, while others may be too healthy to justify the potential risks associated with the procedure itself. The emulated target trial will be contrasted with multi-state and machine learning approaches in order to compare analysis strategies to identify the most suitable patients for treatment in this highly complex setting.

Friday, June 27
11:00 – 
12:00

SMART Tandem Lecture Series: Rolf Backofen & Toni Cathomen

Location: IMBI Lecture Hall, 1st Floor, Stefan-Meier-Straße 26, 79104 Freiburg

The Tandem Lecture Series, hosted by SMART, highlights collaborative research performed by the Prinicpal Investigators across different disciplines. In each session, PIs will present their joint project, discuss developments since the project began, and share their disciplinary approaches. They will also reflect on their experience of working together, including what has helped and what challenges they faced in building a shared understanding. The sessions will conclude with a Q&A, giving researchers, both early career and beyond, the chance to ask questions and join the conversation.

In this collaborative lecture, Rolf Backofen and Toni Cathomen of Project A05 will be presenting.

Friday, July 04
09:00 – 
12:00

Workshop on approaching uncertainty from the foundations

Location: IMBI Lecture Hall, 1st Floor, Stefan-Meier-Straße 26, 79104 Freiburg

In this workshop we approach uncertainty in various task from the exploratory viewpoint – on the one side we want to gather problems and questions in the small data setting with inherent uncertainty, offering you the possibility to show your questions and research. And on the other side we want to present already existing approaches and methodologies to help you solving your problems and enhancing existing approaches.

Moderator: Thorsten Schmidt (A03)

Wednesday, July 09
13:00 – 
14:30

SmallData Seminar Series: Dealing with heterogeneity and low signal to noise ratio when extracting information from biomedical textual data

Location: IMBI Lecture Hall, 1st Floor, Stefan-Meier-Straße 26, 79104 Freiburg

Large language models (LLM) can deal with information which for long time has been thought to only be processable by humans with a given contextual knowledge. Being neural networks, LLMs learn to quantify context sensitive similarity of textual information, also referred to as attention. Specifically by employing this similarity mechanism on multiple layers of abstraction, LLMs allow to address a certain level of a lack in structure and heterogeneity, e.g., in writing style and employed nomenclature, because similarity can be computed on the level of the semantics.

Still, a lack of structure and heterogeneity poses problems when small amounts of information are hidden in a large text, i.e., a low signal to noise ratio, which is the case for scientific literature on rare diseases or when a large heterogeneity is observed such as in doctors’ reports.

In this workshop we want to identify/develop specific use cases which represent the above described challenges from SmallData and develop approaches which (1) address the challenges and (2) quantify the uncertainty of the extracted information.

We give clinical researchers the opportunity to present their challenge of extracting information from heterogeneous text sources. These can be scientific publications such case reports or doctors’ reports describing the state of individual patients.

Participants from biomedicine describe the data, the specific problem and indicate the hurdles with respect to data protection. In addition they also indicate a panel of experts which is willing to validate the results retrieved from the LLMs.

Participants from computer science and statistical data science then infer feasible solutions for the challenge and develop a methodological research question that is in line with the SmallData objectives.

Goal of the workshop is to establish availability of datasets, specific biomedical use cases and strategies for validation of extracted information.

Moderator: Fabian Kabus (F)

Friday, July 11
11:00 – 
12:00

SMART Tandem Lecture Series: Speakers TBC

Location: IMBI Lecture Hall, 1st Floor, Stefan-Meier-Straße 26, 79104 Freiburg

The Tandem Lecture Series, hosted by SMART, highlights collaborative research performed by the Prinicpal Investigators across different disciplines. In each session, PIs will present their joint project, discuss developments since the project began, and share their disciplinary approaches. They will also reflect on their experience of working together, including what has helped and what challenges they faced in building a shared understanding. The sessions will conclude with a Q&A, giving researchers, both early career and beyond, the chance to ask questions and join the conversation.

The speakers for this collaborative lecture will be announced shortly.

Friday, July 18

No Rules Science Exchange

Location: Internal Event Only

On July 18, we’re hosting the SmallData No Rules Science Exchange focused on networking with the opportunity to check out the SmallData Companion and Cubie, our new chatbot. We’ll kick off with a Companion and Cubie Meet & Greet, followed by informal exchange. This is a space for all SmallData members to connect and share ideas for the next funding period.

Friday, July 25
13:00 – 
16:00

Workshop on synthetic data generation and synthetic expert opinions

Location: IMBI Conference Room, 1st Floor, Stefan-Meier-Straße 26, 79104 Freiburg

This workshop discusses different application settings and concepts of synthetic data in a small data context. We will focus on data types such as temporal and longitudinal data, data with a hierarchical structure, and disease gene prioritization. In addition to discussing whether synthetic data underestimates uncertainty and how to use synthetic experts to assess the quality of generated data, we will demonstrate the models and tools currently in use to generate these data types.

Moderator: Clemens Schächter (A03)

Monday, August 04
09:00 – 
16:00

SMART Master Lab Kick Off (Cohort 2.0)

Location: Internal Event Only

On 4 August, 2025, participants of the SMART Master Lab Cohort 2.0 will gather at the University of Freiburg for the kick-off event. This event offers an opportunity to meet, network, and build connections with fellow SmallData researchers from diverse fields, including biomedical sciences, computer sciences, mathematics, and systems modeling. It’s a chance to lay the foundation for future collaborations, foster interdisciplinary relationships, and kick-start the exciting work ahead in the SMART Master Lab!

Administrative Manager

Marc Schumacher

Institute of Medical Biometry and Statistics,
Faculty of Medicine and Medical Center –
University of Freiburg