Oliver Frank

ANNs as models of human cognition
in mental health and disorder
2026

Keywords: Deep Learning, Digital twin, Mental disorder, EMA, Neuro-AI

This PhD project in theoretical neuroscience was conducted as a collaboration between the Hector Institute for AI in Psychiatry at the Central Institute of Mental Health and Heidelberg University. The objective is to develop a tool that enables objective and precise diagnosis and longitudinal tracking of psychiatric disorders by capturing their dynamic and complex nature, with the ultimate goal of enabling personalized, effective therapy and treatment. The work focused on modeling individual cognitive behavior and its neural correlates using longitudinal, high-resolution behavioral and neuroimaging data.

A central component of the project was the design and implementation of a gamified smartphone application containing a cognitive task battery of twelve tasks spanning decision making, executive functioning, relational processing, and working memory. A one-year longitudinal experiment was conducted with five self-recruited patients who engaged with the application on a near-daily basis. This study was carried out in compliance with the German Medical Device Regulation and resulted in dense, individualized behavioral datasets.

Task difficulty and complexity were adapted on a monthly basis according to each participant’s performance, enabling the systematic capture of inter-individual differences and intra-individual learning dynamics. The tasks yielded detailed behavioral measures, including reaction times as well as qualitative and quantitative error patterns across varying task demands. In parallel, five MRI sessions per participant were conducted throughout the study, and the resulting neuroimaging data were preprocessed using adapted and extended versions of established pipelines (Figure 1).

To support model-based analyses, an extensive preprocessing pipeline for smartphone-based cognitive data was developed, allowing flexible encodings of trial structure and task information tailored to different modeling approaches. Building on this, a complete machine learning pipeline for Artificial Neural Network (ANN) training was implemented in TensorFlow. This pipeline supported a wide range of network architectures, loss and activation functions, initialization schemes, regularization strategies, and other tunable hyperparameters. Extensive hyperparameter grid searches were performed across participants and datasets, enabling the successful fitting of highly complex, real-world human cognitive behavior in a multitask setting (Figure 2).

Beyond predictive performance, the internal representations of the trained networks were systematically analyzed from multiple methodological perspectives to reduce model opacity and identify general organizational principles. Network robustness, internal structure, and autocorrelative dynamics were examined in detail. The analyses revealed robust within-subject results and significant between-subject differences. High test performance was consistently associated with biologically plausible internal representations, particularly under moderate to high sparse L1 regularization. Under appropriate conditions, similarities emerged between global organizational principles of cognitive artificial neural networks and brain imaging–based biological models, providing converging evidence for shared structural constraints across artificial and biological systems.

Manuscript in preparation.
https://github.com/oliver-frank/beRNN_v1
https://github.com/oliver-frank/beRNN_ex

Longitudinal social sensing in patients
with psychosis spectrum disorders
2024

Keywords: LMM, Social interaction marker, Schizophrenia

A study, in which I mainly contributed as a statistical advisor during my time as a research scientist at the Hector Institute for AI in Psychiatry, investigated social interactions between seven psychotic adolescent patients and 18 psychiatric staff members and examined how these interactions were associated with disorder severity over a three-month period. Participants wore time-synchronized sensors that exchanged information every four seconds via radio signals, including participant IDs, proximity, signal strength, and precise timestamps. Based on these data, social interactions were modeled as individual adjacency matrices representing weighted social network graphs, in which nodes corresponded to all involved participants and edges reflected the duration of interactions aggregated over a defined time period (Figure 1). From these networks topological markers of social structure were derived.

To examine the relationship between social interaction variables, network topological measures, and disorder severity, assessed using the Positive and Negative Syndrome Scale (PANSS), were fitted into a linear mixed-effects model with nested participant structure. The analysis revealed that relative daily interaction time, average interaction duration, and the clustering coefficient were each significantly associated with PANSS total scores (Figure 2).

View publication

HMM classification for behavioral user evaluation
2022

Keywords: HMM, Gaze behavior, VR

The project was conducted in the scope of my Master thesis at TU Berlin at the Institute for Biopsychology and Neuroergonomics as part of the Human Factors Data Science track. The examined data was collected from 14 participants using a virtual reality setup with integrated eye tracking (HTC Vive Pro Eye). Participants observed a virtual grid-based environment in which an avatar moved randomly either toward or away from a fixed object. For each movement, participants were instructed to label the avatar’s behavior as either goal-directed or non-goal-directed.

The objective of the study was to determine whether these subjective labels could be differentiated based on gaze behavior fitted to Hidden Markov Models (HMMs). The gaze behavior was specified separately for each participant and label, and model parameters (Priors, Emissions, Transition Matrices) were initialized based on theoretically motivated assumptions.

Parameter optimization was performed using the Expectation Maximization (EM) algorithm, which iteratively maximizes the marginal likelihood of the model given the observed gaze sequences. In this process, posterior state probabilities (computed from the product of prior probabilities and likelihoods) were used to update model parameters according to the expected transition and emission frequencies until convergence (Figure 1).

Finally, the resulting HMM parameters served as features for classification. A Linear Discriminant Analysis (LDA) was conducted using repeated 5-fold cross-validation (10 repetitions) to distinguish between goal-directed and non-goal-directed evaluations. The best average classification accuracy achieved was .63, indicating a measurable differentiation between the two conditions (Figure 2).

https://github.com/oliver-frank/gzHMM

Embedded CNN for smart lock control
2021

Keywords: CNN, Facial detection, Lock control, Rasperry Pi, API

During a semester project at TU Berlin in my master’s program, I developed and trained a Convolutional Neural Network (CNN) to classify whether a person was wearing a face mask. The model enabled real-time image classification and was integrated with a Raspberry Pi connected to a webcam and an electronic lock. When the camera detected a person wearing a mask, the system unlocked the mechanism, and it automatically locked again when a person was detected without a mask. The trained model was compressed using TensorFlow Lite to enable efficient deployment on the Raspberry Pi, which operated locally under limited computational resources.

https://github.com/oliver-frank/eCNN

Data-driven AR application analysis
2020

Keywords: AR application, Frontend development, Data-driven analysis

As a Data Analyst, I designed and prototyped frontend features for a smartphone-based AR application and analyzed user behavior data-driven to enable more intuitive, efficient, and faster human-device interactions. 3DQR develops immersive and interactive augmented reality (AR) experiences for simulation and training, primarily in the medical and industrial machinery sectors.

https://3dqr.de/

ANNs as models of human cognition in mental health and disorder 2026

Longitudinal social sensing in patients with psychosis spectrum disorders 2024

HMM classification for behavioral user evaluation 2022

Embedded CNN for smart lock control 2021

Data-driven AR application analysis 2020

ANNs as models of human cognition
in mental health and disorder
2026

Longitudinal social sensing in patients
with psychosis spectrum disorders
2024

HMM classification for behavioral user evaluation
2022

Embedded CNN for smart lock control
2021

Data-driven AR application analysis
2020