About me

I am currently a postdoc at the division of information science and engineering of KTH Royal Institute of Technology in Stockholm, Sweden. I work in the research group of Prof. Ming Xiao, in collaboration with Prof. Volker Lauschke at the department of Physiology and Pharmacology at Karolinska Institutet. I work as an associated researcher with the neonatal transfusion network.

My interest in research is the design of prediction models contributing to more reliable and trustworthy predictions in clinical contexts. This encompasses the design, implementation and training of deep architectures working from different data modalities, possibly integrating multiple ones. I am also interested in enabling health informatics research with information systems integrating various sources of data generated in hospital contexts.

My main project focuses on the development of graph-based prediction models to quantify the impact of genetic variants of drug transporter proteins. This project is part of an effort toward personalized medicine, ultimately aiming at a fast and reliable identification of suitable chemotherapeutic treatments for cancer patients based on their genetic profile.

I obtained my PhD under the supervision of Assoc. Prof. Saikat Chatterjee, in a collaborative project with Prof. Eric Herlenius at the department of Women’s and Children’s Health at Karolinska Institutet. My thesis is available online.

Projects

On-going

  • Variant effect predictions

  • Neored

  • ITNet: Irregular Timeseries Data Fusion with Attention Mechanisms

(submitted to FUSION 2025) [paper], [code]

Abstract: Timeseries regression from multimodal data is a difficult task when modalities have missing data or different sampling frequencies. Current approaches such as feature fusion often rely on interpolation and might result in either too simplistic or overly computationally expensive algorithms. We propose ITNet, an end-to-end trainable multihead causal cross-attention model adapted for irregularly sampled multimodal timeseries data. We show the performances of the model on synthetic data from 2d linear state-space models with a varying number of modalities, varying data missingness and varying signal-to-measurement-noise ratio (SMNR). We compare with Kalman filters exploiting either one or all available modalities. Provided a sufficient number of modalities and high enough SMNR, ITNet outperforms the closed form Kalman Filter. Importantly, our model achieves these results without assuming knowledge of the state transition matrix. This is of particular importance towards the use of ITNet for practical cases.

Finished

  • DREs: DWC Research Extraction scripts

[code]

Abstract: Data Warehouse Connect (DWC) databases are setup in hospital subnetworks inaccessible to outside research organisations for security reasons. Buying hardware and training inhospital IT staff that can maintain and administrate these databases requires investments not all hospitals can afford. This in turn limits the research that can be performed on the data. To enable data extraction for research for the large number of institutions that use DWC, we developed simple scripts to continuously extract, encrypt and transfer monitor data from DWC. The scripts are designed to be robust to network interruption and un-expected machine shutdown. It is assumed that standard Windows software are provided on the database servers and intermediade machines.

  • DANSE: Data-driven Non-linear State Estimation of Model-free Process in Unsupervised Bayesian Setup

[paper], [code]

Abstract: We propose DANSE – a data-driven non-linear state estimation method. DANSE provides a closed-form posterior of the state of a model-free process, given linear measurements of the state in a Bayesian setup, like the celebrated Kalman filter (KF). Non-linear dynamics of the state are captured by data-driven recurrent neural networks (RNNs). The training of DANSE combines maximum-likelihood and gradient-descent in an unsupervised framework, i.e. only measurement data and no process data are required. Using simulated linear and non-linear process models, we demonstrate that DANSE - without knowledge of the process model - provides competitive performance against model-based approaches such as KF, unscented KF (UKF), extended KF (EKF), and a hybrid approach such as KalmanNet.

  • Convolution based Variational Bayes for Factorial HMM

[manuscript], [code]

Abstract: We propose a method to scale factorial Hidden Markov Models (HMMs) to long and complex time series. Factorial HMMs are suited for time series modeling tasks because of the distributed structure of their latent space. We use a continuous state space relaxation of a factorial HMM and propose a convolution-based variational distribution that can be learned with a gradient-based variational expectation-maximization (EM) algorithm. The number of trainable parameters in our model is independent of the length of the input data. Our model can accommodate different kinds of emission distributions, simple to complex, such as Gaussian and conditional normalizing flows (CNFs). The proposed model is evaluated for synthetic and real world data modeling and generation.