MM-KPNN: Multimodal Knowledge-Primed Neural Network for Interpretable Integration of Single-Cell Data
A reproducible framework for interpretable multimodal learning in single-cell biology
MM-KPNN is an interpretable multimodal deep-learning framework that integrates scRNA-seq, scATAC-seq, and spatial transcriptomics data through biologically constrained neural architectures. Building upon existing multimodal models, MM-KPNN introduces pathway- and transcription-factor-aware connectivity within the network topology, enabling transparent interpretation of how molecular programs and regulatory networks drive predictions. This repository generalizes the original single-cell prototype into a reproducible benchmarking platform for multimodal integration and explainability, supporting expansion to real datasets, regulatory priors, and biological validation.
-
Concept-Bottleneck Architecture
MM-KPNN embeds biological concepts—pathways, transcription factors, and regulatory modules—directly into the network topology. This design constrains latent representations to interpretable biological units, ensuring that every prediction can be traced through meaningful mechanisms. -
Multimodal Integration
Jointly models gene expression, chromatin accessibility, and spatial features within a unified graph framework, linking regulatory and phenotypic layers of single-cell data. -
Mechanistic Interpretability
Provides transparent attribution at both node and edge levels using gradient × input, integrated gradients, and GNNExplainer methods, revealing which molecular programs and regulatory interactions drive model decisions. -
Reproducible and Scalable Infrastructure
Built as a modular, end-to-end pipeline compatible with HPC and cloud environments, supporting standardized data handling, evaluation metrics, and reproducible experiment tracking. -
Extensible Framework Family
Serves as the foundation for derivative architectures—including SpatialMMKPNN, Perturbation-MMKPNN, and DrugResponse-GNN—which apply the same interpretable design to spatial, perturbational, and pharmacogenomic contexts.
The core framework is implemented in:
scripts/Pipeline_mm-kpnn.ipynb– complete MM-KPNN training and evaluation pipelinescripts/Prototype.ipynb– compact version for experimentation or adaptation
Model Architecture
- Encoder: modality-specific encoders (RNA, ATAC, spatial) projecting inputs to a shared latent representation.
- Knowledge-Constrained Decoder: biologically structured layers mapping latent features to Reactome/KEGG pathways, then to TFs and genes.
- Attribution Module: computes node- and edge-level relevance using gradient × input, integrated gradients, and GNNExplainer methods.
- Evaluation Suite: includes benchmarking tools for interpretability fidelity, modality alignment, and biological consistency.
The notebooks in this repository are written to maximize clarity and interpretability, not just execution. Each pipeline includes detailed in-line documentation explaining the scientific rationale, assumptions, and decisions behind each step — from data preprocessing to model interpretation.
Key design principles:
- Transparent logic – Each analytical step links computational choices to biological motivation.
- Integrated results – Figures, metrics, and outputs are embedded directly in the workflow to ensure interpretability.
- Modular organization – Code follows the structure of the stepwise guide in
docs/README.pdf, enabling selective reuse or adaptation.
This approach transforms the repository into a reproducible and interpretable record of model reasoning, adding scientific value beyond conventional scripts.
MM-KPNN demonstrates how biologically informed architectures can achieve accurate, interpretable, and transferable multimodal integration across diverse single-cell datasets.
Model performance and interpretability outcomes include:
- Multimodal alignment – joint RNA–ATAC embeddings preserve cell-type topology and improve separation by molecular identity.
- Pathway-level attribution – interpretable activation patterns reveal dominant signaling and regulatory programs (e.g., TGF-β, WNT, and interferon responses).
- Regulatory driver identification – edge- and node-level attributions highlight key transcription factors and regulatory connections shaping cell-type predictions.
- Cross-dataset robustness – embeddings and attributions remain stable across random seeds, feature subsets, and batch-corrected inputs.
- Explainability benchmarking – results include quantitative metrics for attribution fidelity and overlap with curated pathway databases.
All analyses are fully reproducible through the main pipeline notebooks, which integrate quantitative metrics, visualizations, and interpretive commentary at each step.
The framework has been validated across multiple datasets, showing consistent multimodal alignment, robust interpretability metrics, and reproducible feature attributions. It is designed for scalability and extension, supporting additional modalities (e.g., methylation or spatial data) and integration of regulatory priors to expand biological interpretability. The repository also includes concise guidance for parameter tuning and graph adaptation to facilitate deployment across datasets and experimental contexts.
For full documentation and methods, see:
docs/README.pdfscripts/Pipeline_mm-kpnn.ipynbscripts/Prototype.ipynb
Each release of this repository is archived and citable via Zenodo DOI.
Yepes, S. “MM-KPNN: Interpretable Multimodal Neural Networks for Single-Cell Integration.” GitHub Repository, 2025.
DOI: 10.5281/zenodo.17194732