Data4Cure's AI-driven drug target discovery and cancer subtyping at AACR 2025

Data4Cure presented two new research abstracts at the 2025 American Association for Cancer Research (AACR) Annual Meeting, highlighting AI-based approaches for drug-target discovery and large-scale cancer subtyping using foundation models applied to large collections of sequencing data.

Knowledge-graph AI-based prioritization of drug target candidates

In the first study (Abstract LB112/13), Data4Cure researchers leveraged the company's large-scale biomedical knowledge graph encompassing over 4 billion relations across more than 1 million entities — including genes, diseases, drugs, pathways, cell types, and tissues. To prioritize candidate cancer drug targets, they trained and optimized 17 state-of-the-art knowledge-graph embedding (KGE) and graph neural network (GNN) models. The best models significantly outperformed standard machine-learning approaches, uncovered clusters of predicted targets enriched for specific biological functions, and identified novel candidates — including within previously underexplored functional classes.

Foundation-model integration of >180,000 bulk RNA-seq samples

The second study (Abstract LB343/6) describes RNA1, a novel transformer-based foundation model trained on over 180,000 bulk RNA-seq profiles. RNA1 enables systematic assignment of cancer samples to molecular subtypes and the discovery of new subtypes across more than 36,000 tumor samples. RNA1-based subtyping shows significant associations with survival and drug-response outcomes on published clinical-trial datasets, outperforming previously published subtyping approaches across a majority of evaluated benchmarks.

4B+

KG relations

KGE/GNN models trained

180,000+

RNA-seq profiles (RNA1)

Both projects build on Data4Cure's Biomedical Intelligence Cloud platform and its mission to help turn complex multi-modal data into knowledge that drives biomedical research and therapeutic innovation.

Data4Cure's AI-driven drug target discovery and cancer subtyping at AACR 2025

Knowledge-graph AI-based prioritization of drug target candidates

Foundation-model integration of >180,000 bulk RNA-seq samples

A foundation model for forward & reverse translation at AACR 2026

The AI & Insights Layer: synthesizing knowledge, powering discovery