AI-Driven Integration of Histological and Genomic Data for Understanding Breast Cancer Survival

Abstract

Breast cancer (BC) poses a significant threat to public health. Understanding its complex nature using high-dimensional data remains a challenge. Here, we explored the predictive capabilities of feature extraction algorithms on genomic and image data within the context of BC. Firstly, we trained AI models on the TCGA dataset to identify cancer signatures. Secondly, we evaluated the predictive power of these image and omics features using a survival framework on an independent dataset (METABRIC). The genomic models achieved AUC values of 0.99 (expression), 0.78 (SNPs), and 0.72 (CNVs). The image segmentation model distinguished histological structures with a dice score of 0.63. Finally, survival models showed significant associations between BC and features such as image-derived cell stroma % (p=0.002) and genetic-derived CNV status (p=4.34×10-4). Our study highlights the capabilities of AI to capture relevant features in high-dimensional data, enhancing BC survival predictions.

BIB_text

@Article {
title = {AI-Driven Integration of Histological and Genomic Data for Understanding Breast Cancer Survival},
keywds = {
breast cancer; genomics; histological image; image segmentation; Survival model
}
abstract = {

Breast cancer (BC) poses a significant threat to public health. Understanding its complex nature using high-dimensional data remains a challenge. Here, we explored the predictive capabilities of feature extraction algorithms on genomic and image data within the context of BC. Firstly, we trained AI models on the TCGA dataset to identify cancer signatures. Secondly, we evaluated the predictive power of these image and omics features using a survival framework on an independent dataset (METABRIC). The genomic models achieved AUC values of 0.99 (expression), 0.78 (SNPs), and 0.72 (CNVs). The image segmentation model distinguished histological structures with a dice score of 0.63. Finally, survival models showed significant associations between BC and features such as image-derived cell stroma % (p=0.002) and genetic-derived CNV status (p=4.34×10-4). Our study highlights the capabilities of AI to capture relevant features in high-dimensional data, enhancing BC survival predictions.


}
isbn = {979-835031333-8},
date = {2024-05-27},
}
Vicomtech

Parque Científico y Tecnológico de Gipuzkoa,
Paseo Mikeletegi 57,
20009 Donostia / San Sebastián (Spain)

+(34) 943 309 230

Zorrotzaurreko Erribera 2, Deusto,
48014 Bilbao (Spain)

close overlay

Behavioral advertising cookies are necessary to load this content

Accept behavioral advertising cookies