Pregnancy Associated Breast Cancer gene expressions : new insights on their regulation based on Rare Correlated Patterns
Résumé
Each year breast-cancer (BC) is the most common invasive cancer in women, causing a considerable death rate. Given that, BC is classified as a hormone-dependent cancer, when it collides with pregnancy, different questions may arise for which there are still no convincing answers. To deal with this issue, two new frameworks are proposed within this paper: CORAM and DIST-CORAM. The former is the first unified framework dedicated to the extraction of a generic basis of Correlated-Rare association rules from gene expression data. The proposed approach has been successfully applied on a breast-cancer Gene Expression Matrix (GSE1379) with very promising results. The latter, the DIST-CORAM approach, is a big-data processing system based on Apache spark, which deals with correlation mining from streams of micro-array pregnancy associated breast-cancer (PABC). It is successfully applied on the (GSE31192) gene expression matrix (GEM). The correlated patterns of gene-sets affirm that PABC exhibits heightened aggressiveness compared to cancers for Non-PABC women. In fact, the high levels of estrogen and progesterone hormones unfortunately contribute to the tumor aggressiveness and the proliferation of the cancer.