Researchers at Vanderbilt University Medical Center and the University of Calgary have established an analytical framework that integrates genomic, proteomic and electronic health record data to identify cancer risk proteins and therapeutics for cancer prevention.
Their study, reported Dec. 2 in the American Journal of Human Genetics, identifies previously unreported protein biomarkers and candidate drug targets across six major cancer types and highlights approved drugs with potential cancer preventive effects.
To date, genome-wide association studies (GWAS) have identified several hundred genetic variants associated with increased risk for breast, colorectal and prostate cancer, and several dozen risk variants for other cancers, including lung, pancreatic and ovarian cancer.

“Previous research, including our work, has identified hundreds of putative cancer susceptibility genes that could be regulated by these risk variants; however, most dysregulated gene expression has not been thoroughly investigated at the protein level,” said Xingyi Guo, PhD, associated professor of Medicine in the Division of Epidemiology at VUMC.
Guo is a co-senior author of the current study with Zhijun Yin, PhD, MS, associate professor of Biomedical Informatics at VUMC, and Quan Long, PhD, associate professor of Biochemistry and Molecular Biology at the University of Calgary.

“To deepen the understanding of causal mechanisms and enhance drug discovery efforts, it is imperative to explore data from transcriptomic to proteomic studies,” Yin said.
In the current study, the investigators integrated large GWAS data for breast, colorectal, lung, ovarian, pancreatic and prostate cancers and population-scale proteomics data from over 75,000 participants (combined from the Atherosclerosis Risk in Communities study, deCODE genetics, and UK Biobank Pharma Proteomics Project) to identify risk proteins associated with each cancer.
They identified 365 proteins associated with cancer risk, and through further analysis narrowed the list to 101 proteins, including 74 not reported in previous studies. Using a variety of pharmaceutical databases, the researchers comprehensively annotated the risk proteins as therapeutic targets of approved drugs or drugs in clinical testing. The idea, they said, is to find drugs that can potentially be repurposed for cancer prevention.
“Traditional drug discovery faces challenges of escalating costs, lengthy timelines, and high failure rates. Drug repurposing is a promising strategy to identify new applications for existing drugs with well-documented characteristics,” Guo said.
Among the 101 risk proteins, the researchers identified 36 druggable proteins potentially targeted by 404 drugs already approved or in clinical trials. Nineteen of the druggable proteins were targeted by drugs approved or in trials to treat cancer. The researchers compared drug effects using data from more than 3.5 million electronic health records (EHRs) from VUMC. They demonstrated in simulated trials with EHR data that several approved drugs, for example the diuretic medication acetazolamide, were associated with reduced colorectal cancer risk.
“Our findings offer additional insights into therapeutic drugs targeting risk proteins for cancer prevention and intervention. It is essential to evaluate the effects of the reported candidate drugs through both in vitro and in vivo assays in future research,” Yin said.
Co-first authors of the AJHG paper are Qing Li, PhD; Qingyuan Song; and Zhishan Chen, PhD. The research was supported by the National Institutes of Health (grants R37CA227130, R01CA269589, R01CA297582).