Prioritization of Positional Candidate Genes Using Multiple Web-Based Software Tools

Tobias A. Thornblad; Kate S. Elliott; Jeremy Jowett; Peter M. Visscher

doi:10.1375/twin.10.6.861

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

The prioritization of genes within a candidate genomic region is an important step in the identification of causal gene variants affecting complex traits. Surprisingly, there have been very few reports of bioinformatics tools to perform such prioritization. The purpose of this article is to investigate the performance of 3 positional candidate gene software tools available, PosMed, GeneSniffer and SUSPECTS. The comparison was made for 40, 20 and 10 Mb regions in the human genome centred around known susceptibility genes for the common diseases breast cancer, Crohn's disease, age-related macular degeneration and schizophrenia. The known susceptibility gene was not always ranked highly, or not ranked at all, by 1 or more of the software tools. There was a large variation between the 3 tools regarding which genes were prioritized, and their rank order. PosMed and GeneSniffer were most similar in their prioritization gene list, whereas SUSPECTS identified the same candidate genes only for the narrowest (10 Mb) regions. Combining 2 or all of the candidate gene finding tools was superior in terms of ranking positional candidates. It is possible to reduce the number of candidate genes from a starting set in a region of interest by combining a variety of candidate gene finding tools. Conversely, we recommend caution in relying solely on single positional candidate gene prioritization tools. Our results confirm the obvious, that is, that starting with a narrower positional region gives a higher likelihood that the true susceptibility gene is selected, and that it is ranked highly. A narrow confidence interval for the mapping of complex trait genes by linkage can be achieved by maximizing marker informativeness and by having large samples. Our results suggest that the best approach to classify a minimum set of candidate genes is to take those genes that are prioritized by multiple prioritization tools.

Crossref Citations

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Seelow, Dominik Schwarz, Jana Marie Schuelke, Markus and Awadalla, Philip 2008. GeneDistiller—Distilling Candidate Genes from Linkage Intervals. PLoS ONE, Vol. 3, Issue. 12, p. e3874.

Makita, Yuko Kobayashi, Norio Mochizuki, Yoshiki Yoshida, Yuko Asano, Satomi Heida, Naohiko Deshpande, Mrinalini Bhatia, Rinki Matsushima, Akihiro Ishii, Manabu Kawaguchi, Shuji Iida, Kei Hanada, Kosuke Kuromori, Takashi Seki, Motoaki Shinozaki, Kazuo and Toyoda, Tetsuro 2009. PosMed-plus: An Intelligent Search Engine that Inferentially Integrates Cross-Species Information Resources for Molecular Breeding of Plants. Plant and Cell Physiology, Vol. 50, Issue. 7, p. 1249.

Day, Allen Dong, Jun Funari, Vincent A. Harry, Bret Strom, Samuel P. Cohn, Dan H. Nelson, Stanley F. and Creighton, Chad 2009. Disease Gene Characterization through Large-Scale Co-Expression Analysis. PLoS ONE, Vol. 4, Issue. 12, p. e8491.

Raychaudhuri, Soumya Plenge, Robert M. Rossin, Elizabeth J. Ng, Aylwin C. Y. Purcell, Shaun M. Sklar, Pamela Scolnick, Edward M. Xavier, Ramnik J. Altshuler, David Daly, Mark J. and Storey, John D. 2009. Identifying Relationships among Genomic Disease Regions: Predicting Genes at Pathogenic SNP Associations and Rare Deletions. PLoS Genetics, Vol. 5, Issue. 6, p. e1000534.

Chen, J. Bardes, E. E. Aronow, B. J. and Jegga, A. G. 2009. ToppGene Suite for gene list enrichment analysis and candidate gene prioritization. Nucleic Acids Research, Vol. 37, Issue. Web Server, p. W305.

Yoshida, Y. Makita, Y. Heida, N. Asano, S. Matsushima, A. Ishii, M. Mochizuki, Y. Masuya, H. Wakana, S. Kobayashi, N. and Toyoda, T. 2009. PosMed (Positional Medline): prioritizing genes with an artificial neural network comprising medical documents to accelerate positional cloning. Nucleic Acids Research, Vol. 37, Issue. Web Server, p. W147.

Ortutay, Csaba and Vihinen, Mauno 2009. Identification of candidate disease genes by integrating Gene Ontologies and protein-interaction networks: case study of primary immunodeficiencies. Nucleic Acids Research, Vol. 37, Issue. 2, p. 622.

Sha, Y Liu, Q Wang, Y Dong, C and Song, L 2010. Exploring Candidate Genes for Epilepsy by Computational Disease-Gene Identification Strategy. Balkan Journal of Medical Genetics, Vol. 13, Issue. 2, p. 35.

Qiao, Y. Harvard, C. Tyson, C. Liu, X. Fawcett, C. Pavlidis, P. Holden, J. J. A. Lewis, M. E. S. and Rajcan-Separovic, E. 2010. Outcome of array CGH analysis for 255 subjects with intellectual disability and search for candidate genes using bioinformatics. Human Genetics, Vol. 128, Issue. 2, p. 179.

Kaimal, Vivek Sardana, Divya Bardes, Eric E. Gudivada, Ranga Chandra Chen, Jing and Jegga, Anil G. 2011. Disease Gene Identification. Vol. 700, Issue. , p. 241.

Oti, Martin Ballouz, Sara and Wouters, Merridee A. 2011. In Silico Tools for Gene Discovery. Vol. 760, Issue. , p. 189.

Xiao, Yun Xu, Chaohan Ping, Yanyan Guan, Jinxia Fan, Huihui Li, Yiqun and Li, Xia 2011. Differential expression pattern-based prioritization of candidate genes through integrating disease-specific expression data. Genomics, Vol. 98, Issue. 1, p. 64.

Zhu, Cheng Kushwaha, Akash Berman, Kenneth and Jegga, Anil G 2012. A vertex similarity-based framework to discover and rank orphan disease-related genes. BMC Systems Biology, Vol. 6, Issue. S3,

O'Brien, M.A. Costin, B.N. and Miles, M.F. 2012. Bioinformatics of Behavior: Part 2. Vol. 104, Issue. , p. 91.

Börnigen, Daniela Tranchevent, Léon-Charles Bonachela-Capdevila, Francisco Devriendt, Koenraad De Moor, Bart De Causmaecker, Patrick and Moreau, Yves 2012. An unbiased evaluation of gene prioritization tools. Bioinformatics, Vol. 28, Issue. 23, p. 3081.

Moreau, Yves and Tranchevent, Léon-Charles 2012. Computational tools for prioritizing candidate genes: boosting disease gene discovery. Nature Reviews Genetics, Vol. 13, Issue. 8, p. 523.

Jegga, Anil G. Zhu, Cheng and Aronow, Bruce J. 2012. Pediatric Biomedical Informatics. Vol. 2, Issue. , p. 287.

Britto, Ramona Sallou, Olivier Collin, Olivier Michaux, Grégoire Primig, Michael and Chalmel, Frédéric 2012. GPSy: a cross-species gene prioritization system for conserved biological processes—application in male gamete development. Nucleic Acids Research, Vol. 40, Issue. W1, p. W458.

Bromberg, Yana and Lewitter, Fran 2013. Chapter 15: Disease Gene Prioritization. PLoS Computational Biology, Vol. 9, Issue. 4, p. e1002902.

Makita, Yuko Kobayashi, Norio Yoshida, Yuko Doi, Koji Mochizuki, Yoshiki Nishikata, Koro Matsushima, Akihiro Takahashi, Satoshi Ishii, Manabu Takatsuki, Terue Bhatia, Rinki Khadbaatar, Zolzaya Watabe, Hajime Masuya, Hiroshi and Toyoda, Tetsuro 2013. PosMed: ranking genes and bioresources based on Semantic Web Association Study. Nucleic Acids Research, Vol. 41, Issue. W1, p. W109.

Download full list

Article contents

Prioritization of Positional Candidate Genes Using Multiple Web-Based Software Tools

Abstract

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests