data descriptor: sequence data and association statistics from 12,940 type 2 diabetes cases and controls

Jason Flannick, Christian Fuchsberger, Anubha Mahajan, Tanya M. Teslovich, Vineeta Agarwala, Kyle J. Gaulton, Lizz Caulkins, Ryan Koesterer, Clement Ma, Loukas Moutsianas, Davis J. McCarthy, Manuel A. Rivas, John R. B. Perry, Xueling Sim, Thomas W. Blackwell, Neil R. Robertson, N. William Rayner, Pablo Cingolani, Adam E. Locke, Juan Fernandez TajesHeather M. Highland, Josee Dupuis, Peter S. Chines, Cecilia M. Lindgren, Christopher Hartl, Anne U. Jackson, Han Chen, Jeroen R. Huyghe, Martijn Van De Bunt, Richard D. Pearson, Ashish Kumar, Martina Muller-Nurasyid, Niels Grarup, Heather M. Stringham, Eric R. Gamazon, Jaehoon Lee, Yuhui Chen, Robert A. Scott, Jennifer E. Below, Peng Chen, Jinyan Huang, Min Jin Go, Michael L. Stitzel, Dorota Pasko, Stephen C. J. Parker, Tibor V. Varga, Todd Green, Nicola L. Beer, Aaron G. Day-Williams, Teresa Ferreira, Tasha Fingerlin, Momoko Horikoshi, Cheng Hu, Iksoo Huh, Mohammad Kamran Ikram, Bong-Jo Kim, Yongkang Kim, Young Jin Kim, Min-Seok Kwon, Juyoung Lee, Selyeong Lee, Keng-Han Lin, Taylor J. Maxwell, Yoshihiko Nagai, Xu Wang, Ryan P. Welch, Joon Yoon, Weihua Zhang, Nir Barzilai, Benjamin F. Voight, Bok-Ghee Han, Christopher P. Jenkinson, Teemu Kuulasmaa, Johanna Kuusisto, Alisa Manning, Maggie C. Y. Ng, Nicholette D. Palmer, Beverley Balkau, Alena Stancakova, Hanna E. Abboud, Heiner Boeing, Vilmantas Giedraitis, Dorairaj Prabhakaran, Omri Gottesman, James Scott, Jason Carey, Phoenix Kwan, George Grant, Joshua D. Smith, Benjamin M. Neale, Shaun Purcell, Adam S. Butterworth, Joanna M. M. Howson, Heung Man Lee, Yingchang Lu, Soo-Heon Kwak, Wei Zhao, John Danesh, Vincent K. L. Lam, Kyong Soo Park, Danish Saleheen, Wing Yee So, Claudia H. T. Tam, Uzma Afzal, David Aguilar, Rector Arya, Tin Aung, Edmund Chan, Carmen Navarro, Ching-Yu Cheng, Domenico Palli, Adolfo Correa, Joanne E. Curran, Dennis Rybin, Vidya S. Farook, Sharon P. Fowler, Barry I. Freedman, Michael Griswold, Daniel Esten Hale, Pamela J. Hicks, Chiea-Chuen Khor, Satish Kumar, Benjamin Lehne, Dorothee Thuillier, Wei Yen Lim, Jianjun Liu, Marie Loh, Solomon K. Musani, Sobha Puppala, William R. Scott, Loic Yengo, Sian-Tsung Tan, Herman A. Taylor, Farook Thameem, Gregory Wilson, Tien Yin Wong, Pal Rasmus Njolstad, Jonathan C. Levy, Massimo Mangino, Lori L. Bonnycastle, Thomas Schwarzmayr, Joao Fadista, Gabriela L. Surdulescu, Christian Herder, Christopher J. Groves, Thomas Wieland, Jette Bork-Jensen, Ivan Brandslund, Cramer Christensen, Heikki A. Koistinen, Alex S. F. Doney, Leena Kinnunen, Tonu Esko, Andrew J. Farmer, Liisa Hakaste, Dylan Hodgkiss, Jasmina Kravic, Valeriya Lyssenko, Mette Hollensted, Marit E. Jorgensen, Torben Jorgensen, Claes Ladenvall, Johanne Marie Justesen, Annemari Karajamaki, Jennifer Kriebel, Wolfgang Rathmann, Lars Lannfelt, Torsten Lauritzen, Narisu Narisu, Allan Linneberg, Olle Melander, Lili Milani, Matt Neville, Marju Orho-Melander, Lu Qi, Qibin Qi, Michael Roden, Olov Rolandsson, Amy Swift, Anders H. Rosengren, Kathleen Stirrups, Andrew R. Wood, Evelin Mihailov, Christine Blancher, Mauricio O. Carneiro, Jared Maguire, Ryan Poplin, Khalid Shakir, Timothy Fennell, Mark DePristo, Martin Hrabe de Angelis, Panos Deloukas, Anette P. Gjesing, Goo Jun, Peter M. Nilsson, Jacquelyn Murphy, Robert Onofrio, Barbara Thorand, Torben Hansen, Christa Meisinger, et al.

Research output: Contribution to journalArticlepeer-review

22 Scopus citations

Abstract

AbstractTo investigate the genetic basis of type 2 diabetes (T2D) to high resolution, the GoT2D and T2D-GENES consortia catalogued variation from whole-genome sequencing of 2,657 European individuals and exome sequencing of 12,940 individuals of multiple ancestries. Over 27M SNPs, indels, and structural variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1–5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D.
Original languageAmerican English
JournalScientific Data
Volume4
DOIs
StatePublished - 2017
Externally publishedYes

Funding Agency

  • Kuwait Foundation for the Advancement of Sciences

Fingerprint

Dive into the research topics of 'data descriptor: sequence data and association statistics from 12,940 type 2 diabetes cases and controls'. Together they form a unique fingerprint.

Cite this