Robustness of genome-wide scanning using archived dried blood spot samples as a DNA source
- Mads V Hollegaard1Email author,
- Jakob Grove2, 3,
- Jonas Grauholm4,
- Eskil Kreiner-Møller5,
- Klaus Bønnelykke5,
- Mette Nørgaard6,
- Thomas L Benfield7, 8,
- Bent Nørgaard-Pedersen1,
- Preben B Mortensen9,
- Ole Mors10,
- Henrik T Sørensen6,
- Zitta B Harboe11,
- Anders D Børglum2,
- Ditte Demontis2,
- Torben F Ørntoft12,
- Hans Bisgaard5 and
- David M Hougaard1
© Hollegaard et al; licensee BioMed Central Ltd. 2011
Received: 11 February 2011
Accepted: 4 July 2011
Published: 4 July 2011
The search to identify disease-susceptible genes requires access to biological material from numerous well-characterized subjects. Archived residual dried blood spot (DBS) samples, also known as Guthrie cards, from national newborn screening programs may provide a DNA source for entire populations. Combined with clinical information from medical registries, DBS samples could provide a rich source for productive research. However, the amounts of DNA which can be extracted from these precious samples are minute and may be prohibitive for numerous genotypings. Previously, we demonstrated that DBS DNA can be whole-genome amplified and used for reliable genetic analysis on different platforms, including genome-wide scanning arrays. However, it remains unclear whether this approach is workable on a large sample scale. We examined the robustness of using DBS samples for whole-genome amplification following genome-wide scanning, using arrays from Illumina and Affymetrix.
This study is based on 4,641 DBS samples from the Danish Newborn Screening Biobank, extracted for three separate genome-wide association studies. The amount of amplified DNA was significantly (P < 0.05) affected by the year of storage and storage conditions. Nine (0.2%) DBS samples failed whole-genome amplification. A total of 4,586 (98.8%) samples met our criterion of success of a genetic call-rate above 97%. The three studies used different arrays, with mean genotyping call-rates of 99.385% (Illumina Infinium Human610-Quad), 99.722% (Illumina Infinium HD HumanOmni1-Quad), and 99.206% (Affymetrix Axiom Genome-Wide CEU). We observed a concordance rate of 99.997% in the 38 methodological replications, and 99.999% in the 27 technical replications. Handling variables such as time of storage, storage conditions and type of filter paper were shown too significantly (P < 0.05) affect the genotype call-rates in some of the arrays, although the effect was minimal.
Our study indicates that archived DBS samples from the Danish Newborn Screening Biobank represent a reliable resource of DNA for whole-genome amplification and subsequent genome-wide association studies. With call-rates equivalent to high quality DNA samples, our results point to new opportunities for using the neonatal biobanks available worldwide in the hunt for genetic components of disease.
Identifying genetic effects in complex disorders usually requires genome studies in large cohorts. Access to DNA from well-characterized patients and healthy controls represents a major bottleneck. This problem may be circumvented by using archived residual blood samples from newborn screening programs, which encompass the entire population under a certain age in several countries. The blood is usually collected by heel-prick and applied to special filter paper; a proven robust and convenient medium for transport and storage . Storage policies for residual neonatal dried blood spot (DBS) samples vary internationally, but several countries store residual samples in repositories for research purposes [2–8]. Stored DBS samples combined with relevant clinical information from medical registries are an ideal resource for large studies representing an entire population under a given age without selection bias. In addition, availability of previously collected samples allows substantial savings in research-related costs and time.
The Danish Neonatal Screening Biobank (DNSB) contains nearly two million DBS samples collected from almost every Dane born after 1981. It has recently been updated to meet new general guidelines for the establishment and operation of biobanks . Approval from the Scientific Ethical Committee System, the Data Protection Agency, and the DNSB Steering Committee is needed to obtain access to samples for research.
In Denmark, all citizens have a unique personal identification number used in all public registration systems, including the DNSB. Denmark also has a well-established public health care system with equal treatment offered to all citizens. These resources allow researchers to study the entire country as a cohort, and make the DNSB an ideal resource for studying common and complex genetic diseases in Caucasians .
A major challenge using DBS samples for genetic studies is the small amount of blood available in a spot. The amount of genomic DNA (gDNA) that can be extracted from a 3.2-mm punch of a DBS sample is approximately 60 ng . In general, only one or two 3.2-mm punches per DBS sample can be reserved for a given project, limiting screening to only a few single nucleotide polymorphisms (SNP). This obstacle may be overcome by whole-genome amplification (WGA) of the DNA. Previous studies have used whole-genome amplified DNA (wgaDNA) for genotyping with some success, but in most cases, only a limited number of polymorphisms could be tested [11–17].
Here we describe genome-wide association studies (GWAS) using DBS samples from the DNSB. Storage time, storage conditions, and type of filter paper used for DBS collection were evaluated to determine their effects on the amount of amplified wgaDNA material obtained from each sample. The effects of these variables on genotype call rates in three studies, using three different types of array, running on either Illumina or Affymetrix genotyping platforms were also examined.
Our 4,641 subjects were obtained from three case-control GWAS studies. The first study, GEMS (Genomic Medicine for Schizophrenia), called "610k" in this manuscript, included 1,808 DBS samples stored from 1981-1996. The purpose was to identify genetic regions associated with schizophrenia (Ethical Approval no.: 20020020; Data Protection Agency no.: 2002-41-2059). The second study, "Omni1", provided 1,283 DBS samples stored from 1982-2006, and was undertaken to examine the role of genetics in Meningococcal and Pneumococcal infections (Ethical Approval nos. 20060008 and HB-2007-085; Data Protection Agency nos. 2005-41-6012 and 2007-41-0229). The third study, "Axiom", aimed to identify genetic variations associated with asthma, and included DBS samples stored from 1982-2006 (Ethical Approval no. HB-2008-103; Data Protection Agency no. 2008-41-2622). All studies were approved by the DNSB Steering Committee. The current study was conducted as an anonymous register study.
DNA extraction, whole-genome amplification, and SNP genotyping
Two 3.2-mm disks were punched from each DBS sample, and protein was removed as previously described . Genomic DNA was hereafter extracted using the Extract-N-Amp kit (Sigma-Aldrich). To attenuate possible unequal amplification of alleles, WGA was carried out in triplicate using the REPLI-g mini kit (Qiagen). The concentration of wgaDNA was estimated using Quant-IT PicoGreen dsDNA Reagent (Invitrogen). The three studies, "610k", "Omni1" and "Axiom" used an Infinium Human610-Quad chip array (Illumina), an Infinium HD HumanOmni1-Quad chip array (Illumina) and an Axiom Genome-Wide CEU Array chip (Affymetrix), respectively. wgaDNA samples were normalized to 60 ng/μL prior to genome wide scanning (GWS) of SNP genotypes. Samples with genotyping call rates (GCR) below 97% but above 95% were rerun in the Illumina-based studies without reamplifying the gDNA samples, under the assumption that the low call-rates stemmed from a technical issue. Samples with GCRs below 95% were re-amplified before re-genotyping. Both technical replicates (same wgaDNA genotyped twice) and methodological replicates (same sample of WGA used in two separate reactions and genotyped separately) were included in the two Illumina studies. "610k" included six methodological and 11 technical replicates, "Omni1" included 32 methodological and 16 technical replicates, and "Axiom" had no replicates.
To evaluate the sample processing we pooled the DBS samples included in the three GWAS studies. Pooling samples was statistically sound as all samples were treated identically up to the step before choosing the SNP genotyping array platform and technology, but possible sample effects were tested statistically. We used a linear regression model to test for interaction between the included variables: years of storage counting from 1981 (years), storage conditions (condition: 0 (+4°C, 1981-1987), 1 (-20°C, 1988-present)), type of filter paper (filter: 0 (S&S2992, 1981-2000), 1 (S&S903, 2001-present)) and the wgaDNA concentration.
The GCR was used for evaluating the array efficiency and sensitivity to biobank variables. To meet the criteria of being normally distributed, the GCR was transformed using a zero-skewness log (resulting in log(1-GCR), the logarithm of the failure rate). The effect of the years of storage, type of filter paper, and storage conditions on the transformed GCR was analysed in the three studies individually using a linear regression interaction model. A bivariate linear regression model was used to evaluate the effect of the wgaDNA concentration on the GCR. STATA MP11 software (StataCorp LP, TX, USA) was used for the statistical analyses.
A total of 4,641 subjects from the three GWAS disease studies were used to evaluate the use of DBS samples for genetic studies. "610k" included 1,808 samples stored for a mean of 23.9 years (range: 14-28 years; standard deviation (SD): 2.8 years; 45.5% female and 54.5% male). "Omni1" was based on 1,283 samples stored for a mean of 15.7 years (range: 4-28 years; SD: 6.2 years; 43.6% female and 56.4% male). "Axiom" included 1,550 samples stored for a mean of 16.5 years (range: 4-28 years; SD: 6.5 years; 31.8% female and 68.2% male).
Effect of storage year, storage conditions, and type of filter paper on the wgaDNA concentration.
Technical evaluation of the "610k", "Omni1", and "Axiom" studies.
The three GWAS studies were evaluated separately as their performance was significantly different (data not shown), likely because different array types were used on different genotyping platforms (Affymetrix and Illumina).
The mean GCRs in the three studies were: "610k", 99.385385% (range: 47.129-99.933%; GCR 5th percentile: 98.609%); "Omni1", 99.722 (range: 55.685-99.974%; GCR 5th percentile: 99.522%); and "Axiom", 99.206% (range: 89.313-99.890%; GCR 5th percentile: 98.140%).
The effect of biobank-related variables on log(1-GCR) in three GWS arrays.
Dried blood spot samples are being collected and stored in biobanks for diagnostic and research purposes worldwide. In several countries this has been common practice for several decades. We previously showed that DBS samples from the DNSB can be used to generate reliable genetic results using the Illumina genome-wide scanning technology, but this evaluation was restricted to relatively few samples . In the current study, which combined results from three recent GWS studies, we found that DBS samples are suitable for large-scale genetic studies .
According to our regression model, and as suggested in a previous study , increasing years of storage and storage at+4°C negatively affected the wgaDNA concentration (Table 1). Independent of years of storage, the wgaDNA concentration increased when DBS samples were stored at -20°C shortly after reception, thereby increasing the chance of a successful WGS (Table 1). In contrast to a previous finding , the more absorbent S&S903 filter paper did not significantly affect the amount of amplified material compared to the less absorbent S&S2992 filter paper (Table 1) . As only 8.6% (396) of the samples were spotted on S&S903 filter paper, we would like to expand this analysis when new studies have provided additional data from S&S903 samples.
The wgaDNA samples in the three studies performed excellently, with mean GCRs greater than 99.2%, and replication concordance rates greater than 99.9% (Table 2). This indicates that gDNA extracted from DBS samples, amplified under suboptimal conditions (gDNA input below 10 ng), can be used as a reliable DNA resource for high-throughput SNP genotyping. With this in mind, we aimed to detect if any biobank-related variables affected the GCR.
In "Omni1" and "Axiom", but not "610k", increasing wgaDNA concentrations increased the GCR. We speculated as to whether the lack of association in "610k" was due to the fact that the study only included samples from 1981-1996, whereas the other studies incorporated samples stored from 1982-2006.
In contrast to what we expected, the less absorbent S&S2992 filter paper had significantly higher GCRs in "Omni1", compared with the absorbent S&S903. The statistical model also indicated that the GCRs in the S&S903 samples of the"Omni1" increased significantly with fewer years of storage, suggesting that the GCRs over time decrease at a higher rate. Overall, relatively few samples were collected on the S&S903 filter paper (196 (~15%) in "Omni1" and 200 (~13%) in "Axiom"), so the significant associations could also be artefacts. Future studies will help us to answer this question.
The storage conditions significantly affected the GCR in the "Axiom" study, with the GCR increasing when samples were stored at -20°C. Unexpectedly, the older samples performed better than the more recent samples. We speculate that the difference between the Affymetrix (ligation) and Illumina (single base extension) SNP genotyping approaches may contribute to this. Overall, of the three arrays tested, the Illumina "Omni1" array performed best. Compared with the other arrays, "Omni1" had the highest mean GCR, and the highest sample success rate. It is important to point out that none of the arrays performed poorly, and that the effects of the different variables on the GCR were minimal, even when statistically significant. All three arrays should be considered usable for GWS of DBS samples.
The robustness of the three GWS studies indicates that filter paper is an excellent way to collect and store whole blood samples for later DNA research purposes. Collecting samples on filter paper has several advantages compared with standard venepuncture, including less discomfort for the patient, especially if several samples need to be collected within a short period of time. The relatively small amount of blood taken limits the number of analyses that can be performed, but techniques such as WGA help mitigate these restrictions with regard to DNA based methods. To date, DBS samples have been used for multiplex protein analysis , Vitamin D estimation , mRNA profiling , cytomegalovirus identification , and epigenetic methylation testing . These studies, combined with the ability to perform a full genetic SNP profile that we describe here, show that consider DBS biobanks can be considered sources for sample material for future studies of disease. It remains to be seen whether DBS samples can be used for next-generation sequencing, universal epigenetic profiling or detection of copy number variations.
In summary, we found that DNSB DBS samples constitute a good resource for SNP genotyping and GWS array studies. Samples in neonatal screening biobanks worldwide should be considered an important source of genetic material for future genetic studies. Our results also suggest suggests that new samples for GWS studies can be collected on filter paper with minimal discomfort for patients, potentially higher participation rates, and convenience in collection, shipping, costs, and storage as compared with whole blood obtained by venepuncture. Depending on the array chosen, different variables may marginally affect the GCR, but overall our approach using DBS samples stored for up to 28 years performed as well as good quality DNA from whole-blood samples. Though not significantly affecting the GCR, we emphasize the importance of storing DBS samples at-20°C, to enhance the numbers of biomarkers that can be analysed.
We would like to acknowledge laboratory technicians Høgni Kallehauge Petersen and Lis Vestergaard-Hansen for their efforts in extracting and processing the DBS samples used in the three studies.
The "610k" study, (authors: JAG, JOG, PBM, PM, OM, ADB, DD, TFO, BNP, DMH, MVH), was supported by the Stanley Medical Research Institute, the Danish Council for Strategic Research and H. Lundbeck A/S. The funding agencies did not have any role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
The "Omni1" study, (authors: MN, TLB, ZBH, HTS, DMH, MVH), was funded by the Lundbeck Foundation, the Novo Nordisk Foundation, Kong Christian d. tiendes fond, Dir. Jacob Madsen og Hustrus Fond, TrygVesta, Ebba Celinders Legat, Den Alm. Danske Lægeforening, Fonden til Lægevidenskabens Fremme, Augustinus Fonden, Brøderne Hartmanns Fond, and Dagmar Marshalls Fond. The funding agencies did not have any role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
The "Axiom" study, (authors: EKM, KB, HB, DMH, MVH), was funded by the Lundbeck Foundation, the Pharmacy Foundation of 1991, the Augustinus Foundation, the Danish Medical Research Council, and the Danish Pediatric Asthma Centre. In addition COPSAC is funded by private and public research funds all listed on http://www.copsac.com. The funding agencies did not have any role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
- Mei JV, Alexander JR, Adam BW, Hannon WH: Use of filter paper for the collection and analysis of human whole blood specimens. J Nutr. 2001, 131 (5): 1631S-1636S.PubMed
- Aoki K: Newborn screening in Japan. Southeast Asian J Trop Med Public Health. 2003, 34 (Suppl 3): 80-PubMed
- de Carvalho TM, dos Santos HP, dos Santos IC, Vargas PR, Pedrosa J: Newborn screening: a national public health programme in Brazil. J Inherit Metab Dis. 2007, 30 (4): 615-10.1007/s10545-007-0650-7.View ArticlePubMed
- Olney RS, Moore CA, Ojodu JA, Lindegren ML, Hannon WH: Storage and use of residual dried blood spots from state newborn screening programs. J Pediatr. 2006, 148 (5): 618-622. 10.1016/j.jpeds.2005.12.053.View ArticlePubMed
- Therrell BL, Adams J: Newborn screening in North America. J Inherit Metab Dis. 2007, 30 (4): 447-465. 10.1007/s10545-007-0690-z.View ArticlePubMed
- Therrell BL, Hannon WH, Pass KA, Lorey F, Brokopp C, Eckman J, Glass M, Heidenreich R, Kinney S, Kling S: Guidelines for the retention, storage, and use of residual dried blood spot samples after newborn screening analysis: statement of the Council of Regional Networks for Genetic Services. Biochem Mol Med. 1996, 57 (2): 116-124. 10.1006/bmme.1996.0017.View ArticlePubMed
- Webster D: Newborn screening in Australia and New Zealand. Southeast Asian J Trop Med Public Health. 2003, 34 (Suppl 3): 69-70.PubMed
- Wilcken B, Wiley V: Newborn screening. Pathology. 2008, 40 (2): 104-115. 10.1080/00313020701813743.View ArticlePubMed
- Norgaard-Pedersen B, Hougaard DM: Storage policies and use of the Danish Newborn Screening Biobank. J Inherit Metab Dis. 2007, 30 (4): 530-536. 10.1007/s10545-007-0631-x.View ArticlePubMed
- Frank L: Epidemiology. When an entire country is a cohort. Science. 2000, 287 (5462): 2398-2399. 10.1126/science.287.5462.2398.View ArticlePubMed
- Hannelius U, Lindgren CM, Melen E, Malmberg A, von Dobeln U, Kere J: Phenylketonuria screening registry as a resource for population genetic studies. J Med Genet. 2005, 42 (10): e60-10.1136/jmg.2005.032987.PubMed CentralView ArticlePubMed
- Catsburg A, van der Zwet WC, Morre SA, Ouburg S, Vandenbroucke-Grauls CM, Savelkoul PH: Analysis of multiple single nucleotide polymorphisms (SNP) on DNA traces from plasma and dried blood samples. J Immunol Methods. 2007, 321 (1-2): 135-141. 10.1016/j.jim.2007.01.015.View ArticlePubMed
- Lovmar L, Fredriksson M, Liljedahl U, Sigurdsson S, Syvanen AC: Quantitative evaluation by minisequencing and microarrays reveals accurate multiplexed SNP genotyping of whole genome amplified DNA. Nucleic Acids Res. 2003, 31 (21): e129-10.1093/nar/gng129.PubMed CentralView ArticlePubMed
- Park JW, Beaty TH, Boyce P, Scott AF, McIntosh I: Comparing whole-genome amplification methods and sources of biological samples for single-nucleotide polymorphism genotyping. Clin Chem. 2005, 51 (8): 1520-1523. 10.1373/clinchem.2004.047076.View ArticlePubMed
- Sjoholm MI, Dillner J, Carlson J: Assessing quality and functionality of DNA from fresh and archival dried blood spots and recommendations for quality control guidelines. Clin Chem. 2007, 53 (8): 1401-1407. 10.1373/clinchem.2007.087510.View ArticlePubMed
- Hollegaard MV, Sorensen KM, Petersen HK, Arnardottir MB, Norgaard-Pedersen B, Thorsen P, Hougaard DM: Whole genome amplification and genetic analysis after extraction of proteins from dried blood spots. Clin Chem. 2007, 53 (6): 1161-1162. 10.1373/clinchem.2006.082313.View ArticlePubMed
- Hollegaard MV, Grove J, Thorsen P, Norgaard-Pedersen B, Hougaard DM: High-throughput genotyping on archived dried blood spot samples. Genet Test Mol Biomarkers. 2009, 13 (2): 173-179. 10.1089/gtmb.2008.0073.View ArticlePubMed
- Skogstrand K, Thorsen P, Norgaard-Pedersen B, Schendel DE, Sorensen LC, Hougaard DM: Simultaneous measurement of 25 inflammatory markers and neurotrophins in neonatal dried blood spots by immunoassay with xMAP technology. Clin Chem. 2005, 51 (10): 1854-1866. 10.1373/clinchem.2005.052241.View ArticlePubMed
- Hollegaard MV, Grauholm J, Borglum A, Nyegaard M, Norgaard-Pedersen B, Orntoft T, Mortensen PB, Wiuf C, Mors O, Didriksen M: Genome-wide scans using archived neonatal dried blood spot samples. BMC Genomics. 2009, 10: 297-10.1186/1471-2164-10-297.PubMed CentralView ArticlePubMed
- Lasken RS: Genomic DNA amplification by the multiple displacement amplification (MDA) method. Biochem Soc Trans. 2009, 37 (Pt 2): 450-453.View ArticlePubMed
- Vanhorebeek I, Peeters RP, Vander Perre S, Jans I, Wouters PJ, Skogstrand K, Hansen TK, Bouillon R, Van den Berghe G: Cortisol response to critical illness: effect of intensive insulin therapy. J Clin Endocrinol Metab. 2006, 91 (10): 3803-3813. 10.1210/jc.2005-2089.View ArticlePubMed
- Eyles D, Anderson C, Ko P, Jones A, Thomas A, Burne T, Mortensen PB, Norgaard-Pedersen B, Hougaard DM, McGrath J: A sensitive LC/MS/MS assay of 25OH vitamin D3 and 25OH vitamin D2 in dried blood spots. Clin Chim Acta. 2009, 403 (1-2): 145-151. 10.1016/j.cca.2009.02.005.View ArticlePubMed
- Haak PT, Busik JV, Kort EJ, Tikhonenko M, Paneth N, Resau JH: Archived Unfrozen Neonatal Blood Spots Are Amenable to Quantitative Gene Expression Analysis. Neonatology. 2008, 95 (3): 210-216.PubMed CentralView ArticlePubMed
- Boppana SB, Ross SA, Novak Z, Shimamura M, Tolan RW, Palmer AL, Ahmed A, Michaels MG, Sanchez PJ, Bernstein DI: Dried blood spot real-time polymerase chain reaction assays to screen newborns for congenital cytomegalovirus infection. JAMA. 2010, 303 (14): 1375-1382. 10.1001/jama.2010.423.PubMed CentralView ArticlePubMed
- Wong N, Morley R, Saffery R, Craig J: Archived Guthrie blood spots as a novel source for quantitative DNA methylation analysis. Biotechniques. 2008, 45 (4): 423-424. 10.2144/000112945. 426, 428 passimView ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.