Phenotypic, genetic, and genome-wide structure in the metabolic syndrome

Martin, Lisa J; North, Kari E; Dyer, Tom; Blangero, John; Comuzzie, Anthony G; Williams, Jeff

doi:10.1186/1471-2156-4-S1-S95

Volume 4 Supplement 1

Genetic Analysis Workshop 13: Analysis of Longitudinal Family Data for Complex Diseases and Related Risk Factors

Proceedings
Open access
Published: 31 December 2003

Phenotypic, genetic, and genome-wide structure in the metabolic syndrome

Lisa J Martin¹,
Kari E North²,
Tom Dyer³,
John Blangero³,
Anthony G Comuzzie³ &
…
Jeff Williams³

BMC Genetics volume 4, Article number: S95 (2003) Cite this article

1175 Accesses
20 Citations
Metrics details

Abstract

Background

Insulin resistance, obesity, dyslipidemia, and high blood pressure characterize the metabolic syndrome. In an effort to explore the utility of different multivariate methods of data reduction to better understand the genetic influences on the aggregation of metabolic syndrome phenotypes, we calculated phenotypic, genetic, and genome-wide LOD score correlation matrices using five traits (total cholesterol, high density lipoprotein cholesterol, triglycerides, systolic blood pressure, and body mass index) from the Framingham Heart Study data set prepared for the Genetic Analysis Workshop 13, clinic visits 10 and 1 for the original and offspring cohorts, respectively. We next applied factor analysis to summarize the relationship between these phenotypes.

Results

Factors generated from the genetic correlation matrix explained the most variation. Factors extracted using the other matrices followed a different pattern and suggest distinct effects.

Conclusions

Given these results, different methods of multivariate data reduction may provide unique clues on the clustering of this complex syndrome.

Background

The metabolic syndrome (MS) is a cluster of abnormalities including central obesity, abnormal glucose tolerance, elevated insulin and triglycerides, and depressed HDL-C [1–3]. Previous epidemiological studies have implicated common underlying factors influencing the clustering of this syndrome [4]. Yet, the metabolic, physiological, and genetic mechanisms responsible for this clustering have not been elucidated.

Because major genes involved in the etiology of common complex diseases are likely to exert an effect on multiple quantitative traits, statistical techniques that permit the joint analysis of correlated traits, such as factor analysis, may aid in analysis [5]. Using factor analysis, heritable clusters of MS traits have been identified based on phenotypic relationships [6, 7]. To our knowledge, no studies have used the genetic correlation matrix to construct factors for MS traits. Related to this, no studies have explored the use of a 'genome-wide' correlation matrix as an alternative to the phenotypic and genetic correlation matrices. Direct manipulation of the genetic and genomic correlation matrices could represent a powerful method for elucidating the genetic architecture of multiple complex traits. In this study, therefore, we investigated genetic influences on the aggregation of MS phenotypes by applying a uniform factor analytical method to phenotypic, genetic, and genome-wide ('genomic') LOD score correlation matrices using five phenotypic traits (total cholesterol (CHOL), high density lipoprotein cholesterol (HDL-C), triglycerides (TG), systolic blood pressure (SBP), and body mass index (BMI)) from the Framingham data set prepared for the Genetic Analysis Workshop 13 (GAW13).

Methods

Data

The Framingham Heart Study was initiated in 1948 and consisted of 5209 men and women between the ages of 30 and 62 recruited from Framingham, Massachusetts. The subjects returned every 2 years for a detailed medical history, physical examination, and laboratory tests. In 1971, a second-generation group consisting of 5124 of the original participants' adult children and their spouses was enrolled. Longitudinal data were available on SBP, height, weight, CHOL, HDL-C, TG, glucose, hypertensive treatment, hypertensive status, number of cigarettes smoked per day, and grams of alcohol per day. Although glucose was available, we were unable to control for diabetes status, and in the absence of this information the trait was not heritable (data not shown).

The following five phenotypes from the Framingham Heart Study were used to define MS: CHOL, HDL-C, TG, SBP, and BMI. We chose to focus on a single time point for all phenotypic variables. In the original cohort, we used clinic visit 10 because this is the first visit for which data on CHOL and HDL-C were collected. In the offspring cohort, we used clinic visit 1, at which all of the phenotypic data were available and had been collected during a similar timeframe. We also reasoned that by selecting these visits (as early as possible with the data of interest), we could maximize the number of participants included in our analyses. Outliers more than four standard deviations from the mean were dropped; only individuals having complete covariate data (age, sex, cohort, hypertensive treatment, hypertensive status, and smoking) were kept (n = 1648).

Genome-wide LOD correlations

Using the 330 extended families, heritabilities were estimated after adjustment for the above covariates. A variance component model implemented in the program package SOLAR [8], was used to generate multipoint identity-by-descent (IBD) matrices and genome-wide LOD scores. A LOD-score evaluation was performed every 10 centimorgans. Using SAS [9], we calculated a correlation matrix from the genome wide LOD scores.

Phenotypic and genetic correlation matrices

We used bivariate variance-component analysis to estimate the phenotypic, genetic, and environmental correlations between all pair-wise combinations of traits. This method has been described in detail elsewhere [10, 11]; but briefly, the phenotypic covariance is modeled so that the covariation between two individuals for two traits is given by a 2 × 2 covariance matrix with the elements defined by:

Ω_ab= 2Φρ_Gσ_gaσ_gb+ Iρ_Eσ_eaσ_eb, (1)

where a and b take the values of 1 or 2 and ρ_G and ρ_Eare the additive genetic and environmental correlations between the traits. The genetic correlation estimates the proportion of genes shared in common between the traits. This approach has been implemented in SOLAR version 2.0. The phenotypic correlation (ρ_P) is given by:

where

and are the heritabilities of the traits. These correlations were assembled into phenotypic and genotypic matrices for factor analysis.

Factor analysis

The genetic, phenotypic, and genomic correlation matrices were factor analyzed to summarize the relationships between the five phenotypes in the MS using SAS [10]. Orthogonal factors that are linear combinations of the original phenotypes are constructed that explain as much of the total variance in the original variables as possible. Factors were varimax rotated, and factor loadings of 0.40 or greater were used to interpret the factor structures [12, 13].

Results

Heritabilities were determined to be significant for BMI (38.7 ± 3.9), CHOL (41.5 ± 5.6), HDL-C (41.5 ± 5.6), TG (45.6 ± 5.7), and SBP (16.4 ± 3.5). The LOD scores for the genome scans of the traits are shown in Figure 1. Although there are several suggestive linkages, no LOD scores reach significance [14].

Tables 1 and 2 report the genetic, environmental, genomic, and phenotypic correlation matrices. The rotated factor loadings generated from the genetic, phenotypic, and genome-wide LOD score correlation matrices are summarized in Table 3. Factor loadings greater that or equal to 0.40 are indicated in bold type. The genetic correlation factors explained the most variation with factors G1 and G2 explaining, respectively, 31.3% and 24.7% of the total genetic variance. The pattern of loadings differs across the matrices examined, but in general the first factor in each group appears to be a magnitude axis (all loading in the same direction) with high loadings in each of the categories (lipids, fatness, and SBP). For the genetic and phenotypic factors, HDL-C loads in the opposite direction from the other variables, but given that decreased levels are a risk factor, this is to be expected. For the genomic factor, HDL-C loads in the same direction as the other variables because the genomic correlation is concerned only with whether genomic regions account for variability but not with direction of change.

Table 1 Genetic (above diagonal) and environmental (below diagonal) correlation matrices ± standard error.

Full size table

Table 2 Genomic (above diagonal) and phenotypic (below diagonal) correlation matrices.

Full size table

Table 3 Factor loadings from the genetic, phenotypic, and genomic LOD score correlation matrices.

Full size table

Discussion

Previously, factor analysis has been used to identify components underlying the MS through the construction of factors from phenotypic values. Because factor loadings from the genetic and phenotypic correlation matrices are distinct, however, reliance on phenotypic correlation alone may fail to disclose underlying genetic relationships.

In this study we constructed factors not only from phenotypic correlations, but also from the genetic and genome-wide LOD score correlations. Factors extracted from these correlations exhibited variable structure and suggest distinctive effects. With the exception of the second factor from the genome-wide LOD score correlation matrix, SBP loaded strongly on every factor. In other studies, however, SBP has not loaded strongly with other components of MS [6, 7]. Because we were unable to consider glucose or insulin, and because the properties of the variables chosen for analysis can unduly influence the results [15], it is not known whether SBP would remain as pivotal when considered in combination with glucose or insulin.

However, as the genetic correlations are estimated from a polygenic model with no major gene effects estimated, it is possible that the first factor from the genetic correlation matrix is simply summarizing the polygenic effects between the traits. Similarly, the second factor may summarize the QTL effects; indeed, the second factor of the genetic correlation matrix loads similarly to the genome-wide LOD score correlation matrix that summarizes the correlation of QTLs across the genome.

Conclusions

In summary, factors extracted using the phenotypic, genetic, and genome wide LOD score correlation matrices followed different patterns and may suggest distinct effects. Thus, these results imply that different methods of multivariate data reduction provide unique clues on the clustering of this complex syndrome.

References

Ferrannini E, Haffner SM, Mitchell BD, Stern MP: Hyperinsulinaemia: the key feature of a cardiovascular and metabolic syndrome. Diabetologia. 1991, 34: 416-422. 10.1007/BF00403180.
Article CAS PubMed Google Scholar
Despres JP, Lamarche B, Mauriege P, Cantin B, Dagenais GR, Moorjani S, Lupien PJ: Hyperinsulinemia as an independent risk factor for ischemic heart disease. N Engl J Med. 1996, 334: 952-957. 10.1056/NEJM199604113341504.
Article CAS PubMed Google Scholar
Mitchell BD, Kammerer CM, Mahaney MC, Blangero J, Comuzzie AG, Atwood LD, Haffner SM, Stern MP, MacCluer JW: Genetic analysis of the IRS. Pleiotropic effects of genes influencing insulin levels on lipoprotein and obesity measures. Arterioscler Thromb Vasc Biol. 1996, 16: 281-288.
Article CAS PubMed Google Scholar
Edwards KL, Austin MA, Newman B, Mayer E, Krauss RM, Selby JV: Multivariate analysis of the insulin resistance syndrome in women. Arterioscler Thromb. 1994, 14: 1940-1945.
Article CAS PubMed Google Scholar
Blangero J, Konigsberg LW: Multivariate segregation analysis using the mixed model. Genet Epidemiol. 1991, 8: 299-316. 10.1002/gepi.1370080503.
Article CAS PubMed Google Scholar
Edwards KL, Newman B, Mayer E, Selby JV, Krauss RM, Austin MA: Heritability of factors of the insulin resistance syndrome in women twins. Genet Epidemiol. 1997, 14: 241-253. 10.1002/(SICI)1098-2272(1997)14:3<241::AID-GEPI3>3.0.CO;2-8.
Article CAS PubMed Google Scholar
Arya R, Blangero J, Williams K, Almasy L, Dyer TD, Leach RJ, O'Connell P, Stern MP, Duggirala R: Factors of insulin resistance syndrome-related phenotypes are linked to genetic locations on chromosomes 6 and 7 in nondiabetic Mexican Americans. Diabetes. 2002, 51: 841-847. 10.2337/diabetes.51.3.841.
Article CAS PubMed Google Scholar
Almasy L, Blangero J: Multipoint quantitative-trait linkage analysis in general pedigrees. Am J Hum Genet. 1998, 62: 1198-1211. 10.1086/301844.
Article PubMed Central CAS PubMed Google Scholar
SAS Institute Inc.: SAS. 8.0 ed. Cary, NC, SAS Institute. 2001
Google Scholar
Hopper JL, Mathews JD: Extensions to multivariate normal models for pedigree analysis. Ann Hum Genet. 1982, 46: 373-383.
Article CAS PubMed Google Scholar
Blangero J, Williams-Blangero S, Kammerer CM, Towne B, Konigsberg LW: Multivariate genetic analysis of nevus measurements and melanoma. Cytogenet Cell Genet. 1992, 59: 179-181. 10.1159/000098938.
Article CAS PubMed Google Scholar
Kleinbaum D, Kupper L, Muller K: Applied Regression Analysis and Other Multivariate Methods. Boston, MA, Kent Publishing Co. 1988
Google Scholar
Stevens J: Applied Multivariate Statistics for Social Sciences. Mahwah, NJ, Lawrence, Erlbaum Associates. 1996
Google Scholar
Lander E, Kruglyak L: Genetic dissection of complex traits: guidelines for interpreting and reporting linkage results. Nat Genet. 1995, 11: 241-247. 10.1038/ng1195-241.
Article CAS PubMed Google Scholar
Meigs JB: Invited commentary: insulin resistance syndrome? Syndrome X? Multiple metabolic syndrome? A syndrome at all? Factor analysis reveals patterns in the fabric of correlated metabolic risk factors. Am J Epidemiol. 2000, 152: 908-911. 10.1093/aje/152.10.908.
Article CAS PubMed Google Scholar

Download references

Acknowledgments

This contribution to GAW13 was supported by National Institutes of Health grants HL28972, HL45522, GM31575, and MH59490. This analysis was SOLAR powered. SOLAR is available at http://www.sfbr.org/sfbr/public/software/solar/index.html.

Author information

Authors and Affiliations

Center for Epidemiology and Biostatistics, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio, 45229, USA
Lisa J Martin
Department of Epidemiology, University of North Carolina, Chapel Hill North, Carolina, 27514, USA
Kari E North
Department of Genetics, Southwest Foundation for Biomedical Research, San Antonio, Texas, 78245, USA
Tom Dyer, John Blangero, Anthony G Comuzzie & Jeff Williams

Authors

Lisa J Martin
View author publications
You can also search for this author in PubMed Google Scholar
Kari E North
View author publications
You can also search for this author in PubMed Google Scholar
Tom Dyer
View author publications
You can also search for this author in PubMed Google Scholar
John Blangero
View author publications
You can also search for this author in PubMed Google Scholar
Anthony G Comuzzie
View author publications
You can also search for this author in PubMed Google Scholar
Jeff Williams
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lisa J Martin.

Additional information

Authors' contributions

LM and KN performed statistical analyses and interpreted results. JW assisted in the interpretation of the results. TD calculated the IBDs. JB and AC participated in the design of the study. All authors read and approved the final manuscript.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Martin, L.J., North, K.E., Dyer, T. et al. Phenotypic, genetic, and genome-wide structure in the metabolic syndrome. BMC Genet 4 (Suppl 1), S95 (2003). https://doi.org/10.1186/1471-2156-4-S1-S95

Download citation

Published: 31 December 2003
DOI: https://doi.org/10.1186/1471-2156-4-S1-S95

Genetic Analysis Workshop 13: Analysis of Longitudinal Family Data for Complex Diseases and Related Risk Factors

Phenotypic, genetic, and genome-wide structure in the metabolic syndrome