- Open Access
Bivariate variance-component analysis, with application to systolic blood pressure and total cholesterol levels in the Framingham Heart Study
BMC Genetics volume 4, Article number: S81 (2003)
The correlations between systolic blood pressure (SBP) and total cholesterol levels (CHOL) might result from genetic or environmental factors that determine variation in the phenotypes and are shared by family members. Based on 330 nuclear families in the Framingham Heart Study, we used a multivariate normal model, implemented in the software FISHER, to estimate genetic and shared environmental components of variation and genetic and shared environmental correlation between the phenotypes. The natural logarithm of the phenotypes measured at the last visit in both Cohort 1 and 2 was used in the analyses. The antihypertensive treatment effect was corrected before adjustment of the systolic blood pressure for age, sex, and cohort.
The univariate correlation coefficient was statistically significant for sibling pairs and parent-offspring pairs, but not significant for spouse pairs. In the bivariate analysis, the cross-trait correlation coefficients were not statistically significant for all relative pairs. The shared environmental correlation was statistically significant, but the genetic correlation was not significant.
There is no significant evidence for a close genetic correlation between systolic blood pressure and total cholesterol levels. However, some shared environmental factors may determine the variation of both phenotypes.
In an early analysis of the Framingham Heart Study data, it was observed that the modest correlation in blood pressure between spouses was reduced when blood pressure was adjusted for weight . This finding suggested that environmental factors shared by spouses might influence blood pressure and related phenotypes. In an analysis based on data from the Victorian Family Heart Study , Cui and colleagues used cross-trait correlation and bivariate variance component analysis techniques to investigate the genetic and shared environmental correlations between systolic blood pressure (SBP) and diastolic blood pressure (DBP) and between SBP and body mass index (BMI) . They found that the same genes and many of the same family living environmental factors determine variation in both SBP and DBP. On the other hand, SBP and BMI share genetic and family environmental factors to a lesser degree.
In this article, we applied similar analysis techniques to investigate the genetic and shared environmental components of variance between SBP and total cholesterol levels (CHOL) based on data from the Framingham Heart Study.
A total of 330 nuclear families, comprising 1776 individuals (879 males and 897 females), were used in this analysis. When there are multiple nuclear families in a pedigree, a nuclear family that has at least one offspring in Cohort 2 was randomly selected for the analysis. We used the phenotypes of SBP and CHOL, which were measured at the last visit, and transformed them into the logarithms. This ensured that their distributions were close to the normal distribution. We also used the corresponding ages at the last visit in the analysis. A total of 72 parents (31 fathers and 42 mothers) did not have any information on age and the phenotypes. This accounted for 4% (72/1776) of all individuals. These parents did not receive any antihypertensive treatment. There are no missing values in age and the phenotypes in the offspring generation. We analyzed the data with and without imputation. To correct for the antihypertensive treatment effect, we added 10 mm Hg to the measured SBP for the treated subjects and used the original measurements for the untreated subjects [3, 4]. We used the software Stata  and S.A.G.E.  to calculate the descriptive statistics and check the pedigree data.
Statistical analyses were carried out under a multivariate normal model for pedigree analysis using the software FISHER [7, 8]. The mean phenotypes were adjusted for age, sex, and their interaction in each cohort. The variances and correlations between relatives were estimated under maximum likelihood theory. Standard errors and 95% confidence intervals were calculated by using the large sample normal approximation . The variance of each phenotype Y was given by σa2 + σse2 + σe2, where σa2, σse2, and σ e 2 are the genetic, shared environmental, and individual-specific environmental variances, respectively.
Univariate variance component analyses were conducted for each phenotype separately. The covariance between a pair of individuals depends on the type of relationship. For non-spouse relationships the covariance is given by 2φσ a 2 + γσ se 2, where φ is the kinship coefficient between the two individuals and γ (-1 ≤ γ ≤ 1) is the coefficient for shared environmental effect. We estimated the coefficients for sibling pairs, γ Sib , and for parent-offspring pairs, γ PO , from the data. The covariance between spouses was modeled separately by ρ SP (σ a 2 + σ se 2 + σ e 2), where ρ SP is the correlation coefficient between spouses for that trait.
As an extension to the univariate analysis, bivariate variance component analysis considers correlation of two phenotypes simultaneously. The covariance of two phenotypes for an individual was given by ρ a σa1σa2 + ρ se σse1σse2 + ρ e σe1σe2, where ρ a , ρ se , and P e are the correlation coefficients for genetic, shared environmental, and individual environmental effect between the phenotypes, respectively. Correspondingly, σ ai , σ sei , and σ ei are the genetic, shared environmental, and individual-specific environmental standard deviations of phenotype Y i (i = 1, 2).
The covariance of the two phenotypes between a pair of individuals was parameterized in terms of the different types of relationships. For non-spouse relationship, the covariance was 2φρ a σa1σa2 + γ1γ2ρ se σse1σse2, where γ i is the coefficient for shared environment of phenotype Yi(i = 1, 2). As for univariate analyses, we estimated γ Sib and γ PO separately from the data for sibling and parent-offspring relationships, respectively. For spouse-spouse relationship, the covariance was given by RSPσ1σ2, where σ i is the total standard deviation of phenotype Y i (i = 1, 2) and R SP is the cross-trait correlation coefficient between spouses.
Based on data before imputing the missing values, the mean age was 72.5 years (SD 9.3) for the parent cohort and 48.9 years (SD 10.8) for the offspring cohort. Within each cohort, there was no significant difference in age between males and females. Systolic blood pressure (SBP) was significantly higher in the parental cohort, who have a mean of 148 mm Hg (SD23.9), than in the offspring cohort, who have a mean of 124 mm Hg (SD 19.9), based on two-sample t-test (P < 0.001). Within each cohort, there was no significant difference in SBP between males and females. Total cholesterol levels (CHOL) were significantly higher (P < 0.001) in parents who have a mean of 219 mg/dl (SD 41.0) than in offspring who have a mean of 119 mg/dl (SD 37.2). Among the parents, total cholesterol levels were significantly higher (P = 0.001) in mothers who have a mean of 229 mg/dl (SD 40.6) than in fathers who have a mean of 209 mg/dl (SD 39.1). There was no statistically significant difference in CHOL between males and females in the offspring cohort.
Table 1 shows univariate correlation coefficients and their 95% confidence intervals for ln(SBP) and ln(CHOL) in different relative pairs. The magnitude of the correlations was greatest for sibling-sibling pairs in both phenotypes, and least for spouse pairs. These differences between different relative pairs reflect the different impacts of genetic and environmental effects on these phenotypes. Siblings share more living environments than their parents; whereas parents and offsprings are more genetically related than the spouses. The correlations between sibling pairs and between parent-offspring pairs were statistically significant because the confidence intervals did not contain 0. However, the correlation between spouse-spouse pairs was not statistically significant for either phenotype. Table 2 shows the univariate genetic and shared environmental components of variance and 95% confidence intervals for ln(SBP) and ln(CHOL). For ln(SBP), the genetic and shared environmental components of variance accounts for 13% and 11% of the total variance, respectively. The remaining 76% was accounted for by individual-specific factors. For ln(CHOL), a greater proportional of total variance (25%) comes from the shared environmental component, and the genetic and individual-specific components accounted for 5% and 70%, respectively. For shared environmental factors, the coefficients were greater among siblings (γ Sib ) than among parent and offsprings (γ PO ) in both phenotypes. This suggests that individuals within the same generation share more environmental factors than those between generations.
Table 3 shows the within-individual correlation between ln(SBP) and ln(CHOL), which was 0.24 (95% CI, 0.20–0.29), and the cross-trait correlation for different relative pairs. The cross-trait correlations were not statistically significant for all relative pairs because the confidence intervals contain 0. The correlation coefficient was not significantly higher in parent-offspring pairs than in spouse-spouse pairs. This suggests that there may not be common genetic factors determining both ln(SBP) and ln(CHOL). Table 3 also shows the genetic and shared environmental correlation coefficients between ln(SBP) and ln(CHOL). There seems to be no genetic correlation between these two phenotypes because of the confidence interval containing 0. However, the shared environmental correction was statistically significant, being 0.325 (95% CI, 0.075–0.576). This suggests that some shared family environmental factors determine variation of both phenotypes.
Imputation of missing values
There are 72 parents with missing values (the two phenotypes and the corresponding age). We imputed these missing values, re-analyzed the data, and compared the results with the above analyses when the missing values were not imputed. The missing value of the parent's phenotypes were generated according to cohort- and sex-specific normal distribution of relevant non-missing phenotypes. The missing age was calculated according to the spouse's non-missing age, which is available for at least one spouse. The husband's age was imputed by adding 2.8 years onto the wife's age, and vice versa. No significant difference in the mean SBP and CHOL in each cohort and gender category was found before and after the imputation. For example, the mean SBP of the parents was 147.3 mm Hg (SD 23.85) after the imputation compared with 147.7 mm Hg (SD 23.87) before the imputation. Correlation and variance component estimates from both univariate and bivariate analyses were very similar to those given in Table 1,2,3. Because only 4% of individuals have missing values, the imputation did not have large impact on the analysis results.
Different statistical analysis methods were used in this paper to investigate the genetic and shared environmental correlations between ln(SBP) and ln(CHOL) based on nuclear families from the Framingham Heart Study. We found significant correlation in sibling-sibling pairs and parent-offspring pairs for ln(SBP) and also for ln(CHOL). No significant correlation was found in spouse-spouse pairs (Table 1). Both univariate and bivariate analyses showed no significant evidence for a close genetic correlation between ln(SBP) and ln(CHOL), but there seems to be some common environmental determinants for both phenotypes. The variation of these two phenotypes may be determined by different genes and by some shared family living environment. This is similar to observations of Cui et al.  with regards to SBP and BMI.
However, the two measurements of blood pressure (SBP and DBP) shared both genetic and environmental factors . The Framingham Heart Study involved families that were relatively older than the population studied by Cui et al. . The mean age of that population was 53.8 years (SD 6.3) for the parent and 24.0 years (SD 3.7) for the offspring generation. The sample size of the previous study (767 families) is more than twice the size of this analysis. Another important difference between these two studies is the availability of zygosity data on twins in the study. There is no information about whether some of the siblings are twins in this study. However, the previous study includes 66 monozygotic twins and 84 dizygotic twins. These differences may contribute significantly to the power of this analysis to investigate the genetic and shared environmental correlations and variance components.
Cui et al.  have discussed different methods for adjusting hypertensive treatment effects. A fixed value, such as 10 mm Hg, can be added to the measured SBP if a person receives antihypertensive treatment. In contrast, a randomly generated SBP value may be substituted for the measured phenotype for medicated individuals, or the medicated subjects may be totally excluded in variance component and linkage analyses [11, 12]. Cui and colleagues found that adding back an appropriate increment of pressure restores familial components and increases the power of genomic linkage analyses to detect quantitative trait loci .
From this investigation, we did not find any significant evidence to suggest that variation in systolic blood pressure and total cholesterol levels is determined by the same genes. However, there seems to be some common family living environment factors determining the variation in both phenotypes. This information is important in considering analysis methods and components of variance for linkage analysis to discover new genes for SBP using the Framingham Heart Study data.
Hanis CL, Sing CF, Clarke WR, Schrott HG: Multivariate models for human genetic analysis: aggregation, coaggregation and tracking of systolic blood pressure and weight. Am J Hum Genet. 1983, 35: 1196-1210.
Harrap SB, Stebbing M, Hopper JL, Hoang HN, Giles GG: Familial patterns of covariation for cardiovascular risk factors in adults: the Victorian Family Heart Study. Am J Epidemiol. 2000, 152: 704-715. 10.1093/aje/152.8.704.
Cui JS, Hopper JL, Harrap SB: Genes and family environment explain correlations between blood pressure and body mass index. Hypertension. 2002, 40: 7-12. 10.1161/01.HYP.0000022693.11752.E9.
Neaton JD, Grimm RH, Prineas RJ, Stamler J, Grandits GA, Elmer PJ, Cutler JA, Flack JM, Schoenberger JA, McDonald R: Treatment of mild hypertension study. Final results. Treatment of mild hypertension study research group. JAMA. 1993, 270: 713-724. 10.1001/jama.270.6.713.
StataCorp: Stata Statistical Software: Release 7.0. College Station, Stata Corporation. 2001
Case Western Reserve University: S.A.G.E.: Statistical Analysis for Genetic Epidemiology, S.A.G.E. 4.2. Cleveland, OH, Case Western Reserve University. 2002
Lange K, Weeks D, Boehnke M: Programs for pedigree analysis: MENDEL, FISHER, and dGENE. Genet Epidemiol. 1988, 5: 471-472. 10.1002/gepi.1370050611.
Hopper JL, Mathews JD: A multivariate normal model for pedigree and longitudinal data and the software "FISHER". Austral J Statist. 1994, 36: 153-176.
Mukhopadhyay N: Probability and Statistical Inference. New York, Marcel Dekker, Inc. 2000, 560-563.
Cui JS, Hopper JL, Harrap SB: Antihypertensive treatments obscure familial contributions to blood pressure variation. Hypertension. 2003, 41: 207-210. 10.1161/01.HYP.0000044938.94050.E3.
Hunt SC, Elison C, Atwood LD, Pankow JS, Province MA, Leppert MF: Genome scans for blood pressure and hypertension. The National Heart, Lung, and Blood Institute Family Heart Study. Hypertension. 2002, 40: 1-6. 10.1161/01.HYP.0000022660.28915.B1.
Lyle JP: Loosening the cuff – important new advances in modeling antihypertensive treatment effects in genetic studies of hypertension. Hypertension. 2003, 2003: 197-198.
This research is supported by the University of Melbourne Early Career Researcher Grant Scheme and the Melbourne Research Grant Scheme, as well as by the Australian National Health and Medical Research Council (NHMRC) project grant 251610. We thank Jiami Zhang and Xia Zhang and for their efficient technical assistance. Some of the results reported were obtained using the program package S.A.G.E., which is supported by a U.S. Public Health Services Resource grant (RR03655) from the National Center for Research Resources.
About this article
Cite this article
Cui, J.S., Sheffield, L.J. Bivariate variance-component analysis, with application to systolic blood pressure and total cholesterol levels in the Framingham Heart Study. BMC Genet 4, S81 (2003) doi:10.1186/1471-2156-4-S1-S81
- Systolic Blood Pressure
- Total Cholesterol Level
- Framingham Heart Study
- Sibling Pair
- Relative Pair