|
|
Overview
Partial correlation still requires meeting all the usual assumptions of Pearsonian correlation: linearity of relationships, the same level of relationship throughout the range of the independent variable ("homoscedasticity"), interval or near-interval data, and data whose range is not truncated. Partial correlation is common when there is only one control variable but is sometimes used when there are two or three. For large models, researchers use path analysis or structural equation modeling when data are near or at interval level, or use log-linear modeling for lower-level data. Newer versions of structural equation modeling software allow variables of any type on either side of the equation.
|
|
For pedagogical purposes, we consider two hypotheses: (H1) that property crimes spread over into violent crimes, even controlling for population; and (H2) that warmer temperatures are associated with more violent crimes, even controlling for population. We ignore problems associated with small sample size, lack of random sampling, lack of time series data, possible model misspecification due to omitted variables, and not testing procedure assumptions such as linearity, normality, and homoscedasticity.
Temperature has a low bivariate correlation with violent crime (.106) but after population is controlled, a moderate correlation appears (.304). This indicates population suppresses the correlation of temperature with violent crime. The population variable operates positively on violent crime (warmer cities do have more crime) but negatively on temperature (the cooler North has more large cities, and larger cities have more crime). In a hypothetical world in which all cities were of equal size, temperature would have a higher bivariate correlation with violent crime than it does in the real world, and in this sense H2 is also supported.
In the slightly more complex example below, new residuals (vc_resid2 and pc_resid2) are computed for violent crime count (vc_cnt) and property crime count (pc_cnt) respectively. Then a partial correlation is run of vc_cnt and pc_cnt controlling for population and temperature. Also, a regression is run predicting vc_resid2 from pc_resid2. Again, the partial correlation is identical to the regression beta. That is, the residual method is equivalent to the partial correlation method.
The upper regression in the figure below predicts violent crime rate from property crime rate. The lower regression predicts violent crime rate from property crime rate and temperature. For these data, the rate method correctly shows a moderate effect of property crime on violent crime and markedly lower effect of temperature on violent crime. However, due to the confounding effect of population being in the denominator on both sides of the equation, the beta weights for the predictor variables are attenuated compared to the partial correlation or residuals methods. This is to be expected since confounding lowers reliability and lower reliability leads to attenuation.
The regression method most clearly reveals the relative sizes of the predictor effects compared to the control variable (population) effect, but why are the predictor effect size measures (beta weights) lower when population is controlled in the regression method than in the residuals or partial correlation methods? This is because partial correlation and regression with manually residualized variables (as in the residual method above) are partial coefficients, removing the control (here, population) from both the independent and dependent variables. Regression coefficients for non-residualized variables are semi-partial (part) coefficients, discussed in the section which follows. Semi-partial coefficients apply the control variable only the the predictor side of the equation. Why partial coefficients are always larger than the corresponding semi-partial coefficients is discussed below.
A second reason why effect sizes appear lower in the regression method than in the partial correlation and residual methods illustrated above is that the regression method uses all other variables on the predictor side of the equation as controls. That is, in the partial correlation and residuals method examples above, only population was treated as a control. In the regression method, both population and property crime were applied as controls for temperature, and both population and temperature were applied as controls for property crime. As property crime was well correlated with violent crime, once property crime was applied as a control on temperature, the residual of temperature had almost zero correlation with violent crime. Its near-zero beta weight might lead the researcher to think temperature was of no effect. Actually as partial correlation showed, temperature had a moderate correlation with violent crime even after controlling population. The researcher must keep clearly in mind that the regression method uses all other predictor variables as controls (here, property crimes as well as population). Temperature has an effect on violent crime when population is the control but not when both population and property crimes are controls.
Partial correlation, at least when there are not too many predictors, can support the use of one, more than one, or all predictors as controls. Regression supports only use of all predictors as controls. Whether the researcher wants partial or semi-partial coefficients depends on whether the researcher's model suggests all predictors are controls or not. A variable is a control variable if it is modeled as an anteceding cause of a predictor or if it is an intervening cause between the predictor and the dependent. Consider the diagram below:
Model A would perhaps be the intuitive initial model of a researcher for the example discussed in this section. Population is seen as an anteceding cause of both property crime and the dependent, violent crime: the larger the city, the more of each type of crime because there are apt to be more criminals and more targets. Temperature is initially seen as an independent predictor unrelated to population, such that higher temperatures are associated with more violent crime, perhaps because the researcher speculates that warmer weather brings people out and creates more opportunities for violent crime and because hot days may increase stress. Model A thus assumes a single control variable and is appropriately tested by partial correlation.
Model B corresponds to the regression model case, where all predictors are modeled as control variables. Property crime still has population as an anteceding cause of it and the dependent, violent crime, but here property crime has temperature as an anteceding cause of it and the dependent as well. For population, both temperature and property crime are intervening causes between it and the dependent, and hence are controls. For temperature, population is an anteceding cause of it and the dependent, and property crime is an intervening variable between it and the dependent, and thus a control. That is, Model B would justify each predictor variable being a control for each other predictor variable. As such, the regression method of controlling is appropriate, though the partial correlation and residual methods could be adapted to support Model B as well.
Where partial correlation is the correlation of the independent and dependent variables after controlling both for control variables, semi-partial or part correlation is the correlation the independent with the dependent, controlling only the independent variable for control variables. Use partial correlation if research interest is in explaining unique variance in the dependent after both it and the given independent variable are controlled for other predictors in the model. Use semi-partial correlation if research interest is in explaining total variance in the dependent after the independent is controlled for other predictors in the model.
In multiple regression, the squared part correlation is the proportion of the total variance in the dependent variable accounted for by adding the given independent variable to those already entered in the multiple regression formula. Put another way, the squared semi-partial (part) correlation represents the percent of total variance in the dependent variable explained by the given predictor variable, over and beyond other predictors in the model.
Semi-partial or part correlation is the basis for multiple regression. Regression coefficients are semi-partial coefficients. Standardized regression coefficients (beta weights) are semi-partial (part) correlations. Let job satisfaction (J) be the dependent. Let education (E) be the independent and let salary (S) be the control variable. The partial correlation of E with J controlling for S is written rJE.S and when squared is interpreted as the percent of unique variance in J uniquely accounted for by E, after both J and E are controlled by S. Partial correlation is thus the correlation of the residual of J with the residual of E. The semi-partial correlation of E with J controlling for S is written rJ(E.S) and when squared is interpreted as the percent of total (unique plus joint) variance in J uniquely accounted for by E and not by S. Thus semi-partial correlation is the correlation of the residual of E with unadjusted J.
In the example above, for 17 selected US cities, violent crime rate is predicted from population size and summer mean temperature. In SPSS, select Analyze, Regression, Linear; then click the Statistics button and check "Estimates" and "Part and partial correlations". The standardized regression coefficients (beta) are equal to the semi-partial (part) correlations. The partial correlations are always higher than the corresponding part correlations. That the partial and part correlations are higher than the zero-order correlation indicates a suppression effect, as discussed earlier in general and for the same dataset above. As with other correlation coefficients, those above are squared prior to interpretation. For instance, population explains .2992 % of the unique variance in violent crime rate and .2932 % of the total variance in violent crime rate, controlling for other variables in the model......provided the model is correctly specified and other assumptions are met.
For a given independent variable (IV), part correlation first removes from that IV all variance which may be accounted for by control IVs (ex., other IVs in a regression model), then correlates the remaining unique component of the IV with the dependent variable (DV). Part correlation will always be less than the partial correlation, except that it will be equal if the control variable is unrelated to the IV. Partial correlation, in contrast, removes from both the given IV and the DV all variance accounted for by the control IVs, then correlates the unique component of the IV with the unique component of the DV. That is, the common variance of the control variables is removed from just the independent variable in part correlation, whereas in partial correlation it is removed from both the independent and dependent variables. Partial correlation is always larger than the corresponding part correlation because in partial correlation, variance is removed from the DV.
| Gender Record |
Male | Female |
|---|---|---|
| Arrest | a | b |
| No arrest | c | d |
Yule's Q = (ad - bc)/(ad + bc). If our hypothesis is that males are more likely to have arrest records, then ad is concordant pairs (pairs consistent with out hypothesis) and bc is discordant pairs (inconsistent with our hypothesis). Thus Yule's Q is the surplus of concordant over discordant pairs, as a percentage of all pairs (not counting tied pairs like ab and cd). That is, Yule's Q represents the probability that, when we draw two units (a pair) from our population excluding ties, that pair will be consistent with our hypothesis.
Partial Q is simply Q for those pairs of i and j that are tied on a dichotomous control variable, k. Let k be high school diploma/no diploma, in the table below:
| Diploma | Gender Record |
Male | Female |
|---|---|---|
| Arrest | a | b |
| No arrest | c | d | No diploma | Gender Record |
Male | Female |
| Arrest | a' | b' |
| No arrest | c' | d' |
Qij.k = [ad-bc)+(a'd'-b'c')]/[(ad+bc)+(a'd'+b'b')]
Partial association may be constructed in an analogous manner for other measures of association, such as gamma for ordinal data (Yule's Q is gamma for the 2x2 case).
Copyright 1998, 2008, 2009 by G. David Garson.
Last update, 5/2/2009.