A variable may be thought to alter the dependent or independent variables, but may not actually be the focus of the experiment. So that the variable will be kept constant or monitored to try to minimize its effect on the experiment. Such variables may be designated as either a "controlled variable", "
control variable", or "fixed variable".
Extraneous variables are candidate independent variables which may be included in a
regression analysis to aid a researcher with accurate response parameter estimation,
prediction, and
goodness of fit, but are not of substantive interest to the
hypothesis under examination. For example, in a study examining the effect of post-secondary education on lifetime earnings, some extraneous variables might be gender, ethnicity, social class, genetics, intelligence, age, and so forth. A variable is extraneous only when it can be assumed (or shown) to influence the
dependent variable. If included in a regression, it can improve the
fit of the model. If it is excluded from the regression and if it has a non-zero
covariance with one or more of the independent variables of interest, its omission will
bias the regression's result for the effect of that independent variable of interest. This effect is called
confounding or
omitted variable bias; in these situations, design changes and/or controlling for a variable are necessary. Extraneous variables are often classified into three types: • Subject variables, which are the characteristics of the individuals being studied that might affect their actions. These variables include age, gender, health status, mood, background, etc. • Blocking variables or experimental variables are characteristics of the persons conducting the experiment which might influence how a person behaves. Gender, the presence of racial discrimination, language, or other factors may qualify as such variables. • Situational variables are features of the environment in which the study or research was conducted, which have a bearing on the outcome of the experiment in a negative way. Included are the air temperature, level of activity, lighting, and time of day. In modelling, variability that is not covered by the independent variable is designated by e_I and is known as the "
residual", "side effect", "
error", "unexplained share", "residual variable", "disturbance", or "tolerance". ==Examples==