Misplaced Pages

Bias (statistics): Difference between revisions

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editNext edit →Content deleted Content addedVisualWikitext
Revision as of 03:18, 2 March 2010 edit70.80.234.196 (talk) Undid revision 346913975 by VolkovBot (talk) unexplained robot removal. articles exist and pretty much match this one← Previous edit Revision as of 13:35, 23 September 2010 edit undoEmble64 (talk | contribs)269 edits Added other types of bias (with examples) and a reference. Some rewording (hopefully it's clearer now).Next edit →
Line 1: Line 1:
In ], the term '''bias''' is used for describing several different concepts: In ], the term '''bias''' refers to several different concepts:
*], where there is an error in choosing the individuals or groups to take part in a ]. *], where individuals or groups are more likely to take part in a ] project than others, resulting in ]s. This can also be termed ''Berksonian bias''<ref>Rothman, K.J. ''et al.'' (2008) Modern epidemiology. ''Lippincott Williams & Wilkins'' pp.134-137.</ref>.
**] arises from evaluating diagnostic tests on biased patient samples, leading to an overestimate of the sensitivity and specificity of the test.
** A ], sometimes classified as a result of selection bias, is one in which some members of the population are more likely to be included than others.
***] consists of evaluating the ability of a diagnostic test in a biased group of patients, which leads to an overestimate of the sensitivity and specificity of the test.
* The ] is the difference between an estimator's expectation and the true value of the parameter being estimated. * The ] is the difference between an estimator's expectation and the true value of the parameter being estimated.
** ] is the bias that appears in estimates of parameters in a regression analysis when the assumed specification is incorrect, in that it omits an independent variable that should be in the model. ** ] is the bias that appears in estimates of parameters in a regression analysis when the assumed specification is incorrect, in that it omits an independent variable that should be in the model.
* In ], a test is said to be '''unbiased''' when the probability of rejecting the null hypothesis exceeds the significance level when the alternative is true ''and'' is less than or equal to the significance level when the null hypothesis is true. * In ], a test is said to be '''unbiased''' when the probability of rejecting the null hypothesis exceeds the significance level when the alternative is true ''and'' is less than or equal to the significance level when the null hypothesis is true.
* ] or ] are external influences that may affect the accuracy of statistical measurements. * ] or ] are external influences that may affect the accuracy of statistical measurements.
**Detection bias is where a phenomenon is more likely to be observed and/or reported for a particular set of study subjects. For instance, the ] involving ] and ] may mean doctors are more likely to look for diabetes in obese patients than in less overweight patients, leading to an inflation in diabetes among obese patients because of skewed detection efforts.
**Reporting bias involves a skew in the availability of data, such that observations of a certain kind may be more likely to be reported and consequently used in research.
* ] comes from the misuse of data mining techniques. * ] comes from the misuse of data mining techniques.


{{disambig}} {{disambig}}
{{Biases}} {{Biases}}

==References==
{{Reflist}}

] ]
] ]

Revision as of 13:35, 23 September 2010

In statistics, the term bias refers to several different concepts:

  • Selection bias, where individuals or groups are more likely to take part in a research project than others, resulting in biased samples. This can also be termed Berksonian bias.
    • Spectrum bias arises from evaluating diagnostic tests on biased patient samples, leading to an overestimate of the sensitivity and specificity of the test.
  • The bias of an estimator is the difference between an estimator's expectation and the true value of the parameter being estimated.
    • Omitted-variable bias is the bias that appears in estimates of parameters in a regression analysis when the assumed specification is incorrect, in that it omits an independent variable that should be in the model.
  • In statistical hypothesis testing, a test is said to be unbiased when the probability of rejecting the null hypothesis exceeds the significance level when the alternative is true and is less than or equal to the significance level when the null hypothesis is true.
  • Systematic bias or systemic bias are external influences that may affect the accuracy of statistical measurements.
    • Detection bias is where a phenomenon is more likely to be observed and/or reported for a particular set of study subjects. For instance, the syndemic involving obesity and diabetes may mean doctors are more likely to look for diabetes in obese patients than in less overweight patients, leading to an inflation in diabetes among obese patients because of skewed detection efforts.
    • Reporting bias involves a skew in the availability of data, such that observations of a certain kind may be more likely to be reported and consequently used in research.
  • Data-snooping bias comes from the misuse of data mining techniques.
Topics referred to by the same term Disambiguation iconThis disambiguation page lists statistics articles associated with the title Bias.
If an internal link led you here, you may wish to change the link to point directly to the intended article.
Biases
Cognitive biases
Statistical biases
Other biases
Bias reduction

References

  1. Rothman, K.J. et al. (2008) Modern epidemiology. Lippincott Williams & Wilkins pp.134-137.
Categories: