Misplaced Pages

Bias (statistics): Difference between revisions

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editNext edit →Content deleted Content addedVisualWikitext
Revision as of 08:16, 19 November 2013 editOmnipaedista (talk | contribs)Autopatrolled, Extended confirmed users, Pending changes reviewers242,204 edits edited formatting← Previous edit Revision as of 22:53, 16 March 2014 edit undoVelatrix (talk | contribs)Extended confirmed users4,781 edits Mowing down huge parts of the text that don't say anythingNext edit →
Line 1: Line 1:
{{refimprove|date=June 2012}} {{refimprove|date=June 2012}}
A ] is '''biased''' if it is calculated in such a way that is systematically different from the ] of interest. The following lists some types of, or aspects of, bias which should not be considered mutually exclusive: A ] is '''biased''' if it is calculated in such a way that it is systematically different from the ] of interest. The following lists some types of biases, which can overlap.
*], where individuals or groups are more likely to take part in a ] project than others, resulting in ]s. This can also be termed ''Berksonian bias''.<ref>Rothman, K.J. ''et al.'' (2008) Modern epidemiology. ''Lippincott Williams & Wilkins'' pp.134-137.</ref> *],involves individuals being more likely to be selected for study than others, ]. This can also be termed ''Berksonian bias''.<ref>Rothman, K.J. ''et al.'' (2008) Modern epidemiology. ''Lippincott Williams & Wilkins'' pp.134-137.</ref>
**] arises from evaluating diagnostic tests on biased patient samples, leading to an overestimate of the ] of the test. **] arises from evaluating diagnostic tests on biased patient samples, leading to an overestimate of the ] of the test.
* The ] is the difference between an estimator's expectations and the true value of the parameter being estimated. * The ] is the difference between an estimator's expectations and the true value of the parameter being estimated.
** ] is the bias that appears in estimates of parameters in a regression analysis when the assumed specification is incorrect, in that it omits an independent variable that should be in the model. ** ] is the bias that appears in estimates of parameters in a regression analysis when the assumed specification omits an independent variable that should be in the model.
* In ], a test is said to be '''unbiased''' when the probability of rejecting the null hypothesis is less than or equal to the significance level when the null hypothesis is true, and the probability of rejecting the null hypothesis is greater than or equal to the significance level when the alternative hypothesis is true, * In ], a test is said to be '''unbiased''' when the probability of committing a type I error is less than the significance level, and that of getting a true positive (rejecting the null hypothesis when the alternative hypothesis is true) is at least that of the significance level.
* Detection bias is where a phenomenon is more likely to be observed and/or reported for a particular set of study subjects. For instance, the ] involving ] and ] may mean doctors are more likely to look for diabetes in obese patients than in less overweight patients, leading to an inflation in diabetes among obese patients because of skewed detection efforts. * Detection bias occurs when a phenomenon is more likely to be observed for a particular set of study subjects. For instance, the ] involving ] and ] may mean doctors are more likely to look for diabetes in obese patients than in thinner patients, leading to an inflation in diabetes among obese patients because of skewed detection efforts.
* ] may lead to selection of outcomes, test samples, or test procedures that favor a study's financial sponsor. * ] may lead to selection of outcomes, test samples, or test procedures that favor a study's financial sponsor.
* ] involves a skew in the availability of data, such that observations of a certain kind may be more likely to be reported and consequently used in research. * ] involves a skew in the availability of data, such that observations of a certain kind are more likely to be reported.
* ] comes from the misuse of data mining techniques. * ] comes from the misuse of data mining techniques.
* ] arise due to the way that the results are evaluated. * ] arise due to the way that the results are evaluated.

Revision as of 22:53, 16 March 2014

This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Bias" statistics – news · newspapers · books · scholar · JSTOR (June 2012) (Learn how and when to remove this message)

A statistic is biased if it is calculated in such a way that it is systematically different from the population parameter of interest. The following lists some types of biases, which can overlap.

  • Selection bias,involves individuals being more likely to be selected for study than others, biasing the sample. This can also be termed Berksonian bias.
  • The bias of an estimator is the difference between an estimator's expectations and the true value of the parameter being estimated.
    • Omitted-variable bias is the bias that appears in estimates of parameters in a regression analysis when the assumed specification omits an independent variable that should be in the model.
  • In statistical hypothesis testing, a test is said to be unbiased when the probability of committing a type I error is less than the significance level, and that of getting a true positive (rejecting the null hypothesis when the alternative hypothesis is true) is at least that of the significance level.
  • Detection bias occurs when a phenomenon is more likely to be observed for a particular set of study subjects. For instance, the syndemic involving obesity and diabetes may mean doctors are more likely to look for diabetes in obese patients than in thinner patients, leading to an inflation in diabetes among obese patients because of skewed detection efforts.
  • Funding bias may lead to selection of outcomes, test samples, or test procedures that favor a study's financial sponsor.
  • Reporting bias involves a skew in the availability of data, such that observations of a certain kind are more likely to be reported.
  • Data-snooping bias comes from the misuse of data mining techniques.
  • Analytical bias arise due to the way that the results are evaluated.
  • Exclusion bias arise due to the systematic exclusion of certain individuals from the study.

References

  1. Rothman, K.J. et al. (2008) Modern epidemiology. Lippincott Williams & Wilkins pp.134-137.
Biases
Cognitive biases
Statistical biases
Other biases
Bias reduction
Categories: