Bias (statistics): Difference between revisions

Browse history interactively ← Previous edit Next edit →Content deleted Content addedVisual WikitextInline

Revision as of 08:16, 19 November 2013 editOmnipaedista (talk \| contribs)Autopatrolled, Extended confirmed users, Pending changes reviewers242,204 edits edited formatting← Previous edit		Revision as of 22:53, 16 March 2014 edit undoVelatrix (talk \| contribs)Extended confirmed users4,781 edits Mowing down huge parts of the text that don't say anythingNext edit →
Line 1:		Line 1:
	{{refimprove\|date=June 2012}}		{{refimprove\|date=June 2012}}
	A ] is '''biased''' if it is calculated in such a way that is systematically different from the ] of interest. The following lists some types of, ~~or aspects of~~, ~~bias~~ which ~~should~~ ~~not be considered mutually exclusive:~~		A ] is '''biased''' if it is calculated in such a way that it is systematically different from the ] of interest. The following lists some types of biases, which can overlap.
	*], ~~where~~ individuals ~~or groups are~~ more likely to ~~take~~ ~~part~~ in ~~a ] project~~ than others, ~~resulting~~ in ~~]s. This can also be termed ''Berksonian bias''.<ref>Rothman, K.J. ''et al.'' (2008) Modern epidemiology. ''Lippincott Williams & Wilkins'' pp.134-137.</ref>~~		*],involves individuals being more likely to be selected for study than others, ]. This can also be termed ''Berksonian bias''.<ref>Rothman, K.J. ''et al.'' (2008) Modern epidemiology. ''Lippincott Williams & Wilkins'' pp.134-137.</ref>
	**] arises from evaluating diagnostic tests on biased patient samples, leading to an overestimate of the ] of the test.		**] arises from evaluating diagnostic tests on biased patient samples, leading to an overestimate of the ] of the test.
	* The ] is the difference between an estimator's expectations and the true value of the parameter being estimated.		* The ] is the difference between an estimator's expectations and the true value of the parameter being estimated.
	** ] is the bias that appears in estimates of parameters in a regression analysis when the assumed specification ~~is incorrect, in that it~~ omits an independent variable that should be in the model.		** ] is the bias that appears in estimates of parameters in a regression analysis when the assumed specification omits an independent variable that should be in the model.
	* In ], a test is said to be '''unbiased''' when the probability of ~~rejecting~~ ~~the~~ ~~null~~ ~~hypothesis~~ is less than ~~or equal to~~ the significance level ~~when~~ ~~the~~ ~~null~~ ~~hypothesis~~ is true, ~~and~~ the ~~probability~~ of ~~rejecting~~ the ~~null~~ hypothesis is ~~greater~~ ~~than~~ or ~~equal~~ to the significance level ~~when the alternative hypothesis is true,~~		* In ], a test is said to be '''unbiased''' when the probability of committing a type I error is less than the significance level, and that of getting a true positive (rejecting the null hypothesis when the alternative hypothesis is true) is at least that of the significance level.
	* Detection bias is ~~where~~ a phenomenon is more likely to be observed ~~and/or reported~~ for a particular set of study subjects. For instance, the ] involving ] and ] may mean doctors are more likely to look for diabetes in obese patients than in ~~less overweight~~ patients, leading to an inflation in diabetes among obese patients because of skewed detection efforts.		* Detection bias occurs when a phenomenon is more likely to be observed for a particular set of study subjects. For instance, the ] involving ] and ] may mean doctors are more likely to look for diabetes in obese patients than in thinner patients, leading to an inflation in diabetes among obese patients because of skewed detection efforts.
	* ] may lead to selection of outcomes, test samples, or test procedures that favor a study's financial sponsor.		* ] may lead to selection of outcomes, test samples, or test procedures that favor a study's financial sponsor.
	* ] involves a skew in the availability of data, such that observations of a certain kind ~~may be~~ more likely to be reported ~~and consequently used in research~~.		* ] involves a skew in the availability of data, such that observations of a certain kind are more likely to be reported.
	* ] comes from the misuse of data mining techniques.		* ] comes from the misuse of data mining techniques.
	* ] arise due to the way that the results are evaluated.		* ] arise due to the way that the results are evaluated.

Revision as of 22:53, 16 March 2014

This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Bias" statistics – news · newspapers · books · scholar · JSTOR (June 2012) (Learn how and when to remove this message)

A statistic is biased if it is calculated in such a way that it is systematically different from the population parameter of interest. The following lists some types of biases, which can overlap.

Selection bias,involves individuals being more likely to be selected for study than others, biasing the sample. This can also be termed Berksonian bias.
- Spectrum bias arises from evaluating diagnostic tests on biased patient samples, leading to an overestimate of the sensitivity and specificity of the test.
The bias of an estimator is the difference between an estimator's expectations and the true value of the parameter being estimated.
- Omitted-variable bias is the bias that appears in estimates of parameters in a regression analysis when the assumed specification omits an independent variable that should be in the model.
In statistical hypothesis testing, a test is said to be unbiased when the probability of committing a type I error is less than the significance level, and that of getting a true positive (rejecting the null hypothesis when the alternative hypothesis is true) is at least that of the significance level.
Detection bias occurs when a phenomenon is more likely to be observed for a particular set of study subjects. For instance, the syndemic involving obesity and diabetes may mean doctors are more likely to look for diabetes in obese patients than in thinner patients, leading to an inflation in diabetes among obese patients because of skewed detection efforts.
Funding bias may lead to selection of outcomes, test samples, or test procedures that favor a study's financial sponsor.
Reporting bias involves a skew in the availability of data, such that observations of a certain kind are more likely to be reported.
Data-snooping bias comes from the misuse of data mining techniques.
Analytical bias arise due to the way that the results are evaluated.
Exclusion bias arise due to the systematic exclusion of certain individuals from the study.

References

Rothman, K.J. et al. (2008) Modern epidemiology. Lippincott Williams & Wilkins pp.134-137.

v t e Biases
Cognitive biases	Acquiescence Ambiguity Affinity Anchoring Attentional Attribution Actor–observer Correspondence Authority Automation Availability Mean world Belief Blind spot Choice-supportive Commitment Confirmation Selective perception Compassion fade Congruence Cultural Declinism Distinction Dunning–Kruger Egocentric Curse of knowledge Emotional Extrinsic incentives Fading affect Framing Frequency Frog pond effect Halo effect Hindsight Horn effect Hostile attribution Impact Implicit In-group Intentionality Illusion of transparency Mean world syndrome Mere-exposure effect Narrative Negativity Normalcy Omission Optimism Out-group homogeneity Outcome Overton window Precision Present Pro-innovation Proximity Response Restraint Self-serving Social comparison Social influence bias Spotlight Status quo Substitution Time-saving Trait ascription Turkey illusion von Restorff effect Zero-risk In animals
Statistical biases	Estimator Forecast Healthy user Information Psychological Lead time Length time Non-response Observer Omitted-variable Participation Recall Sampling Selection Self-selection Social desirability Spectrum Survivorship Systematic error Systemic Verification Wet
Other biases	Academic Basking in reflected glory Déformation professionnelle Funding FUTON Inductive Infrastructure Inherent In education Liking gap Media False balance Vietnam War Norway South Asia Sweden United States Arab–Israeli conflict Ukraine Net Political bias Publication Reporting White hat
Bias reduction	Cognitive bias mitigation Debiasing Heuristics in judgment and decision-making
Lists: General Memory

Categories: