Bias: Concept and Classification

Collegedunia Team logo

Collegedunia Team Content Curator

Content Curator

Statistical bias is a feature of a statistical technique in which there is a systematic deviation in the expected value of the result from the actual value. The difference between the expected value and the real value of the parameter is known as bias. The bias may show some serious problems for the researcher in the sampling procedure. In this article, we will discuss Bias, its types and classification.

Keyterms: Statistics, Bias, Statistical analysis, Measurement Bias , Non-measurement Bias, Selection Bias, Spectrum Bias, Cognitive Bias, Data-Snooping Bias, Omitted Variable Bias, Exclusion Bias, Analytical Bias, Reporting Bias, Funding Bias, Self-selection bias, Recall bias, Observer bias, Survivorship bias, Cause-effect bias.

Read more: Empirical Probability Formula


Bias in Statistics

[Click Here for Sample Questions]

The deliberate or involuntary favouring of one class or outcome over other potential groups or outcomes in the chosen set of data is called bias. It generally defines the tendency of the measurement process. This phenomenon occurs when a model or data set is unrepresentative and it highlights some grave issues for the researcher as a simple raise cannot ease it in sample size. 

The actual variation between the expected value and the real value of the parameter considered for the experiment is usually portrayed in a bias. Multiple sources of bias are found that usually result in this anomaly and it needs to be rectified in order to provide accurate data investigation. The bias is considered to be a huge drawback in statistical analysis.

Bias

Bias


Different Types of Bias in Statistics

[Click Here for Previous Year's Questions]

There are many important types of biases found in the field of statistics that can affect the study of a data scientist. Some of the major Bias are given below:

  • Selection Bias
  • Spectrum Bias
  • Cognitive Bias
  • Data-Snooping Bias
  • Omitted Variable Bias
  • Exclusion Bias
  • Analytical Bias
  • Reporting Bias
  • Funding Bias
  • Self-selection bias
  • Recall bias
  • Observer bias
  • Survivorship bias
  • Cause-effect bias

Also Read:


Classification of Bias

[Click Here for Sample Questions]

The bias is mainly categorized into two different categories as per the sampling method in statistics. They are-

  1. Measurement Bias 
  2. Non-representative Bias

1. Measurement Bias (Observation or Information Bias)

Measurement Bias takes place when major information in a survey is either measured, collected, or interpreted inaccurately. As per John’s Hopkins, it is when: “…information is collected differently between two groups, leading to an error in the conclusion of the association.” 

The main three different reasons that cause measurement bias are –

  • Data Collection Error

While recording data, mishandling of data or machinery malfunction may lead to ill-handling of data by the scientist. The ineffective use of tools by researchers concerned with data collection may also result in this error.

  • Leading Questions/ Fault in the Questionnaire

The interviewer may pose the question in such a way that it leads to the responses that are preferred by the researcher as compared to the opposite idea to that of the purpose of the survey. More choices can be provided in the questionnaire, for representing all the conflicting views.

  • Respondents' Record-keeping System/ inadvertent false responses 

In this scenario, when many responders may have misunderstood the question and chose an incorrect option, the error or bias happens. 

For example, if the sample group is composed of numerous older adults, they might land into misunderstanding the questionnaire and fetch incorrect inputs when asked to fill the survey answers by remembering their previous experiences. It happens because of weak record keeping. The deficiency of memory becomes the cause of incorrect input in the survey.

2. Non-representative Bias (Selection Bias)

Non-representative Bias occurs because of implementing random methods during the selection process and when a survey sample fails to represent the population accurately. This inaccuracy is also referred to as selection bias. 

This type of bias happens due to involuntarily working with a specific division of population instead of the whole, leading to the unrepresentativeness of the whole population. It leads to exclusion of a specific section of the population which might skew the findings of the survey. The main reasons that cause this bias are-

  • Under coverage Bias

This type of bias happens when some respondents of the sample population are not represented in the sample i.e. some members are excluded from the survey. It mainly occurs due to convenience sampling like collecting data from an easily accessible source such as a local supermarket.

  • Non-response Bias

When individuals identified to represent a survey are unwilling or unable to participate in the survey, this type of bias happens. In such cases, the conflicting views of non-respondents are completely disregarded or remain unnoticed as respondents have an upper hand in the outcome of the survey.

  • Voluntary Response Bias

When members of a sample are self-selected volunteers, this type of bias happens. The voluntary response may give a faulty representation of the overall population in favour of strong opinions. This also causes a lack of appropriate responses as the volunteers for the trials may not represent the targeted respondents.

  • Survivorship Bias

When a lengthy process is involved for being counted as a complete response, this type of bias happens. It gives rise to biased sampling.

  • Confirmation Bias

When the information pertaining to only one belief is favoured, this type of bias happens.

Also Read:


Things to Remember Based on Bias

  • Bais is the deliberate or involuntary favouring of one class or outcome over other potential groups or outcomes in the chosen set of data.
  • Bais generally defines the tendency of the measurement process.
  • The actual variation between the expected value and the real value of the parameter considered for the experiment is usually portrayed in a bias.
  • Some major types of Bias are- Selection Bias, Spectrum Bias, Cognitive Bias, Data-Snooping Bias, Omitted Variable Bias, Exclusion Bias, Analytical Bias, Reporting Bias, Funding Bias, Self-selection bias, Recall bias, Observer bias, Survivorship bias and Cause-effect bias.
  • Bias can be classified into two different categories- Measurement Bias and Non- representative Bias. 

Previous Year Questions

  1. A die is thrown twice. If getting a number greater than four on the die is considered a success, then the variance of the probability distribution of the number of successes is..[MHT CET 2019]
  2. A bag contains 2n2n coins out of which n−1n−1 are unfair with heads on both sides and the remaining are fair. One coin is picked from the bag at random and tossed. If the probability that head falls in the toss is 41564156, then the number of unfair coins in the bag is….[BITSAT 2019]
  3. A bag contains 33 red and 33 white balls. Two balls are drawn one by one. The probability that they are of different colours is….[BITSAT 2015]
  4. A bag contains (2n+1)(2n+1) coins. It is known that nn of these coins have a head on both sides, whereas the remaining (n+1)(n+1) coins are fair. AA coin is picked up at random from the bag and tossed. If the probability that the toss results in a head is 31/4231/42, then nn is equal to….[KCET 2015]
  5. AA and BB each select one number at random from the distinct numbers 1,2,3,.....,n1,2,3,.....,n and the probability that the number selected by AA is less than the number selected by BB is 1009201910092019 .Now , the probability that the number selected by BB is the number immediately next to the number selected by AA is…...[WBJEE 2019]
  6. Two cards are drawn at random from a pack of 52 cards. The probability of these two being "Aces" is….[KCET 2016]
  7. If two dice are thrown simultaneously, then the probability that the sum of the numbers which come up on the dice to be more than 55 is….[KCET 2015]
  8. Two dice are thrown simultaneously, the probability of obtaining a total score of 55 is….[KCET 2016]
  9. Probability of four sons to a couple is…..[NEET 2001]
  10. A four digit number is formed by the digits 1,2,3,41,2,3,4 with no repetition. The probability that the number is odd, is….[JIPMER 2014]
  11. A man speaks truth 2 out of 3 times. He picks one of the natural numbers in the set S = {1, 2, 3, 4, 5, 6, 7} and reports that it is even. The probability that it is actually even is….[KCET 2019]
  12. A bag contains 17 tickets numbered from 1 to 17. A ticket is draw at random, then another ticket is drawn at random, then another ticket is drawn without replacing the first one . The probability that both the tickets may show even numbers is…...[KCET 2018]
  13. For the probability distribution given by X=xiX=xi 0 1 2 pipi 25362536 518518 136136 the standard deviation (σ)(σ) is….[KCET 2018]
  14. A flashlight has 10 batteries out of which 4 are dead. If 3 batteries are selected without replacement and tested, then the probability that all 3 are dead is…..[KCET 2018]
  15. A box contains 66 red marbles numbers from 11 through 66 and 44 white marbles 1212 through 1515. Find the probability that a marble drawn 'at random' is white and odd numbered….[KCET 2015]
  16. Let AA and BB be two events such that P(A∪B)=P(A)+P(B)−P(A)P(B).P(A∪B)=P(A)+P(B)−P(A)P(B). If 0<P(A)<10<P(A)<1 and 0<P(B)<10<P(B)<1 , then P(A∪B)′=P(A∪B)′=….[KEAM]
  17. The probability of happening of an event A is 0.5 and the of B is 0.3 .If A and B are mutually exclusive events, then the probability of neither A nor B is…..[KCET 2018]
  18. If X has a binomial distribution with parameters n = 6, p and P(X = 2) = 12, P(X = 3) = 5 then P =...[KCET 2019]
  19. If A and B are two events of a sample space S such that P(A) = 0.2, P(B) = 0.6 and P(A | B) = 0.5 then P(A| B)=...[KCET 2019]
  20. A random variable X has the following probability distribution: x 1 2 3 4 5 6 7 P(x) k-1 3k k 3k 3k23k2 k2k2 k2+kk2+k Then the value of k is...[KCET 2019]

Sample Questions

Ques: What is bias? (1 Mark)

Ans: The deliberate or involuntary favouring of one class or outcome over other potential groups or outcomes in the chosen set of data is called bias and it generally defines the tendency of the measurement process.

Ques: What is the main classification of bias in statistics? (1 Mark)

Ans: The bias is mainly categorized into two different types as per the sampling method in statistics. They are: 

1) Measurement Bias (Observation or Information Bias) 

2) Non-representative Bias (Selection Bias).

Ques: How can Information Bias be controlled? (1 Mark)

Ans: To control the biases from happening during a survey, the foremost technique that can be applied is implementing a homogenous method for collecting data across groups/ sample populations.

Ques: What are the different types of Cognitive Bias? (2 Marks)

Ans: The orderly manner in which the framework of information influences the respondent’s decision-making ability is called cognitive bias. The different types of cognitive bias are –

  • Overconfidence bias
  • Self-serving Bias
  • Herd Mentality
  • Loss Aversion
  • Narrative Fallacy
  • Anchoring Bias
  • Hindsight Bias
  • Representative Heuristic
  • Confirmation Bias
  • Framing Cognitive Bias

Ques: What are the common factors for cognitive bias? (2 Marks)

Ans: The major factors that help to identify cognitive bias are- 

1) Interference with human individuality 

2) Leads to a judgement that deviates from coherent impartiality.

For Latest Updates on Upcoming Board Exams, Click Here: https://t.me/class_10_12_board_updates


Check Out:

CBSE CLASS XII Related Questions

  • 1.
    Let \( A \) be a matrix of order \( m \times n \) and \( B \) be a matrix such that \( A^T B \) and \( B A^T \) are defined. Then, the order of \( B \) is:


      • 2.
        If $f : \mathbb{N} \rightarrow \mathbb{W}$ is defined as \[ f(n) = \begin{cases} \frac{n}{2}, & \text{if } n \text{ is even} \\ 0, & \text{if } n \text{ is odd} \end{cases} \] then $f$ is :

          • injective only
          • surjective only
          • a bijection
          • neither surjective nor injective

        • 3.
          Solve the differential equation: \[ x^2y \, dx - (x^3 + y^3) \, dy = 0. \]


            • 4.
              The diagonals of a parallelogram are given by \( \mathbf{a} = 2 \hat{i} - \hat{j} + \hat{k} \) and \( \mathbf{b} = \hat{i} + 3 \hat{j} - \hat{k}\) . Find the area of the parallelogram.


                • 5.
                  Let $f'(x) = 3(x^2 + 2x) - \frac{4}{x^3} + 5$, $f(1) = 0$. Then, $f(x)$ is:

                    • $x^3 + 3x^2 + \frac{2}{x^2} + 5x + 11$
                    • $x^3 + 3x^2 + \frac{2}{x^2} + 5x - 11$
                    • $x^3 + 3x^2 - \frac{2}{x^2} + 5x - 11$
                    • $x^3 - 3x^2 - \frac{2}{x^2} + 5x - 11$

                  • 6.
                    Let $f(x) = |x|$, $x \in \mathbb{R}$. Then, which of the following statements is incorrect?

                      • $f$ has a minimum value at $x = 0$
                      • $f$ has no maximum value in $\mathbb{R}$
                      • $f$ is continuous at $x = 0$
                      • $f$ is differentiable at $x = 0$
                    CBSE CLASS XII Previous Year Papers

                    Comments


                    No Comments To Show