Human Reporting Bias

The frequency with which people write about actions, outcomes, or properties is not a reflection of real-word frequenies or the degree to which a property is characteristic of a class of individuals

Human Biases in Data

| Selection bias | Selection does not reflect a random sample | | —- | —- | | Out-group homogeneity bias | Tendency to see outgroup members as more alike than ingroup member | | Biased Data Representation | some groups are represented less positively than others | | Biased Labels | annotations in dataset reflects the worldviews of your annotators |

Human Biases in Interpretation

| Confirmation bias | The tendency to search for, interpret, favor, recall information in a way that confirms preexisting beliefs | | —- | —- | | Overgeneralization | Coming to conclusion based on information that is too general and/or not specific enough | | Correlation fallacy | Confusing correlation with causation | | Automation bias | Propensity for humans to favor suggestions from automated decision-making systems over contradictory information without automation |

Bias network effect

image.png

  • Human data perpetuates human biases
  • as ML learns from human data, the result is a bias network effect

    Bias can be Good, Bad, Neutral

    | Bias in statistics and ML | bias of an estimtor, bias term | | —- | —- | | Congnitive biases | confimation bias, recency bias, optimism bias | | algorithmic bias | characteristics historically associated with discrimination and marginalization, when and where they manifest in algorithmic systems or algorithmcally aided decsion-making |

Measuring Algorithmic Bias: evaluate for fairness & inclusion

Disaggregated Evaluation

image.png

Intersectional Evaluation

image.png

Confusion Matirx

image.png