Sample Selection Bias

What is 'Sample Selection Bias'

Sample selection bias is a type of bias caused by choosing non-random data for statistical analysis. The bias exists due to a flaw in the sample selection process, where a subset of the data is systematically excluded due to a particular attribute. The exclusion of the subset can influence the statistical significance of the test, or produce distorted results.

BREAKING DOWN 'Sample Selection Bias'

Survivorship bias is a common type of sample selection bias. For example, when back-testing an investment strategy on a large group of stocks, it may be convenient to look for securities that have data for the entire sample period. If we were going to test the strategy against 15 years worth of stock data, we might be inclined to look for stocks that have complete information for the entire 15-year period. However, eliminating a stock that stopped trading, or shortly left the market, would input a bias in our data sample. Since we are only including stocks that lasted the 15-year period, our final results would be flawed, as these performed well enough to survive the market.

RELATED TERMS
  1. Sample

    A subset containing the characteristics of a larger population. ...
  2. Sampling

    A process used in statistical analysis in which a predetermined ...
  3. Systematic Sampling

    A type of probability sampling method in which sample members ...
  4. Representative Sample

    A subset of a statistical population that accurately reflects ...
  5. Simple Random Sample

    A subset of a statistical population in which each member of ...
  6. Sampling Distribution

    A probability distribution of a statistic obtained through a ...
Related Articles
  1. Markets

    What is Systematic Sampling?

    Systematic sampling is similar to random sampling, but it uses a pattern for the selection of the sample.
  2. Markets

    How Does Sampling Work?

    Sampling is a term used in statistics that describes methods of selecting a pre-defined representative number of data from a larger data population.
  3. Markets

    What is a Representative Sample?

    In statistics, a representative sample accurately represents the make-up of various subgroups in an entire data pool.
  4. Markets

    Understanding the Simple Random Sample

    A simple random sample is a subset of a statistical population in which each member of the subset has an equal probability of being chosen.
  5. Investing

    Explaining Standard Error

    Standard error is a statistical term that measures the accuracy with which a sample represents a population.
  6. Managing Wealth

    Behavioral Bias - Cognitive Vs. Emotional Bias In Investing

    We all have biases. The key to better investing is to identify those biases and create rules to minimize their effect.
  7. Markets

    Explaining the Central Limit Theorem

    Central limit theorem is a fundamental concept in probability theory.
  8. Markets

    How to Use Stratified Random Sampling

    Stratified random sampling is a technique best used with a sample population easily broken into distinct subgroups. Samples are then taken from each subgroup based on the ratio of the subgroup’s ...
  9. Investing

    8 Common Biases That Impact Investment Decisions

    Behavioral biases hit us all as investors and can vary depending upon our investor personality type.
  10. Markets

    Top Reasons Stock Indices Could Be Biased

    Do the owners of the large stock indices (McGraw Hill Financial, CME Group, and News Corp) have incentive to pick stocks to put in the index that are "shiny" as a marketing ploy? And if so, wouldn't ...
RELATED FAQS
  1. How can a representative sample lead to sampling bias?

    Learn how using representative samples alone is not enough to make sampling bias negligible and why elements such as randomization ... Read Answer >>
  2. What percentage of the population do you need in a representative sample?

    Learn about representative samples and how they are used in conjunction with other strategies to create useful data with ... Read Answer >>
  3. What is the difference between systematic sampling and cluster sampling?

    Learn about the differences between systematic sampling and cluster sampling, including how the samples are created for each ... Read Answer >>
  4. What are the benefits of financial sampling?

    Learn more about how financial sampling is used to determine whether or not inaccurate or fraudulent information exists in ... Read Answer >>
  5. What's the difference between a representative sample and a convenience sample?

    Learn the difference between convenience sampling and representative sampling and the advantages and disadvantages of each ... Read Answer >>
  6. What's the difference between a representative sample and a random sample?

    Explore the differences between representative samples and random samples, and discover how they are often used in tandem ... Read Answer >>
Hot Definitions
  1. Quantitative Trading

    Trading strategies based on quantitative analysis which rely on mathematical computations and number crunching to identify ...
  2. Bond Ladder

    A portfolio of fixed-income securities in which each security has a significantly different maturity date. The purpose of ...
  3. Duration

    A measure of the sensitivity of the price (the value of principal) of a fixed-income investment to a change in interest rates. ...
  4. Dove

    An economic policy advisor who promotes monetary policies that involve the maintenance of low interest rates, believing that ...
  5. Cyclical Stock

    An equity security whose price is affected by ups and downs in the overall economy. Cyclical stocks typically relate to companies ...
  6. Front Running

    The unethical practice of a broker trading an equity based on information from the analyst department before his or her clients ...
Trading Center