What is 'Sample Selection Bias'

Sample selection bias is a type of bias caused by choosing non-random data for statistical analysis. The bias exists due to a flaw in the sample selection process, where a subset of the data is systematically excluded due to a particular attribute. The exclusion of the subset can influence the statistical significance of the test, or produce distorted results.

BREAKING DOWN 'Sample Selection Bias'

Survivorship bias is a common type of sample selection bias. For example, when back-testing an investment strategy on a large group of stocks, it may be convenient to look for securities that have data for the entire sample period. If we were going to test the strategy against 15 years worth of stock data, we might be inclined to look for stocks that have complete information for the entire 15-year period. However, eliminating a stock that stopped trading, or shortly left the market, would input a bias in our data sample. Since we only include stocks that lasted the 15-year period, our final results would be flawed, as these performed well enough to survive the market.

Hedge fund performance indexes are one example of sample selection bias subject to survivorship bias. Because hedge funds that don’t survive stop reporting their performance to index aggregators, resulting indices are naturally tilted to funds and strategies that remain, hence “survive.” This can be an issue with popular mutual fund reporting service as well.

Analysts can adjust to take account of these biases but may introduce news biases in the process.

RELATED TERMS
  1. Bias

    Biases are human tendencies that affect our behavior and perspective, ...
  2. Representative Sample

    A representative sample is a subset of a statistical population ...
  3. Home Country Bias

    Home country bias refers to the tendency for investors to favor ...
  4. Sampling

    Sampling is a process used in statistical analysis in which a ...
  5. Confirmation Bias

    Confirmation bias suggests that investors seek out information ...
  6. Sampling Distribution

    A sampling distribution is a probability distribution of a statistic ...
Related Articles
  1. Investing

    Behavioral Bias: Cognitive Versus Emotional Bias in Investing

    We all have biases. The key to better investing is to identify those biases and create rules to minimize their effect on investing decisions.
  2. Investing

    Mutual Fund Returns: Not Always What They Appear

    Survivorship bias erases substandard performers, distorting overall mutual fund returns.
  3. Investing

    Top Reasons Stock Indices Could Be Biased

    Do the owners of the large stock indices (McGraw Hill Financial, CME Group, and News Corp) have incentive to pick stocks to put in the index that are "shiny" as a marketing ploy? And if so, wouldn't ...
  4. Investing

    5 Mental Mistakes That Affect Stock Analysts

    They know more about stocks than the average person, but analysts are still affected by biases. Find out what they are.
  5. Investing

    4 Investing Biases You Should Avoid

    Don't let these four behavioral biases interfere with your investment strategy and financial success.
  6. Financial Advisor

    Behavioral Finance Tips for Advising Your Clients

    Here's how advisors can prevent clients from making irrational investment decisions.
  7. Investing

    4 Biases That Can Make You A Bad Investor

    Find out how to spot these four biases, and start making more logical investing decisions.
  8. Trading

    3 Psychological Quirks That Affect Your Trading

    There are human tendencies that can block our financial goals. Here's how to get around them.
  9. Investing

    How to Use Stratified Random Sampling

    Stratified random sampling is a technique best used with a sample population easily broken into distinct subgroups. Samples are then taken from each subgroup based on the ratio of the subgroup’s ...
RELATED FAQS
  1. What is the difference between systematic sampling and cluster sampling?

    Learn about the differences between systematic sampling and cluster sampling, including how the samples are created for each ... Read Answer >>
  2. What is the difference between a simple random sample and a stratified random sample?

    Learn about the differences between simple random sampling and stratified random sampling, and the advantages of each method. Read Answer >>
Hot Definitions
  1. Yield Curve

    A yield curve is a line that plots the interest rates, at a set point in time, of bonds having equal credit quality, but ...
  2. Portfolio

    A portfolio is a grouping of financial assets such as stocks, bonds and cash equivalents, also their mutual, exchange-traded ...
  3. Gross Profit

    Gross profit is the profit a company makes after deducting the costs of making and selling its products, or the costs of ...
  4. Diversification

    Diversification is the strategy of investing in a variety of securities in order to lower the risk involved with putting ...
  5. Intrinsic Value

    Intrinsic value is the perceived or calculated value of a company, including tangible and intangible factors, and may differ ...
  6. Current Assets

    Current assets is a balance sheet item that represents the value of all assets that can reasonably expected to be converted ...
Trading Center