Sample Selection Bias

What is 'Sample Selection Bias'

Sample selection bias is a type of bias caused by choosing non-random data for statistical analysis. The bias exists due to a flaw in the sample selection process, where a subset of the data is systematically excluded due to a particular attribute. The exclusion of the subset can influence the statistical significance of the test, or produce distorted results.

BREAKING DOWN 'Sample Selection Bias'

Survivorship bias is a common type of sample selection bias. For example, when back-testing an investment strategy on a large group of stocks, it may be convenient to look for securities that have data for the entire sample period. If we were going to test the strategy against 15 years worth of stock data, we might be inclined to look for stocks that have complete information for the entire 15-year period. However, eliminating a stock that stopped trading, or shortly left the market, would input a bias in our data sample. Since we are only including stocks that lasted the 15-year period, our final results would be flawed, as these performed well enough to survive the market.

RELATED TERMS
  1. Sample

    A subset containing the characteristics of a larger population. ...
  2. Systematic Sampling

    A type of probability sampling method in which sample members ...
  3. Sampling

    A process used in statistical analysis in which a predetermined ...
  4. Sampling Error

    A statistical error to which an analyst exposes a model simply ...
  5. Simple Random Sample

    A subset of a statistical population in which each member of ...
  6. Sampling Distribution

    A probability distribution of a statistic obtained through a ...
Related Articles
  1. Economics

    What is Systematic Sampling?

    Systematic sampling is similar to random sampling, but it uses a pattern for the selection of the sample.
  2. Fundamental Analysis

    How Does Sampling Work?

    Sampling is a term used in statistics that describes methods of selecting a pre-defined representative number of data from a larger data population.
  3. Professionals

    Sampling Considerations

    CFA Level 1 - Sampling Considerations
  4. Fundamental Analysis

    What is a Representative Sample?

    In statistics, a representative sample accurately represents the make-up of various subgroups in an entire data pool.
  5. Professionals

    Sampling and Estimation

    CFA Level 1 - Sampling and Estimation- sampling error, in depth information on confidence intervals and t-distributions
  6. Fundamental Analysis

    Understanding the Simple Random Sample

    A simple random sample is a subset of a statistical population in which each member of the subset has an equal probability of being chosen.
  7. Fundamental Analysis

    Explaining Standard Error

    Standard error is a statistical term that measures the accuracy with which a sample represents a population.
  8. Investing Basics

    Behavioral Bias - Cognitive Vs. Emotional Bias In Investing

    We all have biases. The key to better investing is to identify those biases and create rules to minimize their effect.
  9. Fundamental Analysis

    Explaining the Central Limit Theorem

    Central limit theorem is a fundamental concept in probability theory.
  10. Investing

    How to Use Stratified Random Sampling

    Stratified random sampling is a technique best used with a sample population easily broken into distinct subgroups. Samples are then taken from each subgroup based on the ratio of the subgroup’s ...
RELATED FAQS
  1. How can a representative sample lead to sampling bias?

    Learn how using representative samples alone is not enough to make sampling bias negligible and why elements such as randomization ... Read Answer >>
  2. What percentage of the population do you need in a representative sample?

    Learn about representative samples and how they are used in conjunction with other strategies to create useful data with ... Read Answer >>
  3. What is the difference between systematic sampling and cluster sampling?

    Learn about the differences between systematic sampling and cluster sampling, including how the samples are created for each ... Read Answer >>
  4. What are the benefits of financial sampling?

    Learn more about how financial sampling is used to determine whether or not inaccurate or fraudulent information exists in ... Read Answer >>
  5. What's the difference between a representative sample and a convenience sample?

    Learn the difference between convenience sampling and representative sampling and the advantages and disadvantages of each ... Read Answer >>
  6. What's the difference between a representative sample and a random sample?

    Explore the differences between representative samples and random samples, and discover how they are often used in tandem ... Read Answer >>
Hot Definitions
  1. Cost Of Debt

    The effective rate that a company pays on its current debt. This can be measured in either before- or after-tax returns; ...
  2. Yield Curve

    A line that plots the interest rates, at a set point in time, of bonds having equal credit quality, but differing maturity ...
  3. Stop-Limit Order

    An order placed with a broker that combines the features of stop order with those of a limit order. A stop-limit order will ...
  4. Keynesian Economics

    An economic theory of total spending in the economy and its effects on output and inflation. Keynesian economics was developed ...
  5. Society for Worldwide Interbank Financial Telecommunications ...

    A member-owned cooperative that provides safe and secure financial transactions for its members. Established in 1973, the ...
  6. Generally Accepted Accounting Principles - GAAP

    The common set of accounting principles, standards and procedures that companies use to compile their financial statements. ...
Trading Center