Sample Selection Bias

AAA

DEFINITION of 'Sample Selection Bias'

A type of bias caused by choosing non-random data for statistical analysis. The bias exists due to a flaw in the sample selection process, where a subset of the data is systematically excluded due to a particular attribute. The exclusion of the subset can influence the statistical significance of the test, or produce distorted results.

INVESTOPEDIA EXPLAINS 'Sample Selection Bias'

Survivorship bias is a common type of sample selection bias. For example, when back-testing an investment strategy on a large group of stocks, it may be convenient to look for securities that have data for the entire sample period. If we were going to test the strategy against 15 years worth of stock data, we might be inclined to look for stocks that have complete information for the entire 15-year period. However, eliminating a stock that stopped trading, or shortly left the market, would input a bias in our data sample. Since we are only including stocks that lasted the 15-year period, our final results would be flawed, as these performed well enough to survive the market.

RELATED TERMS
  1. Population

    The entire pool from which a statistical sample is drawn. The ...
  2. James J. Heckman

    An American economist who won the 2000 Nobel Memorial Prize in ...
  3. Reverse Survivorship Bias

    The tendency for low performers to remain in the game, while ...
  4. Sampling Distribution

    A probability distribution of a statistic obtained through a ...
  5. Look-Ahead Bias

    Bias created by the use of information or data in a study or ...
  6. Attribute Bias

    The tendency of stocks selected by a quantitative technique or ...
RELATED FAQS
  1. What are the disadvantages of using a simple random sample to approximate a larger ...

    Simple random sampling statistically measures a subset of individuals selected from a larger group or population to approximate ... Read Full Answer >>
  2. What is the difference between a simple random sample and a stratified random sample?

    Simple random samples and stratified random samples differ in how the sample is drawn from the overall population of data. ... Read Full Answer >>
  3. What are the advantages and disadvantages of stratified random sampling?

    Researchers use stratified random sampling to obtain a sample population that best represents the entire population being ... Read Full Answer >>
  4. How does the market share of a few companies affect the Herfindahl-Hirschman Index ...

    In economics and commercial law, the Herfindahl-Hirschman Index (HHI) is a widely used measure that indicates the amount ... Read Full Answer >>
  5. What does the rule of 70 indicate about a country's future economic growth?

    The rule of 70 could be used to indicate the approximate number of years that it would take a company's economic growth to ... Read Full Answer >>
  6. How is the rule of 70 related to the growth rate of a variable?

    The rule of 70 is related to the growth rate of a variable because it uses the growth rate in its approximation of the number ... Read Full Answer >>
Related Articles
  1. Markets

    Using Historical Volatility To Gauge Future Risk

    Use these calculations to uncover the risk involved in your investments.
  2. Bonds & Fixed Income

    Find The Highest Returns With The Sharpe Ratio

    Learn how to follow the efficient frontier to increase your chances of successful investing.
  3. Active Trading Fundamentals

    Bet Smarter With The Monte Carlo Simulation

    This technique can reduce uncertainty in estimating future outcomes.
  4. Active Trading Fundamentals

    How To Convert Value At Risk To Different Time Periods

    Volatility is not the only way to measure risk. Learn about the "new science of risk management".
  5. Options & Futures

    An Introduction To Value at Risk (VAR)

    Volatility is not the only way to measure risk. Learn about the "new science of risk management".
  6. Active Trading

    Modern Portfolio Theory: Why It's Still Hip

    See why investors today still follow this old set of principles that reduce risk and increase returns through diversification.
  7. Fundamental Analysis

    Monte Carlo Simulation With GBM

    Learn to predict future events through a series of random trials.
  8. Fundamental Analysis

    Calculating Future Value

    Future value is the value of an asset or cash at a specified date in the future that is equivalent in value to a specified sum today.
  9. Economics

    What is Deadweight Loss?

    Mainly used in economics, deadweight loss can be applied to any deficiency caused by an inefficient allocation of resources.
  10. Investing

    The Strong Dollar’s (Real) Toll On Tech Stocks

    A large portion of U.S. technology companies’ sales occur overseas, given the strong international business and consumer demand from many U.S. tech firms.

You May Also Like

Hot Definitions
  1. Stop-Loss Order

    An order placed with a broker to sell a security when it reaches a certain price. A stop-loss order is designed to limit ...
  2. Covered Call

    An options strategy whereby an investor holds a long position in an asset and writes (sells) call options on that same asset ...
  3. Butterfly Spread

    A neutral option strategy combining bull and bear spreads. Butterfly spreads use four option contracts with the same expiration ...
  4. Unlevered Beta

    A type of metric that compares the risk of an unlevered company to the risk of the market. The unlevered beta is the beta ...
  5. Moving Average - MA

    A widely used indicator in technical analysis that helps smooth out price action by filtering out the “noise” from random ...
  6. Yield Curve

    A line that plots the interest rates, at a set point in time, of bonds having equal credit quality, but differing maturity ...
Trading Center