The rapid evolution of computer technology in the last few decades has provided investment professionals (and amateurs) with the capability to access and analyze tremendous amounts of financial data. Additionally, the world wide web, email, and bulletin boards make it possible for people around the globe to access this information quickly, as well as providing a means for individuals to voice their opinions and interact. As a result, some of the more intriguing topics of debate in recent years have revolved around the practice and consequences of "data mining."
Data mining involves searching through databases for correlations and patterns that differ from results that would be anticipated to occur by chance or in random conditions. The practice of data mining in and of itself is neither good nor bad and the use of data mining has become common in many industries. For instance, in an attempt to improve life expectancy researchers might use data mining to analyze causes and correlations with death rates. Data Mining is also used by advertisers and marketing firms to target consumers. But possibly the most notorious group of data miners are stock market researchers that seek to predict future stock price movement. Most if not all Stock Market Anomalies have been discovered (or at least documented) via data mining of past prices and related (or sometimes unrelated) variables.
When market beating strategies are discovered via data mining, there are a number of potential problems in making the leap from a back-tested strategy to successfully investing in future real world conditions. The first problem is determining the probability that the relationships occurred at random or whether the anomaly may be unique to the specific sample that was tested. Statisticians are fond of pointing out that if you torture the data long enough, it will confess to anything.
In what is becoming an infamous example David Leinweber went searching for random correlations to the S&P 500. Peter Coy described Leinweber's findings in a Business Week article titled "He who mines data may strike fool's gold" (6/16/97). The article discussed data mining, Michael Drosnin’s book The Bible Code (much more on this topic later), and the fact that patterns will occur in data by pure chance, particularly if you consider many factors. Many cases of data mining are immune to statistical verification or rebuttal. In describing the pitfalls of data mining, Leinweber "sifted through a United Nations CD-ROM and discovered that historically, the single best predictor of the Standard & Poor's 500-stock index was butter production in Bangladesh." The lesson to learn according to Coy is a "formula that happens to fit the data of the past won't necessarily have any predictive value."
Back testing has always been a suspect class of information . . . When you look backwards, you're only going to show what's good.
Barry Miller (SEC) in What's the Stock Market Got to Do with the Production of Butter in Bangladesh? from Money (March 1998)
Anomalies discovered through data mining are considered to be more significant as the period of time increases and if the anomaly can be confirmed in out of sample tests over different time periods and comparable markets (for instance on foreign exchanges). If an anomaly is discovered in back tests, its also important to determine how costs (transactions costs, the bid-ask spread, & impact costs for institutional traders) would reduce the returns. Some anomalies are simply not realizable. See the value line anomaly and implementation shortfall for more on this topic. Additionally, strategies that have worked in the past may simply stop working as more investors begin investing according to the strategy. See the Efficient Market Hypothesis for more on this topic.
The Motley Fool has been praised by many for offering educational advice to individual investors (for instance, the Motley Fool offers sound recommendations in advising investors to buy and hold stocks, to be wary of stock brokers and analysts conflicts of interest, and to be wary of unrealistic performance claims). But the Motley Fool’s "Foolish Four" stock strategy and its underlying rationale have drawn criticism.
In 1997, BYU Professors Grant McQueen and Steven Thorley coathored a paper in the Financial Analysts Journal (FAJ) that questioned the immensely popular Dogs of the Dow Strategy (Abstract). Having already gathered the data to analyze the Dow Dogs, the Professors followed up by making a case study in data mining out of the Motley Fool’s Foolish Four. McQueen and Thorley analyzed the Foolish Four as described in The Motley Fool Investment Guide (MFIG), but the Fools actually have multiple variations of the Foolish Four (See also the Foolish Four explained and Foolish Four History). That research resulted in another article published in the March/April 1999 issue of the Financial Analysts Journal titled "Mining Fool's Gold." In the spirit of the Fool's entertaining and creative writing style, the professors have posted a "lighthearted" version of the paper (in Wordperfect) on the BYU server. The data used in the study can be downloaded here.
McQueen and Thorley include a full explanation of the potential pitfalls of data mining and they conducted out of sample tests on the Foolish Four. The Professors reason that data mining can be detected by the complexity of the trading rule, the lack of a coherent story or theory, the performance of out-of-sample tests, and the adjustment of returns for risk, transaction costs, and taxes. Additionally, they argue that the Foolish Four and Dow Ten trading rules have become popular enough to impact stock prices at the turn of the year.
The Motley Fool has posted a spirited response to the FAJ paper in their Foolish Four portfolio reports which are accessible in their 1999 archives. See reports dated 5/10, 5/11, 5/12, 5/13, 5/14, 5/17, 5/18, 5/19, 5/20, and 5/21. Included in these responses are several counter arguments to the FAJ paper and as well acknowledgements of valid issues discussed in the paper.
While many of the issues are debatable, the real acid test and critical finding of the FAJ paper was an out of sample test for the Foolish Four returns from 1949 to 1972. For that period the Foolish Four barely beat the Dow 30 by an average of 0.32% per year with substantially more risk. Not only did the strategy underperform the Dow Dogs for the period, but after transactions costs and accounting for risk it clearly would have lagged the DJIA for the period. This critical issue was discussed briefly in the report dated 5/14.
To put this issue in perspective, consider an investor at the start of 1973 looking back at the DJIA performance over the preceding 24 years. It's difficult to rationalize how an investor could have known at that time that the Foolish Four would produce market beating returns going forward.
In another out of sample test, McQueen and Thorley used the base 1973 -1996 period discussed in MFIG, but used July for rebalancing rather than January. Under those conditions the Foolish Four returns beat the DJIA by only 2.95% per year on average, substantially lower than the 12.23% advantage over the DJIA with January rebalancing.
In defense of the Fools, several disclosures were at least made in MFIG and on the web site. In the Foolish Four report dated 8/7/98, they disclose that returns were lower when rebalancing occurred in months other than January. Additionally, in MFIG a 25.5% return figure from a twenty year period is used many times, but they do at least mention that they researched the numbers back to 1961 and for the longer time period, the returns dropped to 18.35%. On the other hand, once it is disclosed that a longer period of time was studied, continuing to cite the stronger shorter term numbers and basing arguments on that data certainly can be viewed as suspect. Disclosing and focusing on longer term results tends to increase the credibility of a data miner's argument.
Jason Zweig voices his opinion of the Foolish Four and shares his own data mined "Very Stupid" and "Extra Dumb" portfolios in False Profits from Money magazine (August, 1999). On the Morningstar web site you can also read John Rekenthaler's opinion in Just foolin’ around as well as Investment Advisor William Bernstein's opinions in an article titled Mined: All Mined (see also James O'Shaughnessy's response and the ensuing debate). Of course, predicting the future is a perilous business and we'll just have to wait and see whether the Foolish Four outperforms the DJIA and/or the Dogs of the Dow in the future.
Moving on to another data mining debate, William Brock, Josef Lakonishok, and Blake LeBaron (BLL) published an article titled "Simple Technical Trading Rules and the Stochastic Properties of Stock Returns," in the December 1992 edition of the Journal of Finance. The study is one of the few academic papers to document a successful trading strategy based on technical analysis (See Technical Anomalies for a complete discussion of the article). The Professors demonstrated that both moving averages and support and resistance tools had predictive value relative to the Dow Jones Industrial Average for the period from 1897-1986.
Data-Snooping, Technical Trading Rule Performance, and the Bootstrap is an article that revisits the BLL paper and will appear in the October 1999 Edition of Journal of Finance. In the article, Ryan Sullivan, Allan Timmermann, and Halbert White (STW) attempt to determine the effect of Data-Snooping on the BLL results. They also use data collected from the period following the original study (BLL data ran through 1986) in order to provide an out of sample test. Adding the recent years provided a full 100 years of data. STW calculated a break even transaction cost level of 0.27% percent per trade for the best performing trading rule for the full period.
Since the original BLL data covered an extremely long period of almost 90 years, one might expect the strategies to perform well in the out of sample tests. But the study's conclusions may end up being used as another potential example of the Efficient Market Hypothesis. STW found "that the results of BLL appear to be robust to data-snooping . . . However, we also find that the superior performance of the best trading rule is not repeated in the out-of-sample experiment covering the period 1987-1996" and "there is scant evidence that technical trading rules were of any economic value during the period 1987-1996." This may offer another caveat for stock market data miners and active investors. Even if an anomaly worked in the past over very long periods of time, and even if results do not appear to suffer from the pitfalls of data snooping, once the anomaly is discovered it may cease to work going forward.
Reasonable people can have a reasonable difference of opinion without it becoming an issue of ethics or faith.
Leigh Steinberg in Winning With Integrity
As interesting as the Foolish Four and Technical Analysis debates are they take a back seat to the international debate that was ignited in 1997 with the release of Michael Drosnin’s controversial best seller "The Bible Code." The book itself has been criticized and dismissed by many mathematicians and professionals including Ronald Kahn of BARRA (see Bible Code Review and Data Mining is Easy). Kahn offered four guidelines for back-testing integrity (Intuition, Restraint, Sensibility, and Out-of-sample testing) and argued that the book ignores all four of those guidelines.
The serious debate quickly moved from the book to a study that was included in the book and is cited as proof that the codes exist. The study titled Equidistant Letter Sequences in the Book of Genesis (HTML version) by Doron Witztum, Eliyahu Rips and Yoav Rosenberg was originally published in the August 1994 issue of the respected journal Statistical Science. At the time The Bible Code was released, the study had not been challenged. Interestingly, the authors of the study quickly distanced themselves from the book and rejected some of Drosnin's claims. For instance, see Doron Witztum on Drosnin's Book and Eliyahu Rips' Statement. The codes research had been publicized to attempt to bridge apparent contradictions between the Bible and science (for instance, another interesting article in this context is The Age of the Universe). And codes research was the subject of much media attention including a Wall Street Journal article. But before long, the spotlight resulted in serious challenges to the original study.
When the study was originally published, Statistical Science Editor Robert Kass wrote that the "referees were baffled" by the results of the study and the fact that the effects persisted through additional analyses and checks. The article was thus included in the journal as a "challenging puzzle." Brendan McKay, Dror Bar-Natan, Maya Bar-Hillel, and Gil Kalai think they have refuted the original study and their article Solving the Bible Code Puzzle (in Adobe Acrobat or here) appears in the May 1999 issue of Statistical Science.
The debate is emotionally charged and involves many complicated statistical factors, translation issues, and other complex considerations. The original Statistical Science study authors and the critics continue to produce commentary, criticisms, responses, as well as new research, and the debates continue. The critics are even circulating a Mathematicians' Statement on the Bible Codes (maintained by Barry Simon) that includes signatures of mathematicians who are not convinced that the Codes theory is valid. The statement is careful to note that among the signatories "are some who believe that the Torah was divinely written." Additional links are included at the bottom of this article for those interested in seeing more on the codes debate, including articles critical of the original study as well as responses from the original study authors.
Returning to the financial world, Alarming Efficiency (RR) from Dow Jones Asset Management (5-6/99) is an interesting article that discusses data mining and the problem of "overfitting." Included are comments from investment industry veterans David Shaw, Ted Aronson, and Robert Arnott. The article argues that given a finite amount of historical data and an infinite number of complex models, uninformed investors might be lured into "overfitting" the data. Patterns that are assumed to be systematic may actually be sample-specific and therefore of no value.
People are coming to us all the time with trading strategies that reportedly make very large excess returns . . . But the vast majority of the things that people discover by taking standard mathematical tools and sifting through a vast amount of data are statistical artifacts.
David Shaw in Alarming Efficiency (RR) from Dow Jones Asset Management (5-6/99)
Aronson argues that the market is "nearly totally efficient" and that "You're fooling yourself if you think you'll outguess the other guy by more than about 51% or 52% of the time." Aronson believes that investors searching for market inefficiencies have reduced the potential to profit from those anomalies to the equivalent of transactions costs. If that is the case, minimizing transactions costs is critical in attempting to beat the market.
So are there any anomalies that have been confirmed in out of sample tests? In another forthcoming Journal of Finance article, James L. Davis, Eugene F. Fama, and Kenneth R. French argue that the answer is a definite yes. Companies with low price to book value ratios outperform and the pattern has been documented in both US and foreign markets. In Characteristics, Covariances, and Average Returns: 1929 to 1997 the authors go a big step further in documenting returns of low price to book value stocks from 1929 to 1963. For the earlier period, the value premium was even larger (.50% per month) than the more recent July 1963 to June 1997 period (.43% per month).
In the end, do we ever really know for sure what strategies will outperform in the future? Opinions on that question definitely vary, but the standard disclaimer applies as always. Past performance is no guarantee of future performance.
- Statistics Resources
- An Introduction to Data Mining
- An Introduction to Data Mining
- What data mining is - and isn't
- Data Mining
- Data Mining Glossary from Two Crows Corp
- Sixteen Tons of Information Overload from Fortune (8/2/99)
- Looking for Patterns ($$) from the Wall Street Journal (6/21/99)
- Witztum's Torah Codes site including responses to criticism.
- History of the codes research
- "Official" Torah Codes Site
- Brendan McKay's "Scientific Refutation of the Bible Codes"
- The Torah Codes: Puzzle and Solution - May 1998 published in Chance.
- A Skeptical Look at the Torah Codes by Barry Simon
(Witztum and Rips response)- John Allen Paulos' parody of the Bible Codes from Once Upon a Number
- Gallery Of Statistics Jokes
Additional mathematical discussions are included in the Cherry Picking, Stock Market Scam, and Coin-Flipping pages.
Last update 7/19/99. Copyright © 1999 Investor Home. All rights reserved. Disclaimer