Hostname: page-component-586b7cd67f-l7hp2 Total loading time: 0 Render date: 2024-11-22T12:13:55.088Z Has data issue: false hasContentIssue false

Model Selection and the Multiplicity of Patterns in Empirical Data

Published online by Cambridge University Press:  01 January 2022

Abstract

Several quantitative techniques for choosing among data models are available. Among these are techniques based on algorithmic information theory, minimum description length theory, and the Akaike information criterion. All these techniques are designed to identify a single model of a data set as being the closest to the truth. I argue, using examples, that many data sets in science show multiple patterns, providing evidence for multiple phenomena. For any such data set, there is more than one data model that must be considered close to the truth. I conclude that, since the established techniques for choosing among data models are unequipped to handle these cases, they cannot be regarded as adequate.

Type
Philosophy of Science: Models
Copyright
Copyright © The Philosophy of Science Association

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

I presented a previous version of this paper at the 20th Biennial Meeting of the Philosophy of Science Association, Vancouver, November 2006. I am grateful to the audience for constructive discussion. I thank Leiden University students Marjolein Eysink Smeets and Lenneke Schrier for suggesting the cortisol example, and Remko van der Geest for comments on a draft.

References

Bogen, James, and Woodward, James (1988), “Saving the Phenomena”, Saving the Phenomena 97:303352.Google Scholar
Bryant, Edward (1997), Climate Process and Change. Cambridge: Cambridge University Press.CrossRefGoogle Scholar
Burnham, Kenneth P., and Anderson, David R. (2002), Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach. 2nd ed. New York: Springer.Google Scholar
Burnham, Kenneth P., and Anderson, David R. (2004), “Multimodel Inference: Understanding AIC and BIC in Model Selection”, Multimodel Inference: Understanding AIC and BIC in Model Selection 33:261304.Google Scholar
Burroughs, William J. (2003), Weather Cycles: Real or Imaginary? 2nd ed. Cambridge: Cambridge University Press.CrossRefGoogle Scholar
Forster, Malcolm R., and Sober, Elliott (1994), “How to Tell When Simpler, More Unified, or Less Ad Hoc Theories Will Provide More Accurate Predictions”, How to Tell When Simpler, More Unified, or Less Ad Hoc Theories Will Provide More Accurate Predictions 45:135.Google Scholar
Gell-Mann, Murray (1994), The Quark and the Jaguar: Adventures in the Simple and the Complex. New York: Freeman.Google Scholar
Goldstein, Harvey (2003), Multilevel Statistical Models. 3rd ed. London: Arnold.Google Scholar
Li, Ming, and Vitányi, Paul M. B. (1997), An Introduction to Kolmogorov Complexity and Its Applications. 2nd ed. Berlin: Springer.CrossRefGoogle Scholar
Mann, Michael E., Bradley, Ray S., and Hughes, Malcolm K. (1998), “Global-Scale Temperature Patterns and Climate Forcing over the Past Six Centuries”, Global-Scale Temperature Patterns and Climate Forcing over the Past Six Centuries 392:779787.Google Scholar
McAllister, James W. (2003), “Effective Complexity as a Measure of Information Content”, Effective Complexity as a Measure of Information Content 70:302307.Google Scholar
Myung, In Jae, Forster, Malcolm R., and Browne, Michael W., eds. (2000), “Special Issue on Model Selection”, Special Issue on Model Selection 44, no. 1.Google Scholar
Partridge, R. Bruce (1995), 3 K: The Cosmic Microwave Background Radiation. Cambridge: Cambridge University Press.CrossRefGoogle Scholar
Solomonoff, Ray J. (1964), “A Formal Theory of Inductive Inference”, A Formal Theory of Inductive Inference 7:122, 224–254.Google Scholar
Suppes, Patrick (1962), “Models of Data”, in Nagel, Ernest, Suppes, Patrick, and Tarski, Alfred (eds.), Logic, Methodology and Philosophy of Science: Proceedings of the 1960 International Congress. Stanford, CA: Stanford University Press, 252261.Google Scholar
Van Cauter, Eve, Leproult, Rachel, and Kupfer, David J. (1996), “Effects of Gender and Age on the Levels and Circadian Rhythmicity of Plasma Cortisol”, Effects of Gender and Age on the Levels and Circadian Rhythmicity of Plasma Cortisol 81:24682473.Google ScholarPubMed
Waldorp, Lourens, and Wagenmakers, Eric-Jan, eds. (2006), “Special Issue on Model Selection: Theoretical Developments and Applications”, Special Issue on Model Selection: Theoretical Developments and Applications 50, no. 2.Google Scholar
Weakliem, David (ed.) (2004), “Special Issue on Model Selection”, Special Issue on Model Selection 33, no. 2.Google Scholar
Weitzman, Elliot D., Fukushima, David, Nogeire, Christopher, Roffwarg, Howard, Gallagher, T. F., and Hellman, Leon (1971), “Twenty-Four Hour Pattern of the Episodic Secretion of Cortisol in Normal Subjects”, Twenty-Four Hour Pattern of the Episodic Secretion of Cortisol in Normal Subjects 33:1422.Google ScholarPubMed