Katana VentraIP

Theory of conjoint measurement

The theory of conjoint measurement (also known as conjoint measurement or additive conjoint measurement) is a general, formal theory of continuous quantity. It was independently discovered by the French economist Gérard Debreu (1960) and by the American mathematical psychologist R. Duncan Luce and statistician John Tukey (Luce & Tukey 1964).

The theory concerns the situation where at least two natural attributes, A and X, non-interactively relate to a third attribute, P. It is not required that A, X or P are known to be quantities. Via specific relations between the levels of P, it can be established that P, A and X are continuous quantities. Hence the theory of conjoint measurement can be used to quantify attributes in empirical circumstances where it is not possible to combine the levels of the attributes using a side-by-side operation or concatenation. The quantification of psychological attributes such as attitudes, cognitive abilities and utility is therefore logically plausible. This means that the scientific measurement of psychological attributes is possible. That is, like physical quantities, a magnitude of a psychological quantity may possibly be expressed as the product of a real number and a unit magnitude.


Application of the theory of conjoint measurement in psychology, however, has been limited. It has been argued that this is due to the high level of formal mathematics involved (e.g., Cliff 1992) and that the theory cannot account for the "noisy" data typically discovered in psychological research (e.g., Perline, Wright & Wainer 1979). It has been argued that the Rasch model is a stochastic variant of the theory of conjoint measurement (e.g., Brogden 1977; Embretson & Reise 2000; Fischer 1995; Keats 1967; Kline 1998; Scheiblechner 1999), however, this has been disputed (e.g., Karabatsos, 2001; Kyngdon, 2008). Order restricted methods for conducting probabilistic tests of the cancellation axioms of conjoint measurement have been developed in the past decade (e.g., Karabatsos, 2001; Davis-Stober, 2009).


The theory of conjoint measurement is (different but) related to conjoint analysis, which is a statistical-experiments methodology employed in marketing to estimate the parameters of additive utility functions. Different multi-attribute stimuli are presented to respondents, and different methods are used to measure their preferences about the presented stimuli. The coefficients of the utility function are estimated using alternative regression-based tools.

Historical overview[edit]

In the 1930s, the British Association for the Advancement of Science established the Ferguson Committee to investigate the possibility of psychological attributes being measured scientifically. The British physicist and measurement theorist Norman Robert Campbell was an influential member of the committee. In its Final Report (Ferguson, et al., 1940), Campbell and the Committee concluded that because psychological attributes were not capable of sustaining concatenation operations, such attributes could not be continuous quantities. Therefore, they could not be measured scientifically. This had important ramifications for psychology, the most significant of these being the creation in 1946 of the operational theory of measurement by Harvard psychologist Stanley Smith Stevens. Stevens' non-scientific theory of measurement is widely held as definitive in psychology and the behavioural sciences generally (Michell 1999).


Whilst the German mathematician Otto Hölder (1901) anticipated features of the theory of conjoint measurement, it was not until the publication of Luce & Tukey's seminal 1964 paper that the theory received its first complete exposition. Luce & Tukey's presentation was algebraic and is therefore considered more general than Debreu's (1960) topological work, the latter being a special case of the former (Luce & Suppes 2002). In the first article of the inaugural issue of the Journal of Mathematical Psychology, Luce & Tukey 1964 proved that via the theory of conjoint measurement, attributes not capable of concatenation could be quantified. N.R. Campbell and the Ferguson Committee were thus proven wrong. That a given psychological attribute is a continuous quantity is a logically coherent and empirically testable hypothesis.


Appearing in the next issue of the same journal were important papers by Dana Scott (1964), who proposed a hierarchy of cancellation conditions for the indirect testing of the solvability and Archimedean axioms, and David Krantz (1964) who connected the Luce & Tukey work to that of Hölder (1901).


Work soon focused on extending the theory of conjoint measurement to involve more than just two attributes. Krantz 1968 and Amos Tversky (1967) developed what became known as polynomial conjoint measurement, with Krantz 1968 providing a schema with which to construct conjoint measurement structures of three or more attributes. Later, the theory of conjoint measurement (in its two variable, polynomial and n-component forms) received a thorough and highly technical treatment with the publication of the first volume of Foundations of Measurement, which Krantz, Luce, Tversky and philosopher Patrick Suppes cowrote (Krantz et al. 1971).


Shortly after the publication of Krantz, et al., (1971), work focused upon developing an "error theory" for the theory of conjoint measurement. Studies were conducted into the number of conjoint arrays that supported only single cancellation and both single and double cancellation (Arbuckle & Larimer 1976; McClelland 1977). Later enumeration studies focused on polynomial conjoint measurement (Karabatsos & Ullrich 2002; Ullrich & Wilson 1993). These studies found that it is highly unlikely that the axioms of the theory of conjoint measurement are satisfied at random, provided that more than three levels of at least one of the component attributes has been identified.


Joel Michell (1988) later identified that the "no test" class of tests of the double cancellation axiom was empty. Any instance of double cancellation is thus either an acceptance or a rejection of the axiom. Michell also wrote at this time a non-technical introduction to the theory of conjoint measurement (Michell 1990) which also contained a schema for deriving higher order cancellation conditions based upon Scott's (1964) work. Using Michell's schema, Ben Richards (Kyngdon & Richards, 2007) discovered that some instances of the triple cancellation axiom are "incoherent" as they contradict the single cancellation axiom. Moreover, he identified many instances of the triple cancellation which are trivially true if double cancellation is supported.


The axioms of the theory of conjoint measurement are not stochastic; and given the ordinal constraints placed on data by the cancellation axioms, order restricted inference methodology must be used (Iverson & Falmagne 1985). George Karabatsos and his associates (Karabatsos, 2001; Karabatsos & Sheu 2004) developed a Bayesian Markov chain Monte Carlo methodology for psychometric applications. Karabatsos & Ullrich 2002 demonstrated how this framework could be extended to polynomial conjoint structures. Karabatsos (2005) generalised this work with his multinomial Dirichlet framework, which enabled the probabilistic testing of many non-stochastic theories of mathematical psychology. More recently, Clintin Davis-Stober (2009) developed a frequentist framework for order restricted inference that can also be used to test the cancellation axioms.


Perhaps the most notable (Kyngdon, 2011) use of the theory of conjoint measurement was in the prospect theory proposed by the Israeli – American psychologists Daniel Kahneman and Amos Tversky (Kahneman & Tversky, 1979). Prospect theory was a theory of decision making under risk and uncertainty which accounted for choice behaviour such as the Allais Paradox. David Krantz wrote the formal proof to prospect theory using the theory of conjoint measurement. In 2002, Kahneman received the Nobel Memorial Prize in Economics for prospect theory (Birnbaum, 2008).

Measurement and quantification[edit]

The classical / standard definition of measurement[edit]

In physics and metrology, the standard definition of measurement is the estimation of the ratio between a magnitude of a continuous quantity and a unit magnitude of the same kind (de Boer, 1994/95; Emerson, 2008). For example, the statement "Peter's hallway is 4 m long" expresses a measurement of an hitherto unknown length magnitude (the hallway's length) as the ratio of the unit (the metre in this case) to the length of the hallway. The number 4 is a real number in the strict mathematical sense of this term.


For some other quantities, invariant are ratios between attribute differences. Consider temperature, for example. In the familiar everyday instances, temperature is measured using instruments calibrated in either the Fahrenheit or Celsius scales. What are really being measured with such instruments are the magnitudes of temperature differences. For example, Anders Celsius defined the unit of the Celsius scale to be 1/100th of the difference in temperature between the freezing and boiling points of water at sea level. A midday temperature measurement of 20 degrees Celsius is simply the difference of the midday temperature and the temperature of the freezing water divided by the difference of the Celsius unit and the temperature of the freezing water.


Formally expressed, a scientific measurement is:

Applications of conjoint measurement[edit]

Empirical applications of the theory of conjoint measurement have been sparse (Cliff 1992; Michell 2009).


Several empirical evaluations of the double cancellation have been conducted. Among these, Levelt, Riemersma & Bunt 1972 evaluated the axiom to the psychophysics of binaural loudness. They found the double cancellation axiom was rejected. Gigerenzer & Strube 1983 conducted a similar investigation and replicated Levelt, et al.' (1972) findings. Gigerenzer & Strube 1983 observed that the evaluation of double cancellation involves considerable redundancy that complicates its empirical testing. Therefore, Steingrimsson & Luce 2005 evaluated instead the equivalent Thomsen condition axiom, which avoids this redundancy, and found the property supported in binaural loudness. Luce & Steingrimsson 2011, summarized the literature to that date, including the observation that the evaluation of the Thomsen Condition also involves an empirical challenge that they find remedied by the conjoint commutativity axiom, which they show to be equivalent to the Thomsen Condition. Luce & Steingrimsson 2011 found conjoint commutativity supported for binaural loudness and brightness.


Michell 1990 applied the theory to L. L. Thurstone's (1927) theory of paired comparisons, multidimensional scaling and Coombs' (1964) theory of unidimensional unfolding. He found support of the cancellation axioms only with Coombs' (1964) theory. However, the statistical techniques employed by Michell (1990) in testing Thurstone's theory and multidimensional scaling did not take into consideration the ordinal constraints imposed by the cancellation axioms (van der Linden 1994).


(Johnson 2001), Kyngdon (2006), Michell (1994) and (Sherman 1993) tested the cancellation axioms of upon the interstimulus midpoint orders obtained by the use of Coombs' (1964) theory of unidimensional unfolding. Coombs' theory in all three studies was applied to a set of six statements. These authors found that the axioms were satisfied, however, these were applications biased towards a positive result. With six stimuli, the probability of an interstimulus midpoint order satisfying the double cancellation axioms at random is .5874 (Michell, 1994). This is not an unlikely event. Kyngdon & Richards (2007) employed eight statements and found the interstimulus midpoint orders rejected the double cancellation condition.


Perline, Wright & Wainer 1979 applied conjoint measurement to item response data to a convict parole questionnaire and to intelligence test data gathered from Danish troops. They found considerable violation of the cancellation axioms in the parole questionnaire data, but not in the intelligence test data. Moreover, they recorded the supposed "no test" instances of double cancellation. Interpreting these correctly as instances in support of double cancellation (Michell, 1988), the results of Perline, Wright & Wainer 1979 are better than what they believed.


Stankov & Cregan 1993 applied conjoint measurement to performance on sequence completion tasks. The columns of their conjoint arrays (X) were defined by the demand placed upon working memory capacity through increasing numbers of working memory place keepers in letter series completion tasks. The rows were defined by levels of motivation (A), which consisted in different number of times available for completing the test. Their data (P) consisted of completion times and average number of series correct. They found support for the cancellation axioms, however, their study was biased by the small size of the conjoint arrays (3 × 3 is size) and by statistical techniques that did not take into consideration the ordinal restrictions imposed by the cancellation axioms.


Kyngdon (2011) used Karabatsos's (2001) order-restricted inference framework to test a conjoint matrix of reading item response proportions (P) where the examinee reading ability comprised the rows of the conjoint array (A) and the difficulty of the reading items formed the columns of the array (X). The levels of reading ability were identified via raw total test score and the levels of reading item difficulty were identified by the Lexile Framework for Reading (Stenner et al. 2006). Kyngdon found that satisfaction of the cancellation axioms was obtained only through permutation of the matrix in a manner inconsistent with the putative Lexile measures of item difficulty. Kyngdon also tested simulated ability test response data using polynomial conjoint measurement. The data were generated using Humphry's extended frame of reference Rasch model (Humphry & Andrich 2008). He found support of distributive, single and double cancellation consistent with a distributive polynomial conjoint structure in three variables (Krantz & Tversky 1971).

 – Paradigm for the design, analysis, and scoring of tests

Item response theory

Arbuckle, J.; Larimer, J. (1976). "The number of two-way tables satisfying certain additivity axioms". Journal of Mathematical Psychology. 12: 89–100. :10.1016/0022-2496(76)90036-5.

doi

Birnbaum, M. H. (2008). "New paradoxes of risky decision making". Psychological Review. 115 (2): 463–501.  10.1.1.144.5661. doi:10.1037/0033-295X.115.2.463. PMID 18426300.

CiteSeerX

Brogden, H. E. (December 1977). "The Rasch model, the law of comparative judgement and additive conjoint measurement". Psychometrika. 42 (4): 631–4. :10.1007/BF02295985. S2CID 123583660.

doi

Cliff, N. (1992). "Abstract measurement theory and the revolution that never happened". Psychological Science. 3 (3): 186–190. :10.1111/j.1467-9280.1992.tb00024.x. S2CID 144507788.

doi

Coombs, C. H. (1964). A Theory of Data. New York: Wiley.

Davis-Stober, C. P. (February 2009). "Analysis of multinomial models under inequality constraints: applications to measurement theory". Journal of Mathematical Psychology. 53 (1): 1–13. :10.1016/j.jmp.2008.08.003.

doi

Debreu, G. (1960). "Topological methods in cardinal utility theory". In Arrow, K.J.; Karlin, S.; Suppes, P. (eds.). Mathematical Methods in the Social Sciences. Stanford University Press. pp. 16–26.

Embretson, S. E.; Reise, S. P. (2000). Item response theory for psychologists. Erlbaum.

Emerson, W. H. (2008). "On quantity calculus and units of measurement". Metrologia. 45 (2): 134–138. :2008Metro..45..134E. doi:10.1088/0026-1394/45/2/002. S2CID 121451085.

Bibcode

Fischer, G. (1995). "Derivations of the Rasch model". In Fischer, G.; Molenaar, I.W. (eds.). Rasch models: Foundations, recent developments, and applications. New York: Springer. pp. 15–38.

Gigerenzer, G.; Strube, G. (1983). "Are there limits to binaural additivity of loudness?". Journal of Experimental Psychology: Human Perception and Performance. 9 (1): 126–136. :10.1037/0096-1523.9.1.126. hdl:21.11116/0000-0000-BC9A-F. PMID 6220118.

doi

Grayson, D. A. (September 1988). "Two-group classification and latent trait theory: scores with monotone likelihood ratio". Psychometrika. 53 (3): 383–392. :10.1007/BF02294219. S2CID 121684695.

doi

Hölder, O. (1901). "Die Axiome der Quantität und die Lehre vom Mass". Berichte Uber die Verhandlungen der Koeniglich Sachsischen Gesellschaft der Wissenschaften zu Leipzig, Mathematisch-Physikaliche Klasse. 53: 1–46. (Part 1 translated by Michell, J.; Ernst, C. (September 1996). "The axioms of quantity and the theory of measurement". Journal of Mathematical Psychology. 40 (3): 235–252. :10.1006/jmps.1996.0023. PMID 8979975.

doi

Humphry, S. M.; Andrich, D. (2008). "Understanding the unit in the Rasch model". Journal of Applied Measurement. 9 (3): 249–264.  18753694.

PMID

Iverson, G.; (1985). "Statistical issues in measurement". Mathematical Social Sciences. 10 (2): 131–153. doi:10.1016/0165-4896(85)90031-9.

Falmagne, J. C.

Johnson, T. (2001). . Australian Journal of Psychology. 53: 23–28. doi:10.1080/00049530108255118.

"Controlling the effect of stimulus context change on attitude statements using Michell's binary tree procedure"

Kahneman, D.; Tversky, A. (1979). "Prospect theory: an analysis of decision under risk". Econometrica. 47 (2): 263–291.  10.1.1.407.1910. doi:10.2307/1914185. JSTOR 1914185.

CiteSeerX

Karabatsos, G. (2001). "The Rasch model, additive conjoint measurement, and new models of probabilistic measurement theory". Journal of Applied Measurement. 2 (4): 389–423.  12011506.

PMID

Karabatsos, G. (February 2005). (PDF). Journal of Mathematical Psychology. 49 (1): 51–69. doi:10.1016/j.jmp.2004.11.001. Archived from the original (PDF) on 2006-02-06.

"The exchangeable multinomial model as an approach for testing axioms of choice and measurement"

Karabatsos, G.; Sheu, C. F. (2004). "Bayesian order constrained inference for dichotomous models of unidimensional non-parametric item response theory". Applied Psychological Measurement. 28 (2): 110–125. :10.1177/0146621603260678. S2CID 122303701.

doi

Karabatsos, G.; Ullrich, J. R. (2002). "Enumerating and testing conjoint measurement models". Mathematical Social Sciences. 43 (3): 485–504. :10.1016/S0165-4896(02)00024-0.

doi

Krantz, D. H. (July 1964). "Conjoint measurement: the Luce–Tukey axiomatisation and some extensions". Journal of Mathematical Psychology. 1 (2): 248–277. :10.1016/0022-2496(64)90003-3.

doi

Krantz, D. H. (1968). "A survey of measurement theory". In Danzig, G.B.; Veinott, A.F. (eds.). Mathematics of the Decision Sciences: Part 2. Providence, Rhode Island: American Mathematical Society. pp. 314–350.

Keats, J. A. (1967). "Test theory". Annual Review of Psychology. 18: 217–238. :10.1146/annurev.ps.18.020167.001245. PMID 5333423.

doi

Kline, P. (1998). The New Psychometrics: Science, psychology and measurement. London: Routledge.

Krantz, D. H.; Luce, R.D; Suppes, P.; Tversky, A. (1971). Foundations of Measurement, Vol. I: Additive and polynomial representations. New York: Academic Press.

Krantz, D. H.; Tversky, A. (1971). "Conjoint measurement analysis of composition rules in psychology". Psychological Review. 78 (2): 151–169. :10.1037/h0030637.

doi

Kyngdon, A. (2006). "An empirical study into the theory of unidimensional unfolding". Journal of Applied Measurement. 7 (4): 369–393.  17068378.

PMID

Kyngdon, A. (2008). "The Rasch model from the perspective of the representational theory of measurement". Theory & Psychology. 18: 89–109. :10.1177/0959354307086924. S2CID 143679173.

doi

Kyngdon, A. (2011). "Plausible measurement analogies to some psychometric models of test performance". British Journal of Mathematical and Statistical Psychology. 64 (3): 478–497. :10.1348/2044-8317.002004. PMID 21973097.

doi

Kyngdon, A.; Richards, B. (2007). "Attitudes, order and quantity: deterministic and direct probabilistic tests of unidimensional unfolding". Journal of Applied Measurement. 8 (1): 1–34.  17215563.

PMID

Levelt, W. J. M.; Riemersma, J. B.; Bunt, A. A. (May 1972). (PDF). British Journal of Mathematical and Statistical Psychology. 25 (1): 51–68. doi:10.1111/j.2044-8317.1972.tb00477.x. hdl:11858/00-001M-0000-0013-2CBF-1. PMID 5031649.

"Binaural additivity of loudness"

Luce, R. D.; Steingrimsson, R. (2011). (PDF). Journal of Mathematical Psychology. 55 (5): 379–389. doi:10.1016/j.jmp.2011.05.004.

"Theory and tests of the conjoint commutativity axiom for additive conjoint measurement"

Luce, R. D.; Suppes, P. (2002). "Representational measurement theory". In Pashler, H.; Wixted, J. (eds.). Stevens' handbook of experimental psychology: Vol. 4. Methodology in experimental psychology (3rd ed.). New York: Wiley. pp. 1–41.

Luce, R. D.; Tukey, J. W. (January 1964). "Simultaneous conjoint measurement: a new scale type of fundamental measurement". Journal of Mathematical Psychology. 1 (1): 1–27.  10.1.1.334.5018. doi:10.1016/0022-2496(64)90015-X.

CiteSeerX

McClelland, G. (June 1977). "A note on Arbuckle and Larimer: the number of two way tables satisfying certain additivity axioms". Journal of Mathematical Psychology. 15 (3): 292–5. :10.1016/0022-2496(77)90035-9.

doi

Michell, J. (June 1994). "Measuring dimensions of belief by unidimensional unfolding". Journal of Mathematical Psychology. 38 (2): 224–273. :10.1006/jmps.1994.1016.

doi

Michell, J. (December 1988). "Some problems in testing the double cancellation condition in conjoint measurement". Journal of Mathematical Psychology. 32 (4): 466–473. :10.1016/0022-2496(88)90024-7.

doi

Michell, J. (1990). An Introduction to the Logic of Psychological Measurement. Hillsdale NJ: Erlbaum.

Michell, J. (February 2009). "The psychometricians' fallacy: Too clever by half?". British Journal of Mathematical and Statistical Psychology. 62 (1): 41–55. :10.1348/000711007X243582. PMID 17908369.

doi

Perline, R.; Wright, B. D.; Wainer, H. (1979). . Applied Psychological Measurement. 3 (2): 237–255. doi:10.1177/014662167900300213. S2CID 53706504.

"The Rasch model as additive conjoint measurement"

Scheiblechner, H. (September 1999). "Additive conjoint isotonic probabilistic models (ADISOP)". Psychometrika. 64 (3): 295–316. :10.1007/BF02294297. S2CID 120080826.

doi

Scott, D. (July 1964). "Measurement models and linear inequalities". Journal of Mathematical Psychology. 1 (2): 233–247. :10.1016/0022-2496(64)90002-1.

doi

Sherman, K. (April 1994). "The effect of change in context in Coombs's unfolding theory". Australian Journal of Psychology. 46 (1): 41–47. :10.1080/00049539408259468.

doi

Stankov, L.; Cregan, A. (1993). "Quantitative and qualitative properties of an intelligence test: series completion". Learning and Individual Differences. 5 (2): 137–169. :10.1016/1041-6080(93)90009-H.

doi

Steingrimsson, R; Luce, R. D. (2005). (PDF). Journal of Mathematical Psychology. 49 (4): 290–306. doi:10.1016/j.jmp.2005.03.003.

"Evaluating a model of global psychophysical judgments I: Behavioral properties of summations and productions"

Stenner, A. J.; Burdick, H.; Sanford, E. E.; Burdick, D. S. (2006). "How accurate are Lexile text measures?". Journal of Applied Measurement. 7 (3): 307–322.  16807496.

PMID

Stevens, S. S. (1946). "On the theory of scales of measurement". Science. 103 (2684): 667–680. :1946Sci...103..677S. doi:10.1126/science.103.2684.677. PMID 17750512.

Bibcode

Stober, C. P. (2009). Luce's challenge: Quantitative models and statistical methodology.

Thurstone, L. L. (1927). "A law of comparative judgement". Psychological Review. 34 (4): 278–286. :10.1037/h0070288. S2CID 144782881.

doi

Tversky, A. (1967). (PDF). Journal of Mathematical Psychology. 4: 1–20. doi:10.1016/0022-2496(67)90039-9. hdl:2027.42/33362.

"A general theory of polynomial conjoint measurement"

Ullrich, J. R.; Wilson, R. E. (December 1993). "A note on the exact number of two and three way tables satisfying conjoint measurement and additivity axioms". Journal of Mathematical Psychology. 37 (4): 624–8. :10.1006/jmps.1993.1037.

doi

van der Linden, W. (March 1994). . Psychometrika. 59 (1): 139–142. doi:10.1007/BF02294273.

"Review of Michell (1990)"

van der Ven, A. H. G. S. (1980). Introduction to Scaling. New York: Wiley.

Karabatsos' S-Plus programs for testing conjoint axioms

Birnbaum's FORTRAN MONANOVA program for testing additivity

Kyngdon's R programs for enumerating cancellation tests, testing axioms and prospect theory

R statistical computing software