Conjoint analysis

From DDL Wiki

(Difference between revisions)
Jump to: navigation, search
(Basic Process)
Current revision (11:09, 30 January 2013) (view source)
(Data Collection)
 
(46 intermediate revisions not shown.)
Line 1: Line 1:
-
Conjoint analysis, also known as multiattribute compositional models, is a statistical technique that was introduced into the marketing research community by University of Pennsylvania, Wharton professor Paul Green in the late 1960s.  Since then, conjoint analysis has become the most popular multiattribute choice model in marketing.  It can be used as a trade-off measurement technique to determine consumer preferences and buying intentions, and to also predict how consumers might react to changes in a current product or new product introduction.
+
'''Conjoint analysis''', also known as [[multiattribute compositional models]], is a statistical technique that was introduced into the marketing research community by University of Pennsylvania, Wharton professor [[Paul Green]] in the late 1960s.  Since then, conjoint analysis has become the most popular multiattribute choice model in marketing.  It can be used as a trade-off measurement technique to determine consumer preferences and buying intentions, and to also predict how consumers might react to changes in a current product or new product introduction. Conjoint analysis is often used today in [[new product design]] and advertising.
 +
 
 +
Conjoint analysis generally implies the use of [[design of experiments]] statistical techniques to construct efficient survey designs and assess the relative importance of each attribute (and it's possible values, or "levels") that compose a product. The independent variables (factors) are the attributes set to discrete levels, and the dependent variable (output) is the consumer response: either rating, ranking, or choice among the presented set of alternatives.
 +
==Basic Process==
==Basic Process==
The basic process for a conjoint analysis is as follows:
The basic process for a conjoint analysis is as follows:
-
*  The product features that will be analyzed are determined
+
*  The product features that will be analyzed are determined.
*  Potential customers are shown numerous different sets of products.  Each set includes several product profiles that have multiple conjoined product features.
*  Potential customers are shown numerous different sets of products.  Each set includes several product profiles that have multiple conjoined product features.
*  For each set of products, they select (by ranking, rating, or choosing) the profiles according to some overall criterion, such as preference, acceptability, or likelihood of purchase.  
*  For each set of products, they select (by ranking, rating, or choosing) the profiles according to some overall criterion, such as preference, acceptability, or likelihood of purchase.  
-
*  In the analysis of the data, part-worths are identified for the factor levels such that each specific combination of part-worths equals the total utility of any given profile. A set of part-worths are derived for each respondent.
+
*  Choice simulator models such as the first choice model, average probability model, and the [[logit model]] are used to estimate the [[utility]] of each product feature.
-
*  The goodness-of-fit criterion relates the derived ranking or rating of stimulus profiles to the original ranking or rating data.
+
*  This information is then incorporated into a new or existing product.
-
*  A set of objects are defined for the choice simulator. Based on previously determined part-worths for each respondent, each simulator computes an utility value for each of the objects defined as part of the simulation.
+
-
*  Choice simulator models such as the first choice model, average probability model, and the [[logit model]] are used to estimate the utility of each product feature.
+
-
==Data Collection Methods==
+
==Data Collection==
-
A [[full-factorial]] survey could be generated that tests every possible combination of attributes.  However, in most cases this is not feasible due to the great amount of questions needed.  It is also not even necessary, since [[orthogonal arrays]] can be used to reduce the number to a much smaller [[fractional-factorial]] design. This design can be generated using a program such as SAS Intstitute's [[SAS System]].
+
Instead of asking respondents what they prefer in a product directly, conjoint analysis presents to them potential product profiles that have specific combinations of attributes.  These profiles can be evaluated in several ways, including:
-
 
+
-
The best combinations of attributes can be determined in several ways, including:
+
*Rating (e.g. rating with a scale from 1-10)
*Rating (e.g. rating with a scale from 1-10)
*Ranking (e.g. rank best as 1st, 2nd, 3rd, etc.)  
*Ranking (e.g. rank best as 1st, 2nd, 3rd, etc.)  
*Choice (e.g. four options, which do you choose?)
*Choice (e.g. four options, which do you choose?)
-
The conjoint analysis can be done with different types of data, which include stated choice and revealed preference.  The differences between the two are outlined below:
+
An example of a product profile using rating is shown below:
-
*Stated choice - Using data from a survey
+
-
Pros
+
-
- controlled experiment
+
-
Cons
+
-
- survey item choices are not always the same as what is desired or already in the marketplace
+
-
*Revealed Preference - Using data collected from actual results in the marketplace
+
[[Image:rating.png]]
-
  Pros:
+
 
 +
An example of a product profile using choice is shown below:
 +
 
 +
[[Image:choice.png]]
 +
 
 +
The conjoint analysis is usually done using [[stated choice]], which is data collected using a survey that is typically developed with [[experimental design]] (aka [[design of experiments]]) techniques.  A [[full factorial]] survey could be generated that tests every possible combination of attributes.  However, in most cases this is not feasible due to the great amount of questions needed.  It is also not even necessary, since balanced and orthogonal arrays can be used to reduce the number to a much smaller [[fractional factorial]] design. This design can be generated using a program such as SAS Institute's [[SAS]] system. The advantage of stated choice data is that it is possible to conduct a controlled experiment, which provides high-quality data. However, the task of evaluating product profiles in the survey may differ from the consumer experience in the marketplace, and survey data measures what people say and not necessarily what they do. Designing such a survey is an iterative process, and while there are no fixed rules on how to create a "good" conjoint survey, there are some best practices for [[designing a conjoint survey]].
 +
 
 +
An alternative to stated choice data is [[revealed preference]] data.  This consists of using data that is collected from actual performance in the marketplace.  The advantage of this is that the data is a real measure of how consumers act.  However, there are problems with [[multicollinearity]] - it is difficult to tell exactly why someone is choosing one product over another.
==Simulation Analysis==
==Simulation Analysis==
-
The most common simulator models include the first choice model, the average choice (Bradley-Terry-Luce) model, and the Logit model. The First choice model identifies the product with the highest utility as the product of choice. This product is selected and receives a value of 1. Ties receive a .5 value. After the process is repeated for each respondent's utility set, the cumulative "votes" for each product are evaluated as a proportion of the votes or respondents in the sample. The Bradley-Terry-Luce model estimates choice probability in a different fashion. The choice probability for a given product is based on the utility for that product divided by the sum of all products in the simulated market. The logit model uses an assigned choice probability that is proportional to an increasing monotonic function of the alternative's utility. The choice probabilities are computed by dividing the logit value for one product by the sum for all other products in the simulation. These individual choice probabilities are averaged across respondents. In summary, while the literature shows the maximum utility (first choice model) to provide the best overall validation, choice behavior has a strong probabilistic component. We have not measured this component adequately, but instead attributed lack of validity to "noise", our inability to model information search and overload effects, and measurement error.
+
There are many different simulator models available for use in estimating the utility function of the product attributes. These include the [[first choice model]], the [[average choice model]], and the [[logit model]].
-
==Example==
+
===[[First choice model]]===
 +
For each respondent, the product with the highest utility is the product of choice and receives a value of 1.  If there is a tie for the highest utility, both products get a value of 0.5.  In the end the most votes for a product evaluated as a proportion of the number of respondents is the most desirable product.
 +
===[[Average choice model]]===
 +
Also known as the [[Bradley-Terry-Luce model]].  The choice probability for a product is based on the utility of that product divided by the sum of all products in the simulated market.
-
==External links==
+
===[[Random Utility Discrete Choice Models]]===
-
*[http://www.sawtoothsoftware.com/qs-whatisconjoint.shtml What is conjoint analysis?] A useful conjoint software site that explains conjoint analysis in detail.
+
Introduces a random error term to the measurement of utility and calculates the probability of choice as equal to the probability of the utility of one product being greater than the utility of all other products. Models vary based on the assumed distribution of the random error component and the specification of the deterministic component of utility. The most popular models are the [[logit model]] and the [[probit model]].
 +
==External links==
*[http://www.sas.com SAS Software] Software for designing full-factorial and fractional factorial surveys.
*[http://www.sas.com SAS Software] Software for designing full-factorial and fractional factorial surveys.
 +
*[http://cran.r-project.org/web/packages/AlgDesign/index.html AlgDesign] package in [[R]] for full-factorial and fractional factorial conjoint design.
 +
==References==
 +
*[http://www.sawtoothsoftware.com/qs-whatisconjoint.shtml What is conjoint analysis?] By Sawtooth Software.
-
==Reference==
+
*[http://marketing.byu.edu/htmlpages/tutorials/conjoint.htm Conjoint Analysis Tutorial] From Bringham Young University's Institute of Marketing.
-
*Green, P. E., V. Rao, and J. Wind, Conjoint Analysis: Methods and Applications, 1999, Knowledge@Wharton.
+
-
*Green, P. E. and V. Srinivasan (1978), "Conjoint Analysis in Consumer Research" Issues and Outlook", Journal of Consumer Research, Vol. 5, (September), pp 103-123.  
+
*Green, P. E., V. Rao, and J. Wind, (1999) Conjoint Analysis: Methods and Applications. Knowledge@Wharton.
 +
 
 +
*Green, P. E. and V. Srinivasan (1978) "Conjoint Analysis in Consumer Research" Issues and Outlook", Journal of Consumer Research, Vol. 5, (September), pp 103-123.  
*Luce, R. D. and J. W. Tukey. "Simultaneous Conjoint Measurement: A New Type of Fundamental Measurement," Journal of Mathematical Psychology, 1 (February 1964), pp 1-27.
*Luce, R. D. and J. W. Tukey. "Simultaneous Conjoint Measurement: A New Type of Fundamental Measurement," Journal of Mathematical Psychology, 1 (February 1964), pp 1-27.
 +
 +
*Nishimura, K., and Aizaki, H., (2008) Design and Analysis of Choice Experiments Using R: A Brief Introduction, Agricultural Information Research, 17(2), pp.86-94. http://www.jstage.jst.go.jp/article/air/17/2/17_86/_article
 +
 +
[[Category:market research methods]]

Current revision

Conjoint analysis, also known as multiattribute compositional models, is a statistical technique that was introduced into the marketing research community by University of Pennsylvania, Wharton professor Paul Green in the late 1960s. Since then, conjoint analysis has become the most popular multiattribute choice model in marketing. It can be used as a trade-off measurement technique to determine consumer preferences and buying intentions, and to also predict how consumers might react to changes in a current product or new product introduction. Conjoint analysis is often used today in new product design and advertising.

Conjoint analysis generally implies the use of design of experiments statistical techniques to construct efficient survey designs and assess the relative importance of each attribute (and it's possible values, or "levels") that compose a product. The independent variables (factors) are the attributes set to discrete levels, and the dependent variable (output) is the consumer response: either rating, ranking, or choice among the presented set of alternatives.


Contents

Basic Process

The basic process for a conjoint analysis is as follows:

  • The product features that will be analyzed are determined.
  • Potential customers are shown numerous different sets of products. Each set includes several product profiles that have multiple conjoined product features.
  • For each set of products, they select (by ranking, rating, or choosing) the profiles according to some overall criterion, such as preference, acceptability, or likelihood of purchase.
  • Choice simulator models such as the first choice model, average probability model, and the logit model are used to estimate the utility of each product feature.
  • This information is then incorporated into a new or existing product.

Data Collection

Instead of asking respondents what they prefer in a product directly, conjoint analysis presents to them potential product profiles that have specific combinations of attributes. These profiles can be evaluated in several ways, including:

  • Rating (e.g. rating with a scale from 1-10)
  • Ranking (e.g. rank best as 1st, 2nd, 3rd, etc.)
  • Choice (e.g. four options, which do you choose?)

An example of a product profile using rating is shown below:

Image:rating.png

An example of a product profile using choice is shown below:

Image:choice.png

The conjoint analysis is usually done using stated choice, which is data collected using a survey that is typically developed with experimental design (aka design of experiments) techniques. A full factorial survey could be generated that tests every possible combination of attributes. However, in most cases this is not feasible due to the great amount of questions needed. It is also not even necessary, since balanced and orthogonal arrays can be used to reduce the number to a much smaller fractional factorial design. This design can be generated using a program such as SAS Institute's SAS system. The advantage of stated choice data is that it is possible to conduct a controlled experiment, which provides high-quality data. However, the task of evaluating product profiles in the survey may differ from the consumer experience in the marketplace, and survey data measures what people say and not necessarily what they do. Designing such a survey is an iterative process, and while there are no fixed rules on how to create a "good" conjoint survey, there are some best practices for designing a conjoint survey.

An alternative to stated choice data is revealed preference data. This consists of using data that is collected from actual performance in the marketplace. The advantage of this is that the data is a real measure of how consumers act. However, there are problems with multicollinearity - it is difficult to tell exactly why someone is choosing one product over another.

Simulation Analysis

There are many different simulator models available for use in estimating the utility function of the product attributes. These include the first choice model, the average choice model, and the logit model.

First choice model

For each respondent, the product with the highest utility is the product of choice and receives a value of 1. If there is a tie for the highest utility, both products get a value of 0.5. In the end the most votes for a product evaluated as a proportion of the number of respondents is the most desirable product.

Average choice model

Also known as the Bradley-Terry-Luce model. The choice probability for a product is based on the utility of that product divided by the sum of all products in the simulated market.

Random Utility Discrete Choice Models

Introduces a random error term to the measurement of utility and calculates the probability of choice as equal to the probability of the utility of one product being greater than the utility of all other products. Models vary based on the assumed distribution of the random error component and the specification of the deterministic component of utility. The most popular models are the logit model and the probit model.

External links

  • SAS Software Software for designing full-factorial and fractional factorial surveys.
  • AlgDesign package in R for full-factorial and fractional factorial conjoint design.

References

  • Green, P. E., V. Rao, and J. Wind, (1999) Conjoint Analysis: Methods and Applications. Knowledge@Wharton.
  • Green, P. E. and V. Srinivasan (1978) "Conjoint Analysis in Consumer Research" Issues and Outlook", Journal of Consumer Research, Vol. 5, (September), pp 103-123.
  • Luce, R. D. and J. W. Tukey. "Simultaneous Conjoint Measurement: A New Type of Fundamental Measurement," Journal of Mathematical Psychology, 1 (February 1964), pp 1-27.
Personal tools