Conjoint analysis

From DDL Wiki

(Difference between revisions)
Jump to: navigation, search
(Data Collection)
Current revision (11:09, 30 January 2013) (view source)
(Data Collection)
 
(27 intermediate revisions not shown.)
Line 1: Line 1:
-
Conjoint analysis, also known as multiattribute compositional models, is a statistical technique that was introduced into the marketing research community by University of Pennsylvania, Wharton professor Paul Green in the late 1960s.  Since then, conjoint analysis has become the most popular multiattribute choice model in marketing.  It can be used as a trade-off measurement technique to determine consumer preferences and buying intentions, and to also predict how consumers might react to changes in a current product or new product introduction.
+
'''Conjoint analysis''', also known as [[multiattribute compositional models]], is a statistical technique that was introduced into the marketing research community by University of Pennsylvania, Wharton professor [[Paul Green]] in the late 1960s.  Since then, conjoint analysis has become the most popular multiattribute choice model in marketing.  It can be used as a trade-off measurement technique to determine consumer preferences and buying intentions, and to also predict how consumers might react to changes in a current product or new product introduction. Conjoint analysis is often used today in [[new product design]] and advertising.
 +
 
 +
Conjoint analysis generally implies the use of [[design of experiments]] statistical techniques to construct efficient survey designs and assess the relative importance of each attribute (and it's possible values, or "levels") that compose a product. The independent variables (factors) are the attributes set to discrete levels, and the dependent variable (output) is the consumer response: either rating, ranking, or choice among the presented set of alternatives.
 +
==Basic Process==
==Basic Process==
Line 6: Line 9:
*  Potential customers are shown numerous different sets of products.  Each set includes several product profiles that have multiple conjoined product features.
*  Potential customers are shown numerous different sets of products.  Each set includes several product profiles that have multiple conjoined product features.
*  For each set of products, they select (by ranking, rating, or choosing) the profiles according to some overall criterion, such as preference, acceptability, or likelihood of purchase.  
*  For each set of products, they select (by ranking, rating, or choosing) the profiles according to some overall criterion, such as preference, acceptability, or likelihood of purchase.  
-
*  Choice simulator models such as the first choice model, average probability model, and the [[logit model]] are used to estimate the utility of each product feature.
+
*  Choice simulator models such as the first choice model, average probability model, and the [[logit model]] are used to estimate the [[utility]] of each product feature.
*  This information is then incorporated into a new or existing product.
*  This information is then incorporated into a new or existing product.
Line 17: Line 20:
An example of a product profile using rating is shown below:
An example of a product profile using rating is shown below:
-
{| class="wikitable" border="1"
+
[[Image:rating.png]]
-
|-
+
-
! How likely are you to purchase this computer? <br />(Use a scale from 1=10, with 1 being "not at all likely" and 10 being "would definitely purchase")
+
-
|-
+
-
!  <br />3 GHz Processor<br />100GB Hard Drive<br />512MB RAM<br />$1,000 <br /><br /> Your Score: __________ <br />
+
-
|}
+
-
 
+
An example of a product profile using choice is shown below:
An example of a product profile using choice is shown below:
-
{| class="wikitable" border="1"
+
[[Image:choice.png]]
-
|-
+
-
! If you were in the market to purchase a PC today... 
+
-
|-
+
-
! 3GHz !! 4GHz !! 2GHz !! None
+
-
|}
+
 +
The conjoint analysis is usually done using [[stated choice]], which is data collected using a survey that is typically developed with [[experimental design]] (aka [[design of experiments]]) techniques.  A [[full factorial]] survey could be generated that tests every possible combination of attributes.  However, in most cases this is not feasible due to the great amount of questions needed.  It is also not even necessary, since balanced and orthogonal arrays can be used to reduce the number to a much smaller [[fractional factorial]] design. This design can be generated using a program such as SAS Institute's [[SAS]] system. The advantage of stated choice data is that it is possible to conduct a controlled experiment, which provides high-quality data. However, the task of evaluating product profiles in the survey may differ from the consumer experience in the marketplace, and survey data measures what people say and not necessarily what they do. Designing such a survey is an iterative process, and while there are no fixed rules on how to create a "good" conjoint survey, there are some best practices for [[designing a conjoint survey]].
-
The conjoint analysis is usually done using [[stated choice]], which is data collected using a survey.  The advantage of this method is that it is a controlled experiment.  However, the product profiles in the survey are not always the same as products in the marketplace and are also not always desired.  Another disadvantage is that it measures what people say and not what they do; hypothetical results are not always true in real life.  An alternative to stated choice data is [[revealed preference]] data.  This consists of using data that is collected from actual performance in the marketplace.  The advantage of this is that the data is a real measure of how consumers act.  However, there are problems with multi-collinearity - it is difficult to tell exactly why someone is choosing one product over another.
+
An alternative to stated choice data is [[revealed preference]] data.  This consists of using data that is collected from actual performance in the marketplace.  The advantage of this is that the data is a real measure of how consumers act.  However, there are problems with [[multicollinearity]] - it is difficult to tell exactly why someone is choosing one product over another.
-
A [[full-factorial]] survey could be generated that tests every possible combination of attributes.  However, in most cases this is not feasible due to the great amount of questions needed.  It is also not even necessary, since balanced and orthogonal arrays can be used to reduce the number to a much smaller [[fractional-factorial]] design. This design can be generated using a program such as SAS Intstitute's [[SAS System]].
+
==Simulation Analysis==
 +
There are many different simulator models available for use in estimating the utility function of the product attributes.  These include the [[first choice model]], the [[average choice model]], and the [[logit model]].
-
{| class="wikitable" border="1"
+
===[[First choice model]]===
-
|-
+
For each respondent, the product with the highest utility is the product of choice and receives a value of 1.  If there is a tie for the highest utility, both products get a value of 0.5.  In the end the most votes for a product evaluated as a proportion of the number of respondents is the most desirable product.
-
! header 1 !! header 2 !! header 3
+
-
|-
+
-
! Row header 1
+
-
| cell 1,1 || cell 1,2
+
-
|-
+
-
! Row header 2
+
-
| cell 2,1 || cell 2,2
+
-
|}
+
-
==Simulation Analysis==
+
===[[Average choice model]]===
-
The most common simulator models include the first choice model, the average choice (Bradley-Terry-Luce) model, and the Logit model. The First choice model identifies the product with the highest utility as the product of choice. This product is selected and receives a value of 1. Ties receive a .5 value. After the process is repeated for each respondent's utility set, the cumulative "votes" for each product are evaluated as a proportion of the votes or respondents in the sample. The Bradley-Terry-Luce model estimates choice probability in a different fashion. The choice probability for a given product is based on the utility for that product divided by the sum of all products in the simulated market. The logit model uses an assigned choice probability that is proportional to an increasing monotonic function of the alternative's utility. The choice probabilities are computed by dividing the logit value for one product by the sum for all other products in the simulation. These individual choice probabilities are averaged across respondents. In summary, while the literature shows the maximum utility (first choice model) to provide the best overall validation, choice behavior has a strong probabilistic component. We have not measured this component adequately, but instead attributed lack of validity to "noise", our inability to model information search and overload effects, and measurement error.
+
Also known as the [[Bradley-Terry-Luce model]]. The choice probability for a product is based on the utility of that product divided by the sum of all products in the simulated market.
-
==Example==
+
===[[Random Utility Discrete Choice Models]]===
-
Here is an example of a question on a survey
+
Introduces a random error term to the measurement of utility and calculates the probability of choice as equal to the probability of the utility of one product being greater than the utility of all other products. Models vary based on the assumed distribution of the random error component and the specification of the deterministic component of utility. The most popular models are the [[logit model]] and the [[probit model]].
==External links==
==External links==
-
*[http://www.sawtoothsoftware.com/qs-whatisconjoint.shtml What is conjoint analysis?] A useful conjoint software site that explains conjoint analysis in detail.
 
-
 
*[http://www.sas.com SAS Software] Software for designing full-factorial and fractional factorial surveys.
*[http://www.sas.com SAS Software] Software for designing full-factorial and fractional factorial surveys.
 +
*[http://cran.r-project.org/web/packages/AlgDesign/index.html AlgDesign] package in [[R]] for full-factorial and fractional factorial conjoint design.
 +
==References==
 +
*[http://www.sawtoothsoftware.com/qs-whatisconjoint.shtml What is conjoint analysis?] By Sawtooth Software.
-
==Reference==
+
*[http://marketing.byu.edu/htmlpages/tutorials/conjoint.htm Conjoint Analysis Tutorial] From Bringham Young University's Institute of Marketing.
-
*Green, P. E., V. Rao, and J. Wind, Conjoint Analysis: Methods and Applications, 1999, Knowledge@Wharton.
+
-
*Green, P. E. and V. Srinivasan (1978), "Conjoint Analysis in Consumer Research" Issues and Outlook", Journal of Consumer Research, Vol. 5, (September), pp 103-123.  
+
*Green, P. E., V. Rao, and J. Wind, (1999) Conjoint Analysis: Methods and Applications. Knowledge@Wharton.
 +
 
 +
*Green, P. E. and V. Srinivasan (1978) "Conjoint Analysis in Consumer Research" Issues and Outlook", Journal of Consumer Research, Vol. 5, (September), pp 103-123.  
*Luce, R. D. and J. W. Tukey. "Simultaneous Conjoint Measurement: A New Type of Fundamental Measurement," Journal of Mathematical Psychology, 1 (February 1964), pp 1-27.
*Luce, R. D. and J. W. Tukey. "Simultaneous Conjoint Measurement: A New Type of Fundamental Measurement," Journal of Mathematical Psychology, 1 (February 1964), pp 1-27.
 +
 +
*Nishimura, K., and Aizaki, H., (2008) Design and Analysis of Choice Experiments Using R: A Brief Introduction, Agricultural Information Research, 17(2), pp.86-94. http://www.jstage.jst.go.jp/article/air/17/2/17_86/_article
 +
 +
[[Category:market research methods]]

Current revision

Conjoint analysis, also known as multiattribute compositional models, is a statistical technique that was introduced into the marketing research community by University of Pennsylvania, Wharton professor Paul Green in the late 1960s. Since then, conjoint analysis has become the most popular multiattribute choice model in marketing. It can be used as a trade-off measurement technique to determine consumer preferences and buying intentions, and to also predict how consumers might react to changes in a current product or new product introduction. Conjoint analysis is often used today in new product design and advertising.

Conjoint analysis generally implies the use of design of experiments statistical techniques to construct efficient survey designs and assess the relative importance of each attribute (and it's possible values, or "levels") that compose a product. The independent variables (factors) are the attributes set to discrete levels, and the dependent variable (output) is the consumer response: either rating, ranking, or choice among the presented set of alternatives.


Contents

Basic Process

The basic process for a conjoint analysis is as follows:

  • The product features that will be analyzed are determined.
  • Potential customers are shown numerous different sets of products. Each set includes several product profiles that have multiple conjoined product features.
  • For each set of products, they select (by ranking, rating, or choosing) the profiles according to some overall criterion, such as preference, acceptability, or likelihood of purchase.
  • Choice simulator models such as the first choice model, average probability model, and the logit model are used to estimate the utility of each product feature.
  • This information is then incorporated into a new or existing product.

Data Collection

Instead of asking respondents what they prefer in a product directly, conjoint analysis presents to them potential product profiles that have specific combinations of attributes. These profiles can be evaluated in several ways, including:

  • Rating (e.g. rating with a scale from 1-10)
  • Ranking (e.g. rank best as 1st, 2nd, 3rd, etc.)
  • Choice (e.g. four options, which do you choose?)

An example of a product profile using rating is shown below:

Image:rating.png

An example of a product profile using choice is shown below:

Image:choice.png

The conjoint analysis is usually done using stated choice, which is data collected using a survey that is typically developed with experimental design (aka design of experiments) techniques. A full factorial survey could be generated that tests every possible combination of attributes. However, in most cases this is not feasible due to the great amount of questions needed. It is also not even necessary, since balanced and orthogonal arrays can be used to reduce the number to a much smaller fractional factorial design. This design can be generated using a program such as SAS Institute's SAS system. The advantage of stated choice data is that it is possible to conduct a controlled experiment, which provides high-quality data. However, the task of evaluating product profiles in the survey may differ from the consumer experience in the marketplace, and survey data measures what people say and not necessarily what they do. Designing such a survey is an iterative process, and while there are no fixed rules on how to create a "good" conjoint survey, there are some best practices for designing a conjoint survey.

An alternative to stated choice data is revealed preference data. This consists of using data that is collected from actual performance in the marketplace. The advantage of this is that the data is a real measure of how consumers act. However, there are problems with multicollinearity - it is difficult to tell exactly why someone is choosing one product over another.

Simulation Analysis

There are many different simulator models available for use in estimating the utility function of the product attributes. These include the first choice model, the average choice model, and the logit model.

First choice model

For each respondent, the product with the highest utility is the product of choice and receives a value of 1. If there is a tie for the highest utility, both products get a value of 0.5. In the end the most votes for a product evaluated as a proportion of the number of respondents is the most desirable product.

Average choice model

Also known as the Bradley-Terry-Luce model. The choice probability for a product is based on the utility of that product divided by the sum of all products in the simulated market.

Random Utility Discrete Choice Models

Introduces a random error term to the measurement of utility and calculates the probability of choice as equal to the probability of the utility of one product being greater than the utility of all other products. Models vary based on the assumed distribution of the random error component and the specification of the deterministic component of utility. The most popular models are the logit model and the probit model.

External links

  • SAS Software Software for designing full-factorial and fractional factorial surveys.
  • AlgDesign package in R for full-factorial and fractional factorial conjoint design.

References

  • Green, P. E., V. Rao, and J. Wind, (1999) Conjoint Analysis: Methods and Applications. Knowledge@Wharton.
  • Green, P. E. and V. Srinivasan (1978) "Conjoint Analysis in Consumer Research" Issues and Outlook", Journal of Consumer Research, Vol. 5, (September), pp 103-123.
  • Luce, R. D. and J. W. Tukey. "Simultaneous Conjoint Measurement: A New Type of Fundamental Measurement," Journal of Mathematical Psychology, 1 (February 1964), pp 1-27.
Personal tools