<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Conceptual question about the profit matrix in Discussions</title>
    <link>https://community.jmp.com/t5/Discussions/Conceptual-question-about-the-profit-matrix/m-p/47042#M26805</link>
    <description>&lt;P&gt;This issue has been discussed previously, but I keep finding myself confused about how to use the profit matrix.&amp;nbsp; My question is conceptual in nature, so let's just use some simple numbers.&amp;nbsp; Suppose I have data on customer churn and I build a predictive model for churn (the particular technique does not matter for this question).&amp;nbsp; My decision errors are not symmetric, however, so I want to modify my probability cutoff to reduce the more costly misclassification error.&amp;nbsp; Suppose the relevant data is:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;I will send every predicted lost customer a $50 gift.&lt;/LI&gt;&lt;LI&gt;I expect 10% of these customers to be retained as a result of the gift.&lt;/LI&gt;&lt;LI&gt;Every retained customer has an expected lifetime present value of profits of $1000.&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;From this, I know that falsely predicting a customer will be retained (false positive) costs me more than falsely predicting I will lose a customer (false negative).&amp;nbsp; However, I can't decide how to operationalize this.&amp;nbsp; The profit matrix would seem a natural place to incorporate this information, but the example in the JMP documentation does not easily translate to this problem.&amp;nbsp; My inclination is to use a profit matrix that look like&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; predicted churn=1 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; predicted churn=0&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; actual churn = 1 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; -50&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; -100&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; actual churn = 0 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; +50&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; +1000&lt;/P&gt;&lt;P&gt;My reasoning would be that I will send a $50 gift to all predicted churners, but that 10% of these will actually be retained (expected profits = 10%(1000)=$100).&amp;nbsp; However, the model prediction concerns the classifications of the initial model, not the results of my marketing efforts.&amp;nbsp; In other words, the actual churn=0 is not the result of my gift, it is the misclassifications of the initial model.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Similarly, the -100 in the matrix results from the fact that if I predict the customer to be retained and they are not, I don't send the gift (saving $50) but I miss the opportunity to have retained 10% of these customers (expected profits of $100).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The fact that the misclassification errors are not symmetric is clear - but the way to implement this is not.&amp;nbsp; Even if I don't use the profit matrix, it isn't clear to me how to choose a probability cutoff in my classification model that incorporates the simplified data I assumed above.&amp;nbsp; I realize that the classification model would really be the first step in a process whereby I would want to experiment with ways to increase retention.&amp;nbsp; But the assumptions I make about the gifts, probability of success, and expected profits should provide enough information to use in applying the classification model to this problem.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can anyone help sort out how to incorporate the assumed information into choosing a probability cutoff for a classification model?&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;</description>
    <pubDate>Fri, 10 Nov 2017 14:02:47 GMT</pubDate>
    <dc:creator>dale_lehman</dc:creator>
    <dc:date>2017-11-10T14:02:47Z</dc:date>
    <item>
      <title>Conceptual question about the profit matrix</title>
      <link>https://community.jmp.com/t5/Discussions/Conceptual-question-about-the-profit-matrix/m-p/47042#M26805</link>
      <description>&lt;P&gt;This issue has been discussed previously, but I keep finding myself confused about how to use the profit matrix.&amp;nbsp; My question is conceptual in nature, so let's just use some simple numbers.&amp;nbsp; Suppose I have data on customer churn and I build a predictive model for churn (the particular technique does not matter for this question).&amp;nbsp; My decision errors are not symmetric, however, so I want to modify my probability cutoff to reduce the more costly misclassification error.&amp;nbsp; Suppose the relevant data is:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;I will send every predicted lost customer a $50 gift.&lt;/LI&gt;&lt;LI&gt;I expect 10% of these customers to be retained as a result of the gift.&lt;/LI&gt;&lt;LI&gt;Every retained customer has an expected lifetime present value of profits of $1000.&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;From this, I know that falsely predicting a customer will be retained (false positive) costs me more than falsely predicting I will lose a customer (false negative).&amp;nbsp; However, I can't decide how to operationalize this.&amp;nbsp; The profit matrix would seem a natural place to incorporate this information, but the example in the JMP documentation does not easily translate to this problem.&amp;nbsp; My inclination is to use a profit matrix that look like&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; predicted churn=1 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; predicted churn=0&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; actual churn = 1 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; -50&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; -100&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; actual churn = 0 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; +50&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; +1000&lt;/P&gt;&lt;P&gt;My reasoning would be that I will send a $50 gift to all predicted churners, but that 10% of these will actually be retained (expected profits = 10%(1000)=$100).&amp;nbsp; However, the model prediction concerns the classifications of the initial model, not the results of my marketing efforts.&amp;nbsp; In other words, the actual churn=0 is not the result of my gift, it is the misclassifications of the initial model.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Similarly, the -100 in the matrix results from the fact that if I predict the customer to be retained and they are not, I don't send the gift (saving $50) but I miss the opportunity to have retained 10% of these customers (expected profits of $100).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The fact that the misclassification errors are not symmetric is clear - but the way to implement this is not.&amp;nbsp; Even if I don't use the profit matrix, it isn't clear to me how to choose a probability cutoff in my classification model that incorporates the simplified data I assumed above.&amp;nbsp; I realize that the classification model would really be the first step in a process whereby I would want to experiment with ways to increase retention.&amp;nbsp; But the assumptions I make about the gifts, probability of success, and expected profits should provide enough information to use in applying the classification model to this problem.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can anyone help sort out how to incorporate the assumed information into choosing a probability cutoff for a classification model?&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;</description>
      <pubDate>Fri, 10 Nov 2017 14:02:47 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Conceptual-question-about-the-profit-matrix/m-p/47042#M26805</guid>
      <dc:creator>dale_lehman</dc:creator>
      <dc:date>2017-11-10T14:02:47Z</dc:date>
    </item>
  </channel>
</rss>

