<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: When to use data transformations in Discussions</title>
    <link>https://community.jmp.com/t5/Discussions/When-to-use-data-transformations/m-p/868102#M103103</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.jmp.com/t5/user/viewprofilepage/user-id/67399"&gt;@blip555555&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It's very difficult (if not impossible) to help you without an (anonymized) dataset with the situation you're facing. Please read the post&amp;nbsp;&lt;LI-MESSAGE title="Getting correct answers to correct questions quickly" uid="550097" url="https://community.jmp.com/t5/Discussions/Getting-correct-answers-to-correct-questions-quickly/m-p/550097#U550097" discussion_style_icon_css="lia-mention-container-editor-message lia-img-icon-forum-thread lia-fa-icon lia-fa-forum lia-fa-thread lia-fa"&gt;&lt;/LI-MESSAGE&gt;.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;To assess if a transformation would be needed, it's important to look at residuals plot, to check if there is still a pattern in residuals that is not handled by the assumed model. Are you experiencing heteroscedasticity ? Or strange patterns in your residuals ? You can look at&amp;nbsp;&lt;A href="https://www.jmp.com/en/statistics-knowledge-portal/what-is-regression/simple-linear-regression-assumptions#:~:text=Because%20we%20are%20fitting%20a,value%20of%20the%20predictor%20increases." target="_blank" rel="noopener"&gt;Regression Model Assumptions | Introduction to Statistics | JMP&lt;/A&gt;&amp;nbsp;for more information.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I would also distinguish data transformation from &lt;A href="https://www.jmp.com/support/help/en/18.1/#page/jmp/overview-of-the-generalized-linear-mixed-models-personality.shtml#" target="_blank" rel="noopener"&gt;Generalized Linear Mixed Models&lt;/A&gt;&amp;nbsp;(GLM) in JMP Pro, where the response distribution can be specified (and enable to fit model with different response distributions: normal, exponential, gamma, ...).&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;SPAN&gt;Applying a non-linear (e.g., log, inverse) transformation to the dependent variables not only normalizes the residuals, but also distorts the ratio scale properties of measured variables. Transformation affects both the average response and its variance, so the error term can be greatly inflated.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;Applying GLM and setting up the model with link function enable to stay in the original scale of the data, using a link function to transform the mean into a linear function of the predictor variables and a variance function to allow for variance heterogeneity in the analysis rather than trying to transform it away (for example through log transform). So the link function affects the mean response but not the response variance, enabling to have an error in the original scale.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;You can read&amp;nbsp;&lt;LI-MESSAGE title="Difference between &amp;amp;quot;least square&amp;amp;quot; and &amp;amp;quot;generelized linear method&amp;amp;quot; in the fit model" uid="638087" url="https://community.jmp.com/t5/Discussions/Difference-between-quot-least-square-quot-and-quot-generelized/m-p/638087#U638087" discussion_style_icon_css="lia-mention-container-editor-message lia-img-icon-forum-thread lia-fa-icon lia-fa-forum lia-fa-thread lia-fa"&gt;&lt;/LI-MESSAGE&gt;&amp;nbsp;for more information.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hope this conversation starter may help you,&lt;/P&gt;</description>
    <pubDate>Wed, 16 Apr 2025 08:19:43 GMT</pubDate>
    <dc:creator>Victor_G</dc:creator>
    <dc:date>2025-04-16T08:19:43Z</dc:date>
    <item>
      <title>When to use data transformations</title>
      <link>https://community.jmp.com/t5/Discussions/When-to-use-data-transformations/m-p/867940#M103080</link>
      <description>&lt;P&gt;Good afternoon,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am constructing a LMM and have created a QQ plot attached in image 1. The conditional residuals seem to deviate from normality a fair bit so I transformed the response variable data using log base 10. This improved my R^2 by about 2% and the QQ plot seems a bit better (image 2). However, I'm not sure if these slight improvements are worth the transformation? As I would then have to transform the data back to non-log to report it in my thesis and that would be a fold-change instead of an actual arithmetic difference.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thank you!&lt;/P&gt;</description>
      <pubDate>Sat, 12 Apr 2025 14:48:06 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/When-to-use-data-transformations/m-p/867940#M103080</guid>
      <dc:creator>blip555555</dc:creator>
      <dc:date>2025-04-12T14:48:06Z</dc:date>
    </item>
    <item>
      <title>Re: When to use data transformations</title>
      <link>https://community.jmp.com/t5/Discussions/When-to-use-data-transformations/m-p/868032#M103091</link>
      <description>&lt;P&gt;I remember when one of my instructors said "The only reason to do data transformation is to simplify the model". &amp;nbsp;BTW, that was G.E.P. Box. &amp;nbsp;I suggest you read his papers on the subject.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="font-weight: 400;"&gt;Box, G.E.P., Paul Tidwell, (1962) “&lt;EM&gt;Transformation of the Independent Variables&lt;/EM&gt;”, &lt;U&gt;Technometrics&lt;/U&gt;, Vol. 4, No. 4, November&lt;/P&gt;
&lt;P style="font-weight: 400;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="font-weight: 400;"&gt;also the paper and attached discussions:&lt;/P&gt;
&lt;P style="font-weight: 400;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="font-weight: 400;"&gt;Draper, Norman, William Hunter, (1969), "Transformations: Some Examples Revisited", &lt;U&gt;Technometrics&lt;/U&gt;, Vol. 11, No. 1, February&lt;/P&gt;</description>
      <pubDate>Sun, 13 Apr 2025 14:40:50 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/When-to-use-data-transformations/m-p/868032#M103091</guid>
      <dc:creator>statman</dc:creator>
      <dc:date>2025-04-13T14:40:50Z</dc:date>
    </item>
    <item>
      <title>Re: When to use data transformations</title>
      <link>https://community.jmp.com/t5/Discussions/When-to-use-data-transformations/m-p/868102#M103103</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.jmp.com/t5/user/viewprofilepage/user-id/67399"&gt;@blip555555&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It's very difficult (if not impossible) to help you without an (anonymized) dataset with the situation you're facing. Please read the post&amp;nbsp;&lt;LI-MESSAGE title="Getting correct answers to correct questions quickly" uid="550097" url="https://community.jmp.com/t5/Discussions/Getting-correct-answers-to-correct-questions-quickly/m-p/550097#U550097" discussion_style_icon_css="lia-mention-container-editor-message lia-img-icon-forum-thread lia-fa-icon lia-fa-forum lia-fa-thread lia-fa"&gt;&lt;/LI-MESSAGE&gt;.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;To assess if a transformation would be needed, it's important to look at residuals plot, to check if there is still a pattern in residuals that is not handled by the assumed model. Are you experiencing heteroscedasticity ? Or strange patterns in your residuals ? You can look at&amp;nbsp;&lt;A href="https://www.jmp.com/en/statistics-knowledge-portal/what-is-regression/simple-linear-regression-assumptions#:~:text=Because%20we%20are%20fitting%20a,value%20of%20the%20predictor%20increases." target="_blank" rel="noopener"&gt;Regression Model Assumptions | Introduction to Statistics | JMP&lt;/A&gt;&amp;nbsp;for more information.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I would also distinguish data transformation from &lt;A href="https://www.jmp.com/support/help/en/18.1/#page/jmp/overview-of-the-generalized-linear-mixed-models-personality.shtml#" target="_blank" rel="noopener"&gt;Generalized Linear Mixed Models&lt;/A&gt;&amp;nbsp;(GLM) in JMP Pro, where the response distribution can be specified (and enable to fit model with different response distributions: normal, exponential, gamma, ...).&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;SPAN&gt;Applying a non-linear (e.g., log, inverse) transformation to the dependent variables not only normalizes the residuals, but also distorts the ratio scale properties of measured variables. Transformation affects both the average response and its variance, so the error term can be greatly inflated.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;Applying GLM and setting up the model with link function enable to stay in the original scale of the data, using a link function to transform the mean into a linear function of the predictor variables and a variance function to allow for variance heterogeneity in the analysis rather than trying to transform it away (for example through log transform). So the link function affects the mean response but not the response variance, enabling to have an error in the original scale.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;You can read&amp;nbsp;&lt;LI-MESSAGE title="Difference between &amp;amp;quot;least square&amp;amp;quot; and &amp;amp;quot;generelized linear method&amp;amp;quot; in the fit model" uid="638087" url="https://community.jmp.com/t5/Discussions/Difference-between-quot-least-square-quot-and-quot-generelized/m-p/638087#U638087" discussion_style_icon_css="lia-mention-container-editor-message lia-img-icon-forum-thread lia-fa-icon lia-fa-forum lia-fa-thread lia-fa"&gt;&lt;/LI-MESSAGE&gt;&amp;nbsp;for more information.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hope this conversation starter may help you,&lt;/P&gt;</description>
      <pubDate>Wed, 16 Apr 2025 08:19:43 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/When-to-use-data-transformations/m-p/868102#M103103</guid>
      <dc:creator>Victor_G</dc:creator>
      <dc:date>2025-04-16T08:19:43Z</dc:date>
    </item>
  </channel>
</rss>

