<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Genetic Algorithms, Simulated Annealing, etc. for Optimizing QSAR Regression Equations in Discussions</title>
    <link>https://community.jmp.com/t5/Discussions/Genetic-Algorithms-Simulated-Annealing-etc-for-Optimizing-QSAR/m-p/356130#M60602</link>
    <description>&lt;P&gt;The penalized methods are one way to go. But have you considered partial least squares? You data set is wide and shallow. Tailor made for PLS. Still a linear equation. And if your predictors are correlated...even better.&lt;/P&gt;</description>
    <pubDate>Thu, 04 Feb 2021 18:43:16 GMT</pubDate>
    <dc:creator>P_Bartell</dc:creator>
    <dc:date>2021-02-04T18:43:16Z</dc:date>
    <item>
      <title>Genetic Algorithms, Simulated Annealing, etc. for Optimizing QSAR Regression Equations</title>
      <link>https://community.jmp.com/t5/Discussions/Genetic-Algorithms-Simulated-Annealing-etc-for-Optimizing-QSAR/m-p/356030#M60590</link>
      <description>&lt;P&gt;I am interested in creating regression equations for QSAR-related problems.&amp;nbsp; The general idea is to find a linear regression equation from a pool of many (&amp;gt;200) independent variables or descriptors to predict one dependent variable.&amp;nbsp; Typically, the number of rows/observations is small (n=50 at a minimum).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;There are ways to do this using Generalized Regression in JMP.&amp;nbsp; While the penalized regression methods or the simple forward or backward elimination can find a "good" linear regression equation, my concern is that these methods may not find the globally optimum solution and that they get stuck in a local minimum.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;There are other techniques outside of JMP that can sample many more possible choices such as: genetic algorithms, simulated annealing, particle swarm optimization, ant/bee colony, etc. by taking subsets of descriptors, using a function to evaluate the resulting model, splitting up the subset to make and evaluate newer subsets.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I was wondering if anyone has created a way to do this within JMP or even has a way to do this outside of JMP (e.g. in R, Python, Matlab, SAS, etc.) and can perhaps think of a way to add-in this functionality within JMP?&amp;nbsp; My specific need is to make linear regression equations and not neural networks, random forests, or other types of models that JMP can also make.&lt;/P&gt;</description>
      <pubDate>Sat, 10 Jun 2023 20:43:05 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Genetic-Algorithms-Simulated-Annealing-etc-for-Optimizing-QSAR/m-p/356030#M60590</guid>
      <dc:creator>PrakashMadhav</dc:creator>
      <dc:date>2023-06-10T20:43:05Z</dc:date>
    </item>
    <item>
      <title>Re: Genetic Algorithms, Simulated Annealing, etc. for Optimizing QSAR Regression Equations</title>
      <link>https://community.jmp.com/t5/Discussions/Genetic-Algorithms-Simulated-Annealing-etc-for-Optimizing-QSAR/m-p/356130#M60602</link>
      <description>&lt;P&gt;The penalized methods are one way to go. But have you considered partial least squares? You data set is wide and shallow. Tailor made for PLS. Still a linear equation. And if your predictors are correlated...even better.&lt;/P&gt;</description>
      <pubDate>Thu, 04 Feb 2021 18:43:16 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Genetic-Algorithms-Simulated-Annealing-etc-for-Optimizing-QSAR/m-p/356130#M60602</guid>
      <dc:creator>P_Bartell</dc:creator>
      <dc:date>2021-02-04T18:43:16Z</dc:date>
    </item>
    <item>
      <title>Re: Genetic Algorithms, Simulated Annealing, etc. for Optimizing QSAR Regression Equations</title>
      <link>https://community.jmp.com/t5/Discussions/Genetic-Algorithms-Simulated-Annealing-etc-for-Optimizing-QSAR/m-p/356214#M60612</link>
      <description>&lt;P&gt;An issue with PLS is that it won't select subsets of the independent variables and will just include every independent variable.&amp;nbsp; This may result in overfitting issues.&lt;/P&gt;</description>
      <pubDate>Thu, 04 Feb 2021 20:55:49 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Genetic-Algorithms-Simulated-Annealing-etc-for-Optimizing-QSAR/m-p/356214#M60612</guid>
      <dc:creator>PrakashMadhav</dc:creator>
      <dc:date>2021-02-04T20:55:49Z</dc:date>
    </item>
    <item>
      <title>Re: Genetic Algorithms, Simulated Annealing, etc. for Optimizing QSAR Regression Equations</title>
      <link>https://community.jmp.com/t5/Discussions/Genetic-Algorithms-Simulated-Annealing-etc-for-Optimizing-QSAR/m-p/356222#M60614</link>
      <description>&lt;P&gt;It's not clear to me what you mean by 'subsets of variables'...if variable selection is part of your problem solving goals, PLS can be used quite nicely. You have the flexibility in JMP Pro to select the number of latent factors to work with. Then find the most influential variables within that construct...JMP Pro also has a flexible set of cross validation methods...KFold might work best for your shallow data?&lt;/P&gt;</description>
      <pubDate>Thu, 04 Feb 2021 21:11:47 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Genetic-Algorithms-Simulated-Annealing-etc-for-Optimizing-QSAR/m-p/356222#M60614</guid>
      <dc:creator>P_Bartell</dc:creator>
      <dc:date>2021-02-04T21:11:47Z</dc:date>
    </item>
    <item>
      <title>Re: Genetic Algorithms, Simulated Annealing, etc. for Optimizing QSAR Regression Equations</title>
      <link>https://community.jmp.com/t5/Discussions/Genetic-Algorithms-Simulated-Annealing-etc-for-Optimizing-QSAR/m-p/357119#M60713</link>
      <description>&lt;P&gt;I know&amp;nbsp;&lt;a href="https://community.jmp.com/t5/user/viewprofilepage/user-id/14122"&gt;@P_Bartell&lt;/a&gt;&amp;nbsp; loves PLS, and its a good solution in a lot of places. I'm a huge fan of the Tree Methods, and recently an add-in for XGBoost was published:&amp;nbsp;&lt;LI-MESSAGE title="XGBoost Add-In for JMP Pro" uid="319383" url="https://community.jmp.com/t5/JMP-Add-Ins/XGBoost-Add-In-for-JMP-Pro/m-p/319383#U319383" discussion_style_icon_css="lia-mention-container-editor-message lia-img-icon-tkb-thread lia-fa-icon lia-fa-tkb lia-fa-thread lia-fa"&gt;&lt;/LI-MESSAGE&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Since it sounds like you have a JMP Pro license, it might be worth your time to take a look at it.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 08 Feb 2021 15:21:09 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Genetic-Algorithms-Simulated-Annealing-etc-for-Optimizing-QSAR/m-p/357119#M60713</guid>
      <dc:creator>Byron_JMP</dc:creator>
      <dc:date>2021-02-08T15:21:09Z</dc:date>
    </item>
    <item>
      <title>Re: Genetic Algorithms, Simulated Annealing, etc. for Optimizing QSAR Regression Equations</title>
      <link>https://community.jmp.com/t5/Discussions/Genetic-Algorithms-Simulated-Annealing-etc-for-Optimizing-QSAR/m-p/357176#M60721</link>
      <description>&lt;P&gt;None of these will help, except to suggest that JSL can be written to do simulated annealing and simulations of systems:&lt;/P&gt;&lt;P&gt;&lt;LI-MESSAGE title="How to use Define Class" uid="274410" url="https://community.jmp.com/t5/Uncharted/How-to-use-Define-Class/m-p/274410#U274410" discussion_style_icon_css="lia-mention-container-editor-message lia-img-icon-blog-thread lia-fa-icon lia-fa-blog lia-fa-thread lia-fa"&gt;&lt;/LI-MESSAGE&gt;&amp;nbsp; annealing/spring-force&lt;/P&gt;&lt;P&gt;&lt;LI-MESSAGE title="Halloween Trilogy: all in one" uid="28574" url="https://community.jmp.com/t5/Uncharted/Halloween-Trilogy-all-in-one/m-p/28574#U28574" discussion_style_icon_css="lia-mention-container-editor-message lia-img-icon-blog-thread lia-fa-icon lia-fa-blog lia-fa-thread lia-fa"&gt;&lt;/LI-MESSAGE&gt;&amp;nbsp; annealing/spring-force&lt;/P&gt;&lt;P&gt;&lt;LI-MESSAGE title="Wind Visualization" uid="82476" url="https://community.jmp.com/t5/Uncharted/Wind-Visualization/m-p/82476#U82476" discussion_style_icon_css="lia-mention-container-editor-message lia-img-icon-blog-thread lia-fa-icon lia-fa-blog lia-fa-thread lia-fa"&gt;&lt;/LI-MESSAGE&gt;&amp;nbsp; simulation/visualization of a system&lt;/P&gt;&lt;P&gt;&lt;LI-MESSAGE title="Video flock/swarm/school" uid="21078" url="https://community.jmp.com/t5/Uncharted/Video-flock-swarm-school/m-p/21078#U21078" discussion_style_icon_css="lia-mention-container-editor-message lia-img-icon-blog-thread lia-fa-icon lia-fa-blog lia-fa-thread lia-fa"&gt;&lt;/LI-MESSAGE&gt;&amp;nbsp; swarming/flocking&lt;/P&gt;&lt;P&gt;These are JSL intensive projects; you'll know a lot about JSL after you go down this path.&lt;/P&gt;</description>
      <pubDate>Mon, 08 Feb 2021 16:19:18 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Genetic-Algorithms-Simulated-Annealing-etc-for-Optimizing-QSAR/m-p/357176#M60721</guid>
      <dc:creator>Craige_Hales</dc:creator>
      <dc:date>2021-02-08T16:19:18Z</dc:date>
    </item>
  </channel>
</rss>

