Stan Siranovich, Crucial Connection, LLC

 

Much has been written in the popular press about “Big Data” and its uses, both good and bad.  Less well reported, but just as revolutionary, has been the development of statistical discovery software and analytical techniques used to unearth relationships and to make predictions using this data.  One such statistical technique is that of Partial Least Squares.

 

In this poster session, we will use JMP Statistical Discovery Software and the Partial Least Squares platform to explore protein tertiary structure, downloaded from a large public data set of 45,730 rows by 10 columns.  In particular, we will use Partial Least Squares analysis to predict the Root Mean Square Deviation (RMSD) between two proteins from nine very highly correlated variables.  We will delve into an explanation of the output data and what it means, then look “under the hood” at what calculations or algorithms the software performed to give us our result.

 

Slide1.JPGSlide2.JPGSlide3.JPGSlide4.JPGSlide5.JPGSlide6.JPGSlide7.JPGSlide8.JPG

Presented At Discovery Summit 2018

Presenter

Files

Published on ‎03-24-2025 08:51 AM by Community Manager Community Manager | Updated on ‎03-27-2025 09:04 AM

Stan Siranovich, Crucial Connection, LLC

 

Much has been written in the popular press about “Big Data” and its uses, both good and bad.  Less well reported, but just as revolutionary, has been the development of statistical discovery software and analytical techniques used to unearth relationships and to make predictions using this data.  One such statistical technique is that of Partial Least Squares.

 

In this poster session, we will use JMP Statistical Discovery Software and the Partial Least Squares platform to explore protein tertiary structure, downloaded from a large public data set of 45,730 rows by 10 columns.  In particular, we will use Partial Least Squares analysis to predict the Root Mean Square Deviation (RMSD) between two proteins from nine very highly correlated variables.  We will delve into an explanation of the output data and what it means, then look “under the hood” at what calculations or algorithms the software performed to give us our result.

 

Slide1.JPGSlide2.JPGSlide3.JPGSlide4.JPGSlide5.JPGSlide6.JPGSlide7.JPGSlide8.JPG



Start:
Mon, Oct 8, 2018 09:00 AM EDT
End:
Fri, Oct 12, 2018 05:00 PM EDT
Attachments
0 Kudos