I am not sure if this is what you are seeking, but here are a few comments.
When I teach an introductory JMP class some of the key ponts I make include:
Then another key point is to introduce stacked tables great for By analyses (Anova) vs. unstacked used for modeling and correlation. This fits the rubric of organizing or data prep for analyses.
I agree that JMP is not a spreadsheet, however, in many companies, including the Pharma industry, spreadsheets are still playing as the main character in data collection and data analysis. The industry learned how to validate these spreadsheets so it will not need 'expensive' software for data collection,l trending, and analysis. Statistical analysis must come with built-in validated fixed options as part of the analysis package, compliant with relevant Data integrity regulations to compete with standard spreadsheets.
Hi @Sagi_Nahum I agree with you -- spreadsheets are still ubiquitous in the medical device industry too I can tell you that much.
Consider this perspective however. Spreadsheets would require software validation or 100% verification since data can be entered and formulas calculated in any cell configuration. All of this layout and formula creation makes for an end-product that is highly customize-able and therefore subject to calculation error. Hence we need robust verification or validation activities to ensure data integrity.
Consider the same implementation in JMP. I can construct calculations using 'Instant Column Formulas' and generate these formulas with a few right-clicks using my mouse and the menu-driven capabilities in JMP. I can create summary tables of complex calculations using Right-Click > "make combined data table" or Right-Click > Edit > "Make a table of graphs like this". I can save my analysis scripts as JSL code embedded in my data table(s). By doing this, I take advantage of menu-driven capabilities in JMP and I also generate source code which, if clicked on, ensures that my analysis is fully reproducible. So from this aspect, JMP can offer a unique advantage over spreadsheet-based software in that the "verification" pathway can be much easier in many cases, because it is less subject to human calculation-error. Also, the more rigid-structure of column-based formulas in JMP makes it such that the user must enter one formula for one column and cannot manipulate cells within a column independently.
Finally, I think JMP is still a spreadsheet in many ways. Why not use it this way if it resonates with your brain better? I'll attach an example for you with some column formulas. You can ignore replicated values down the rows of a Column. In essence, your calculation can be in the service of "row 1" (just "fill the cells below with a dark color to encourage the viewer to ignore them). The redundant row-wise calculations are computationally very cheap anyway if you have a table of only a few hundred to a few (hundred) thousand rows.
I use JMP as a database that can be computed directly.
Its sql-like statements are too powerful.
JSL is very powerful and very flexible.I have always insisted on using JSL to operate JMP.
Thanks for the post Mark. When I teach an Intro to JMP class I refer students to the Discovering JMP book that comes with JMP Help>Books>Discovering JMP and I point them to page 63, How is JMP Different from Excel? That seems to be very helpful to students.
I do not know if I can add much more to what has already been stated here, but here goes.
At my insistence, my employer purchased JMP so that I could perform the type(s) of analysis required for and by our most demanding client. While most government contracts are gray, this particular client wanted to go deeper than “what would happen if we implemented this?”¹ Or, as he put it, he wanted his study “to have a methodology and results that were defensible.” That meant following a statistical process: design of experiments, a data collection plan based on the DoE, modeling and analysis of the data, and inferencing the results from statistical analysis.
Prior to our acquisition of JMP, everything was performed in Excel. Depending on who was processing the data the workbooks that were created ran anywhere from mediocre to why. Despite the large user base, Excel is still at its core a powerful accounting ledger, so what our management incorrectly deemed to be statistical analysis was nothing more than summary analysis.² Proving this particular client’s need was not going to happen in Excel.
When we acquired a JMP license, my first thought was that a data table looks very similar to a spreadsheet, but I quickly found and realized that where Excel and other spreadsheet software typically have the cell as the base object; that is, each of the 6,871,947,674 cells is independent having no relation to any other cell unless otherwise defined by the user. As I had prior experience in SAS, it became rapidly apparent that a data table is a data model in waiting. As others have noted, columns serve as either independent variables (predictors) or dependent variables (responses), while the rows in a JMP data table are observations.
JMP’s paradigm prevents users from engaging in some of the more atrocious way in which people use Excel; such as putting multiple “tables” on a single worksheet instead of being treated as independent objects that should occupy their own worksheet. Excel while seemingly having a broad range of graphs, pales in comparison to the Graph Builder platform.
The various platforms offered in JMP have no equivalent in Excel outside of perhaps third-party add-ins. For people inexperienced with statistics, the Analysis ToolPak add-in that included, but not pre-installed in Excel reinforces the misconception that there is not much to performing statistics. Want to design an experiment? Can you employ various techniques to determine if a data set is from a normally distributed population? Need to perform analysis on non-parametric data? Do you want to perform pairwise analysis to determine the factors in your model that are significantly different? If you are using Excel, forget about addressing such needs.
Recently, a co-worker an I showed the president of the company how he could quickly explore data in JMP. The Distribution platform alone sold him, as he saw that with little effort he could rapidly see how data was distributed with histograms and box plots, as well as get quantiles and summary statistics. The capabilities of Graph Builder blew his mind. Our president later asked me if JMP can perform SQL queries, t which I answered, “Yes.” I provided him with the built-in PDF books included with JMP where he how to perform such queries.
Simply put, true data discovery and statistical analtysis software, such as JMP, are far better suited to analytics than the kludges that are applied in Excel. Excel is not designed to perform much beyond reporting summary statistics and basic graphs.
¹ It is often the case that government contracts are posed by non-technical persons at the management level. More often than not, the contractor needs to guide their client to a solid, and achievable, objective given the biding price and contract period of performance.
² One of the major issues data analysts and data scientists face is getting their supervisors and clients to realize that the generation of descriptive (summary) statistics is not the application of statistics, or more properly, statical science and data analytics that leads to statistical inference.
I also stumbled on to this thread and thank the previous responders for some great ideas. I offer my humble experiences. I do two things in my professional life:
1. I teach experimental design and sampling as means to gather data (and regression when there is no data collection plan) and a host of procedures to analyze the data. I follow my own guidance in the analysis of data: Practical, Graphical and lastly quantitative.
2. I analyze data sets form clients (these can take many forms).
I make a point that ~95% of my data analysis time is spent organizing the data into a structure/order so the data can be analyzed correctly. All questions that can be answered, conclusions that can be made, confidence in extrapolating the results, etc. are DEPENDENT on how the data was acquired (context).
All of my clients use Excel. I gave up on some emphatic DON"T USE THAT POS (piece of software). Software is a language. Many individuals speak Excel fluently and they are comfortable with it. JMP is a different language. Learning a new language can be terrifying and frustrating. Excel has its good points and its issues. Its good points include how extremely flexible it is, that is, of course, one of its bad points. I state ALL statistical analysis software requires discipline in its use (yes this includes dare I say Minitab...yikes). Over the course of working with my clients, it becomes clear the incredible capability of JMP. I demonstrate how organizing the data wrong leads to erroneous analysis. They naturally become convinced it is in their best interest to recognize how JMP performs analysis. I have not found one "event" to be the convincing argument, but repetition. Once they see this (admittedly it takes many reps), they now can use Excel with the knowledge of how it will need to be organized in JMP to perform the correct analysis.
A recent JMP Discussions reply from @markbailey regarding the KS test interpretation illustrates another differentiating global feature in JMP compared to your garden variety spreadsheet application. Interactive/online Help. If one is curious about a specific analysis platform report element and want to learn more, you don't have to hunt/peck/guess by keyword or table of contents search to learn more. All you need to do is go to the JMP main menu bar, select Tools -> Help, then hover your cursor over the element you want to learn more about and click on the element. You'll go directly to the most relevant content in the JMP documentation. Easy peasy.
There are no labels assigned to this post.