World Statistics Day was yesterday, but we’re celebrating all week long! This celebration means acknowledging the impact statistics has on our world. Who is your favorite statistician? Share with us who they are and why they top your favorites list.
Choose Language Hide Translation Bar
0 Kudos

Graphical Depiction of the Structure of Data Collection - FRD / Sampling Trees

Our students/coworkers typically find themselves with specific needs related to the graphical depiction of the STRUCTURE of data.  We would like JMP to either improve or modify the existing capability for Ishikawa / Fishbone diagrams to enable the creation of "trees" depicting a data collection, or to ignore the Diagram platform to provide a new way to take a list of variables in a data collection, and display the structure of the data easily in tree format.


A) Depiction of the data present in existing files/DBs, including the presence of confounding in that data - independent variables that are NOT tracked or present in the existing data, but are confounded with fields that are (Example:  Age and Height might not be tracked in a database that DOES have a field for WEIGHT) - so the tree might show that we have five patients, and each has a WEIGHT in the database, but we each also has an Age and Height NOT present in the data.  We show this because in the analysis we want to indicate where an effect might be INCORRECTLY attributed to a variable present, when in fact it could be due in part or total to some other variable we did not collect.


B) Depiction of NEW data collected passively (sampling plans).  The need here is essentially the same as A, but is listed separately as this data would not exist at the time the plan is created.


C) Factor Relationship Diagrams (FRDs).  Essentially the depiction of a DOE created by JMP, with design structure based on the chosen design parameters, but then also the capability of placing within each combination a tree for the unmanipulated variables (unit structure)..For example, for the combination - + + - we would also want to know / show that within this combination, there were two machines examined, and within each we took two material samples.  Ideally Design Structure would be BLACK, and Unit Structure RED.


To attempt to do this via the Diagram platform is awkward, and we are in particular interested in C, as JMP is typically where we choose our design, so it would be nice to have it just translate that into a diagram.  Later enhancements would increase flexibility of unit structure capability, including the ability to add CONDITIONS at the top of any diagram (noises/variables constant throughout).

Tracking Number:

Defect ID:

Super User

This comment is not meant to diminish your wish list item. It is meant as an fyi about something you likely can do today. The JMP variability chart is quite useful to look at data structure and relationships, if there are not too many factors and levels. In fact, it is my "goto" graph for the raw data to look for anomalies or to see if there are any results quite different from each other.


The Variability plot allows the user to drag the order of the "factor" variables. Below is a simple concept script and the resulting "picture".  The 3rd picture I made horizontal, th eusual view, once there are response values to plot.  Note for your example with two entities/tools and 2 samples, I'd use a legend color and/or symbol to show that there were 2 entities, and each value has its own point on the graph.


You might aready know all of this, sorry if I am preaching to the choir. I am just adding this note as an FYI.


Names Default to Here(1);

dt = current data table();  //the DOE table
cnames = dt << get column names("string");

clst = {};

For(i=1, i<=nitems(cnames), i++, 
	If(cnames[i]=="Pattern", Continue());
	prop = (Column(cnames[i])<< get properties list());
	if(Contains(prop, Expr(Design Role)), insert into(clst, cnames[i])  )

If( nitems(clst) > 0,
__dt = dt << Subset(Columns(EvalList(clst)), Invisible, OutputTableName( (dt << get name) || "- Layout") );

__dt <<select rows(1::nrow(__dt));
__dt << Hide(1);
__dt << clear select;
__dt << new Column("Y", continuous, <<set each value(0));

vc = __dt << Variability Chart(
	Y( :Y ),
	X( Eval List(clst) ),
	Vertical Charts( 1 ),
	Std Dev Chart( 0 ),
			{"Variability Chart for Y"},
			{Min( 0.123391170101054 ), Max( 0.940740039529616 ), Inc( 1 ),
			Minor Ticks( 0 ), Label Row(
				{Show Major Labels( 0 ), Show Major Ticks( 0 ),
				Show Minor Ticks( 0 )}
			{"Variability Chart for Y"},
			"Variability Chart",
			{Frame Size( 121, 338 )}
			{"Variability Chart for Y"},
			DropBox( 5 ),
			{Horizontal Alignment( "Center" )}



You can also get a similar view, without the direct labeling, with nested axes in Graph Builder.