cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Try the Materials Informatics Toolkit, which is designed to easily handle SMILES data. This and other helpful add-ins are available in the JMP® Marketplace
Choose Language Hide Translation Bar
lala
Level VIII

Can this simple two-step operation be done in one step?

Can it be done with memory arrays?

Thanks!

dt = Open( "$SAMPLE_DATA/Big Class.jmp" );
ca = "sex";
dt << Select Where( Row() < 25 );
dt << Select Columns( ca );
d3 = dt << Subset( Output Table( "b3" ), Selected Rows( 1 ), columns( ca ) );
d2 = d3 << Summary(
	Group( 1 ),
	Freq( 0 ),
	Weight( 0 ),
	Link to original data table( 0 ),
	Output Table( "b2" )
);
  • Is there no need to generate d3?

1 ACCEPTED SOLUTION

Accepted Solutions
jthi
Super User

Re: Can this simple two-step operation be done in one step?

Here are few options, but it is very easy to come up with even more (also I'm not sure what would be the issue of using subset as you can create it as private and close it immediately after you have the subset)

Names Default To Here(1);

dt = Open("$SAMPLE_DATA/Big Class.jmp");
groupcol = "sex";

// Filter
dt_filter = dt << Summary(
	Local Data Filter(
		Add Filter(
			columns(Transform Column("Row", Formula(Row()))),
			Where(Transform Column("Row", Formula(Row())) < 25)
		)
	),
	Group(Eval(groupcol)),
	Freq("None"),
	Weight("None"),
	Link to original data table(0),
	Output table("1")
);

// Exclude
dt << Select Where(Row() >= 25) << Hide and Exclude(1);
dt_exclude = dt << Summary(
	Group(Eval(groupcol)),
	Freq("None"),
	Weight("None"),
	Link to original data table(0),
	Output table("2")
);
dt << Clear Row States;

// SQL
dt_sql = Query(
	Table(dt, "bc"),
	Eval Insert("\[
	SELECT ^groupcol^, COUNT(rowid) "N Rows"
	FROM bc
	WHERE ROWID < 25
	GROUP BY ^groupcol^
	]\")
);
dt_sql << Set Name("3");


// Summarize and transform (could also use exclude)
dt << Transform Column("A", Formula(If(Row() < 25, 1, .)));
Summarize(dt, groups = By(Eval(groupcol)), counts = Sum(:A)); // Summarize requires numeric columns
dt_summarize = New Table("4",
	New Column(groupcol, Character, Nominal, Values(groups)),
	New Column("N Rows", Numeric, Continuous, Values(counts))	
);
dt << Delete Columns(:A);

Write();
-Jarmo

View solution in original post

5 REPLIES 5
jthi
Super User

Re: Can this simple two-step operation be done in one step?

What do you want to do?

 

Edit: create summary table?

-Jarmo
lala
Level VIII

Re: Can this simple two-step operation be done in one step?

I want to extract a subset of the data in this column, and then aggregate the respective numbers of these subsets.Note that the format of this column is text.

 

Thanks!

jthi
Super User

Re: Can this simple two-step operation be done in one step?

Do you want them in table? Do you need the subset (you mentioned you want to "extract a subset")?

-Jarmo
lala
Level VIII

Re: Can this simple two-step operation be done in one step?

Subsets may not appear.However, the final breakdown needs to be presented as a table of data.

Thanks!

jthi
Super User

Re: Can this simple two-step operation be done in one step?

Here are few options, but it is very easy to come up with even more (also I'm not sure what would be the issue of using subset as you can create it as private and close it immediately after you have the subset)

Names Default To Here(1);

dt = Open("$SAMPLE_DATA/Big Class.jmp");
groupcol = "sex";

// Filter
dt_filter = dt << Summary(
	Local Data Filter(
		Add Filter(
			columns(Transform Column("Row", Formula(Row()))),
			Where(Transform Column("Row", Formula(Row())) < 25)
		)
	),
	Group(Eval(groupcol)),
	Freq("None"),
	Weight("None"),
	Link to original data table(0),
	Output table("1")
);

// Exclude
dt << Select Where(Row() >= 25) << Hide and Exclude(1);
dt_exclude = dt << Summary(
	Group(Eval(groupcol)),
	Freq("None"),
	Weight("None"),
	Link to original data table(0),
	Output table("2")
);
dt << Clear Row States;

// SQL
dt_sql = Query(
	Table(dt, "bc"),
	Eval Insert("\[
	SELECT ^groupcol^, COUNT(rowid) "N Rows"
	FROM bc
	WHERE ROWID < 25
	GROUP BY ^groupcol^
	]\")
);
dt_sql << Set Name("3");


// Summarize and transform (could also use exclude)
dt << Transform Column("A", Formula(If(Row() < 25, 1, .)));
Summarize(dt, groups = By(Eval(groupcol)), counts = Sum(:A)); // Summarize requires numeric columns
dt_summarize = New Table("4",
	New Column(groupcol, Character, Nominal, Values(groups)),
	New Column("N Rows", Numeric, Continuous, Values(counts))	
);
dt << Delete Columns(:A);

Write();
-Jarmo