cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Try the Materials Informatics Toolkit, which is designed to easily handle SMILES data. This and other helpful add-ins are available in the JMP® Marketplace
Fill in Missing Cells
Byron_JMP
Staff

Report tables often end up having incomplete columns where labels need to be copied to the sequential cells below until the next label where the process is repeated again and again. The Excel Import Wizard has an option to deal with spans of missing cells that are merged. This script can be used after the data is imported and it doesn't require the spans of missing cells to be merged.

I've been using this script, or versions of it, for so long that I'm not sure where the bones of it came from.

Comments

I wonder if one could do this without Expressions. For one single column "MyCol1" you could do sth. like this:

For Each Row( If( Is Missing( MyCol1 ), :MyCol1= Lag( MyCol1, 1 ) ) );

Using an automated approach to select the columns like you do combined with this should be able to do. 

Byron_JMP

Making the script into an add-in or putting it into a menu option is my crutch for forgetting the formulas to make it happen,

This is another easy formula to do this. 

:Complete is the new column where the formula lives.

:Missing is the source column with missing rows.

//Rows with Character

If( :Missing == "",

  :Complete[Row() - 1, Empty()],

  :Missing)

//Rows with Numbers

If( Is Missing( :Missing ) == 1,

  :Complete [Row() - 1, Empty()],

  :Missing)

Byron_JMP

I'm updateing this add in with this script. Thanks to Brady Brady for some tweaks that make this run faster on really big tables. On a 5MM row table with something to fill in every 10K rows it takes 10-15 sec.

 

Names Default To Here( 1 );
dt = Current Data Table();

dtcollist = dt << get column names;

dlg = Dialog( "Select Column", vlist( selection = List Box( dtcollist ), button( "OK" ) ) );
Remove From( dlg, 2 );
Eval List( dlg );

dtMat = (dt << Get All Columns As Matrix);

For( i = 1, i <= N Items( selection ), i++,
colNum = Contains( dtColList, Name Expr( selection[i] ) );
vMat = dtMat[0, colNum];
mis = Loc( Is Missing( vMat ) );
If( N Row( mis ),
col = column(selection[i]);
If( mis[1] == 1,
col[1] = col[(Loc( vMat ))[1]]
);

For( j = if(mis[1]==1, 2, 1), j <= N Row( mis ), j++,
col[mis[j]] = col[mis[j] - 1]
);
);
);

A similar issue, but a bit different is the case where you want a certain number of cells before the actual entry to have the same value. There you need to make adjustments to not overwrite the previous e.g. 4 cells in case there is a nonmissing entry. Attached a script and a screenshot of the datatable before and after the script has been run. In this script I only covered the numeric case.

Names Default To Here( 1 );
// Get Current Data Table
dt = Current Data Table();

// Get Column Names from selection box (where you want to fill)
dtcollist = dt << get column names;
dlg = Dialog(
	"Select Column",
	vlist( selection = List Box( dtcollist ), button( "OK" ) )
);
Remove From( dlg, 2 );
Eval List( dlg );

// Get matrix of values in selected columns 
dtMat = (dt << Get All Columns As Matrix);

// For loop to loop through all selected columns 
For( i = 1, i <= N Items( selection ), i++,
	// get i-th selected column
	colNum = Contains( dtColList, Name Expr( selection[i] ) );
	// get values as matrix
	vMat = dtMat[0, colNum];
	// get rows where matrix is non-empty
	mis = Loc( vMat );
	// If there is non-missing value loop through previous rows
	If( N Row( mis ),
		// Get i-th Column name
		col = Column( selection[i] );
		/*
		If( mis[1] == 1,
			col[1] = col[(Loc( vMat ))[1]]
		);
		*/
		// Loop to fill previous 4 rows from the row with the k-th non-missing value
		For( k = 1, k <= N Rows(mis), k++,
		For( j = If(mis[k]-4 < 1, 1,mis[k]-4), j < mis[k], j++,
			If(Is Missing(col[j]), col[j] = col[mis[k]],col[j])
		);
		);
	);
);

Data Table before and after applying the scriptData Table before and after applying the script

 

Byron_JMP

Another update:

Instead of using this script, take a look at this collection of tools: https://community.jmp.com/t5/JMP-Add-Ins/Data-Table-Tools-Add-in/ta-p/28582

 

This add-in has a better version of the script above plus a whole lot more.

 

Cheers,

Byron

ENTHU

I have a data table with 2 columns. Need to check if Value is blank for def,jkl,mno and stu and fill in "NA" for these entries.

Any help is appreciated.

ParameterValue
abc5
def 
ghi22
jkl 
mno 
pqr3
stu 
vwx73
yz99

Do you want a script to do this? Interactive you could use recode and save a formula column, this formula script can be used to do what you want 

another way is to use a new formula column and in the formula editor you can use the conditional if statement and check for missing ness and set the value to na

then you could use the ctrl+f to search and replace. 

Sonthere are many options you can choose from. I like the recode quite much. In scripting I’d probably use another way: save the column in a matrix or list as you want to use nonnumeric values “na” which will change the values to characters, change the missing values to na and then write Back the values to the column values. Hope one of these ideas will help you further

ENTHU

Thanks for the reply!

I am trying to write a script for this.The issue here is I might have many more blank columns.But just need to check values for these 4 columns and fill in with "NA".

 

Thanks

ENTHU

Still stuck at this point not knowing how to proceed further.

Any inputs appreciated!

 

Thanks

Hi,

 

thow there might be a more elegant version these two options should work.

Attached is a data table including a tavble script doing what you want (I think). In addition there is a Column with a formula using the Match statement doing the same. The script I used is

Names Default to Here(1);
dt = Current Data Table();
For(i = 1, i <= N Rows(dt), i++,
    n = dt << get rows where(:Parameter == "def");
    Column("Value")[n] = "NA";
    n = dt << get rows where(:Parameter == "jkl");
    Column("Value")[n] = "NA";
    n = dt << get rows where(:Parameter == "mno");
    Column("Value")[n] = "NA";
    n = dt << get rows where(:Parameter == "d");
    Column("Value")[n] = "NA";
);

Best,

 

Martin

I have a case where I want to fill values from multiple columns forwards and backwards within chunks of the table based on a label column. After all this discussion, I don't think there is an elegant way to do that -- please correct if I'm wrong.

 

I must say, these are some horribly convoluted solutions for what should be a 1-liner:

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.core.groupby.DataFrameGroupBy.fill...

 

 

Byron_JMP

get this add-in (free)

 

Data Table Tools Add-in 

 

Screen Shot 2020-04-10 at 11.52.24 AM.png

nearly sure that the span empty cells tool will do what you're looking for.

 

 

@bayesfactor wrote:
I must say, these are some horribly convoluted solutions for what should be a 1-liner:

I agree there is just one line of code the user must write in the link you provided, however the underlying function is not a oneliner. This is true for most of the easy to use functions in almost any programming language, if it is R, Python, MATLAB, even in C/C++, ....

In the data table tools add-in (as @Byron_JMP mentioned) you will find a functionality Span Empty Cells, where @brady_brady has created such a function you could use to do what you want with one command. Though the tool put the selection of the table and the columns and all settings into an interface. So if you would like to do it differently you would have to modify the function source code so it takes the selected columns and accepts a seperate inpout specifying the way to span the cells.  

 

In case you want to have a JSL function to do this to be able to incorporate it in other scripts, it would be a good idea to put this request both to support@jmp.com and the JMP Wish List.

The other way is to build on Brady's scripted function and modifying the routine to be used as a JSL function in a JSL script.

Recommended Articles