cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
The Discovery Summit 2025 Call for Content is open! Submit an abstract today to present at our premier analytics conference.
Get the free JMP Student Edition for qualified students and instructors at degree granting institutions.
Choose Language Hide Translation Bar
View Original Published Thread

WHY Support Vector Regression (SVR) considers EXCLUDED ROWS during analysis?

iXmkXm
Level II

I am trying to predict continuous response on the basis of continuous inputs. However, during analysis, I've observed that SVR is considering excluded rows, which I want to use for model validation. Furthermore, the regression results couldn't be duplicated in Scikit-Learn library.

Please help on this issue!

This post originally written in German and has been translated for your convenience. When you reply, it will also be translated back to German.

1 ACCEPTED SOLUTION

Accepted Solutions
Byron_JMP
Staff


Re: WHY Support Vector Regression (SVR) considers EXCLUDED ROWS during analysis?

Hello,

I just ran a default SVM model on the same data set, Tablet Production.jmp

I'm using JMP16.2 on a Mac

I randomly selected 50% of the rows and set their state to Hidden/Excluded

I ran the SVM platform from the Analyze menu with and without the hidden/excluded state.

 

It looks like the platform is obeying the row state.

 

Byron_JMP_0-1658162052494.png

 

Why isn't SciKit Matching JMP? It likely has to do with how The model was parameterized.

For example, the default Fit might not match the default in SciKit.

Support Vector Machines(
	Y( :Dissolution ),
	X(
		:API Particle Size, :Mill Time, :Screen Size, :Mag. Stearate Supplier,
		:Lactose Supplier, :Sugar Supplier, :Talc Supplier, :Blend Time,
		:Blend Speed, :Compressor, :Force, :Coating Supplier, :Coating Viscosity,
		:Inlet Temp, :Exhaust Temp, :Spray Rate, :Atomizer Pressure
	),
	Fit(
		Kernel Function( "Radial Basis Function" ),
		Gamma( 0.0588235294117647 ),
		Cost( 1 ),
		Validation Method( "None" )
	)
);

 

JMP Systems Engineer, Health and Life Sciences (Pharma)

View solution in original post

2 REPLIES 2
Byron_JMP
Staff


Re: WHY Support Vector Regression (SVR) considers EXCLUDED ROWS during analysis?

Hello,

I just ran a default SVM model on the same data set, Tablet Production.jmp

I'm using JMP16.2 on a Mac

I randomly selected 50% of the rows and set their state to Hidden/Excluded

I ran the SVM platform from the Analyze menu with and without the hidden/excluded state.

 

It looks like the platform is obeying the row state.

 

Byron_JMP_0-1658162052494.png

 

Why isn't SciKit Matching JMP? It likely has to do with how The model was parameterized.

For example, the default Fit might not match the default in SciKit.

Support Vector Machines(
	Y( :Dissolution ),
	X(
		:API Particle Size, :Mill Time, :Screen Size, :Mag. Stearate Supplier,
		:Lactose Supplier, :Sugar Supplier, :Talc Supplier, :Blend Time,
		:Blend Speed, :Compressor, :Force, :Coating Supplier, :Coating Viscosity,
		:Inlet Temp, :Exhaust Temp, :Spray Rate, :Atomizer Pressure
	),
	Fit(
		Kernel Function( "Radial Basis Function" ),
		Gamma( 0.0588235294117647 ),
		Cost( 1 ),
		Validation Method( "None" )
	)
);

 

JMP Systems Engineer, Health and Life Sciences (Pharma)
iXmkXm
Level II


Re: WHY Support Vector Regression (SVR) considers EXCLUDED ROWS during analysis?

Thanks for your detailed response, I've also upgraded to 16.2, reanalysed the problem and now the issue is not duplicating (excluded rows are also not appearing in predicted/residual plots). I am also able to replicate the analysis on scikit-Learn by using gamma='auto' or (1/features). Thanks again for your detailed answer, obliged!