cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
%3CLINGO-SUB%20id%3D%22lingo-sub-225496%22%20slang%3D%22en-US%22%20mode%3D%22UPDATE%22%3EKonsistente%20Behandlung%20ausgeschlossener%20Zeilen%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-225496%22%20slang%3D%22en-US%22%20mode%3D%22UPDATE%22%3E%3CP%3EHallo%2C%3C%2FP%3E%0A%3CP%3E%20%3C%2FP%3E%0A%3CP%3EIch%20habe%20mit%20dem%20JMP-Support%20gesprochen%2C%20da%20ich%20einen%20Fehler%20in%20JMP%20vermutete.%20Dabei%20habe%20ich%20Folgendes%20erfahren%3A%3C%2FP%3E%0A%3CP%3E-%20Wenn%20man%20bestimmte%20Zeilen%20als%20%E2%80%9Eausgeschlossen%E2%80%9C%20markiert%20(egal%20ob%20sie%20als%20sichtbar%20ausgew%C3%A4hlt%20sind%20oder%20nicht)%2C%20werden%20sie%20aus%20vielen%20Berechnungen%20entfernt%20(z.%20B.%20werden%20bei%20der%20Erstellung%20eines%20Histogramms%20ausgeschlossene%20Zeilen%20nicht%20ber%C3%BCcksichtigt).%3C%2FP%3E%0A%3CP%3E-%20Wenn%20bestimmte%20Zeilen%20als%20%E2%80%9Eausgeschlossen%E2%80%9C%20markiert%20werden%20(unabh%C3%A4ngig%20davon%2C%20ob%20sie%20sichtbar%20sind%20oder%20nicht)%2C%20werden%20sie%20als%20Validierungssatz%20f%C3%BCr%20Plattformen%20wie%20Boosted%20Tree%20oder%20Bootstrap%20Forest%20%3CSTRONG%3Eeinbezogen%3C%2FSTRONG%3E%20.%3C%2FP%3E%0A%3CP%3EOffensichtlich%20ist%20dies%20beabsichtigtes%20Verhalten.%20Mich%20pers%C3%B6nlich%20hat%20es%20verwirrt%20und%20gl%C3%BCcklicherweise%20habe%20ich%20mich%20mit%20dem%20Support%20in%20Verbindung%20gesetzt%2C%20um%20mehr%20dar%C3%BCber%20zu%20erfahren%2C%20bevor%20ich%20die%20Ergebnisse%20ver%C3%B6ffentlicht%20habe%2C%20da%20in%20meinem%20Fall%20die%20ausgeschlossenen%20Zeilen%20ung%C3%BCltige%20Daten%20sind.%20Ich%20behalte%20diese%20nur%20zu%20Nachverfolgungszwecken.%3C%2FP%3E%0A%3CP%3E%20%3C%2FP%3E%0A%3CP%3E%3CU%3EUnd%20abschlie%C3%9Fend%20mein%20Wunsch%3A%20K%C3%B6nnte%20JMP%20bei%20der%20Verwendung%20ausgeschlossener%20Zeilen%20v%C3%B6llig%20konsistent%20sein%3F%3C%2FU%3E%3C%2FP%3E%0A%3CP%3E%20%3C%2FP%3E%0A%3CP%3EHier%20ist%20ein%20Zitat%2C%20das%20mir%20der%20JMP-%20%3CFONT%20size%3D%223%22%3ESupport%3C%2FFONT%3E%20gesendet%20hat%3A%3C%2FP%3E%0A%3CP%20class%3D%22cs95e872d0%22%3E%3CFONT%20size%3D%222%22%3E%3CSPAN%20class%3D%22cs78ab32121%22%3E%E2%80%9EDer%20Bootstrap%20Forest%20und%20mehrere%20andere%20Plattformen%20in%20JMP%20Pro%20verf%C3%BCgen%20%C3%BCber%20eine%20Funktion%2C%20die%20daf%C3%BCr%20sorgt%2C%20dass%2C%20wenn%20einige%20Zeilen%20ausgeschlossen%20sind%20und%20Sie%20ansonsten%20keinen%20Validierungssatz%20angeben%2C%20diese%20Zeilen%20als%20Validierungssatz%20verwendet%20werden.%3C%2FSPAN%3E%3C%2FFONT%3E%3CFONT%20size%3D%222%22%3E%3CSPAN%20class%3D%22cs78ab32121%22%3EUm%20zu%20verhindern%2C%20dass%20diese%20Zeilen%20%C3%BCberhaupt%20einbezogen%20werden%2C%20k%C3%B6nnen%20Sie%3A%3C%2FSPAN%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%20class%3D%22cs95e872d0%22%3E%3CFONT%20size%3D%222%22%3E%3CSPAN%20class%3D%22cs78ab32121%22%3E1)%20Unterteilen%20Sie%20die%20Datentabelle%20so%2C%20dass%20diese%20Zeilen%20nicht%20enthalten%20sind%2C%20und%20f%C3%BChren%20Sie%20dann%20den%20Bootstrap%20Forest%20erneut%20aus.%3C%2FSPAN%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%20class%3D%22cs95e872d0%22%3E%3CFONT%20size%3D%222%22%3E%3CSPAN%20class%3D%22cs78ab32121%22%3E2)%20Verwenden%20Sie%20eine%20andere%20Validierungsmethode%20(Holdback%20oder%20Validierungsspalte).%3C%2FSPAN%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%20class%3D%22cs95e872d0%22%3E%3CFONT%20size%3D%222%22%3E%3CSPAN%20class%3D%22cs78ab32121%22%3EIm%20Allgemeinen%20ist%20es%20oft%20eine%20gute%20Idee%2C%20einige%20Zeilen%20einem%20Validierungssatz%20zu%20widmen.%20Dadurch%20h%C3%A4tten%20Sie%20die%20M%C3%B6glichkeit%2C%20die%20Option%20%E2%80%9EFr%C3%BChzeitiges%20Stoppen%E2%80%9C%20zu%20verwenden%2C%20die%20dabei%20helfen%20kann%2C%20eine%20%C3%9Cberanpassung%20zu%20vermeiden.%E2%80%9C%3C%2FSPAN%3E%3C%2FFONT%3E%3C%2FP%3E%0A%3CP%20class%3D%22cs95e872d0%22%3E%20%3C%2FP%3E%3C%2FLINGO-BODY%3E%3CLINGO-LABS%20id%3D%22lingo-labs-225496%22%20slang%3D%22en-US%22%20mode%3D%22UPDATE%22%3E%3CLINGO-LABEL%3EPr%C3%A4diktive%20Modellierung%20und%20maschinelles%20Lernen%3C%2FLINGO-LABEL%3E%3C%2FLINGO-LABS%3E%3CLINGO-SUB%20id%3D%22lingo-sub-245755%22%20slang%3D%22en-US%22%20mode%3D%22NONE%22%3EBetreff%3A%20Konsistente%20Behandlung%20ausgeschlossener%20Zeilen%20-%20Status%20ge%C3%A4ndert%20auf%3A%20Geliefert%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-245755%22%20slang%3D%22en-US%22%20mode%3D%22NONE%22%3E%3CP%3EAb%20JMP%2015%20werden%20ausgeschlossene%20Zeilen%20nicht%20f%C3%BCr%20die%20Validierung%20verwendet%2C%20es%20sei%20denn%2C%20Sie%20%C3%BCbermitteln%20die%20JSL%3A%3C%2FP%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%0A%3CPRE%3EUse%20Excluded%20Rows%20for%20Validation(1)%3C%2FPRE%3E%0A%3CP%3E%26nbsp%3B%3C%2FP%3E%3C%2FLINGO-BODY%3E
Choose Language Hide Translation Bar
0 Kudos

Consistent handling of excluded rows

Hi,

 

I had a conversation with JMP support since I assumed a bug in JMP. However, I learned the following:

- If one marks certain rows as "excluded" (no matter if chosen visible or nor) they will be removed from many computations (e.g. doing a histogram does not take excluded rows into account)

- If one marks certain rows as "excluded" (no matter if chosen visible or nor) they will be included as a validation set for platforms like boosted tree or bootstrap forest.

Apparently this is intended behavior. Personally, it confused me and fortunately I got in contact with support to learn about it before I published the results, since in my case the excluded rows are invalid data. I just keep those for tracking purposes.

 

So, finally my wish: could JMP be fully consistent in the use of excluded rows?

 

Here's a quote that JMP support sent me:

"The Bootstrap Forest and several other platforms in JMP Pro have a feature that if some rows are excluded, and you do not otherwise specify a Validation set, those rows are used as the Validation set. To avoid those rows from being included at all, you could:

1) Subset the data table so it doesn't include those rows, then re-run the Bootstrap Forest

2) Use a different Validation method (Holdback or Validation Column)

In general, it is often a good idea to devote some rows to a Validation set. This would give you the ability to use the Early Stopping option, which can help avoid overfitting."

 

1 Comment
Jeff_Perkinson
Community Manager
Status changed to: Delivered

Starting in JMP 15 excluded rows are not used for Validation unless you submit the JSL:

 

Use Excluded Rows for Validation(1)