cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Try the Materials Informatics Toolkit, which is designed to easily handle SMILES data. This and other helpful add-ins are available in the JMP® Marketplace
%3CLINGO-SUB%20id%3D%22lingo-sub-782064%22%20slang%3D%22en-US%22%20mode%3D%22CREATE%22%3E%C2%BFHay%20alguna%20manera%20de%20decirle%20a%20LCA%20que%20las%20filas%20est%C3%A1n%20ordenadas%20por%20clase%3F%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-782064%22%20slang%3D%22en-US%22%20mode%3D%22CREATE%22%3E%3CP%3EConjunto%20de%20datos%20t%C3%ADpico%3A%20120-200%20filas%2C%2015%20clases.%3C%2FP%3E%3CP%3E%20%3C%2FP%3E%3CP%3ESe%20sabe%20que%20las%20filas%20est%C3%A1n%20ordenadas%20por%20clase.%20Lo%20que%20no%20se%20sabe%20es%20d%C3%B3nde%20est%C3%A1n%20las%20%22vallas%22%20entre%20las%20clases.%3C%2FP%3E%3CP%3E%20%3C%2FP%3E%3CP%3ETal%20como%20est%C3%A1%20(es%20decir%2C%20sin%20tener%20en%20cuenta%20el%20orden%20de%20las%20filas)%2C%20el%20an%C3%A1lisis%20de%20ciclo%20de%20vida%20clasifica%20correctamente%20aproximadamente%20el%2094%20%25%20de%20las%20filas.%20Mi%20intuici%C3%B3n%20me%20dice%20que%20si%20supiera%20c%C3%B3mo%20indicarle%20al%20algoritmo%20que%20las%20filas%20est%C3%A1n%20agrupadas%20por%20clase%20en%20la%20entrada%2C%20estar%C3%ADamos%20en%20el%20100%20%25.%3C%2FP%3E%3CP%3E%20%3C%2FP%3E%3CP%3EVisual%2C%20en%20caso%20de%20que%20mi%20uso%20de%20'agrupado%20por'%20no%20est%C3%A9%20claro%3A%3C%2FP%3E%3CP%3E%20%3C%2FP%3E%3CP%3ELo%20correcto%20ser%C3%ADa%20as%C3%AD%3A%3C%2FP%3E%3CP%3E%20%3C%2FP%3E%3CBLOCKQUOTE%3E%3CP%3E1a%3C%2FP%3E%3CP%3E1a%3C%2FP%3E%3CP%3E1a%3C%2FP%3E%3CP%3E1a%3C%2FP%3E%3CP%3E1a%3C%2FP%3E%3CP%3E1b%3C%2FP%3E%3CP%3E1b%3C%2FP%3E%3CP%3E1b%3C%2FP%3E%3CP%3E1b%3C%2FP%3E%3CP%3E1b%3C%2FP%3E%3CP%3E1b%3C%2FP%3E%3CP%3E1b%3C%2FP%3E%3CP%3E1b%3C%2FP%3E%3CP%3E1c%3C%2FP%3E%3CP%3E1c%3C%2FP%3E%3CP%3E1c%3C%2FP%3E%3CP%3E1c%3C%2FP%3E%3CP%3E1c%3C%2FP%3E%3CP%3E1c%3C%2FP%3E%3CP%3E1c%3C%2FP%3E%3C%2FBLOCKQUOTE%3E%3CP%3ETodas%20las%20filas%201a%20siempre%20ser%C3%A1n%20vecinas%20en%20t%C3%A9rminos%20de%20orden%20de%20filas%2C%20y%20todas%20las%201b%2C%20etc.%3C%2FP%3E%3CP%3E%20%3C%2FP%3E%3CP%3E%20%3C%2FP%3E%3CP%3EActualmente%2C%20a%20veces%20me%20pasa%3A%3C%2FP%3E%3CP%3E%20%3C%2FP%3E%3CP%3E%20%3C%2FP%3E%3CBLOCKQUOTE%3E%3CP%3E1a%3C%2FP%3E%3CP%3E1a%3C%2FP%3E%3CP%3E1a%3C%2FP%3E%3CP%3E%3CSTRONG%3E1c%3C%2FSTRONG%3E%3C%2FP%3E%3CP%3E%3CSTRONG%3E1c%3C%2FSTRONG%3E%3C%2FP%3E%3CP%3E1b%3C%2FP%3E%3CP%3E1b%3C%2FP%3E%3CP%3E1b%3C%2FP%3E%3CP%3E1b%3C%2FP%3E%3CP%3E1b%3C%2FP%3E%3CP%3E1b%3C%2FP%3E%3CP%3E1b%3C%2FP%3E%3CP%3E1b%3C%2FP%3E%3CP%3E1c%3C%2FP%3E%3CP%3E1c%3C%2FP%3E%3CP%3E1c%3C%2FP%3E%3CP%3E1c%3C%2FP%3E%3CP%3E1c%3C%2FP%3E%3CP%3E1c%3C%2FP%3E%3CP%3E1c%3C%2FP%3E%3C%2FBLOCKQUOTE%3E%3CP%3E%C2%BFIdeas%3F%3C%2FP%3E%3CP%3E%20%3C%2FP%3E%3CP%3EPotencialmente%2C%20podr%C3%ADa%20crear%20una%20columna%20con%20Row()%20como%20valor%2C%20pero%20me%20preocupa%20que%20la%20mayor%C3%ADa%20de%20las%20columnas%20de%20agrupamiento%20tengan%20valores%20de%20tipo%20%22clase%22.%20Para%2015%20clases%2C%20cada%20columna%20en%20uso%20podr%C3%ADa%20tener%20entre%202%20y%208%20valores%20%C3%BAnicos.%20Row()%20tendr%C3%A1%20120%20valores%20%C3%BAnicos%2C%20lo%20que%20parece%20como%20si%20se%20hubiera%20agregado%20un%20comod%C3%ADn%20a%20la%20mezcla.%3C%2FP%3E%3C%2FLINGO-BODY%3E%3CLINGO-LABS%20id%3D%22lingo-labs-782064%22%20slang%3D%22en-US%22%20mode%3D%22CREATE%22%3E%3CLINGO-LABEL%3EAn%C3%A1lisis%20y%20modelado%20de%20datos%20b%C3%A1sicos%3C%2FLINGO-LABEL%3E%3CLINGO-LABEL%3ECombinaci%C3%B3n%20y%20limpieza%20de%20datos%3C%2FLINGO-LABEL%3E%3C%2FLINGO-LABS%3E
Choose Language Hide Translation Bar
mtowle419
Level II

Is there a way to tell LCA that the rows are ordered by class?

Typical dataset: 120-200 rows, 15 classes.

 

Rows are known to be ordered by class. What is unknown is where the 'fences' between classes are.

 

As-is -- i.e., without taking row order into account -- LCA correctly classifies about 94% of rows. My intuition is that if I knew how to tell the algorithm that the rows are grouped by class on input, we'd be at 100%.

 

Visual, in case my use of 'grouped by' is unclear:

 

Correct would look like this:

 

1a

1a

1a

1a

1a

1b

1b

1b

1b

1b

1b

1b

1b

1c

1c

1c

1c

1c

1c

1c

All 1a rows will always be neighbors in terms of row order, and all 1b, etc.

 

 

Currently, I sometimes get:

 

 

1a

1a

1a

1c

1c

1b

1b

1b

1b

1b

1b

1b

1b

1c

1c

1c

1c

1c

1c

1c

Ideas?

 

Potentially, I could make a column with Row() as the value, but my worry is that most of the clustering cols have 'class'-type values. For 15 classes, each col in use might have between 2-8 unique values. Row() will have 120 unique values, which feels like throwing a wildcard into the mix. 

0 REPLIES 0