Share your ideas for the JMP Scripting Unsession at Discovery Summit by September 17th. We hope to see you there!
Choose Language Hide Translation Bar
gail_massari
Community Manager Community Manager

Retrieving, Organizing and Analyzing Text

 

See how to:

  • Understand Text Explorer conventions and definitions related to analyzing unstructured text, including corpus, document, term, phrase, DTM (Document Term Matrix), tokenizing, stemming and stop word 

 

See how to:

  • Find the most common terms and phrases and determine the context in which terms or phrases are used
  • Use term reports, phrase reports and word clouds
  • Apply built-in and user-defined phrases
  • Interactively customize, add or remove stop words

 

See how to:

  • Find terms that tend to appear together
  • Group and explore similar documents
  • Uncover recurring themes (topics) within the collection of documents
  • Cull important information from the text so it can be used in predictive models
  • Use latent semantic analysis (SVD), scatterplot tendrils, topic analysis (Rotated SVD), Top Terms per Cluster Report and  Term Probabilities by Cluster Report
  • Save clusters or latent classes to get categorical predictors and save singular vectors to get continuous predictors