Do you have wide data where each observation occupies a single row, and where you have individual or groups of columns useful for making predictions? Do you need some direct way assure that your data is set up to assess relationships accurately and completely? Would you like to establish repeatable routines to assure the quality of new records when they are added?
Wide format is great for regression modeling and data mining, but can be overwhelming to clean up to make sure it is useful, easily visualized, and accurately analyzed.
In this session, we will follow these steps to explore JMP tools for organizing and providing an overview of large datasets, particularly those with many columns:
- Organizing columns and column properties using Column Manger (Column Manager, Group Columns and Standardize Attributes) to efficiently add, edit, and delete columns.
- Rename columns using Recode Column Names.
- Identify, explore, visually inspect and manage missing values and outliers that might distort estimates and bias results.
- Visualize relationships using Column Switcher and Screening tools.
- Provide interactive management reports on the wide data.
Suggested Prerequisites
- Some experience managing and analyzing wide data.