Hello Everyone,
I have two separate data files (one for eye movements and one for performance on a task). I want to combine these two data files into one file so that I can analyze them with one another, but am running into troubles on how to get started.
The two data files have nothing but the participant file name in common and have varying amounts of responses per participants (ie. The eye movement data has more responses per participants compared to the task performance file).
I have uploaded the files to this post.
Guidance would be very helpful!!
I'm still having trouble understanding what you want to do. So, I pulled out a single subject's data from both files (Session label ecem_10c). One file has 337 rows and the other has 2,286 rows. If you join these without dropping multiples you wil get 337x2286 rows - and for the full dataset that is probably too much informaton. But without any other field to join on, what columns would you want to view together in order to do your analysis. If you can tell what those columns are, I think I might be able to help. But I don't understand your data and experimental conditions enough to know which of these columns (30 in one dataset and 27 in the other) make sense to look at together and which do not.
I would suggest subsetting a single participant from both files in order to determine exactly how you want to join the two files - if you can get what you want for a single participant, then you can easily do it for all of them. It sounds like you may be able to join on one of those other common columns. When joining, you can always click on Select columns and only select the ones you want - and it sounds like it may be easier to generate the data for a particular question you wish to analyze rather than trying to create the data set for any and all questions you want to analyze (you can always join them a bit differently for a different analysis). But the idea is can you put the data together for one participant? You may have to summarize the data a bit before trying to join them since you may not need all of those rows for the analysis you wish to conduct, so you may want to Tabulate the data from each dataset first, and then try to join the tabulated files.
In your larger file, create a new column using a formula such as ColMean(whatever measure you want to average by Session Label, by video). In other words, get the average (or sum or standard deviation or whatever measure you want across the videos) by including two By conditions in that Col function - the first identifies the participant and the second identifies the video. Then tabulate the data by participant using the newly created column as the data you want. You can take whatever other columns you want - and you can do something like this in the other data set if you want to join some of the data from there. From Tabulate, make it into a data table (and do the same if you want data from the other table to join to it).
So i have made my two smaller tabulated files. I am having trouble joining them now into one file though.
I basically just need to get the average /video into the fixrep file and I will be good, but I have tried it multiple ways and am not getting the desired output.
See if this is what you want. I joined by matching 2 columns: session id and video.