Scott,  check my understanding of what you are trying to do:  You have a population of almost 5000 parts.  You want to create two sub-populations from the parent population having medians of 3.52 and 3.62, and having similar sigma values (0.28).  Then you will evaluate the two sub-populations to determine if the difference in medians has an effect on a succeeding test.  If this is what you are trying to do, then I would question the statistical validity of your approach.  If this is the case, then I suggest you build a DOE with length being an independent variable.  I have been using Definitive Screening Designs and have found them to be very efficient. 
Another point regarding the validity of your approach:  If your proces for producing the 5000 parts shows a "reasonable degree of statistical control" for length, then your approach is certainly not valid.  However, if the process is not stable (and the good-looking histogram does NOT confirm whether or not your process is stable because the histogram loses time-orderliness), then you can select samples from periods of high and/or low length values and compare them for evaluation.  However, I would still want to do a properly coonducted DOE.
					
				
			
			
				
	Steve