Scott, check my understanding of what you are trying to do: You have a population of almost 5000 parts. You want to create two sub-populations from the parent population having medians of 3.52 and 3.62, and having similar sigma values (0.28). Then you will evaluate the two sub-populations to determine if the difference in medians has an effect on a succeeding test. If this is what you are trying to do, then I would question the statistical validity of your approach. If this is the case, then I suggest you build a DOE with length being an independent variable. I have been using Definitive Screening Designs and have found them to be very efficient.
Another point regarding the validity of your approach: If your proces for producing the 5000 parts shows a "reasonable degree of statistical control" for length, then your approach is certainly not valid. However, if the process is not stable (and the good-looking histogram does NOT confirm whether or not your process is stable because the histogram loses time-orderliness), then you can select samples from periods of high and/or low length values and compare them for evaluation. However, I would still want to do a properly coonducted DOE.
Steve