Each comparability scenario requires different questions to be asked, most importantly will you be using an equivalence test or a quality range to establish comparability. The answer to that depends on the criticality of the evaluation and sample sizes. For quality ranges, it is typical to use a reference interval (mean +/- 3 standard deviations for example). There are lots of references for this. For equivalence testing, it will be more involved to determine your EACs. Here is a good paper that explains:
Feel free to reach out for clarification, thanks, Heath