Stepwise regression follows a path in the forward direction, backward direction, or in mixed directions. It considers changing the model by a single term based on the current model and the candidate terms. So there is a chance that the best model was not visited and the chance that this might occur increases with the number of candidate terms. This approach allows you to consider many possible terms in most practical situations.
The All Possible Models option, on the other hand, visits all possible models up to the complexity that you specified. Your available memory and time might limit the practical number of terms that can be considered.
The Generalized Regression option is somewhere in between. Here is the explanation from Help:
Best Subset Computes parameter estimates by increasing the number of active effects in the model at each step. In each step, the model is chosen among all possible models with a number of effects given by the step number. The values on the horizontal axes of the Solution Path plots represent the number of active effects in the model. Step 0 corresponds to the intercept-only model. Step 1 corresponds to the best model of the ones that contain only one active effect. The steps continue up to the value of Max Number of Effects specified in the Advanced Controls in the Model Launch report. See āAdvanced Controlsā
on page 309. Tip: The Best Subset Estimation Method is computationally intensive. It is not recommended for large problems.
Hope this explanation helps.
So you won't necessarily get the same model from all these methods and the distribution has nothing to do with the outcome, assuming that you use the same distribution in all cases.
Remember that these methods are aids to selecting a model. None of them guarantees that it will find the best model.