It is a hypothesis test, so it works like the Shapiro-Wilks test. The null hypothesis is the fitted distribution. The sample test statistic is A2. It is an upper-tailed test like the chi square and F tests. A large statistics is evidence against the null. The stated probability is for A2 at least as large as the one estimated from your sample when the null hypothesis is true.
It is meant to be used to decide if you should reject the fitted model. It is not intended to be used for model selection. A criterion such as AICc would be better for that purpose.
The three examples that you provided each present a p-value for your decision. First you decide the type I error risk you are willing to take (i.e., alpha level) and then compare the p-value to alpha. If the p-value is less than alpha, you reject the null hypothesis. Otherwise, you continue to assume the null is true. For example, if I use alpha = 0.1, then I do not reject the null for the first case but I do reject the null for the other two tests.
Note that using the same hypothesis test with the same sample of the population for many different models increases the chances of type I errors. Such a practice is an example of multiple comparisons.