Your question is the most frequently asked question of a statistician: what n should I use?
The most frequent reply is "it depends."
There are numerous web sites with zero failure acceptance plans that require AQL, acceptable quality level, and other details to be defined before using a formula or a table for N. A very simple one is http://asq.org/quality-progress/2007/11/basic-quality/zero-defect-sampling.html.
However, I recommend you find your company statistician or maybe work with the local university statistics professor. There are many details that would need to be discussed before providing an answer (it depends). For example, how is the sample collected: the end of line from multiple production lines? If you collect 12 samples every 30 minutes, does that mean all 12 were collected at one time? Will your plan capture the multiple source tools? Does the bar weight depend only on one tool set or a series of tools? [Since you said bars, I was thinking of coated granola bars, yumm! For a scenario like that, the weight problem be due to the bar "press" or the coater.]
Do you have control charts/monitors for each of the tool processes? If yes, what is their stability? When you assume a random normal distribution, have you looked at the data by time and tool and is that a reasonable assumption? Just like cancer studies on nude mice, sometimes testing/sampling from the most likely to fail can provide information, and other times it can miss valuable information. If you have many tools, you could use simulation to "what if" a bar was created from the series of tools on the low, low, low end.
When you have seen failures before were they truly random or clustered in time?
Many sampling plans, especially if something has changed an there is no true baseline, use a double sampling method. For example if one of your samples is far away from the spec limits (both means and sample standard deviation) , sample as usual, but if a sample is "near" the spec limit, increased sampling and testing.
I wish I could be more specific, but after many years of statistics and supporting a manufacturing line, N alone is never the complete answer. And if you review my questions, you'll notice they are asking details about what is known, the factors that influence any sampling plan.
I have no one size formula, but maybe some of these questions give you some ideas and avenues to pursue. With the simple ASQC rule of thumb provided in the link.
Good Luck!