Perhaps it's unfair to say that lower grades are polished turds. They are the inevitable result of the ammo production process. In other words, they make a considerable amount of ammo and only a small portion of it is top-of-line. Most of it is the lesser grade (unless, in the case of SK, Magazine is something else).I think, if anything, this info suggests we pay too much for their 'seconds'. It's a great business model ... set out to make Rifle Match and if you make a complete mess of it ... voila, it's Magazine. Talk about polishing a turd.
There really should be disclosure of the grading criteria.
When any of the match ammo makers produce .22LR ammo, they don't produce ammo lot by lot, determining later exclusively by testing what variety it is. They load the machines with the necessary components -- casings (pre-primed?), bullets, components, lube -- and produce a considerable amount of ammo. This is enough to ultimately produce many lots, the smallest portion of which is top tier, the largest portion of which is bottom.That is, again the machine is loaded with components and the results of a run become one lot, but a lot of what? Well, that could be Center X, Midas Plus, or Exact depending on testing.
While the grading methodology is kept secret and never publicly divulged, there's a speculative explanation that might make some sense. It's possible that the ammo makers have a good idea where in the production run the best ammo is typically produced. For example, it may be at the beginning, the very middle, or near the end. To continue with the example, if the best ammo is usually found at the middle of the production run, it becomes the top graded variety.
If it is like this, or something similar, Lapua, for example, may expect to identify certain segments of the production run as X-Act, Midas +, and Center X. The smallest part would be X-Act, the largest CX. No actual testing by means of verifying a grade by shooting in a test tunnel would be necessary.
In other words, grading may be achieved by a process other than actual performance. If it is graded in a way such as this, it wouldn't necessarily guarantee that all "higher" grades are in fact better performing with better ES and SD than all "lesser" grades. It would explain why not all lots of X-Act outperform all lots of Center X.