It combines several tests so it would be nice to explicitly specify what do we measure there and compared to what (e.g. baseline as it is now or some implementation of one of the previous tests by then may be in production?).
Topic on Talk:Wikipedia.org Portal A/B testing
Appearance