Hi Castle of Argh.
You are correct. Short samples with direct line level matched A/B comparison are MUCH more accurate than long period listening. Human ears have a very short memory... In the range of a few seconds at most. Go beyond that and anything remotely close sounds the same. Or bias creeps in. Either way it isn't correct.