I guess that comes down to how we view the value of the results.
If, overall, across several series of the same test, you cannot demonstrably show a clear indication one way or the other the results indicate that, overall, there is nothing that you can reliably differentiate.
If you repeated the same test across multiple sessions and overall indicated say 80 out of 100 that would be pretty solid evidence, if across that same 100 repetitions you got 50 that is clearly rather less compelling even if on one day you got 18/20.
Again, I believe you take from the tests what suits your feelings rather than trust what the the test demonstrates and use that as a data point, good, bad or inconclusive.
If you did in fact have say 80 out of 100 then we are wasting time with this conversation because I would agree with you there is something that you are differentiating.