2. is most often used to dismiss the negative results of others ("yes, of course you cannot hear it on your cheap ear buds and onboard audio, but that does not mean the difference does not exist"); audiophiles are of course free to use whatever expensive equipment they have, but they usually simply refuse to participate in these tests. The "golden ears" argument also falls into this group.
3. is statistically very unlikely to hide large errors, one would not expect a DAC to have 10% distortion, for example, and then the ADC to almost perfectly un-distort it so that the result becomes less than 0.001%. Not to mention some errors like noise are not even reversible. It also does not make much sense to deliberately design hardware that way, I doubt it would be really cheaper than making both the DAC and ADC perform well in the first place; it would also easily be exposed when using a combination of DAC and ADC from different manufacturers. I have seen conspiracy theories that the Windows drivers of sound cards "cheat" to achieve better RMAA results, but when the cards perform just as well on Linux with open source drivers, these theories are debunked. Limited and partial "synergy" can occur, but not to an extent to make a major difference, and of course more often than not the ADC just degrades the sound even more, therefore making it overall easier (but in practice not unlikely still impossible) to hear a difference.
One potential problem is recording a sound card DAC with the ADC on the same card; since both use the same clock, low frequency jitter from the clock will be eliminated. Therefore, two separate devices should preferably be used, but then ground loops need to be avoided. Another possibility is running the DAC and ADC at the same sample rate, and the imaging of the DAC filter being aliased back into the audio band by the ADC filter, and correcting the frequency response to some extent (as an example, think of an upsample-downsample loop that simply duplicates samples first, and then removes samples; even though each conversion is of poor quality separately, the loop has perfect lossless overall performance).