For me, detail and clarity are different.
To keep it basic:
Clarity - shimmer, sparkle, air, bell-like quality, shimmering, realism, life-like, serene, and so on.
Detail - information, raw data, replication of the recording, fine microdetail of an instrument, presence of an intrument (or synthetic sound).
For me, the ER-4S had very high detail, but what the CK10 and UE700 do better is imaging and layering, it divides the details into different pathways so I can hear them more clearly, on the ER-4S there's a high level of detail coming from the same direction, with fairly low imaging and layering. Perhaps this is why I don't understand "coherency", because I don't really look for it, I prefer the imaging and seperated pathways, even if this is an artificial feature (but so is 5 speakers and a sub-woofer in a living room).
Coherency is something in between. I don't really consider ER4S to have really good coherency, nor is DBA-02 / B2 type of sound. The later is obviously too resolved / layered to be considered coherent, but the former is also lacking in space and air, like everything is compressed to one single point. In that sense RE272 sounds more coherent because it still retains that part of the information. A good way to understand coherency is to listen to live music, then compared it to recording. You will never find music being separated into layers nor compressed into one point in live.