Have you guys tried comparing headphones and loudspeakers by quick switching? I was amazed by my measurements before but now I feel quite the opposite
. The simulation sounds too bright even though the localization is quite accurate. I haven't done careful tweaking now but I think it's about 5db more with a q value of 0.5 in the highs. I use @musicreo 's setup where ear canals are partially blocked, and I tried to insert the mics as deep as I can so it's less than 1cm from ear drum. This problem doesn't come from impulcifer, because when I play binaural recordings of speakers, it still sounds the same bright. I tried the manloud method proposed by David Griesinger, but I cannot get consistent results as he does and it doesn't sound good.
I measured the hptf after equalizing headphones, which is very flat during multiple wearing except for some small dips at high frequencies. So it can only be the problem of measuring. But if my understanding is correct, measuring from open ear canal should not cause large systematic error like this. So which step did I possibly go wrong?
One big problem of the measurement is to get the correct timbre. This talk from David Griesinger does explain the problem. But I have some measurements were the timbre was ok although the measurement was not taken at the eardrum. Still I believe that deeper mic insertions improve the final result.

I measured the hptf after equalizing headphones, which is very flat during multiple wearing except for some small dips at high frequencies. So it can only be the problem of measuring. But if my understanding is correct, measuring from open ear canal should not cause large systematic error like this. So which step did I possibly go wrong?