You are right. Practical is an ambiguous word.
I have to define my premises better.
But before let me state this clearly: I am a consumer and I would never say to any engineer that he is currently doing something right or wrong.
Thanks, but you are nothing like the typical consumer.
I am not asking questions about what is currently correct, but about what would be viable in the near future, since my questions are all related to playback devices that are just hitting the market.
Unfortunately, we've had systems that have been able to measure and apply correction to consumer sound systems for almost 15 years. While millions of units have been sold the actual proper application of the auto-cal technology has been abysmal. And those are low, or percieved "no-cost" features included with units that would have been purchased without them. I use auto-cal as an example of something that offers significant improvement to the user experience but has not been widely accepted.
Expecting the general application of something as advanced as the Realizer is just fantasy.
Now to my understanding of practical.
Measurement of HRTF in anechoic chambers is definitely not practical.
Living in a room with very low reverberation is definitely not practical.
Measuring a binaural impulse response only once with only one professional monitor (that is specified to have high directivity and flat frequency response on axis in an anechoic chamber) in the near field while stuffing fist reflections on ceiling, floor and walls seems to me more practical than the first two options. Currently possible with Smyth Research Realiser.
Measuring a BRIR in your acoustically controlled mixing and mastering room seems as practical as the prior option.
How many of your costumers are going to have such controlled environment? I wouldn't mind at least listening in the same way you are.
So I agree with you, very few. You are right, you made your point.
But then how many of your customers have a listening room with the acoustical treatment your mixing and mastering room has?
Translation of a mix done in a well designed studio to playback in a typical home is actually pretty good. Mostly what happens at home is listening outside the sweet spot (something that can be checked in the studio), and poor frequency response (again, easily simulated). There are compromises, but the correlation is actually fairly good, and at least known.
Buying beamforming transducers arrays have just been demonstrated by Comhear (beamforming or yarra 3dx). I would expect the percentage of your costumers with access to that kind of playback environment to rise.
Well, it's at zero now, so a 100% increase would be fairly simple to accomplish. But do you have any idea what "typical consumer" means? There will never be that kind of technology in the home of a "typical consumer".
If that occurs, then ITD starts to become important.
It's important now, just not in the way you seem to think. ITD can add, if done right, a sense of depth and space to any stereo mix, and the expense of mono compatibility.
Buying crosstalk cancellation DSP is currently expensive, but the playback environment is similar to the prior option.
Crosstalk cancellation has been around for 37 years in various forms, nothing as involved as DSP, but some actually pretty astounding. It's never penetrated the market beyond the boutique level for the same reasons and problems we still have. We've also had two significant attempts at getting more than two-channel stereo into the home. First with Quad, which we thought failed because of format confusion, a tiny sweet spot, and the expense and inconvenience of 4 speakers. But later, 5.1 music taught us that none of that was the problem. With 5.1 music we had standardized format, standardized speaker plans, a huge listening window, clear improvement over stereo, a huge pre-installed base of 5.1 systems (the extra speakers were already there for movie surround sound) and it
still can't penetrate the market beyond the boutique.
Sorry if you don't see it this way, but DSP crosstalk cancellation systems will never be more in the market than at the boutique level, and as such cannot be considered in music production beyond the specialized demo material, etc. An obvious parallel is binaural, again, around for 50 years. Today there are billions of headphone listeners, more than ever. But can you buy your favorite music in binaural today? Not even a little. It's still at the curiosity, and boutique stage, and not likely to change.
Having head and torso scanned or photographed and a HRTF derived from such data is not currently possible. If it becomes available, then virtually all headphone listeners will have access to controlled externalized playback environment.
I think your view of "virtually all headphone listeners" is way, way out of whack. You know what "every headphone listener" has access to now? Precision headphone-specific EQ developed from high resolution measurements. It's available on the most common headphone listening platform in the world, and is so cheap as to be a non issue. Ever heard of that? Didn't think so.
"Have access to" and actually having something proliferate are two very different things.
If your answer to my last written question is no, then you believe or that third order ambisonics render elevation in a better way than binaural encoded files OR the other way around.
I didn't say that at all, did I? I believe they are two different things and don't present the same way.
With such uncertainty, ambisonics choice with convolved HRTF would at least work around the potential problem of matching the HRTF of the dummy head (or the standard HRTF of the mixing code) with the listener HRTF.
That's not a small problem, though, it's actually a deal-breaker. And because it's hard to get it right without customizing, it severely limits the commercial application of the technology. And yet, without a good listener HRTF in there somewhere, the whole thing just doesn't work very well.
I would like to know if such matching is so critical or it just introduces a negligible tonal inaccuracy.
I have not played much with Ambisonics, but from my binaural work, matching the HRTF to the listener is critical to the palpability of the resulting image. It's not so much a tonal issue as an imaging accuracy issue. There's also an artistic problem too. It isn't always desirable to have that full 3D sound field.