bigshot
Headphoneus Supremus
The subjective sound experience is the same with recordings as real life. That is the baseline, not the sound to be tested.
Should be pinned as a must-read / must-ask for anyone posting on the science forum…“There is no hope or point in quantifying anything for guys who think of themselves as an infallible sensory machine with infinite range. No amount of research will validate that delusion.”
But doesn't therein lie the rub?If we can measure the physical sound isn’t the rest of the audio experience down to perception and obviously just preferences ?
That perception will vary from person to person with the listening experience for some people being dramatically more influenced by stimuli beyond the sound pressure waves entering their ears than others ?
This is exactly what we do and we do know how it works. Where does this sudden "Humans don't know science" in all areas come from? We are able to build the JWST and place it in the L2 but we do not how electricity works when it comes to audio...But doesn't therein lie the rub?
Science has come up with some metrics for sound (or, more relevant perhaps, audio signals), but they are far from a complete descriptor. E.g. if total harmonic distortion is zero, we do indeed know that the signal is replicated perfectly. But if it is non-zero, there are many different permutations of the original signal possible that give the same non-zero THD value, but they are all different when examined in detail. The sound signal metrics that we use suffer from multiplicity; thus we can have two audio components that quantify identical in terms of frequency response, THD, IMD, noise levels etc, but that do produce different signal errors when examined in more detail, and may therefore sound different. The scientific knowledge as to how those different signal errors my affect perception in different individuals is as yet incomplete.
We can set some stochastically determined boundaries on what should/should not be audible to the vast majority of people, but until we know how our sound/audio signal metrics are misaligned with respect to the way humans process auditory information, absolutist statement re. perceived fidelity of sound reproduction are suspect, especially if we are looking at statistics that are close to the boundary of what is scientifically deemed perceptible and what isn't. (and since 16/44.1 PCM seems to come up again and again: that topology is very close to the boundary of what is scientifically deemed perceptible, so close that any flaw in its implementation may well be audible to a good number of people).
Oh, we do know how electricity works when it comes to audio, we know it very well indeed.This is exactly what we do and we do know how it works. Where does this sudden "Humans don't know science" in all areas come from? We are able to build the JWST and place it in the L2 but we do not how electricity works when it comes to audio...
This is really fantastic. Thanks for it.But doesn't therein lie the rub?
Science has come up with some metrics for sound (or, more relevant perhaps, audio signals), but they are far from a complete descriptor. E.g. if total harmonic distortion is zero, we do indeed know that the signal is replicated perfectly. But if it is non-zero, there are many different permutations of the original signal possible that give the same non-zero THD value, but they are all different when examined in detail. The sound signal metrics that we use suffer from multiplicity; thus we can have two audio components that quantify identical in terms of frequency response, THD, IMD, noise levels etc, but that do produce different signal errors when examined in more detail, and may therefore sound different. The scientific knowledge as to how those different signal errors my affect perception in different individuals is as yet incomplete.
We can set some stochastically determined boundaries on what should/should not be audible to the vast majority of people, but until we know how our sound/audio signal metrics are misaligned with respect to the way humans process auditory information, absolutist statement re. perceived fidelity of sound reproduction are suspect, especially if we are looking at statistics that are close to the boundary of what is scientifically deemed perceptible and what isn't. (and since 16/44.1 PCM seems to come up again and again: that topology is very close to the boundary of what is scientifically deemed perceptible, so close that any flaw in its implementation may well be audible to a good number of people).
Precisely.The idea that sound signal quality/fidelity can be adequately described by frequency curve/THD/ID/noise levels etc. is a sign of hubris
The whole "16/44.1 is sufficient" debate I have followed on and off for years, and a lot seems to get lost in translation between the people who understand the underlying mathematical theory, and those who just pick up on the headline statements. Many extend the ideas put forward in Nyquist/Shannon too far beyond what the theory was aiming to prove.This is really fantastic. Thanks for it.
Great. And presumably bit depth as well?The whole "16/44.1 is sufficient" debate I have followed on and off for years, and a lot seems to get lost in translation between the people who understand the underlying mathematical theory, and those who just pick up on the headline statements. Many extend the ideas put forward in Nyquist/Shannon too far beyond what the theory was aiming to prove.
The gap between the theory and what is practically possible is that Nyquist/Shannon proves that in context of Fourier decomposition of the orginal signal versus its PCM sampled counterpart no relevant information has been lost, and that therefore perfect reconstruction of the original signal from the PCM data samples is indeed possible. However whilst it encompasses the Whittaker-Shannon interpolation formula for reconstruction it does not address more practical methods of reconstruction, and it also requires perfect band filtering in the frequency (Fourier) domain. The conventional sample-and hold DAC reconstruction with analogue low-pass filters in the time domain as found in early digital audio is theoretically flawed in this respect, although in practice it results in quite acceptable performance. Oversampling filters with multiple coefficient parameterisation improve matters quite a bit.
The 16/44.1 proponents (and TBH, it works perfectly well enough for me) should have a look at how the same theory panned out in the world of digital image capture. Mathematically the two are quite comparable (compare audio frequency vs. spacial frequency, our upper hearing limit vs. visual acuity, etc.) In practice it turns out the most practical solution to improve image quality in digital photography is to work with a higher pixel count than visual acuity would suggest as sufficient. This reduces the need for an aggressive image-softening anti-alias filtering in front of the digital sensor and post-capture software filtering processing (its audio equivalent is low-pass filtering of the audio prior to sampling, and post DAC filtering.) Likewise, the most practical way to improve audio quality is to increase the sampling rate to one above what Nyquist/Shannon stipulates as sufficient, e.g. 48kHz or 96kHz.
As long as the information density of the data stream is reasonably consistent, then in theory bit depth and bit rate/sampling rate are somewhat interchangeable using e.g. dithering and noise shaping technologies. After a fashion the 1-bit DSD very high bit rate topology is an extreme example of that (only 1 bit, but very high bit rate, enabling very similar if not better performance than 16/44.1). Technics had a half-way house MASH technology.Great. And presumably bit depth as well?
Good point.As long as the information density of the data stream is reasonably consistent, then in theory bit depth and bit rate/sampling rate are somewhat interchangeable using e.g. dithering and noise shaping technologies. After a fashion the 1-bit DSD very high bit rate topology is an extreme example of that (only 1 bit, but very high bit rate, enabling very similar if not better performance than 16/44.1).
Wrong. It doesn’t have to measure zero to sound perfect, it just needs to fall below the threshold of audibility, which with human ears is a long way from zero. I don’t think you’ve bothered to research what is audible and what isn’t, so you’re assuming that everything is audible. That just isn’t true. There is a point where error is too small to matter. If you don’t understand that, you have no way of predicting how something sounds.if total harmonic distortion is zero, we do indeed know that the signal is replicated perfectly.
This is really fantastic. Thanks for it.
Precisely.
Good point.