KeithEmo
Member of the Trade: Emotiva
- Joined
- Aug 13, 2014
- Posts
- 1,698
- Likes
- 868
There seem to be several conversations going on here....
To address your first comment - simply - if the original acoustic source didn't have any ringing, but the digital representation of it does, then the digital representation is NOT perfect - and the ringing added by the digital proces is indeed a flaw. (Since there shouldn't be any ringing at all, but some ringing is inevitable, and even inherent, in any currently available digital representation, we much admit that none of the solutions are perfect, and so we are picking a compromise. The best we may even hope for is "the best digital representation, given the sample rate, and the other limitations of the system we've chosen.) However, even beyond that, we can't absolutely choose a single "best compromise" because the choice depends on our parameters; one filter gives the flattest power response, another gives the best phase response, and yet another gives the most accurate impulse response... name your poison. (And, in the end, the question returns to "which of the flaws is least audible? And, of course, we are hoping to be able to choose an option where there are NO AUDIBLE FLAWS. )
My comment about how some DACs do in fact process different sample rates differently was directed at your comment about "the results of conversions performed by Sox sounding identical to you". And my point was that, when comparing files at different sample rates, you can't assume that listening to both on the same DAC constitutes "a level playing field". For example, assuming that you started with a 24/192k file and converted it to 16/44k using Sox, it's possible that the 16/44k file has actually been seriously altered, but that the DAC itself is seriously altering the 24/192k file when it plays it, and so their apparent similarity is simply the result of two similar errors. (This isn't as far fetched as it sounds - because many DACs - especially older ones - actually do have significantly higher levels of various types of distortion at higher sample rates. Therefore, because of design limitations, the DAC you've chosen could actually be "cancelling out" some positive difference in the higher sample rate file by adding more distortion to it during playback.) And, in the particular case of the example you've chosen, the 44k version of the file you converted using Sox might sound identical BECAUSE THERE'S SOMETHING WRONG, and, if so, then doing a better quality conversion might just produce a 16/44k file that sounded BETTER than the 24/192k original ON THAT DAC. (I would agree that it's somewhat unlikely, and the only way I could imagine to determine if the different result was better would be to repeat the same experiment with several different converters and DACs, but such a result is not at all beyond the range of possibility, which means that it's just one more factor that makes the results uncertain.)
There absolutely ARE DACs in existence which process and filter source files at different sample rates differently enough that the differences in how the DAC handles the files exceeds the differences in the files themselves. (I owned a well-known non-oversampling DAC whose frequency response was 20-20k +/- 0.25 dB at 96k, but 20-20k +0/-3 dB at 44k. If you'd compared your two files on that DAC, and they were audibly identical, then that would have meant that the files were in fact far different... )
As for the fact that you seem to not "believe that ringing matters" - there seem to be a lot of voices to the contrary. (While you can feel free to argue at what level it becomes audible, the vast majority of DAC chip manufacturers, and people who write CODECS, all seem to agree that, at some point, pre-ringing becomes audible and unpleasant. We're not talking about something that's being claimed by a few fringe companies; we're talking about something that Wolfson and several DAC manufacturers consider important, that virtually all DAC chip vendors specify, and that Dolby Labs considers to be an important feature in their latest professional level encoder. Therefore, I really don't think it's reasonable to dismiss it out of hand.)
I would suggest that, if you want to establish reasonably that a given sample rate converter is "inaudible", a good start would be to take a 16/44k file, upsample it to 96k, then down-sample it back to 44k - using the same converter. If they sound identical, then, excluding the possibility that the conversions might introduce errors that cancel out, you will have at least established the possibility that the conversions are "perfect". To me, simply converting one to another, using an "arbitrarily good enough" conversion, then comparing the two on an "arbitrarily good enough" DAC and playback system, is leaving far too many variables not adequately controlled.
Now, again, we need to remember that we're discussing "proving something to a reasonable degree of certainty to make it a scientific claim" here.
If the question is simply of whether "there seems to be enough evidence to convince you or me that it probably doesn't matter to us" - then the level of proof required is FAR lower.
(I do apologize, to a degree, for "trying to pick you apart on scientific details" - but, at least to me, you seem to be getting dangerously close the the line of "I don't hear a difference - therefore there can't possibly be one". I personally don't have especially "good pitch", so I can't hear when a guitar is slightly out of tune... but I still can't rule out that many of the people I know who claim that they CAN hear when one is even slightly out of tune might still be telling the truth... )
I did know someone once who tried the experiment of taking a 44k file, upsampling it to 96k, and then downsampling it to 44k again - using one of the popular audio editing programs (I believe it was an early version of Adobe Audition.) Note that he used actual music, and not steady state test tones or pink noise. He was horrified at the level and type of differences that existed between the original file and the double-converted one - which should have been "identical". (I've never tried that experiment, and it might be interested to see how Sox would fare with it.)
Quote:
To address your first comment - simply - if the original acoustic source didn't have any ringing, but the digital representation of it does, then the digital representation is NOT perfect - and the ringing added by the digital proces is indeed a flaw. (Since there shouldn't be any ringing at all, but some ringing is inevitable, and even inherent, in any currently available digital representation, we much admit that none of the solutions are perfect, and so we are picking a compromise. The best we may even hope for is "the best digital representation, given the sample rate, and the other limitations of the system we've chosen.) However, even beyond that, we can't absolutely choose a single "best compromise" because the choice depends on our parameters; one filter gives the flattest power response, another gives the best phase response, and yet another gives the most accurate impulse response... name your poison. (And, in the end, the question returns to "which of the flaws is least audible? And, of course, we are hoping to be able to choose an option where there are NO AUDIBLE FLAWS. )
My comment about how some DACs do in fact process different sample rates differently was directed at your comment about "the results of conversions performed by Sox sounding identical to you". And my point was that, when comparing files at different sample rates, you can't assume that listening to both on the same DAC constitutes "a level playing field". For example, assuming that you started with a 24/192k file and converted it to 16/44k using Sox, it's possible that the 16/44k file has actually been seriously altered, but that the DAC itself is seriously altering the 24/192k file when it plays it, and so their apparent similarity is simply the result of two similar errors. (This isn't as far fetched as it sounds - because many DACs - especially older ones - actually do have significantly higher levels of various types of distortion at higher sample rates. Therefore, because of design limitations, the DAC you've chosen could actually be "cancelling out" some positive difference in the higher sample rate file by adding more distortion to it during playback.) And, in the particular case of the example you've chosen, the 44k version of the file you converted using Sox might sound identical BECAUSE THERE'S SOMETHING WRONG, and, if so, then doing a better quality conversion might just produce a 16/44k file that sounded BETTER than the 24/192k original ON THAT DAC. (I would agree that it's somewhat unlikely, and the only way I could imagine to determine if the different result was better would be to repeat the same experiment with several different converters and DACs, but such a result is not at all beyond the range of possibility, which means that it's just one more factor that makes the results uncertain.)
There absolutely ARE DACs in existence which process and filter source files at different sample rates differently enough that the differences in how the DAC handles the files exceeds the differences in the files themselves. (I owned a well-known non-oversampling DAC whose frequency response was 20-20k +/- 0.25 dB at 96k, but 20-20k +0/-3 dB at 44k. If you'd compared your two files on that DAC, and they were audibly identical, then that would have meant that the files were in fact far different... )
As for the fact that you seem to not "believe that ringing matters" - there seem to be a lot of voices to the contrary. (While you can feel free to argue at what level it becomes audible, the vast majority of DAC chip manufacturers, and people who write CODECS, all seem to agree that, at some point, pre-ringing becomes audible and unpleasant. We're not talking about something that's being claimed by a few fringe companies; we're talking about something that Wolfson and several DAC manufacturers consider important, that virtually all DAC chip vendors specify, and that Dolby Labs considers to be an important feature in their latest professional level encoder. Therefore, I really don't think it's reasonable to dismiss it out of hand.)
I would suggest that, if you want to establish reasonably that a given sample rate converter is "inaudible", a good start would be to take a 16/44k file, upsample it to 96k, then down-sample it back to 44k - using the same converter. If they sound identical, then, excluding the possibility that the conversions might introduce errors that cancel out, you will have at least established the possibility that the conversions are "perfect". To me, simply converting one to another, using an "arbitrarily good enough" conversion, then comparing the two on an "arbitrarily good enough" DAC and playback system, is leaving far too many variables not adequately controlled.
Now, again, we need to remember that we're discussing "proving something to a reasonable degree of certainty to make it a scientific claim" here.
If the question is simply of whether "there seems to be enough evidence to convince you or me that it probably doesn't matter to us" - then the level of proof required is FAR lower.
(I do apologize, to a degree, for "trying to pick you apart on scientific details" - but, at least to me, you seem to be getting dangerously close the the line of "I don't hear a difference - therefore there can't possibly be one". I personally don't have especially "good pitch", so I can't hear when a guitar is slightly out of tune... but I still can't rule out that many of the people I know who claim that they CAN hear when one is even slightly out of tune might still be telling the truth... )
I did know someone once who tried the experiment of taking a 44k file, upsampling it to 96k, and then downsampling it to 44k again - using one of the popular audio editing programs (I believe it was an early version of Adobe Audition.) Note that he used actual music, and not steady state test tones or pink noise. He was horrified at the level and type of differences that existed between the original file and the double-converted one - which should have been "identical". (I've never tried that experiment, and it might be interested to see how Sox would fare with it.)
Quote:
Ringing is technically not a flaw. The *ideal*, meaning technically perfectly correct filter rings indefinitely at the cutoff frequency. This is why having a cutoff higher than we can hear is important, and why non-minimum phase filters *in the audible range* can be problematic. I am well aware of how bad some SRCs are, but SoX (which was the program under question) isn't one of them. And why are you trying to expand into how DACs work at different rates? The whole point here was to get two sources of the same content to the same rate so they can be compared. I'm also well aware of what people "claim" to hear, which somehow suddenly disappears under certain situations (that are being well hashed to death in the other thread so we'll leave that be).