zareliman
100+ Head-Fier
- Joined
- Mar 29, 2012
- Posts
- 111
- Likes
- 24
Hi
A while ago I discovered the 192/24 formats, downloaded some, read some stuff. According to the nyquist theorem there should be no audible difference between the 48/24 and 192/24 so I decided to downsample the ultra-big files I had (since I'm running low on storage).
Even though I know and trust the nyquist theorem I still wanted to try if I could spot the differences between a downsampled track and the original using ABX tests so I'm not losing original quality with my conversion.
I read some stuff about the 3 options for foobar2000, the PPHS, the dbpowerAMP and the SoX. The SoX was supposed to be the better one in terms of quality (still I don't know how can a resampler be different than other in terms of quality, I can understand one being faster or more efficient due to better coding but the quality shouldn't be an issue when you're supposed to just delete half the samples). There's supposed to be filters that eliminate some undesired frequencies theorically resulting in better quality but, again, conceptually I think the best conversion method shouls just downsample without altering it any further.
I decided to use: The David Hazeltine Trio - Impromptu (Binaural+) record, track 2.
I started my tests by transcoding the original 96/24 to Mp3. I could spot the difference within seconds after I knew "where to look". At least my hearing/focus can be trusted at that level (8/8 ABX correct)
Then decided to transcode from 96/24 to the highest quality Nero AAC. I scored 8/8 in the ABX test without being doubtful in any choice.
Now the 96/24 versus the 48/24 (FLAC). To my surprise I could hear differences here (I have the ABX tests to prove it 8/8 correct).
So either I have a very special sense of hearing (I still can't hear above 18khz when I tested) or there's something wrong with my conversion method. I used the foobar2000 built-in tool, the latest FLAC encoder without dither and only the SoX resample DSP with 98% Passband and 40% Phase response, no antialiasing, Best Quality and Downsample 2x.
I suspect the SoX resampler for 2 reasons, the Passband and Phase response must be doing something to the original signal (I lack the technical knowledge to know what they do exactly). The second reason is that the gain changes when the file is resampled which I don't think should be happening in a downsampling.
If that's the case, what would be the best way to conduct such test ? Is there a mathematically accurate downsampling (just delete 1 every 2 samples) ? Or could I win the golden ear prize they're offering...
EDIT: The problem was identified. Through WASAPI or DS the playback of big files had a singular noise when played in y system (I couldn't find about my issue in the net I assume it must be a very particular issue produced by some incompatiblity in my system). Though ASIO the issue dissapears.
A while ago I discovered the 192/24 formats, downloaded some, read some stuff. According to the nyquist theorem there should be no audible difference between the 48/24 and 192/24 so I decided to downsample the ultra-big files I had (since I'm running low on storage).
Even though I know and trust the nyquist theorem I still wanted to try if I could spot the differences between a downsampled track and the original using ABX tests so I'm not losing original quality with my conversion.
I read some stuff about the 3 options for foobar2000, the PPHS, the dbpowerAMP and the SoX. The SoX was supposed to be the better one in terms of quality (still I don't know how can a resampler be different than other in terms of quality, I can understand one being faster or more efficient due to better coding but the quality shouldn't be an issue when you're supposed to just delete half the samples). There's supposed to be filters that eliminate some undesired frequencies theorically resulting in better quality but, again, conceptually I think the best conversion method shouls just downsample without altering it any further.
I decided to use: The David Hazeltine Trio - Impromptu (Binaural+) record, track 2.
I started my tests by transcoding the original 96/24 to Mp3. I could spot the difference within seconds after I knew "where to look". At least my hearing/focus can be trusted at that level (8/8 ABX correct)
foo_abx 2.0.1 report
foobar2000 v1.3.9 beta 1
2015-09-12 14:02:46
File A: 02. Jesu Joy of Man's Desiring.mp3
SHA1: 560d3c0b13e9ae880014b08a33fa989326f972e7
Gain adjustment: +1.60 dB
File B: 02. Jesu Joy of Man's Desiring.flac
SHA1: 09ed7a6da74d0de725715e0519fee6eb262bcb36
Gain adjustment: +1.62 dB
Output:
WASAPI (event) : Speakers (ASUS Xonar D1 Audio Device), 24-bit
Crossfading: NO
14:02:46 : Test started.
14:03:14 : 01/01
14:03:23 : 02/02
14:03:30 : 03/03
14:04:17 : 04/04
14:04:23 : 05/05
14:04:28 : 06/06
14:04:35 : 07/07
14:04:38 : 08/08
14:04:38 : Test finished.
----------
Total: 8/8
Probability that you were guessing: 0.4%
-- signature --
8d64736d68ba51ad702056666775f2ffc2795ceb
Then decided to transcode from 96/24 to the highest quality Nero AAC. I scored 8/8 in the ABX test without being doubtful in any choice.
foo_abx 2.0.1 report
foobar2000 v1.3.9 beta 1
2015-09-12 14:06:04
File A: 02. Jesu Joy of Man's Desiring.m4a
SHA1: 3d3dc295ba147cc35915b582a7267fe37399b9a1
Gain adjustment: +1.63 dB
File B: 02. Jesu Joy of Man's Desiring.flac
SHA1: 09ed7a6da74d0de725715e0519fee6eb262bcb36
Gain adjustment: +1.62 dB
Output:
WASAPI (event) : Speakers (ASUS Xonar D1 Audio Device), 24-bit
Crossfading: NO
14:06:04 : Test started.
14:06:18 : 01/01
14:06:23 : 02/02
14:06:27 : 03/03
14:06:33 : 04/04
14:06:41 : 05/05
14:06:51 : 06/06
14:06:54 : 07/07
14:06:59 : 08/08
14:06:59 : Test finished.
----------
Total: 8/8
Probability that you were guessing: 0.4%
-- signature --
e8009cbfbd07428cbd6c0da3cb27a0071f7e6674
Now the 96/24 versus the 48/24 (FLAC). To my surprise I could hear differences here (I have the ABX tests to prove it 8/8 correct).
foo_abx 2.0.1 report
foobar2000 v1.3.9 beta 1
2015-09-12 14:11:47
File A: 02. Jesu Joy of Man's Desiring.flac
SHA1: 09ed7a6da74d0de725715e0519fee6eb262bcb36
Gain adjustment: +1.62 dB
File B: 02. Jesu Joy of Man's Desiring.flac
SHA1: b1c8bb800b8f4a9915954dc641d2d797d0d6b821
Gain adjustment: +1.60 dB
Output:
WASAPI (event) : Speakers (ASUS Xonar D1 Audio Device), 24-bit
Crossfading: NO
14:11:47 : Test started.
14:11:55 : 01/01
14:12:03 : 02/02
14:12:16 : 03/03
14:12:22 : 04/04
14:12:29 : 05/05
14:12:38 : 06/06
14:12:42 : 07/07
14:12:47 : 08/08
14:12:47 : Test finished.
----------
Total: 8/8
Probability that you were guessing: 0.4%
-- signature --
54e5e721bebd7810ae4ac4bd26664e66ed770107
So either I have a very special sense of hearing (I still can't hear above 18khz when I tested) or there's something wrong with my conversion method. I used the foobar2000 built-in tool, the latest FLAC encoder without dither and only the SoX resample DSP with 98% Passband and 40% Phase response, no antialiasing, Best Quality and Downsample 2x.
I suspect the SoX resampler for 2 reasons, the Passband and Phase response must be doing something to the original signal (I lack the technical knowledge to know what they do exactly). The second reason is that the gain changes when the file is resampled which I don't think should be happening in a downsampling.
If that's the case, what would be the best way to conduct such test ? Is there a mathematically accurate downsampling (just delete 1 every 2 samples) ? Or could I win the golden ear prize they're offering...
EDIT: The problem was identified. Through WASAPI or DS the playback of big files had a singular noise when played in y system (I couldn't find about my issue in the net I assume it must be a very particular issue produced by some incompatiblity in my system). Though ASIO the issue dissapears.