Lossy Audio Codec's Comparison [HUGE amount of pics] [iTunes UPDATE on p.7]
Feb 28, 2007 at 6:23 PM Thread Starter Post #1 of 225

Sir Nobax

500+ Head-Fier
Joined
Feb 1, 2007
Posts
627
Likes
10
Since the day i started 'listening' to music instead of 'hearing' it i'm worried about the quality of lossy codecs like MP3, AAC and WMA. Every time i hear a song i wonder if it is really how it has to sound, because of the loss of data in the compression.
Of couse i could A-B every codec but still this is a very uneffective way to hear how much a file suffered from the lossy compression.

Enter The Spectrograph.

A while back i tested every major lossy codec with encoding a complex wave with an enormous amount of sweeps, because i think some of you would like to see how a MP3 compressed file really "looks". I encoded the file several times to different codecs at different bitrate's with dbPowerAmp and i made spectographs of them with an old version of CoolEdit Pro. (i will upload the sample tomorrow)

[WARNING]
All encodings including MP3 Lame (i could be totally wrong but i think it was 3.47) [UPDATE: The Lame codec is version 3.97] are mostly CBR, this was because VBR wouldnt work well with just a 6 second long sample. I cant tell how much the quality is different compared to "real" MP3 Lame encoded files with VBR.



[size=large]THE ORIGINAL WAVE[/size]
WAVOriginal.jpg

Grab it HERE


[size=large]AAC[/size]


AACCBRHE96kbpsFast.jpg

HE-AAC at 96kbps Fast setting

AACCBRLC96kbpsFastPNS.jpg

LC-AAC at 96kbps Fast setting with PNS

AACCBRLC96kbpsFast.jpg

LC-AAC at 96kbps Fast setting without PNS

AACCBRLC128kbpsFast.jpg

LC-AAC at 128kbps Fast setting

AACCBRLC128kbpsGood.jpg

LC-AAC at 128kbps Good setting (notice that the "Good" quality setting is worse then the "Fast" setting.)

AACVBRLC128kbpsFast.jpg

LC-AAC at ~128kbps Fast setting in VBR

AACCBRLC196kbpsFast.jpg

LC-AAC at 196kbps Fast setting

AACCBRLC320kbpsFast.jpg

LC-AAC at 320kbps Fast setting (99% perfect copy)


[size=large]MP3 Blade[/size]


MP3Blade96kbps.jpg

MP3 Blade at 96kbps

MP3Blade128kbps.jpg

MP3 Blade at 128kbps

MP3Blade196kbps.jpg

MP3 Blade at 192kbps

MP3Blade256kbps.jpg

MP3 Blade at 256kbps

MP3Blade320kbps.jpg

MP3 Blade at 320kbps (almost as good as perfect)


[size=large]MP3 Lame[/size]


MP3LAMECBR96kbps.jpg

MP3 Lame at 96kbps

MP3LAMEABR128kbps.jpg

MP3 Lame at ~128kbps in ABR (average bitrate)

MP3LAMECBR128kbps.jpg

MP3 Lame at 128kbps

MP3LAMECBR192kbps.jpg

MP3 Lame at 192kbps

MP3LAMECBR256kbps.jpg

MP3 Lame at 256kbps

MP3LAMECBR320kbps.jpg

MP3 Lame at 320kbps (in CBR at 320kbps Lame is worse then Blade, i'm sure that a ~320kbps VBR Lame encoded MP3 would be an near-perfect copy though


[size=large]MP3Pro[/size]


MP3Pro64kbps.jpg

MP3Pro at 64kbps (notice the funny methode to compress tones above 6khz, it causes a lot of jittering and ringing but sounds better then a normal 64kbps MP3 encoding.)

MP3Pro96kbps.jpg

MP3Pro at 96kbps (this encoding is very bad, still the same amount of jittering and ringing as 64kbps, HE- and LC-AAC sounds way better at this bitrate)

.. 30 pic limit reached ...
 
Feb 28, 2007 at 6:24 PM Post #3 of 225
More Pics

[size=large]OGG Vorbis[/size]


OGGCBR96kbps.jpg

OGG at 96kbps (some strange thing happens everytime i encode a OGG file, especially high bitrate ones, the bitrate is WAY lower then others, i sounds fine but the graphs shows its still flawed, probably OGG is forcing VBR)

OGGCBR128kbps.jpg

OGG at 128kbps


OGGCBR192kbps.jpg

OGG at 192kbps


OGGCBR256kbps.jpg

OGG at 256kbps


OGGCBR350kbps.jpg

OGG at 350kbps


OGGCBR500kbps.jpg

blink.gif
OGG at 500kbps (with this file you can see that this can never be 500kbps, the filesize was 170KB, the 320kbps AAC was 190KB, confirming that OGG forces VBR)


[size=large]WMA 9.1[/size]


WMA9.1CBR96kbps.jpg

WMA 9.1 at 96kbps

WMA9.1CBR128kbps.jpg

WMA 9.1 at 128kbps


WMA9.1CBR256kbps.jpg

WMA 9.1 at 256kbps


[size=large]WMA 9.2[/size]


WMA9.2CBR96kbps.jpg

WMA 9.2 at 96kbps

WMA9.2CBR128kbps.jpg

WMA 9.2 at 128kbps

WMA9.2CBR192kbps.jpg

WMA 9.2 at 192kbps

WMA9.2CBR256kbps.jpg

WMA 9.2 at 256kbps

WMA9.2CBR320kbps.jpg

WMA 9.2 at 320kbps (compare this to any other 320kbps file, except OGG, and you'll see that this is really bad)


[size=large]WMA 10 Pro[/size]


WMA10ProCBR96kbps.jpg

WMA 10 Pro at 96kbps, the best 96kbps file

WMA10ProCBR128kbps.jpg

WMA 10 Pro at 128kbps, nothing special

WMA10ProCBR192kbps.jpg

WMA 10 Pro at 192kbps

WMA10ProCBR256kbps.jpg

WMA 10 Pro at 256kbps, this just amazed me, near perfect at 256kbps, trully amazing
rs1smile.gif
.



Well this is it so far, i hope you guys enjoy watching the spectographs and get an impression of the quality between the codecs.

And again i think it is a shame that LAME could'nt be encoded this way. Maybe i'll try it again with the newer versions.

If anyone wich to request an codec, just ask
cool.gif
.
 
Feb 28, 2007 at 6:31 PM Post #4 of 225
Quote:

Originally Posted by Sir Nobax /img/forum/go_quote.gif
If anyone wich to request an codec, just ask
cool.gif
.



Thank you so much for your time
icon10.gif
Actually I have one request. AAC-LC @224 with CBR and VBR, since that seems to be the limit before the ipod buffer overfills and there is hard disk interaction.
 
Feb 28, 2007 at 6:49 PM Post #6 of 225
Quote:

Originally Posted by hYdrociTy /img/forum/go_quote.gif
i love you


OMG, that just made me feel all warm and fuzzy inside
blink.gif
.



I'm going to reinstall all my stuff and i'll post the results for the AAC-LC @ 224 with CBR and VBR file.
 
Feb 28, 2007 at 7:03 PM Post #7 of 225
Quote:

Originally Posted by Sir Nobax /img/forum/go_quote.gif
OMG, that just made me feel all warm and fuzzy inside
blink.gif
.



I'm going to reinstall all my stuff and i'll post the results for the AAC-LC @ 224 with CBR and VBR file.



wow, just wow

thank you so much for dong this
 
Feb 28, 2007 at 7:20 PM Post #8 of 225
If only we could do a similar test for headphones - e.g. feed in a reference signal and then record the output with a mic. Even on the best cans, I bet the graphs would be pretty bad with lots of holes.

Such a test would be hard though - even the microphone would create imperfections in the graph of it's own?

Ps. the graphs are good for technicals. But the reason compression works is because the human ear isn't sensitive to the missing black holes in certain places. A graph full of black holes doesn't mean it will sound bad.
 
Feb 28, 2007 at 7:23 PM Post #9 of 225
Quote:

Originally Posted by Sir Nobax /img/forum/go_quote.gif
Of couse i could A-B every codec but still this is a very uneffective way to hear how much a file suffered from the lossy compression.


Actually, that is the most effective way to hear how much a file suffered from lossy compression.

These spectrographs are interesting to look at, but as I mentioned in the other thread where you brought this up, they really don't give you any useful information about how the lossy codec sounds.

I think that it is a mistake to look at a spectrograph, see that a codec leaves out significant amount of data, and decide from that spectrograph that the codec sounds bad. I would expect the opposite to be true. All other things being equal, wouldn't the best codec be the one that leaves out that most data and still sounds indistinguishable from the original?
 
Feb 28, 2007 at 8:28 PM Post #11 of 225
Quote:

Originally Posted by Febs /img/forum/go_quote.gif
Actually, that is the most effective way to hear how much a file suffered from lossy compression.

These spectrographs are interesting to look at, but as I mentioned in the other thread where you brought this up, they really don't give you any useful information about how the lossy codec sounds.

I think that it is a mistake to look at a spectrograph, see that a codec leaves out significant amount of data, and decide from that spectrograph that the codec sounds bad. I would expect the opposite to be true. All other things being equal, wouldn't the best codec be the one that leaves out that most data and still sounds indistinguishable from the original?



my thoughts exactly. how is looking at pictures more effective than listening when you are trying to judge the quality of an audio reproduction? now, if you would ABX and then explain your findings with visual representations, that would be useful. but simply attempting to judge based on the images is illogical. this line points out the flaw in your method
Quote:

(notice the funny methode to compress tones above 6khz, it causes a lot of jittering and ringing but sounds better then a normal 64kbps MP3 encoding.)


 
Feb 28, 2007 at 9:15 PM Post #12 of 225
Quote:

Originally Posted by Febs /img/forum/go_quote.gif
Actually, that is the most effective way to hear how much a file suffered from lossy compression.

These spectrographs are interesting to look at, but as I mentioned in the other thread where you brought this up, they really don't give you any useful information about how the lossy codec sounds.

I think that it is a mistake to look at a spectrograph, see that a codec leaves out significant amount of data, and decide from that spectrograph that the codec sounds bad. I would expect the opposite to be true. All other things being equal, wouldn't the best codec be the one that leaves out that most data and still sounds indistinguishable from the original?





Even if AAC leaves a lot more of data at 128Kbps than Mp3 at 128Kbps the AAC is not a bigger file, I guess this test is about the best codec takes away the most unhearable sounds in music or just not messes up the signal.

One thing that i didn't understand was that WMA9.2 was worse than AAC
blink.gif
I can hear the difference from AAC and lossless at much higher bitrates than WMA9.2.

What program did you use to encode with? Did you use WMP11? no you could not have done that because it can't encode higher than 192kbps.

Try to do it with WMP11.
 
Feb 28, 2007 at 9:43 PM Post #14 of 225
Quote:

Originally Posted by Sir Nobax
Of couse i could A-B every codec but still this is a very uneffective way to hear how much a file suffered from the lossy compression.


Based on this statement, it seems as though you don't understand the nature of lossy encoding. It's not designed to be lossy in a perceptual sense; merely in a quantitative sense. That's the whole point of psychoacoustic modeling and its application in the form of so-called 'lossy' encoding. What your charts show is the impact of each codec's psy model quantitatively, but the charts don't show the qualitative impact on their own, and only arguably do so in concert with a working understanding of psychoacoustics and ability to analyze this data through that knowledge and framework.

When you set the bitrate of any of these codecs, you have a determined, finite bandwidth with which the data is represented. Analyzing the data in this manner (a particular frequency-domain analysis) only even gives you part of the quantitative story, let alone qualitative. A great example of the fallacy demonstrated here was old analyses of VQF (a vector quantization based codec developed by NTT and Yamaha) where 96kbps VQF was claimed to exceed the quality of MP3 at 128kbps and beyond (sometimes going so far as claiming better than 192+kbps), and then data was supplied via a spectrogram. I tried VQF, and the qualitative performance of it was relatively poor, even against MP3 at that time (which was in a substantially less refined state compared to today). I suspect there was a substantial amount of quantization noise, based on the overall sound, and it also seemed to lose a lot of positional information...perhaps due to lousy VQ.

The charts are interesting, but I do not recommend choosing a codec based on their superficial appearance. Blade MP3, unless it has been substantially updated and improved in a manner I'm completely unaware of (as far as I know, development was halted in 2002), is not of the refinement of the LAME mp3 encoder and I would not use it in place of LAME even at 320kbps.
 
Feb 28, 2007 at 9:53 PM Post #15 of 225
Another vote for FLAC and how about ATRAC 256/292/352 and lossless.

I'm not entirely sure what I'm looking at in these pictures. Anyone care to explain one of these images?
 

Users who are viewing this thread

Back
Top