[1] That said, the internationally mandated Loudness metrics (1770 et al) do a very good job at what they were designed for: broadcast loudness normalization with an emphasis on advertisements.
[2] Likewise, DRi does a very good job at what it was designed for: an easy to understand (hence the single integer representation) measurement of the dynamic density of popular music.
[3] [Please excuse the promo, but we feel strongly about this…] DRO2 and DRM2 also include metrics not found in other products ...
[4] The upshot of all this awareness of Loudness and the Loudness Wars is that artists, engineers and producers are starting to realize that, if you make content artificially loud, it will be gain reduced on playback through any popular realtime distribution service. So, by trying to make it “stand out,” you’re actually making it quieter, aurally receding into the background, relative to the tracks before and after it.
1. The ITU 1770 was NOT specifically aimed at broadcast with an emphasis on advertisements, just at recorded audio loudness in general! It agree that the ATSC largely used it for the purpose of controlling advertisement levels, due to the introduction of the CALM Act in the USA, in addition to controlling levels between different channels (and so did the EBU). However a year or two ago, the AES published a recommended set of loudness specifications (based on the same ITU 1770 measurement) specifically for music production/distribution! Furthermore, there's some discussion of developing specs based on ITU 1770 for theatrical film production and distribution. Of all the loudness measurement paradigms, the one's based on ITU 1770 are by far the most accurate but even these only address what the perceived relative loudness (and/or dynamic range) actually is, NOT what it should be. For example, a Hayden string quartet should not be the same loudness as say a song by Metallica or have the same dynamic range as say a Mahler symphony. There's currently no solution for this aspect of loudness or dynamic range and none even on the horizon, as far as I'm aware such a solution is well beyond even the most sophisticated AI. Furthermore, this is NOT purely a music issue, a romantic comedy or other inter-personal drama should obviously not be as loud or dynamic as say a big action blockbuster.
2. For me, that is it's problem or rather, problems!:
A. I don't exactly know what you mean by the term "dynamic density" but more importantly; if it was designed as an easy to understand measurement of "dynamic density", why is it labelled and marketed as an easy to understand measurement of "dynamic range"? I assume it's a far easier "sell" to use the term "dynamic range", which many are familiar with, than to have to explain the term "dynamic density" and then "sell" an idea/concept which no one is familiar with and which most wouldn't bother to familiarise themselves with? In other words, it's simplicity and ease of use broadens it appeal but the downside is that most would naturally assume that the DR database is actually a database of dynamic range measurements, which it isn't, and so it's effectively misleading to most and confusing to others, like the OP, who observe the discrepancy.
B. If it's designed as a measurement of the "dynamic density of popular music" why does the DR database contain measurements of other genres, such as classical music for example? Even if it were restricted to only popular music, it would still have essentially the same problem as the "Loudness Normalisation" (ITU 1770) measurement: Namely, what is the right amount? A DR measurement of say "6" might be entirely appropriate for one piece (or even a whole popular music sub-genre) but entirely inappropriate for another, IE. Some sub-genres require more compression than others. Furthermore, there's the natural dynamic range of popular music genres to consider, which the DR database doesn't. For example, it would be entirely feasible to create a piece of ambient music with a DR measurement of say "7", using little/no compression, while another genre might need a quite severe amount of compression to achieve a measurement of "8". In other words, one piece with little/no compression, which is totally fine might measure into the Red, "Bad" region, while another piece with unwanted, quite heavy compression artefacts might measure in the Orange, "Not too bad" region!
C. There seems to be a problem with the DR measurement with certain media types, vinyl for example. It appears that certain distortions are introduced in the cutting and/or playback process with vinyl which causes a higher DR measurement than is actually the case, up to "4" higher I believe. So vinyl releases (or rather, rips of vinyl releases) cannot be compared with releases in a digital format.
If it were me, at the very least I'd have an easy to access Wiki/explanation page on the DR Database site, explaining these issues and providing more detailed guidance on how to use and interpret the presented DR measurements!
3. Yes, I can see that! IMO though, the DR database is clearly misrepresenting both dynamic range and the loudness war. btw, what is DRO2 and DRM2?
4. Firstly, let's not forget that it was the engineers who first raised the issue of the loudness war, more than 25 years ago. Secondly, we're not there yet, there's still the issues of: A. How loud a piece/song should be, relative to another type of piece/song. B. No single standard. Youtube is still probably the world's largest music distribution platform and with an integrated loudness specification equivalent to around -12 to -13LUFS, that's not low enough to really punish excessive compression.
G