This isn't really so much of a DIY-related question, but (yet another) question from me for technically-minded head-fiers.
If the human perception of loudness is roughly logarithmic, why is linear PCM used for almost all digital music reproduction?
This is the mental exercise I keep playing out in my head:
Start with a certain loudness that we'll call mezzo forte, define a second loudness that is one unit of perceptual loudness more loud, called forte, and then a third loudness, fortissimo, that is one more unit of perceived loudness greater than forte.
If the perceptual space is logarithmic and we were to map these units into a linear space, the difference between fortissimo and forte would be a degree of magnitude greater than the difference between forte and mezzo forte:
Giving the three values arbitrary values in logarithmic space that make them evenly spaced apart:
mezzo forte = 3
forte = 4
fortissimo = 5
Solving for the corresponding values in linear space, assuming that loudness perception is base-10 logarithmic (which it isn't, but it makes the math easier):
mezzo forte:
3 = log(x)
x = 10^3 = 1000
forte:
4 = log(x)
x = 10^4 = 10000
fortissimo:
5 = log(x)
x = 10^5 = 100000
In the above linear space, the difference between mezzo forte and forte is (10000 - 1000) = 9000 linear units, while the difference between forte and fortissimo is (100000 - 10000) = 90000 linear units, ten times more.
If these linear units were analogous to linear PCM quantization levels, wouldn't that mean that linear PCM has considerably higher, er... perceptual loudness resolution (is there a better word for that?) at the loud end than at the quiet end?
Is there a flaw in my thinking? I have a degree for Computer Science and another for Photography, which basically means I understand the basics of how PCM works and I have a decent idea of how visual perception works, but I'm mostly talking out of my ass when I try to apply what I know towards psychoacoustics.
I starting thinking down this path while I was pondering why the dynamics (micro and macro) of vinyl sound so much better to me than those of CD audio, despite some valid technical arguments against that being the case. I started thinking about how the goals of engineering and mastering for the two processes would differ, which got me thinking about the linear nature of PCM used in most audio production. I'm not really interesting in starting a vinyl vs. CD debate (ugh), so let's try to avoid that...
So why do we use linear quantization to model phenomenona that we perceive logarithmically?
If the human perception of loudness is roughly logarithmic, why is linear PCM used for almost all digital music reproduction?
This is the mental exercise I keep playing out in my head:
Start with a certain loudness that we'll call mezzo forte, define a second loudness that is one unit of perceptual loudness more loud, called forte, and then a third loudness, fortissimo, that is one more unit of perceived loudness greater than forte.
If the perceptual space is logarithmic and we were to map these units into a linear space, the difference between fortissimo and forte would be a degree of magnitude greater than the difference between forte and mezzo forte:
Giving the three values arbitrary values in logarithmic space that make them evenly spaced apart:
mezzo forte = 3
forte = 4
fortissimo = 5
Solving for the corresponding values in linear space, assuming that loudness perception is base-10 logarithmic (which it isn't, but it makes the math easier):
mezzo forte:
3 = log(x)
x = 10^3 = 1000
forte:
4 = log(x)
x = 10^4 = 10000
fortissimo:
5 = log(x)
x = 10^5 = 100000
In the above linear space, the difference between mezzo forte and forte is (10000 - 1000) = 9000 linear units, while the difference between forte and fortissimo is (100000 - 10000) = 90000 linear units, ten times more.
If these linear units were analogous to linear PCM quantization levels, wouldn't that mean that linear PCM has considerably higher, er... perceptual loudness resolution (is there a better word for that?) at the loud end than at the quiet end?
Is there a flaw in my thinking? I have a degree for Computer Science and another for Photography, which basically means I understand the basics of how PCM works and I have a decent idea of how visual perception works, but I'm mostly talking out of my ass when I try to apply what I know towards psychoacoustics.
I starting thinking down this path while I was pondering why the dynamics (micro and macro) of vinyl sound so much better to me than those of CD audio, despite some valid technical arguments against that being the case. I started thinking about how the goals of engineering and mastering for the two processes would differ, which got me thinking about the linear nature of PCM used in most audio production. I'm not really interesting in starting a vinyl vs. CD debate (ugh), so let's try to avoid that...

So why do we use linear quantization to model phenomenona that we perceive logarithmically?







. I can understand if the practicality of implementation makes linear PCM a better solution. Is my general idea logically/mathematically sound, though?
