TrevC
Well-known member
unsleepable said:TrevC said:Setting the same volume level for all tracks means reducing the dynamic range. It's doing exactly what it's supposed to do. They would need to store the peak level for every track and apply attenuation appropriately according to that level, an impossible task. I think it works pretty well and they have it pretty much spot on. It's certainly better than that used on radio.
I think it's more complex than just storing the peak level for each track. And in my opinion, it's not only a perfectly possible task, but I also don't doubt for a second that that's actually how it's currently done.
The necessary information to normalise the volume must be sent along with each track, so that it's the player that normalises the audio. It would be pointless for the Spotify servers to do this on the fly, when there is so much CPU time available at their disposal in all the devices where their players are running. And it would be equally pointless to double their necessary storage space.
Furthermore, I can locally cache tracks for offline playback, and then enable or disable normalisation in settings and don't have to re-download all cached tracks, no?
Finally, we may understand differently normalisation and dynamic range reduction. I don't think they are really the same thing.
The only difference between the two are the time constants used. The volume levelling on Spotify uses a fairly long time constant when raising the gain, slightly shorter in reducing it, which works fine for most tracks but is caught out by longish periods of quiet music with a sudden crescendo, the beginning of Wish you were Here by Pink Floyd sounds pretty bad, for example. It's a bit like the old automatic level circuits in cassette recorders without level adjustment.. Audio compression or dynamic range reduction as used by radio stations like Radio 2 uses the shortest possible time constant that changes the levels instantly squashing the dynamic range to a few dBs. An extreme version of dynamic range crushing is used on AM radio, where audio processing units such as the Optimod reduce dynamic range to almost zero. Certain music sounds better with poor dynamic range IMO, old Motown records for example, but that might be because I always used to hear them that way. :O)