There isn't any error or discrepancy for the discrete fourier transform (DFT). The DFT can be applied to the audio signal by 64 samples at a time or by 2048 samples at a time or by 100000 samples at a time, it doesn't matter. The inverse transform will give the audio samples back perfectly as long as the window size of the inverse transform is the same. However, there can be a discrepancy if one incorrectly assumes that the samples are periodic. By definition, the DFT shows the spectrum of a discrete
periodic signal with a period time of the chosen window size (number of samples transformed). Audio is virtually never periodic. On top of that, there still can be an error even if the analyzed signal is periodic.
Consider the following example:

The DFT is applied to a 600Hz sampled cosine wave. 48000 samples are taken every second. If we take 200 samples and calculate the DFT of that, we've effectively sampled 2 and a half periods of the cosine wave. The DFT shows the spectrum of a signal with the period time of its window size (200 samples) so once we apply the IDFT to that spectrum and add in the rest of the periods, we end up with an incorrect signal because the actual analyzed signal is periodic by 80 samples (and its integer multiples as well). As can be seen, this error shows up as "discontinuities" in the time domain.
The discontinuity is particularly bad in this case because the sampling is off by exactly half cycle of the cosine wave. This discontinuity shows as spectral leakage in the frequency domain. The spectral leakage is the direct consequence of the discontinuity so essentially the spectrum just correctly shows the quick and abrupt change of the assumed signal even though there were no such "jumps" in the originally sampled signal.
If we don't make this assumption of periodicity, we could instead take the DFT from the following 200 samples (instead of assuming they are the same 200 samples as before), apply the IDFT to that, this way we would get back the correct waveform up to the first 400 samples with no discontinuity at all.
In a more broad view, the reason we have to assume something about the signal outside of the analyzed part is because if the signal isn't completely defined in the time domain, it can't be defined in the frequency domain either. The fourier transform in its most general form is an integral over negative infinity to infinity which I'm sure you've seen plenty of times by now. Whenever the fourier transform is restricted to a time period between t1 and t2, there's an implicit assumption of the function outside of the time interval between t1 and t2. By far the most used assumption is that the function has the same period time as the analyzed interval because that is what most often gives useful results. Less often, the function is assumed to be zero instead but nonetheless, the fourier transform doesn't exist for functions that aren't defined for all the real numbers.