User Tools

Site Tools


atrac3p:atrac_spectrum_anatomy

This is an old revision of the document!


ATRAC3+ anatomy

ATRAC3+ spectrum layout

The spectrum (0..22.5kHz) for 44.1kHz sample rate is divided into 16 bands of equal spectral width of 1406,25Hz. The division into bands of same width is a direct consequence of the DCT synthesis scheme. Some of these 16 bands are further devided into bandlets, as at low frequencies the same spectral width occupies a higher relative spectrum range (i.e. a higher pitch interval). The bandlet data has to be combined into band data before the IDCT synthesis can be performed, so all processing steps in time domain work on band basis or even more coarse, while frequency-domain processing often is performed on the bandlet level.

The selection of spectral dither noise amplitude uses an even more coarse scheme of dither groups that contain several bandlets. Borders of dither groups coincide with borders of bands except for the first dither group that leaves out the first two bandlets of the first band.

The layout is like this

dither group none 0 1 2 3 4
band 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
bandlet 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Data in an ATRAC3+ substream block by level

  • Per substream data:
    • highest non-zero bandlet (used to calculate highest non-zero band and highest non-zero dither group)
    • Mute flag
  • Per channel data
  • Per dither-group data
    • Frequency-domain dither level
  • Per band data
    • Joint-stereo info
    • Envelopes
    • Tone data
    • Time-domain dither info
  • Per bandlet data
    • Spectral quantizers
    • Spectral levels
    • Coefficient encoding tree selection
    • Coefficients
atrac3p/atrac_spectrum_anatomy.1269089847.txt.gz · Last modified: 2010/03/20 12:57 by megadiscman

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki