atrac3p:serialized_tone_data
                Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| atrac3p:serialized_tone_data [2010/07/23 14:58] – Explain tone linking - way easier than expected ;) megadiscman | atrac3p:serialized_tone_data [2010/10/25 22:28] (current) – Fix tree name megadiscman | ||
|---|---|---|---|
| Line 3: | Line 3: | ||
| Tonal components are sine-shaped signals that get added after IMDCT on decoding. | Tonal components are sine-shaped signals that get added after IMDCT on decoding. | ||
| + | ===== Header info ===== | ||
| Tonal components have their own stereo processing which is not connected to the residual spectrum stereo processing, this is the cause that the serialized tone data has a header that is not replicated per channel. The first bit in the tone info header defines the dynamic range. In the high-dynamic-range mode, the levels for each tone are chosen on a 64 step exponential amplitude scale, while in low-dynamic range, only an overall level is chosen on the exponential scale, and the individual tones have the level chosen on a 16-level linear scale. | Tonal components have their own stereo processing which is not connected to the residual spectrum stereo processing, this is the cause that the serialized tone data has a header that is not replicated per channel. The first bit in the tone info header defines the dynamic range. In the high-dynamic-range mode, the levels for each tone are chosen on a 64 step exponential amplitude scale, while in low-dynamic range, only an overall level is chosen on the exponential scale, and the individual tones have the level chosen on a 16-level linear scale. | ||
| Line 47: | Line 48: | ||
| === 1: variable-length-encoded difference-to-master === | === 1: variable-length-encoded difference-to-master === | ||
| - | The slave pitch is encoded as difference to the master pitch with the same tone index (if present) or the master tone with the highest index that is present, or, if there are no master tones at all, as difference to 0. Difference application wraps at 1024. | + | In each band, the slave pitches of all tones are encoded as difference to the master pitch with the same tone index in that band (if present) or the master tone with the highest index that is present, or, if there are no master tones at all, as difference to 0. Difference application wraps at 1024. | 
| ===== Tone Linking ===== | ===== Tone Linking ===== | ||
| - | For the compression of level and phase information in the slave channel, tones in the slave channel are linked to tones in the master channel based on their pitch. The linking algorithm is like this: | + | For the compression of level information in the slave channel, tones in the slave channel are linked to tones in the master channel based on their pitch. The linking algorithm is like this: | 
| * Look for the tone(s) in the master channel which have the lowest absolute deviation in pitch from the current slave tone. If more than one tone with the same absolute deviation is found, pick the first one. If the absolute deviation in pitch is less then 8, link to that tone. | * Look for the tone(s) in the master channel which have the lowest absolute deviation in pitch from the current slave tone. If more than one tone with the same absolute deviation is found, pick the first one. If the absolute deviation in pitch is less then 8, link to that tone. | ||
| Line 57: | Line 58: | ||
| * Otherwise, don't link the tone at all. | * Otherwise, don't link the tone at all. | ||
| - | ===== Tone Levels ===== | + | ===== Tone Levels | 
| - | depending on high/low dynamic range mode different, | + | In high-dynamic range mode, the level of each tone is stored on a logarithmic scale. The value between 0 and 63 can be encoded in one of four modes, the last two only being available on the slave channel (as they refer linked tones). | 
| + | |||
| + | ==== Encoding Modes ==== | ||
| + | |||
| + | === 0: direct encoding === | ||
| + | The level of each tone in each band is encoded as a plain 6-bit number. | ||
| + | |||
| + | === 1: variable-length encoding === | ||
| + | The level of each tone in each band is stored using a variable-length code. The possible level values range between 20 and 51 in this case. | ||
| + | |||
| + | === 2 (only in slave channel): variable-length encoded difference to master === | ||
| + | The level of each tone in each band is stored as variable-length-encoded difference to the level of the linked tone in the master channel, which is assumed as 34 if there is no linked tone. The difference application does not wrap. | ||
| + | |||
| + | === 3 (only in slave channel): clone master channel data === | ||
| + | The level of each tone that is linked to a master tone is copied from the master tone level, unlinked tones get a default level of 32. | ||
| + | |||
| + | ===== Band Base Levels (LDR mode) ===== | ||
| + | In low-dynamic range mode, the base level of all tones in a band is described by one common value on an logarithmic scale, with each individual tone having a linear scaled additional level. This allows a more fine-grained level control if all tones in a band have approximately the same level. The encoding modes are similar, but not equivalent to the level info of the HDR mode: | ||
| + | |||
| + | ==== Encoding Modes ==== | ||
| + | |||
| + | === 0: direct encoding === | ||
| + | The base level of each band is encoded as a plain 6-bit number. | ||
| + | |||
| + | === 1: variable-length encoding === | ||
| + | The base level of each band is stored using a variable-length code. The possible level values range between 24 and 55 in this case. | ||
| + | |||
| + | === 2 (only in slave channel): variable-length encoded difference to master === | ||
| + | The base level of each band is stored as variable-lenght-encoded difference to the master level of base level of the corresponding band in the master channel, which is assumed as 44 if the master channel has no tones. Difference application does not wrap. | ||
| + | |||
| + | === 3 (only in slave channel): clone master channel data === | ||
| + | The base level of each band is copied from the corresponding master base level; Bands without tones in the master channel get a base level of 49. | ||
| + | |||
| + | ===== Tone Levels (LDR mode) ===== | ||
| + | The individual linearly-scaled tone levels (on a scale between 0 and 15) in LDR mode are stored similar to the logarithmic tone levels in HDR mode: | ||
| + | |||
| + | ==== Encoding Modes ==== | ||
| + | |||
| + | === 0: direct encoding === | ||
| + | The level of each tone is encoded as a plain 4-bit number | ||
| + | |||
| + | === 1: variable-lenght encoding === | ||
| + | The level of each tone is encoded using a variable-lenght code. Bands with just one tone use a different | ||
| + | |||
| + | === 2 (only in slave channel): variable-length encoded difference to master === | ||
| + | The level of each tone in each band is stored as variable-lenght-encoded difference to the level of the linked tone in the master channel, which is assumed as 12 if there is no linked tone. Difference applications wraps around. | ||
| + | |||
| + | === 3 (only in slave channel): clone master channel data === | ||
| + | The level of each tone that is linked to a master tone is copied from the master tone level, unlinked tones get a default level of 14. | ||
| ===== Tone Phase ===== | ===== Tone Phase ===== | ||
| The phase is encoded as plain 5-bit-number for each tone. | The phase is encoded as plain 5-bit-number for each tone. | ||
| + | ===== Encoding ===== | ||
| + | ==== Tone info header ==== | ||
| + | * 1 bit: Chooses high-dynamic-range mode if set, low-dynamic-range mode if clear | ||
| + | * symbol from [[tone info trees#bands with tones tree]] (in the range 1..16 bands). | ||
| + | * in two-channel substreams: | ||
| + | * 1 bit: If clear, never clone information from master to slave for any band otherwise | ||
| + | * 1 bit: If clear, clone information from master to slave for all bands, otherwise | ||
| + | * 1 bit per band with tone data: If set, clone data for this band | ||
| + | * 1 bit: If clear, master channel is the left channel for all bands | ||
| + | * 1 bit: If clear, master channel is the right channel for all bands | ||
| + | * 1 bit per band with tone data: If set, master channel is the right channel | ||
| + | * 1 bit: If clear, use right channel as-is | ||
| + | * 1 bit: If clear, apply 180° phase shift (=negation) to right channel | ||
| + | * 1 bit per band with tone data: Id set, right channel is negated. | ||
| + | * for each channel: | ||
| + | * per-channel tone info (see below) | ||
| + | |||
| + | ==== Per-channel tone info ==== | ||
| + | * per-band data: | ||
| + | * start/end positions | ||
| + | * counts of tones in the bands | ||
| + | * per-tone data: | ||
| + | * pitches | ||
| + | * if in HDR mode | ||
| + | * tone levels (HDR) | ||
| + | * otherwise | ||
| + | * band base levels (LDR) | ||
| + | * tone levels (LDR) | ||
| + | * phase | ||
| + | |||
| + | ==== Start/End Positions ==== | ||
| + | * if on slave channel: | ||
| + | * 1 bit to choose cloning mode if set, or plain encoding if clear | ||
| + | * if in plain encoding mode (always on master channel) | ||
| + | * for each band that might have tone data (all bands up to the highest band with tones in the master channel, all uncloned bands in the slave channel) | ||
| + | * 1 bit: enable start ramp, if set: | ||
| + | * 5 bits: start ramp position. | ||
| + | * 1 bit: enable end ramp, if set: | ||
| + | * 5 bits: end ramp position. | ||
| + | * otherwise (in clone mode) | ||
| + | * no data | ||
| + | |||
| + | ==== Band Tone Count ==== | ||
| + | * on master channel | ||
| + | * 1 bit: coding mode | ||
| + | * on slave channel | ||
| + | * 2 bits: coding mode | ||
| + | * encoded tone counts (see directly below) | ||
| + | |||
| + | === Coding Mode 0: direct encoding === | ||
| + | * for each band that might have tone data: | ||
| + | * 4 bit tone count | ||
| + | |||
| + | === Coding Mode 1: variable-length encoding === | ||
| + | * for each band that might have tone data: | ||
| + | * symbol from [[tone info trees#tone per band tree]] | ||
| + | |||
| + | === Coding Mode 2 (slave only): variable-length delta-to-master encoding === | ||
| + | * for each band that might have tone data: | ||
| + | * symbol from [[tone info trees#tone per band delta to master tree]], encoding the difference to master as signed 3-bit number. Delta application wraps after sign-extending to 4 bits. | ||
| + | |||
| + | === Coding Mode 3 (slave only): clone master === | ||
| + | * no data | ||
| + | |||
| + | ==== Tone pitches ==== | ||
| + | * On slave channel only: | ||
| + | * 1 bit: coding mode (master always uses mode 0) | ||
| + | * encoded pitch info (see directly below) | ||
| + | |||
| + | === Coding mode 0: near-direct encoding === | ||
| + | * for each band that has a non-zero tone count: | ||
| + | * if the tone count is bigger than 1: | ||
| + | * 1 bit, chooses " | ||
| + | * for each tone in that band: | ||
| + | * up to ten bits: near-direct encoded pitch. In ascending mode, leading 1 bits of the current tone pitch that were already set in the previous pitch are omitted (note that the count if leading one bits is monotonically increasing). Analogously, | ||
| + | |||
| + | === Coding mode 1: difference-to-master === | ||
| + | * for each tone in each band that has a non-zero tone count: | ||
| + | * symbol from [[tone info trees#tone pitch delta tree]]. | ||
| + | |||
| + | ==== Tone levels (HDR) ==== | ||
| + | * on master channel | ||
| + | * 1 bit: coding mode | ||
| + | * on slave channel | ||
| + | * 2 bits: coding mode | ||
| + | * encoded level info (see directly below) | ||
| + | |||
| + | === Coding mode 0: direct encoding === | ||
| + | * for each tone in each band: | ||
| + | * 6 bits: binary encoded tone level | ||
| + | |||
| + | === Coding mode 1: variable-length encoding === | ||
| + | * for each tone in each band: | ||
| + | * symbol from [[tone info trees#HDR level tree]]. | ||
| + | |||
| + | === Coding mode 2 (slave only): variable-lenght encoded difference to master === | ||
| + | * for each tone in each band: | ||
| + | * symbol from [[tone info trees#HDR level difference tree]]. Use linked tone level or 34 (if not linked) as base. | ||
| + | |||
| + | === Coding mode 3 (slave only): clone master === | ||
| + | * no data | ||
| + | |||
| + | ==== Band base levels (LDR) ==== | ||
| + | * on master channel | ||
| + | * 1 bit: coding mode | ||
| + | * on slave channel | ||
| + | * 2 bits: coding mode | ||
| + | * encoded level info (see directly below) | ||
| + | |||
| + | === Coding mode 0: direct encoding === | ||
| + | * for each band: | ||
| + | * 6 bits: binary encoded base level | ||
| + | |||
| + | === Coding mode 1: variable-length encoding === | ||
| + | * for each band: | ||
| + | * symbol from [[tone info trees#LDR band level tree]]. Add **24** to the symbol number to obtain the real level. | ||
| + | |||
| + | === Coding mode 2 (slave only): variable-lenght encoded difference to master === | ||
| + | * for each band: | ||
| + | * symbol from [[tone info trees# | ||
| + | |||
| + | === Coding mode 3 (slave only): clone master === | ||
| + | * no data | ||
| + | |||
| + | ==== Tone levels (LDR) ==== | ||
| + | * on master channel | ||
| + | * 1 bit: coding mode | ||
| + | * on slave channel | ||
| + | * 2 bits: coding mode | ||
| + | * encoded level info (see directly below) | ||
| + | |||
| + | === Coding mode 0: direct encoding === | ||
| + | * for each tone in each band: | ||
| + | * 4 bits: binary encoded tone level | ||
| + | |||
| + | === Coding mode 1: variable-length encoding === | ||
| + | * for each band: | ||
| + | * if that band has one tone: | ||
| + | * symbol from [[tone info trees# | ||
| + | * otherwise for each tone in that band | ||
| + | * symbol from [[tone info trees#LDR level tree]]. | ||
| + | |||
| + | === Coding mode 2 (slave only): variable-lenght encoded difference to master === | ||
| + | * for each tone in each band: | ||
| + | * symbol from [[tone info trees#HDR level difference tree]]. Use linked tone level or 34 (if not linked) as base. | ||
| + | |||
| + | === Coding mode 3 (slave only): clone master === | ||
| + | * no data | ||
| + | |||
| + | ===== Tone phase ===== | ||
| + | * for each tone in each band: | ||
| + | * 5 bit phase value | ||
atrac3p/serialized_tone_data.1279897127.txt.gz · Last modified: 2010/07/23 14:58 by megadiscman
                
                