Audio | v +---------------+ | RDOVAE encoder| +---------------+ | v +---+---+---+---+---+---+---+---+---+---+---+---+---+---+ | L | L | L | L | L | L | L | L | L | L | L | L | L | L | +---+---+---+---+---+---+---+---+---+---+---+---+---+---+ | S | S | S | S | S | S | S | S | S | S | S | S | S | S | +---+---+---+---+---+---+---+---+---+---+---+---+---+---+ | | | v | | +---+---+---+---+---+---+---+ | | decoder <--| L | | L | | L | | L | | | +---+---+---+---+---+---+---+ | | | S | | | +---+ | | v | +---+---+---+---+---+---+---+ | decoder <--| L | | L | | L | | L | | +---+---+---+---+---+---+---+ | | S | | +---+ | v +---+---+---+---+---+---+---+ decoder <--| L | | L | | L | | L | +---+---+---+---+---+---+---+ | S | +---+ / | p0 , if X = 0 | P(X) = < |X| | (1 - p0) * r , if X != 0 | --------------- \ 2 * (1 - r)