Case Study: A Serial Pass Mode Architecture for BPC

Case Study: A Serial Pass Mode Architecture for BPC

A simplified block diagram of the serial pass mode EBCOT tier-1 architecture [3,4] is presented in Figure 81.19. To begin, the “Memories” block actually contains three individual memories, each the same dimensions as the code-block and 1-bit per entry. The significant status memory (σ) maintains

VLSI Architectures for JPEG 2000 EBCOT-Design Techniques and Challenges-0084

VLSI Architectures for JPEG 2000 EBCOT-Design Techniques and Challenges-0085

information on the significance status of each bit in the current bit-plane of the code-block. The second memory, first refinement status memory (σmr), uses a single bit to indicate whether or not the coefficient has been processed during an earlier magnitude refinement pass. The final memory subblock, new significant status memory (σnew), contains the most recent updates to the coefficient’s significance status and is similar to σ. For example, once a coefficient is found significant, the corresponding bit in σnew is set to indicate its new significance status. At the start of a bit-plane, the contents of σnew are copied into σ to reflect all status changes that occurred in the previous bit-plane. Using the column-processing concept, the “neighbor processing” block reads the data from the memories and calculates H, V, and D for each bit in a column simultaneously. These values along with the significance information of the coefficient are used to determine which coefficients are coded during the current pass (NBC) and thus enable pixel skipping. The “context formation” block contains the circuitry for generating the context labels for those NBC bits. The control unit included several submodules that manage the operation of the entire architecture.

One obstacle encountered with column processing is that the state variable processing element (PE) produces the NC for each of the four coefficients in parallel. However, the PEs for context formation operate in serial. To resolve this discrepancy, the selector block uses the NBC information to serially choose which of the coefficients will be sent to the context formation PEs. It is possible to implement parallel processing in the context formation blocks; however, this requires several duplications of the hardware, which may be underutilized when only a few coefficients from a column are coded during a particular pass.

The components of this architecture were implemented using Synopsis and 0.5 µm technology. The results are shown in Table 81.3.

Comments

Popular posts from this blog

SRAM:Decoder and Word-Line Decoding Circuit [10–13].

ASIC and Custom IC Cell Information Representation:GDS2

Timing Description Languages:SDF