- Open Access
- Total Downloads : 483
- Authors : P.Sowjanya, P.Pushpalatha
- Paper ID : IJERTV2IS80437
- Volume & Issue : Volume 02, Issue 08 (August 2013)
- Published (First Online): 24-08-2013
- ISSN (Online) : 2278-0181
- Publisher Name : IJERT
- License: This work is licensed under a Creative Commons Attribution 4.0 International License
Transposed Form Fir Filter Implementation Using Reconfigurable Architectures
P.Sowjanya 1 , P.Pushpalatha 2
Department of ECE, University college of engineering, JNTU Kakinada 533003, Kakinada,Andhra Pradesh
Abstract
FIR filters require two factors Low complexity and reconfiguability it is main concern especially in case of higher order filters. Complexity is depend on coefficient multipliers in filters. Multiplication complexity is based on number of adders used for multiplication. The well known Architectures CSM based on filter coefficient partitioning and PSM implemented by usig BCSE algorithms are used in this paper to achieve the requirements. The BCSE algorithm eliminates the same bit patterns present in filter coefficients so that number of computations are reduced this will efficiently reduce the number of adders so the complexity of hardware circuitary is reduced . The proposed architectures are implemented and tested on FPGA and synthesized using Xilinx ISE. The obtained results show that this methods offers good area and power reduction compared to the existing methods.
KeywordsConstant shift method, programmable shift method, Coefficient partitioning, Binary common subexpression elimination.
-
Introduction
Finite impulse response filters are digital filters, which have a finite impulse response. FIR filters do not employ any feedbacks and are also known as non-recursive filters, convolution filters, or moving-average filters because you can express the output of a FIR filter as a finite
convolution as shown in equation ( 1)
for an extra pipeline stage for the adder of the products to achieve high throughput.
The complexity of FIR filters is mainly depend on coefficient multiplication. The two key factors that determine the complexity of coefficient multiplications in FIR filters are the number of LOs and LD. LOs represent the adders required in computing the sum of partial products in the multiplier, it will determine the area and power requirements of the filter circuit. LD is the number of adder steps in a maximal path of decomposed multiplications, which determines the speed of filtering operations. Therefore the focus of low-complexity FIR filter implementation algorithms is on reducing the number of LOs and LD in coefficient multipliers.
Many algorithms proposed in literature [4] for the implementation of low-power and high-speed FIR filters with a minimum number of LOs and LD. Among these approaches ,the CSE techniques based on binary formate produced the best hardware reduction compared to the CSD based CSE and graphical methods in [3] . The proposed CSM architecture depends on partitioning the filter coefficients into fixed groups and the PSM architecture is implemented by eliminating redundancy with the help of BCSE. CSM offer good power reduction but it will slightly increase the area requirement. PSM offer good tradeoff between power and area and by usig PSM architecture the length of the filter coefficients can be changed without any modification in hardware circuitry.
-
Existing method architecture
y h x
– ( 1 )
The existing method uses adder for the multiplication .
Where y is the output of FIR filter, h is filter coefficient and x is the input value.
The different architectures of FIR filters are 1.Direct Form FIR Filter
-
Transposed Form FIR Filter 3.Symmetric Form FIR Filter 4.Distributed Arithmetic FIR Filter
-
A variation of the direct FIR model is called the transposed FIR filter. It can be constructed from the direct form FIR filter by following the steps
-
Exchanging the input and output
-
Inverting the direction of signal flow
-
Substituting an adder by a fork
Transposed FIR filter is the preferred implementation of an FIR filter. The benefit of this filter is that there is no need
Fig.I. shows the existing method architecture.
Fig. I. Existing method architecture
In Fig.I. x[n] represents the input, h(0),h(1)—h(5)
represents the filter coefficients and
z 1 represents the
delay element. In this architecture the number of
multipliers and and adders will increase with increase in number of filter order so the complexity of FIR filter is increased. This method uses CSD representation of filter coefficients the number of unpaired bits required to represent coefficients are more compared the proposed method in [6]. The look ahead method is used in CSD fror selecting patterns and to eliminate redundant horizontal and vertical common subexpressions in filter coefficients. The general representation of CSD for the ith order filter coefficient is expressed in equation (2).
B1
Vol. 2 Issue 8, August – 2013
hi 2aij j0
-(2)
Where i,j are integers from 0 to n here n is length of filter coefficient. The number of bits required to represent filter coefficients and the complexity of FIR filter is reduced by usig proposed method.
-
Proposed Method Architecture
The proposed architectures based on transposed direct form FIR filter it is mainly consists of shift and add unit, multiplexer block and adder unit as shown in Fig. II.
Fig. III. Architecture of shift and add unit.
In the figure x>>k represents the right shift of input by k times this shift and add unit require 3 adder stages but the conventional methods of multiplication require 5 adder stages so the proposed methods CSM and PSM in [6] use shift and add units for the multiplication purpose.
-
Multiplexer unit
This unit is to multiply the output of shift and add unit with the filter coefficients stored in look up table the input is variable and filter coefficients are constants and the multiplication here is MCM in [8]. The number of multiplexers required in CSM are based on number of groups that the filter coefficient is partitioned and in PSM the number of multiplexers required depends upon the number of non zero terms in the filter coefficients.
-
Final shifter unit
-
-
This unit shifts the result after all intermediate additions are performed the output of the final shifter unit is shown in equation (3)
y 24 x 26 x 215 x 216 x After coefficient partitioning the output is y 24 (x 22 x) 215 (x 21 x)
(3)
-(4)
The final shifter unit is different for the architectures CSM and PSM for CSM the shifter is constant but for PSM the shifter is programmable.
Fig. II. Architecture of the proposed method.
The dotted portion in Fig .II represents the processing element in the proposed architectures and these processing elements are different for CSM and PSM.
2.1.Shift and add unit
Shift and add unit is similar to the multiplier this unit adds the multiplicand x to itself y number of times where y is multiplier term. The shift and add unit architecture is shown in Fig.3.
-
Architecture of Constant Shift Method
CSM architecture is implemented by partitioning the filter coefficients and those coefficients are directly stored in LUT those grouped coefficients are used as select signals for the multiplexer. The number of multiplexers required for the CSM are n/3 here n is the filter coefficient length. If filter coefficient length is 6 then number of multiplexers required are 2. For better understanding consider the below example.
h=0.111111
Number of non zero bits are 6so n=6and number of multiplexers are 6/3=2 and h can be expressed as
y 2 x 22 x 23 x 24 x 25 x 26 x
After partitioning y can be written as
y 21 (x 21 x 22 x 23 x 24 x 25 x)
The Fig. IV shows the CSM architecture.
Fig. IV. CSM Architecture
The steps involved in CSM are as follows Step 1: Take the input x.
Step 2: Read the coefficients from the LUT and use as the select signal for the multiplexers.
Step 3: Perform the final shifting function on the output of the multiplexer.
Step 4: Perform the addition of intermediate sums using the final adder unit.
Step 5: Store the final result, h*x, in the delay unit D. Step 6: Go to step 2 if the coefficients in the LUT are not finished, else go to 1
-
Architecture of Programmable Shift Method
The PSM method is implemented based on CSE algorithm. In this architecture instead of constant shifting used in CSM programmable shifters are used.Filter coefficients are coded and stored in LUT.The coding procedure is as follows
Consider h=[1010011001010011]
Assign 2=[101] and 3=[101]
By substituting the above values h will become as h=[3000020003000020]
The h will be stored in LUT as 000001101011011110 and 100111111010000000 each coefficient in coded format. Consists of sign bit, shift values . As the coded formate of filter coefficients are in coded formate the number of binary digits required to represent filter coefficients are reduced compared to the other existing methods used in [2],[3]. The architecture of PSM is shown in Fig. IV.
Vol. 2 Issue 8, August – 2013
Fig.V. PSM architecture
The steps involved in PSM implementation are as
Step 1: Obtain the BCSs from filter coefficients using CSE algorithm.
Step 2: Store the resultant coefficients in the LUT. Step 3: Take the input x.
Step 4: Read the coefficients from the LUT and use as the select signal for the multiplexers and the programmable shifters.
Step 5: Perform the final shifting function on the output of the multiplexer using PS
Step 6: Perform the addition of intermediate sums using the final adder unit.
Step 7: Store the final result in the delay unit D.
Step 8: Go to step 4 if the coefficients in the LUT are not finished, else go to 3.
-
Comparison between CSM and PSM
-
In CSM filter coefficients are grouped and are used as select signals for multiplexer.
-
In PSM coding of filter coefficients are done by using BCSE.
-
By using PSM Number off adders required are reduced to great extent compared to CSM.
-
PSM is independent on filter coefficient length but CSM is depend on length of filter coefficient
-
-
Experimental results
-
Synthesis Results
The VHDL code is developed for the proposed architectures and are synthesized on Xilinx 9.2i.The below table shows the comparison between CSM and PSM in terms of Gate count and area occupation.
Table. 1
Synthesis Results for Existing method and proposed method
Existing method
Proposed method
parameter
CSD-CSM
CSD-PSM
BCSM
BPSM
Gate count
1534
1523
1420
1404
Area
5.6
5.2
4.3
4.1
From the Table I, it is clear that the number of gates required for Proposed method are less than that of the Existing method so the hardware complexity is reduced by using Proposed architecture and it is also clear by looking into the table the area requirement is also less for the Proposed method so the proposed methods CSM and PSM are best suitable methods for the design of low complex FIR filters.
Table. 2
Comparison between CSM and PSM in terms of delay
Parameter
CSM
PSM
Power
84
48.3
Delay
9.08
9.87
From Table. 2 it is clear that the power consumption is also reduced in PSM compared to that of CSM.
-
-
Conclusion
Hence the main objective of this project is to design FIR filter with the features of reconfigurability and Low Complexity. With the proposed architectures CSM and PSM the requirements are achieved. So it is clear that these architectures are well suited to design higher order filters compared to that of the other existing methods.
-
References
cellular base stations, IEEE J. Sel. Areas Commun., vol. 17, no. 4, pp. 561573, Apr. 1999.