Design of Delay-Efficient Configurable Booth Multiplier for High Speed Applications

Shabeer Ahmad Ganiee; Sajad Ahmad Ganiee; Dr. Faroze Ahmad

doi:10.17577/IJERTV3IS10785

Volume 03, Issue 01 (January 2014)

Design of Delay-Efficient Configurable Booth Multiplier for High Speed Applications

DOI : 10.17577/IJERTV3IS10785

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 676
Total Downloads : 463
Authors : Shabeer Ahmad Ganiee, Sajad Ahmad Ganiee, Dr. Faroze Ahmad
Paper ID : IJERTV3IS10785
Volume & Issue : Volume 03, Issue 01 (January 2014)
Published (First Online): 28-01-2014
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Design of Delay-Efficient Configurable Booth Multiplier for High Speed Applications

Shabeer Ahmad Ganiee

Assistant Professor, ECE

Sajad Ahmad Ganiee

M.Tech (Electronic Circuit and System)

Dr. Faroze Ahmad

HOD, Electronics & Communication

IUST, Awantipora, J&K

Abstract Multipliers, play an important role in the design of microprocessor, graphical systems, multimedia systems, DSP system etc .Nearly 15 percent of total IC power is consumed by multiplication alone. It is therefore very important to have an efficient design in terms of performance, area, speed of the multiplier, and for the same Booths multiplication algorithm provides a very fundamental platform for all the new advances made for high end multipliers meant for faster multiplication with higher performance. The algorithm provides an efficient encoding of the bits during the first steps of the multiplication process. In this paper an attempt has been made to design configurable logic for 4/8/12/16-bit booth multiplier. This multiplier can be configured to perform multiplication on 4 or 8 or 12 or 16 bit operands. The multiplier will detect the range of the operands through configuration register. The configuration register can be configured through input ports. The multiplier has been synthesized on Vertex 7 technology and it has achieved a maximum combinational delay of 1.846ns.

Keywords Booth Multiplier, Booth Multiplier Algorithm, Configurable Booth Multiplier (CBM)

INTRODUCTION

Arithmetic and logic operations like addition, multiplication, exponensation play a vital role in digital circuits and have wide applications in the field of engineering. The demand for high speed processing has been increasing day by day as a result of expanding computer and signal processing applications. Higher throughput arithmetic operations are important to achieve the desired performance in many real-time signal and image processing applications [1]. Among these arithmetic operations, multiplication is the key of almost every digital circuit and finds applications in many Digital Signal Processing (DSP) systems such as Convolution, Fast Fourier Transform (FFT), filtering , in microprocessors in its arithmetic and logic unit and in graphics [2].

Digital multipliers are the most commonly used components in any digital circuit design. They are fast, flexible, reliable and efficient components that are utilized to implement any operation. Depending upon the arrangement of the components, there are different types of multipliers available. Particular multiplier architecture is chosen based on the application. Development of fast multiplier circuit has been a subject of interest over decades. Since multiplication

dominates the execution time of most DSP algorithms, so there is a need of high speed multiplier[3,4].

Currently, execution time of multiplication is still the dominant factor in determining the instruction cycle time of a DSP chip. Many multiplication algorithms have been proposed in literature to perform multiplication, each offering different advantages and having tradeoff in terms of speed, circuit complexity, area and power consumption. Reducing the time delay and power consumption are very essential requirements for many applications [1, 5].

Low power multipliers with high clock frequencies play an important role in todays digital signal processing [6,7,8]. Thats why if one also aims to minimize power consumption, it is of great interest to reduce the delay by using various delay optimizations.

Various multiplication algorithms such as Booth, Array, Wallace tree, Braun and Baugh Wooley have been proposed in literature from time to time. Among these Donald Booth made an improvement in the multiplier by reducing the number of partial products generated. Booth used desk calculators that were faster at shifting than adding and created the algorithm to increase their speed [9]. To speed up the multiplication Booth encoding performs several steps of multiplication at once. Booths multiplication algorithm takes advantage of the fact that an adder, subtractor is nearly as fast and small as a simple adder.

In this paper, an attempt has been made to configure the Booth multiplier using configuration register that can supports single 4-bit, single 8-bit, single 12-bit or single 16- bit data. This CBM depends upon the output of configuration register that can be configured through input ports.Since there are sequential and combinational multiplier implementations but combinational case will be considered here because the scale of integration is large enough to consider parallel multiplier implementations in digital VLSI systems.
BOOTH MULTIPLIER

Andrew Donald Booth in 1951, devised a multiplication algorithm which was named after his name as Booths Algorithm. Signed multiplication is a vigilant process. Through unsigned multiplication there is no need to take the sign of the number into consideration. Same procedure cannot be applied for signed multiplication due to the reason that the signed numbers are in a 2s compliment form which would

give us inaccurate result if multiplied in an analogous manner to unsigned multiplication [10].Unsigned multipliers cannot be applied to most of the multimedia and DSP applications due to their signed multiplication operation [11]. Thus here Booths algorithm comes in rescue. The Booth recording multiplier scans the three bits at a time to reduce the number of partial products generated [12].Booths algorithm conserves the sign of the end result, thus showing better performance in terms of operating speed ,time delay, power dissipation and area. From the basics of Booth Multiplication it can be proved that the addition/subtraction operation can be skipped if the successive bits in the multiplicand are same, thus reducing the delay to a greater extent.

2.1 Booth Multiplication Algorithm

Place the result so obtained from arithmetic operations in the left half of the beginning product.

Step 4: Perform an arithmetic right shift (ASR) on the entire product for each pass. After X-passes we will get the required result, where X is the number of bits in the input operands. For the above example the result can be thus obtained after five passes.

Start

Booths algorithm multiplies two signed binary numbers in 01

twos complement notation. Various steps involved in the

Test multiplier [i: i-1]
10

Booths multiplication are as:

Step 1: From the two numbers under test decide which operand will be multiplier and which will be the multiplicand

.Note that the number with smallest difference between a series of consecutive numbers is chosen as a multiplier.

For example if we have to multiply 10 (01010) and -5 (11011). For 10 (01010) we have —– 0 to 1 one change,1 to 0 one change,0 to 1 one change and 1 to 0 one change .So , in total there are four changes in binary form of 10.

00

Add multiplicand to the left half of the product and place the result in the left half of the product register

11

Subtract multiplicand to the left half of the product and place the result in the left half of the product register.

< 32 rep

For -5(11011),we have —- 1 to 1 no change,1 to 0 one change ,0 to 1 one change and 1 to 1 no change ,so in overall there are only two change in -5.

Thus -5 (11011) is chosen as the multiplier and 10 (01010) as the multilicand.

Step 2: Begin with the product that consists of the multiplier with an additional zero padding bits.Since our multiplier is 11011, after adding 5 leading zeros to the multiplier we get our beginning product.

00000 11011—— Beginning product

Step 3: Use least significant bit LSB and previous LSB to determine the arithmetic action.Intially 0 is chosen as the previous LSB.Thus our initial product and previous LSB becomes

00000 11011 0 —— initial partial product

Prior to the shifting, the multiplicand may be added to partial product, subtracted from the partial product, or left unchanged according to the following rules:
1. No arithmetic action
2. Add multiplicand to the left half of the product.
1. Subtract multiplicand from the left half of the product.
2. No arithmetic action
Arithmetic Right Shift (ASR)

= 32 rep

Done

Figure 1 Flow Chart of Booth Multiplier
CONFIGURABLE BOOTH MULTIPLIER
The configuration register will detects the effective dynamic range of input data and then generates the control signal to determine the flow of data. To simplify the implementation range detection can be realized by using the group of input bits. The data detection starts from the most significant bits, examining each four bit group. In the range detection technique, both the input 16-bit operands A [16:0] and B [16:0] are divided into four parts or subexpressions that are A [15:12], A [11:8], A [7:4] and A [3:0] and B [15:12], B
[11:8], B [7:4] and B [3:0]. The size of both operands is checked separately and simultaneously whether they come in the range of 4 bits, 8 bits, 12 bits or 16 bits using the following relation:

If (| (A[15:12]) ==1) (i.e. Performing Bitwise OR operation on four MSB.)

range A=2'b11

else if( |(A[11:8] ) ==1 ) rangeA=2'b10;

else if ( | (A[7:4] )==1) range A=2'b01;

else if ( |(A[3:0] )==1) range A=2'b00;

else if( |(A[11:8] )==1) range A=2'b10;

else if( |(A[15:12] )==1) range A=2'b11

In the similar manner range is detected for B and the final range is decided by the relation:

if (range A > range B) range=range A else

range= range B

This procedure of range detection minimizes the generation of partial products to a greater extent by suppressing the most significant bits if they are zero. Calculation will become shorter, faster and efficient..
RESULTS

Simulation Results

For the model under consideration the simulation results are carried out using Verilog HDL as simulation tool and Modelsim as simulator. Simulation results for various input operands A [15:0] and B [15:0] to obtain P [31:0] have been verified.

Figure 4 Simulation Result for A=1010 1000 0011 0011

and B= 0011 0101 0100 0000

Figure 5 Simulation Result for A=0000 0101 0100 0011 and

Before you begin to format your paper, first write and sa

B= 0000 0001 1111 1110

TABLE 1

.

RESULT FOR VARIOUS INPUTS OF DIFFERENT RANGE

A[15:0]	B[15:0]	P[31:0]	Range
1010 1000 0011 0011	0011 0101 0100 0000	11101101101111001 001101111000000	16
(Adec= -22477)	(Bdec = 13632)	(Pdec= -306406464)
0000 0101 0100 0011	0000 0001 1111 1110	00000000000010100 111101101111010	12
(Adec = 1347)	(Bdec = 510)	(Pdec= 686970)
0000 0000 0101 1111	0000 0000 0011 1010	00000000000000000 001010110000110	8
(Adec= 95)	(Bdec = 58)	(Pdec= 5510)
000 0000 0000 0110	0000 0000 0000 0111	00000000000000000 000000000101010	4
(Adec= 6)	(Bdec= 7)	(Pdec= 42)

Figure 6 Simulation Result for A=0000 0000 0101 1111 and

the content as a

B=0000 0000 0011 1010

Figure 7 Simulation Result for A=0000 0000 0000 0110 and

B=0000 0000 0000 0111

Synthesis

The multiplier has been synthesized on Vertex 7 FPGA Board. A detailed summary of devices utilized and timing summary has been shown below:

TABLE 2.

SUMMARY OF DEVICES UTILIZED IN CONFIGURABLE BOOTH MULTIPLIER

8

7

6

5

4

3

2

1

Booth Multiplier

Combinational Path Delay

7-10ns







				1.84ns

Configurable Booth

Multiplier

Parameter		Value
Latches	1-bit Latch	22
Comparators	2-bit Comparator	1
	1-bit 2-to-1 multiplexer	47
Multiplexers	2-bit 2-to-1multiplexer	13
	33-bit 3-to-1 multiplexer	1

Figure 8: Graphical representation of Combinational Delays

TABLE 3.

TIMING SUMMARY

Delay	Value
Minimum input arrival time before clock:	1.042ns
Maximum output required time after clock:	1.575ns
Maximum combinational path delay:	1.846ns

From the above analysis of delay, Configurable booth multiplier is delay efficient .The Combinational delay of Configurable booth Multiplier is 1.84ns which is quiet low than the simple booth multiplier having a delay of 7-10ns.

5:CONCLUSION

We have presented a 4/8/12/16-bit configurable booth multiplier. This multiplier can be configured to perform 4 or 8 or 12 or 16 bit operands depending upon the output of configuration register. The multiplier will detect the range of the operands through configuration register. The configuration register can be configured through input ports. This process of configuration not only reduces the combinational path delay but also reduces power consumption to a larger extent. It also deactivates the redundant switching activities in ineffective ranges as much as possible. The proposed multiplier is very suitable for portable multimedia and DSP applications which require flexible processing ability, lesser switching activity and short design cycle. The multiplier has been synthesized on Vertex 7 FPGA Board and it has achieved a maximum combinational delay of 1.846ns.

REFERENCES

Himanshu Thapliyal and Hamid R. Arabnia, A Time-Area- Power Efficient Multiplier and Square Architecture Based On Ancient Indian Vedic Mathematics, Department of Computer Science, The University of Georgia, 415 Graduate Studies Research Center Athens, Georgia 30602-7404, U.S.A.
Purushottam D. Chidgupkar and Mangesh T. Karad, The Implementation of Vedic Algorithms in Digital Signal Processing, Global J. of Engng. Educ., Vol.8, No.2 Â© 2004 UICEE Published in Australia.
Low Power and High speed 8*8 bit Multiplier using non clocked Pass Transistor Logic,C.Senthilpari Ajay Kumar Singh and K Diwadkar 14244-1355-9/2007,IEEE
Kiat-Sang Yeo and Kanshik Roy Low voltage Low Power VLSI Sub System Mc Graw- Hill Publication
E. Abu-Shama, M. B. Maaz, M. A. Bayoumi, A Fast and Low Power Multiplier Architecture, The Center for Advanced Computer Studies, The University of Southwestern Louisiana Lafayette, LA 70504.
Padmanabhan Balasubramanian and Nikos E. Mastorakis, High Speed Gate Level Synchronous Full Adder Designs, WSEAS TRANSACTIONS on CIRCUITS and SYSTEMS Issue 2, Volume 8, 290-300, February 2009.
Sanjiv Kumar Mangal,Rahul M.Badghare, FPGA Implementation of Low Power Parallel Multiplier,10th International Conference on VLSI Design,2007.
Yingtao Jiang, Abdulkarim Al-Sheraidah, Yuke Wang, Edwin Sha, and Jin-Gyun Chung, A Novel Multiplexer-Based Low-Power Full Adder, in IEEE transactions on circuits and systems vol. 51, no. 7, July 2004
L.D. Van and C. C. Yang, Generalized low-error area efficient fixed width multipliers, IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 52, no. 8, pp. 16081619, Aug. 2005.
Laxman S, Darshan Prabhu R, Mahesh S Shetty ,Mrs. Manjula BM, Dr. Chirag Sharma,FPGA Implementation of Different Multiplier Architectures, International Journal of Emerging
Shiann-Rong Kuang and Jiun-Ping Wang Design of Power efficient Configurable Booth Multiplier Vol.57,No3,Marcp010
Tam Anh Chu, Booth Multiplier with Low Power High Performance Input Circuitary, US Patent, 6.393.454 B1,May 21, 2002.
ECE/Comp. Sci. 352 Digital System Fundamentals, Project #2 (Spring 2000), Department of Electrical and Computer Engineering, University of Wisconsin Madison, April 2000

Design of Delay-Efficient Configurable Booth Multiplier for High Speed Applications

Leave a Reply