Comparative Analysis on Heuristic Based Load Balancing Algorithms in Grid Environment

Manthan Brahambhatt; Dakshata Panchal

doi:10.17577/IJERTV4IS041010

Volume 04, Issue 04 (April 2015)

Comparative Analysis on Heuristic Based Load Balancing Algorithms in Grid Environment

DOI : 10.17577/IJERTV4IS041010

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 76
Total Downloads : 207
Authors : Manthan Brahambhatt, Dakshata Panchal
Paper ID : IJERTV4IS041010
Volume & Issue : Volume 04, Issue 04 (April 2015)
DOI : http://dx.doi.org/10.17577/IJERTV4IS041010
Published (First Online): 23-04-2015
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Comparative Analysis on Heuristic Based Load Balancing Algorithms in Grid Environment

Manthan Brahambhatt

M.E.Student Computer Department,

St. Francis Institute of Technology Mumbai, India

Dakshata Panchal Assistant Professor Computer Department,

St. Francis Institute of Technology Mumbai, India

AbstractThe solution to increasing demands of computational resources for completing computational jobs were found in terms of grid computing. Grid computing is all new technology in the field of distributed systems, parallel computing and cluster computing. Grid environment is a collection of multiple resources, geographically widely distributed at multiple locations to reach over a common goal. This environment allows multi owner resources to solve large scale applications. The major concern and challenge in grid computing environment is to balance the load of jobs on resources over the entire network. The criticality lies in the issue of network aware load balancing algorithms that needs to be quick and dynamic. Taking dynamicity as a consideration, heuristic based algorithms are well suited for such applications. The objective of this paper is to analyze different heuristic based load balancing algorithms, such as Genetic Algorithm, ant Colony Optimization, Particle Swarm Optimization and Tabu Search.

Keywords Grid computing, load balancing, genetic algorithm,tabu search, particle swarm optimization, ant colony optimization.

INTRODUCTION

Grid computing is defined as a type of parallel and distributed system that enables the sharing, selection, and aggregation of geographically distributed autonomous resources dynamically at runtime depending on their availability, capability, performance, cost, and quality of service. A computational grid is a hardware and software infrastructure that provides a dependable, consistent and inexpensive access to high end capabilities. This grid environment supports sharing and coordinated use of resources, independently of their physical type and location that share the common goal. All resources come together to work upon a common task and achieve the maximum performance. This grid environment allows the use of geographically widely distributed and multi-owner resources to solve large scale applications.

In grid computing there are two major parties [1], namely resource consumers (users) and resource providers (facilitators). Resource consumers are the one who submit various applications to the environment, whereas resource providers are the one who share their resources with the consumers and help them to achieve their goal. With respect to the parties there are two objective functions set. These two objective functions can be classified as Application centric and Resource centric. Application centric: It functions at

application level. It aims to optimize the performance of each individual application. E.g.: makespan. Resource centric: It functions at system level. It aims to optimize the performance of resources. E.g.: throughput, utilization etc.

There are two different types of grid computing environment

a.) Homogeneous environment

b.) Heterogeneous environment

Focusing on heterogeneous environment, heterogeneity exist in two categories
- Networks that are used to interconnect computational resources may differ significantly in terms of their communication protocols and bandwidth.
- Computational resources may differ and have different computer architectures, memory size, different hardware, CPU speed, RAM capability etc. and may also have different software applications such as operating system, grid management software and so on.
This grid computing environment is used to solve large scale applications such as meteorological simulations, data intensive applications, research of DNA sequences etc. The best examples of grid computing are Garuda by C-DAC India, seti@home by university of California, Berkeley.

A typical distributed system involves the use of geographically widely distributed network. This widely distributed network involves different nodes connected to each other in the network. Each node possesses some amount of initial load which represent amount of work to be performed. This work needs to be evenly balanced in the whole network and this is called load balancing.

This paper is organized as follows. In section II, different methods of performing load balancing for grid environment is introduced. In section III, heuristic based approach of load balancing is explained. In section IV, comparative analyses of all algorithms are done. Finally, in the next session, conclusion and future work is given.
DIFFERENT METHODS OF LOAD BALANCING FOR

GRID ENVIRONMENT

As per the literature survey we got to know that load balancing mainly deals with distributing set of independent jobs among all the computing nodes of the grid such that the jobs are uniformly distributed over the entire grid computing environment and none of the nodes are under loaded or overloaded [2]. There are two methods of performing load balancing,

Static method: In Static load balancing algorithms, it assumes that a priori information about all the characteristics of jobs the computing nodes, the computing nodes and the communication network is known. It has two major disadvantages,
- Workload distribution of many applications cannot be predicted before program execution.
- Assumptions made may not apply to distributed environment.
Because of static approach cannot respond to dynamic runtime environment, it may lead to load imbalance on some nodes and significantly increase the load balancing time.

Dynamic Method: In dynamic load balancing algorithms, it attempts to use the runtime state information to make more informative decisions in sharing the system load. Though dynamic algorithms have higher runtime complexity than static algorithms, they give better performance than latter. These kinds of algorithms are preferred for heterogeneous network where network elements may vary in capacity or number at runtime.

Dynamic load balancing algorithms are further classified into two different approaches.
1. Centralized Approach: In this method only one node acts as the central controller for the whole grid computing environment. Job allocation to the all slave nodes is done by this controller. It is a simple approach and beneficial when the communication cost is less significant. This kind of approach is mainly used for small grid computing environment. The main drawbacks of this kind of approach are, it limits the scalability of the grid by becoming bottleneck and Failure of central controller node causes entire system to fail.
2. Decentralized Approach: In this approach all nodes get involved in making load balancing decision. They are scalable and have better fault tolerance. This approach is preferred because elements of network may vary in capacity or number during run time. The major drawback of this approach is that it increases the communication overhead to a large extent.
This approach is again classified into three different categories: Receiver initiated where heavily loaded nodes take the initiative, Sender initiated where lightly loaded nodes takes the initiative, symmetrically initiated combines the advantage of both the above types.
HEURISTIC BASED APPROACH FOR LOAD BALANCING

ALGORITHM

This section describes different types of heuristics and evolutionary based algorithms. These search techniques are sed to find solution to optimization and search problem [3]. These techniques are inspired from evolutionary biology and apply features such as inheritance, selection, crossover and mutation. These methods have been proved to work better as compared to classical algorithms. We will look four different evolutionary based load balancing algorithms those are Genetic Algorithm, Tabu Search, Particle Swarm optimization, Ant Colony Optimization.

Genetic Algorithm:

A Genetic Algorithm is a biologically inspired optimization and search technique [3] [4]. The behavior of Genetic Algorithm mimics the evolution of simple, single celled organisms. This algorithm is particularly useful in situations where the solution space to be searched is so huge, making sequential search, time consuming and computationally very expensive. It is a type of guided random search technique, able to find efficient solutions in variety of cases.

The evolution of the GA population from one generation to the next is usually achieved through the use of three operators that are fundamental in GA: selection, crossover, and mutation.

Selection: It is a process of selecting chromosomes from the current generation for processing to the next generation.

Crossover: Once chromosomes are selected, crossover is applied to the chosen individuals. The crossover operator usually operates on two individuals or parents to produce two children. It ensures that characteristics of each parent are inherited in the children.

Mutation: While the crossover operator works on a pair or more of chromosomes to produce two or more offspring, the mutation operator works on each individual offspring. The mutation operator helps prevent early convergence of the genetic algorithm by changing characteristics of chromosomes in the population.

Algorithm: Begin

Initialize the population, P. Evaluate P.

While stopping conditions not true do

Select Elite in P consisting of k(1<k<population size) best individuals.

Apply selection from individuals in P to create Pmating, consisting of (population size-k) individuals.

Crossover Pmating. Mutate Pmating.

Copy the whole individuals of Pmating to P, replace the worst (population size-k) individuals in P.

Evaluate P.

If Escape condition true then Escape

End While

End

Tabu search:

Tabu search (TS) was first proposed in its current form by Glover. It has been successfully applied to a wide range of theoretical and practical problems, including graph coloring, vehicle routing, job shop scheduling, course scheduling, and maximum independent set problem. One main ingredient of Tabu search (TS) is the use of adaptive memory to guide problem solving. One may argue that memory is a necessary component for intelligence, and intelligent problem solving. Tabu search uses a set of strategies and learned information to mimic human insights for problem solving, creating essentially an artificial intelligence unto itselfthough

problem specific it may be. In its most basic sense, a Tabu search can be thought of as a local search procedure, whereby it moves from one solution to a neighboring solution. In choosing the next solution to move to, however, Tabu search uses memory and extra knowledge endowed about the problem. A basic Tabu search algorithm is shown below.

velocity vector of each particle randomly by using equation

XK0 = xmin + (xmax-xmin)* r VK0 =Vmin + (Vmax-Vmin)*r

Where xmax, xmin, Vmax, Vmin are any random

values and r is random number between 0 and 1. Apply SPV rule to find the permutation for the tasks
- Step 2.:
  - Update iteration variable
- Step 3.:
  - Update inertia weight
    
    w = wend + (wstart-wend) * where = (1/ 1+(x / xmax))
    - Step 4.:
      - Update velocity
        
        V k+1= wV k +c rand () x (pbest –
        
        i i 1 1 i
    - Step 5.:
      
      sik) + c2rand2() x (gbest-sik) And update velocity of each particle
      - Update position
S k+1 = S k + V k+1

i i i

Fig 1. Tabu Search Framework

Algorithm

Input : Parameter for the Tabu Search. Output: A feasible Solution to the Problem. Begin

Generate an initial solution s.

while stopping condition not true do

select next solution neighboring s. update memory.

End

Three things are most important in Tabu search Framework i.e. Tabu list, Search Intensification, Search Diversification. For detail view please follow figure 1. of Tabu Search Framework.

Particle swarm optimization:

Particle swarm optimization (PSO) is an algorithm modelled on swarm intelligence, that finds a solution to an optimization problem in a search space, or model and predict social behaviour in the presence of objectives [5]. The PSO is a stochastic, population-based computer algorithm modelled on swarm intelligence. Swarm intelligence is based on social- psychological principles and provides insights into social behaviour, as well as contributing to engineering applications. The particle swarm optimization algorithm was first described in 1995 by James Kennedy and Russell C. Eberhart.

Algorithm:
- Step 1.:
  
  Initialization: Set the contents for this PSO
  
  And update particle of each position
- Step 6.:
  - Apply the SPV rule to find the permutation.
- Step 7.:
  - Update personal best, by evaluating the particle.
- Step 8.:
  - Update global best
- Step 9.:
  - Stopping criterion. If the number of iteration exceeds the maximum number of iteration, then stop, otherwise go to Step 2.
    
    Ant Colony Optimization:
    
    It is a meta-heuristic using artificial ant to find desirable solutions to difficult combinatorial optimization problems [6]. The behaviour of artificial ants is based on the traits of real ants as described above, plus additional capabilities that make them more effective, such as a memory of past actions. Each ant of the colony builds a solution to the problem under consideration and uses information collected on the problem characteristics and its own performance to change how other ants see the problem.
    
    Algorithm:
- Step 1.:
  - Initialize the value of , , ,, N, T, RUi and also set pheromone trails for each resource.
- Step 2.:
  - Select the next task t.
- Step 3.:
  
  j
  
  r
  
  j
  - Determine the transition probability (load) of each resource Rj as:
    
    algorithm. Define the active resource and the list
    
    of tasks. The dimension of PSO algorithm is the
    
    p (t)
    [ j
    
    (t)] *[ ]
    number of tasks. Initialize position vector and
    
    r [ r
    
    (t)] *[ ]
- Step 4.:
  - Find resource Ri with high transition probability among all resources:
  - i.e. resource Ri is having minimum load
- Step 5.:
  - Assign task t to Ri .
- Step 6.:
  - Set T = T – 1
- Step 7.:
  - Check whether any task completion or failure reported. If no, go to Step 11.
1. If number of task increases then execution time also increases
2. If number of agent increases then execution time decreases
  
  Whenever we increase number of tasks keeping all resources same and number of resources also same then resource utilization per resources also increases and hence execution time increases though overall effect of makespan of that grid remains same.
  1. Comparing with respect to number of tasks, If number of task increases then
    td>
    Resource utilization
    
    Properties
    
    GA
    
    TS
    
    PSO
    
    ACO
    
    generalized
    
    Execution time
    
    Table2. Comparison with respect to number of tasks
    - Step 8.:
      - If (task completion at any resource Ri ) then Increase pheromone of Ri as:
        
        i(t) = i(t)+
      - reporting Ri as lightly loaded.
    - Step 9.:
      
      i
      - RUi =RUi + FT t
    - Step 10.:
      - If (task failure at any resource Ri ) then decrease pheromone of Ri as:
        
        i(t) = i(t) –
      - reporting Ri as heavily loaded
    - Step 11.:
      - If (T>0) then go to Step 2.
    - Step 12.:
      - For each resource Ri , 1 i N
        
        Whenever we increase number of resources keeping number of tasks same then task distribution is done in such a manner that resource utilization per resource decreases and hence decreasing resource time effecting overall makespan time and decreases makespan too.
  2. With respect to number of resources, if number of resources increases then
Table3. Comparison with respect to number of resources

Compute RU

RUi

K

Properties

GA

TS

PSO

ACO

Resource utilization

Execution time

i N RU

K 1

Print resource utilization of Ri.

COMPARATIVE ANALYSIS

When we compare all the algorithms with their properties like working and stopping conditions the results are as follows in the following table

PROPERTIES	GA	TS	PSO	ACO
WORKING	Fitness function	Fitness function and moves made	Objective function	Pheromone value
ITERATION/ INERTIA VALUE	Evolution period	Iterations	Iteration value Inertia value	No of tasks
AGENTS	Not present	Not present	Present	Present
STOPPING CONDITIONS	Evolution period	Iterations	Iterations	No of tasks

Table1. Comparison of Properties of Heuristics and Evolutionary Based Algorithm

CONCLUSION AND FUTURE WORK

Load balancing in grid computing is really important to be taken into consideration for any grid environment. It has been convincingly proved that load balancing and task scheduling is best solved by heuristics and evolutionary algorithms. These heuristics and evolutionary algorithm s give far better results than traditional load balancing algorithms like receiver broadcasting algorithms, bidding approach, dynamic scheduling using weights etc. It may happen that these algorithms may take extra processing requirement and incur extra storage space but eventually performs better load balancing than traditional algorithms.

In future, hybridization of the techniques will improve the load balancing and utilization of the grid further. This hybrid technique then can be implemented in the real world problem on large scale and then performance can be measured such that the next generation grid computing environment must be intelligent enough and autonomous to meet requirements of self-management.

In all the following algorithms there are two things very important and common in all of them.

REFERENCES

Belabbas Yagoubi, and Meriem Meddeber, Distributed Load Balancing model for Grid Computing, Revue ARIMA journal volume. 12, pp.43-60, September 2010.
Janhavi B., Sunil Surve, and Sapna Prabhu,Comparison Of Load Balancing in a Grid, IEEE Computer Society, 2010 International

Conference on Data Storage and Data Engineering, pp. 20-23, doi:10.1109/DSDE.2010.13
R. Rajeswari, Dr. N.Kasthuri ,Comparative Survey on Load Balancing Techniques in Computational Grids , International Journal of Scientific & Engineering Research (IJSER), Volume 4, Issue 9,

September-2013 ISSN 2229-5518
Riky Subrata, Albert Y. Zomaya, Bjorn Landfeldt Artificial life techniques for load balancing in computational grids Journal of Computer and System Sciences 73 (2007), Elsevier, doi:10.1016/j.jcss.2007.02.006
Mr. P.Mathiyalagan, U.R.Dhepthie ,Dr. S.N.Sivanandam Grid Scheduling using enhanced PSO algorithm, International Journal on Computer Science and Engineering(IJCSE), Vol. 02, No. 02, 2010, ISSN : 0975-3397, 140-145.
Sandip Kumar Goyal , Manpreet Singh Adaptive and Dynamic Load Balancing in Grid Using Ant Colony Optimization International Journal of Engineering and Technology (IJET) ISSN : 0975-4024 Vol. 4 No 4 Aug-Sep 2012.

Properties	GA	TS	PSO	ACO
			generalized
Execution time

Comparative Analysis on Heuristic Based Load Balancing Algorithms in Grid Environment

Leave a Reply