- Open Access
- Total Downloads : 28
- Authors : R. Pavithra, R. Kalaivani
- Paper ID : IJERTCONV4IS05012
- Volume & Issue : NCETCS – 2016 (Volume 4 – Issue 05)
- Published (First Online): 24-04-2018
- ISSN (Online) : 2278-0181
- Publisher Name : IJERT
- License: This work is licensed under a Creative Commons Attribution 4.0 International License
Overview of Distributed Database Indexing Techniques
R. Pavithra
Department of Computer science KMCPGS
Puducherry, India
R. Kalaivani
Department of Computer Science KMCPGS
Puducherry, India
Abstract This paper studies the concept of indexing techniques in Distributed Database Management System (DDBMS). This paper starts with small introduction about Distributed Database, its indexing concepts and the types of indexing. The main objective of this paper to find the new concept in indexing process. Finally, this paper concludes with the new idea in indexing.
Keywords Distributed Database Architecture, Indexing Techniques, StarSparse Indexing.
-
INTRODUCTION
-
Distributed Database Management System
Distributed Database is a database in which all individual devices having separate Central Processing Unit (CPU) and connected with Centralized computer network equipment. This overall system controlled by Distributed Database Management System (DDBMS) also called Distributed Database System. It may be stored in multiple computers, located in the same physical location. Unlike parallel computing systems, in which the processors are closely coupled and constitute a single database system, a distributed database system consists of loosely coupled sites that share no physical components. If updating is made in one database should automatically replicated in other databases called consistency. Consistency plays the major role in Distributed Database. Distributed Database has two processes to involve that the distributed databases for update. They are replication and duplication. Both replication and duplication should have the data in all distributive systems.
-
Characteristics of Distributed Database
-
Each individual system has its own memory.
-
Parallel processing is possible in the distributed database.
-
It should always maintain consistency over the information
stored in database.
-
It should increase reliability and availability.
-
In distributed database Distribution transparency maintained.
-
-
Architecture of Distributed Database :
The below figure 1 explains the architecture of distributed database. It consists of individual nodes called systems, and each has separate database connected through computer network.
DB DB
Node 1
Node 1
Node 2
Node 2
Computer Network System
Node 4
Node 4
Node 3
Node 3
DB DB
Fig 1: Distributed Database Architecture
-
-
INDEXING TECHNIQUES
-
Definition of Indexing
Indexing is a data structure technic to efficiently retrieve records from the database files based on some attributes on which the indexing has been done. Example, index page presents in a text book.
-
Three major categories of indexing
Primary index: Primary index is defined on an ordered data file. The data file is ordered on a key field. It is generally the primary key of the relation.
Secondary index: Secondary index may be generated from a field which is a candidate key and has a unique value in every record, or a non-key with duplicate values.
Clustering index: Clustering index is defined on an ordered data file the data file is ordered on a non-key field.
-
Classificiation of Primary Index
-
Bit Map Index: Bit map index is a special kind of index that stores the bulk of its data as bit arrays and answers most queries by performing bit wise logical operation on this bit maps.
-
Reverse Index: Reverse key index reverse the key value before entering it the index. Example: the value 47853 becomes 35874 in the index. Reversing the key value is particularly useful for indexing data such as sequence numbers where new key values monotonically increase.
-
Dense Index: A dense index in databases is a file with pairs of keys and pointers for every record in the data file. Every key in this file is associated with a particular pointer to a record in the sorted data file.
emp_id
e_name
Designa tion
salary
10500
Bala
Accoun tant
5000
28674
Joan
Attender
3000
38745
Mogan
Manager
10000
46512
Prithivraj
Sales Excutive
7000
56789
Suresh
Super visor
8000
emp_id
e_name
Designa tion
salary
10500
Bala
Accoun tant
5000
28674
Joan
Attender
3000
38745
Mogan
Manager
10000
46512
Prithivraj
Sales Excutive
7000
56789
Suresh
Super visor
8000
The below figure 2 explains the dense index, the employee details relations using emp_id as primary key.
emp_id 10500
28674
38745
46512
56789
Fig 2: employee details relation
rol_no
St_name
dept
Marks
%
11100
Adhi
Comp.Sci
90
11518
Pavithra
Comp.Sci
85
18125
Ramya
Physics
86
19133
Kalivani
History
89
21100
Suba
Maths
75
21115
Indu
Commerce
55
22289
Banu
Comp.Sci
85
23587
Nancy
Maths
75
31100
Geetha
History
72
34567
Gomathi
Biology
85
37423
Kavery
Maths
90
38574
Megala
English
88
45876
Subashini
Commerce
60
87426
Akila
commerce
75
rol_no
St_name
dept
Marks
%
11100
Adhi
Comp.Sci
90
11518
Pavithra
Comp.Sci
85
18125
Ramya
Physics
86
19133
Kalivani
History
89
21100
Suba
Maths/p>
75
21115
Indu
Commerce
55
22289
Banu
Comp.Sci
85
23587
Nancy
Maths
75
31100
Geetha
History
72
34567
Gomathi
Biology
85
37423
Kavery
Maths
90
38574
Megala
English
88
45876
Subashini
Commerce
60
87426
Akila
commerce
75
-
Sparse index: Sparse index in database is a file with pairs of keys and pointers for every block in the data file. Every key in the file is associated with a particular pointer to the block in the sorted data file. In clustered indices with the duplicate keys the sparse index points to the lowest search key in a search in each block. Below the figure 3 illustrate the sparse index search process.
The above diagram student details relation use the sparse index to search the records within particular range for example the range rol_no 11100 to 21100 and final range between 31100. It does not search the records above the range. For example rol_no 45876 and 87426. Each number range is considered a Block so each Block as its records only. The below figure 4 is illustrate blocks of sparse index.
.
.
Block 0
Block 1
Fig 4: Sparse index blocks
-
Multilevel Index: Multilevel index first breaks the entire index into two again the first patitions is devided into two.If the database is grows automatically the number of indices also grows. Multilevel indexing processed like linear search process. The below figure 5 illustrate the structure of multilevel indexing.
Block 0
rol_no 11100
21100
31100
Figure 3: Sparse Index Search Process
Blocks 04
Block2
Block 4
Block 1
Block 2
Block 3
Fig 5: Multilevel Indexing structure
-
-
Disadvantages of sparse and multilevel indexing
-
The major disadvantage of multilevel indexing is it requires more storage space than other techniques.
-
When the database is large the number of indices required also increase.
-
In sparse indexing it checks the blocks in sequential order when the number is out of its range finding of record is becomes difficult process.
-
The time spent to search the particular range also become waste.
So, to overcome the disadvantage of sparse indexing this paper produce the new concept called StarSparse indexing. Star Sparce indexing process also include sparse Index with some added features.
-
-
STARSPARSE INDEX
-
INTRODUCTION
In this star parse indexing, flag is used when the process of indexing started , if the flag is set to 1 or true that particular block only gets permission to further indexing process. All the blocks having the records between some ranges this range is considered as maximum value and minimum value. The search records are between the maximum and minimum value mean the indexing process successfully completed. In other words the flag set to 1.Oherwise the flags set to 0 mean the search records are not present that maximum and minimum value
-
STUCTURE OF STARSPARSE INDEX
-
ALGORITHM FOR STARSPARSE INDEX
The algorithm of StarSparse indexing consists of the variables such as search records , flag.
StarSparse( sr, b[],flag)
{
sr= searchrecord;
/ search record assigned by the user.
b[n]= {block1,block2,block3}
/block are assigned to array b.n is assigned to number of
/ blocks. for(i=0;b[i]<=n;i++)
{
If (sr<max && sr>min) then print(block i consist the search record) flag=1;
else
print (block i does not consist the search record)
}
return block;
}
End;
-
EXAMPLE OF STARSPARSE INDEX
This example contains the search record as number 28579,
there are 7 blocks and maximum minimum value as 10000 to
30000.
-
Steps followed in StarSparse Index
-
Assign the variable to
Block 1
Block 1
StarSparse Indexing
Block 1
Block 1
sr=28579,max=30000 and min=10000,n=7.
-
Send the max ,min values to all the 7 blocks.
-
Each block check there header with max and min range.
-
If any block header match with the max and min value then, that particular block flag is set to 1.
-
Here the block 1 header has matched with the max and min values.
-
So, block1s flag is 1.
-
The further search process is take place in block 1 only not remaining blocks.
-
-
Flowchart of the above example:
-
-
The figure 7 illustrates the flowchart of above example.
Block 1
Fig 6: StarSparse index structure
start
CONCLUSION
Assign sr=28579, max=30000,min=10000,n=7
Assign sr=28579, max=30000,min=10000,n=7
This paper tried to give a new concept about indexing. StarSparse indexing is a useful concept for indexing process using this finding a particular record easily. It contains only few additional features of sparse indexing. So, it can be easy to implement. It effectively reduces the amount of time requires to overall indexing process.
REFERENCES:
flag=1
Search record 28579
Search record 28579
Find record 28579
If block header
=
max min
flag=0
stop
-
Cyrus Shahabi, Latifur Khan, Dennis Mcleod. A probe based techniques to optimize join queries in distributed internet bases, knowledge and information Systems.2000, 2.
For(i=0;b[i]<=n;i++)
For(i=0;b[i]<=n;i++)
-
Syam Menon. Allocating fragments in distributed databases[J]. IEEE transactions on parallel and distributed Systems, 2005 16: 5577-585.
-
TSAIPSM CHINALP, Optimizing queries with foreign function in a distributed environment, IEEE trans on knowledge and Data Eng. 2002, 14(2):809-824.
-
LEEE Chiang, CHIH Chi-Sheng, CHEN yaw-huei. Optimizing large join queries using a graph based approach, IEEE Trans on Knowledge and Data Eng.2001, 13(20:298-315
-
S. Pramanik, D. Vineyard. Optimization Join Queries in Distribute Database. IEEE TSE. 2004.14(9):19:1326
-
Peng- Yeng Yin, Shiuh_Sheng Yu, Pei-Pei Wang and Yi-Te Wang. A hybrid particle swarm optimization algorithm for optimal task assignment in distributed systems[J].Computer Standards & Interfaces, Volume 28, issue 4, April 2006,444-450.
-
D. Kossamann, K. Stocker. Iterative Dynamic Programming: A New Class of Query Optimization Algorithms [J]. ACM Transactions on Database Systems, 2000, 25(1):43-82.
-
Prof. (Dr.) Anand K. Tripathi and Mrs. Monica Tripathi: A Frame: Why Information System Fails, Proceeding of the National Conference on Managing complexity in Information World, publication Bhilai Institute of Technology, Durg,Bhilai(CG) dated 7-8 Nov- 2008. Edition-1,vlo-1,pf 112- 115.
-
Prof. (Dr) Anadn K.Tripathi and Mrs. Monica Tripathi AICTE Sponsored, National Conference on Next generation Computing & iInformation systems at Model Institute of Engineering &
Fig 7: flow chart of example
E . ADVANTAGES OF STARSPARSE INDEXING
-
The traditional sparse indexing checks all the blocks and all the records but the StarSparse index check only one block.
-
The time required for searching procss is less than sparse index.
-
The best case search time required the number records present in i th block.
-
It requires same data structure used in sparse indexing.
Technilogy, Jammu Feb.14-15,2009.edition2,vol-2,pg.724-731
-
- [Balt82] Balter. R.,Beard, P., and Decitre. P., why Control of the concurrency Level in Distributed Systems is More Fundamental than Deadlock Management, Proc . 1 st ACM SIGACT-SIGOPS Symp.on Principles of Dist.Comp., Aug 1982.
-
J. Clerk Maxwell, A Treatise on Electricity and Magnetism, 3rd
ed., vol-1., Oxford: Clarendom, 1892, pp.68-73.
-
I. S Jacobs and C.P Bean, Fine Particuls , thin films and exchange anisotroy,in Magnetism vol. III G.T Rado and H. Suhl,
Eds. New York: Academic 1963,pp 271-350
-
K. Elissa, Title of Paper if Known, unpublished.