APPLICATION OF HADOOP MAPREDUCE TECHNIQUE TO VIRTUAL DATABASE SYSTEM DESIGN

Abstract

Today in the world of cloud and grid computing integration of data from heterogeneous databases is inevitable. Virtual Database Technology (VDB) is one of the effective solutions for integration of data from heterogeneous sources. This will become complex when size of the database is very large. MapReduce is a new framework specifically designed for processing huge datasets on distributed sources. Apache’s Hadoop is an implementation of MapReduce. Currently Hadoop has been applied successfully for file based datasets. This paper proposes to utilize the parallel and distributed processing capability of Hadoop MapReduce for handling heterogeneous query execution on large datasets. So, Virtual Database Engine built on top of this will result in effective high performance distributed data integration.

Keywords: Database integration, Hadoop MapReduce, Heterogeneous databases, Query optimization, Virtual Database Technology

Article Review Status: Published

Pages: 1-7 (Download PDF)

Creative Commons Licence
GB Journals by www.gbjournals.org is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License

  • Our Journal Publishing Partners