Volume 18, Special Issue on Information Retrieval and Web Search, 2021

Comparative Analysis for Content Defined Chunking Algorithms in Data Deduplication


D. Viji and Dr.S. Revathy

Abstract

Data deduplication works on eliminating redundant data and reducing storage consumption. Nowadays more data generated and it was stored in the cloud repeatedly, due to this large volume of storage will be consumed. Data deduplication tries to reduce data volumes disk space and network bandwidth can be to reduce costs and energy consumption for running storage systems. In the data deduplication method, data broken into small size of chunk or block. Hash ID will be calculated for all the blocks then it’s compared with existing blocks for duplication. Blocks may be fixed or variable size, compared with a fixed size of block variable size chunking gives a better result. So the chunking process is the initial task of deduplication to get an optimal result. In this paper, we discussed various content defined chunking algorithms and their performance based on chunking properties like chunking speed, processing time, and throughput.


Pages: 255-268

DOI: 10.14704/WEB/V18SI02/WEB18070

Keywords: Data Deduplication, Content-defined Chunking, Cloud Storage System.

Full Text