Volume 18, Special Issue on Information Retrieval and Web Search, 2021
Comparative Analysis for Content Defined Chunking Algorithms in Data Deduplication
D. Viji and Dr.S. Revathy
Abstract
Data deduplication works on eliminating redundant data and reducing storage consumption. Nowadays more data generated and it was stored in the cloud repeatedly, due to this large volume of storage will be consumed. Data deduplication tries to reduce data volumes disk space and network bandwidth can be to reduce costs and energy consumption for running storage systems. In the data deduplication method, data broken into small size of chunk or block. Hash ID will be calculated for all the blocks then it’s compared with existing blocks for duplication. Blocks may be fixed or variable size, compared with a fixed size of block variable size chunking gives a better result. So the chunking process is the initial task of deduplication to get an optimal result. In this paper, we discussed various content defined chunking algorithms and their performance based on chunking properties like chunking speed, processing time, and throughput.
Pages: 255-268
DOI: 10.14704/WEB/V18SI02/WEB18070
Keywords: Data Deduplication, Content-defined Chunking, Cloud Storage System.