An approximation to the greedy algorithm for differential compression of very large files

Ramesh C. Agarwal; Suchitra Amalapurapu; Shaili Jain

Publication

DCC 2004

Conference paper

An approximation to the greedy algorithm for differential compression of very large files

DCC 2004

Abstract

A new diffrential compression algorithm that combines the hash value and suffix array techniques of Manber and Myers, was discussed. Differential compression algorithms did not find the longest matches for every offset of the version file. The performance of Greedy algorithm was based on the block hash table, the quick index array and the pointer array. Computing the suffix arrray of block hash reduced the storage and computing requirements by a factor equal to the block size used. It was found that the differential compression method was used in compression applications, web crawling as well as computational biology.

Date

14 Jun 2004

Publication

DCC 2004

Authors

IBM-affiliated at time of publication

Abstract

Date

Publication

Authors

Share