Performance of Generalized Deduplication Under Different Input Conditions

Research output: Contribution to book/anthology/report/proceedingArticle in proceedingsResearchpeer-review


In the era of exponential growth in data, compression of data is needed to limit the growth in the required storage space and supporting infrastructure. Recently, generalized deduplication has been proposed as a better way of performing data deduplication. We use Golomb Rice transform for providing generalized deduplication. With this setup, we evalute the performance of generalized deduplication for the input conditions modeled using binomial, geometric, poisson and uniform distributions. We derive the closed form expressions for pmf of the data after generalized deduplication. For all the input conditions, generalized deduplicaton transforms the input pmf into a new pmf that is highly suitable for deduplication, reduces size of the deduplication table, provides comparable compression gain for fewer number of input chunks.
Original languageEnglish
Title of host publicationEuropean Wireless Conference : 26th European Wireless Conference
Number of pages7
Publication date2021
ISBN (Electronic)978-3-8007-5672-8
Publication statusPublished - 2021
EventEuropean Wireless 2021 - Verona , Italy
Duration: 10 Nov 202112 Nov 2021


ConferenceEuropean Wireless 2021
City Verona


Dive into the research topics of 'Performance of Generalized Deduplication Under Different Input Conditions'. Together they form a unique fingerprint.
  • Scale-loT

    Lucani Rötter, D. E.


    Project: Research

Cite this