Aarhus University Seal

Yggdrasil: Privacy-Aware Dual Deduplication in Multi Client Settings

Research output: Contribution to book/anthology/report/proceedingArticle in proceedingsResearchpeer-review

Standard

Yggdrasil: Privacy-Aware Dual Deduplication in Multi Client Settings. / Sehat, Hadi; Pagnin, Elena; Lucani Rötter, Daniel Enrique.

ICC 2021 - IEEE International Conference on Communications, Proceedings. IEEE, 2021.

Research output: Contribution to book/anthology/report/proceedingArticle in proceedingsResearchpeer-review

Harvard

Sehat, H, Pagnin, E & Lucani Rötter, DE 2021, Yggdrasil: Privacy-Aware Dual Deduplication in Multi Client Settings. in ICC 2021 - IEEE International Conference on Communications, Proceedings. IEEE, IEEE International Conference on Communications, 14/06/2021. https://doi.org/10.1109/ICC42927.2021.9500816

APA

Sehat, H., Pagnin, E., & Lucani Rötter, D. E. (2021). Yggdrasil: Privacy-Aware Dual Deduplication in Multi Client Settings. In ICC 2021 - IEEE International Conference on Communications, Proceedings IEEE. https://doi.org/10.1109/ICC42927.2021.9500816

CBE

Sehat H, Pagnin E, Lucani Rötter DE. 2021. Yggdrasil: Privacy-Aware Dual Deduplication in Multi Client Settings. In ICC 2021 - IEEE International Conference on Communications, Proceedings. IEEE. https://doi.org/10.1109/ICC42927.2021.9500816

MLA

Sehat, Hadi, Elena Pagnin, and Daniel Enrique Lucani Rötter "Yggdrasil: Privacy-Aware Dual Deduplication in Multi Client Settings". ICC 2021 - IEEE International Conference on Communications, Proceedings. IEEE. 2021. https://doi.org/10.1109/ICC42927.2021.9500816

Vancouver

Sehat H, Pagnin E, Lucani Rötter DE. Yggdrasil: Privacy-Aware Dual Deduplication in Multi Client Settings. In ICC 2021 - IEEE International Conference on Communications, Proceedings. IEEE. 2021 doi: 10.1109/ICC42927.2021.9500816

Author

Sehat, Hadi ; Pagnin, Elena ; Lucani Rötter, Daniel Enrique. / Yggdrasil: Privacy-Aware Dual Deduplication in Multi Client Settings. ICC 2021 - IEEE International Conference on Communications, Proceedings. IEEE, 2021.

Bibtex

@inproceedings{eef55b275f97402890f8537424114727,
title = "Yggdrasil: Privacy-Aware Dual Deduplication in Multi Client Settings",
abstract = "This paper proposes Yggdrasil, a protocol for privacy-aware dual data deduplication in multi-client settings. Yggdrasil is designed to reduce Cloud storage space while safeguarding the privacy of Clients' data. This is achieved by exploiting a dual setting, where both the Cloud and the Clients store a fraction of the data. Yggdrasil combines two innovative techniques to achieve this goal. First, generalized deduplication, an emerging technique to reduce data footprint; second, nondeterministic lightweight transformations that insures a high level of privacy while improving the degree of cross-user data compression in the Cloud. Our Client preprocessing guarantees that an honest-but-curious Cloud storage provider faces a high degree of uncertainty as to determine what the original Clients' data are. We introduce an uncertainty metric to measure the privacy of the Client's outsourced data and three compression metrics to investigate the compression potential of Yggdrasil. Our experiments with a dataset of DVI files show that Yggdrasil achieves an overall compression rate of 43%, which means that Yggdrasil can represent the same database using less than half of the original space. Moreover, for the same experiment Clients only store 17% of the original data, the Cloud hosts the remaining 26%, and the Client preprocessing ensures each outsourced fragment has 10^293 possible original strings. Higher uncertainty is possible, with the expanse of reducing the compression capability.",
keywords = "Data Compression, Data Privacy, Deduplication, Generalized Deduplication",
author = "Hadi Sehat and Elena Pagnin and {Lucani R{\"o}tter}, {Daniel Enrique}",
year = "2021",
month = jun,
doi = "10.1109/ICC42927.2021.9500816",
language = "English",
booktitle = "ICC 2021 - IEEE International Conference on Communications, Proceedings",
publisher = "IEEE",
note = "IEEE International Conference on Communications, Icc 2021 ; Conference date: 14-06-2021 Through 23-06-2021",

}

RIS

TY - GEN

T1 - Yggdrasil: Privacy-Aware Dual Deduplication in Multi Client Settings

AU - Sehat, Hadi

AU - Pagnin, Elena

AU - Lucani Rötter, Daniel Enrique

PY - 2021/6

Y1 - 2021/6

N2 - This paper proposes Yggdrasil, a protocol for privacy-aware dual data deduplication in multi-client settings. Yggdrasil is designed to reduce Cloud storage space while safeguarding the privacy of Clients' data. This is achieved by exploiting a dual setting, where both the Cloud and the Clients store a fraction of the data. Yggdrasil combines two innovative techniques to achieve this goal. First, generalized deduplication, an emerging technique to reduce data footprint; second, nondeterministic lightweight transformations that insures a high level of privacy while improving the degree of cross-user data compression in the Cloud. Our Client preprocessing guarantees that an honest-but-curious Cloud storage provider faces a high degree of uncertainty as to determine what the original Clients' data are. We introduce an uncertainty metric to measure the privacy of the Client's outsourced data and three compression metrics to investigate the compression potential of Yggdrasil. Our experiments with a dataset of DVI files show that Yggdrasil achieves an overall compression rate of 43%, which means that Yggdrasil can represent the same database using less than half of the original space. Moreover, for the same experiment Clients only store 17% of the original data, the Cloud hosts the remaining 26%, and the Client preprocessing ensures each outsourced fragment has 10^293 possible original strings. Higher uncertainty is possible, with the expanse of reducing the compression capability.

AB - This paper proposes Yggdrasil, a protocol for privacy-aware dual data deduplication in multi-client settings. Yggdrasil is designed to reduce Cloud storage space while safeguarding the privacy of Clients' data. This is achieved by exploiting a dual setting, where both the Cloud and the Clients store a fraction of the data. Yggdrasil combines two innovative techniques to achieve this goal. First, generalized deduplication, an emerging technique to reduce data footprint; second, nondeterministic lightweight transformations that insures a high level of privacy while improving the degree of cross-user data compression in the Cloud. Our Client preprocessing guarantees that an honest-but-curious Cloud storage provider faces a high degree of uncertainty as to determine what the original Clients' data are. We introduce an uncertainty metric to measure the privacy of the Client's outsourced data and three compression metrics to investigate the compression potential of Yggdrasil. Our experiments with a dataset of DVI files show that Yggdrasil achieves an overall compression rate of 43%, which means that Yggdrasil can represent the same database using less than half of the original space. Moreover, for the same experiment Clients only store 17% of the original data, the Cloud hosts the remaining 26%, and the Client preprocessing ensures each outsourced fragment has 10^293 possible original strings. Higher uncertainty is possible, with the expanse of reducing the compression capability.

KW - Data Compression

KW - Data Privacy

KW - Deduplication

KW - Generalized Deduplication

U2 - 10.1109/ICC42927.2021.9500816

DO - 10.1109/ICC42927.2021.9500816

M3 - Article in proceedings

BT - ICC 2021 - IEEE International Conference on Communications, Proceedings

PB - IEEE

T2 - IEEE International Conference on Communications

Y2 - 14 June 2021 through 23 June 2021

ER -