## Ordered and Unordered Top-K Range Reporting in Large Data Sets

Research output: Contribution to book/anthology/report/proceedingArticle in proceedingsResearchpeer-review

• Department of Computer Science
We study the following problem: Given an array A
storing N real numbers, preprocess it to allow fast
reporting of the K smallest elements in the subarray
A[i, j] in sorted order, for any triple (i, j,K) with 1 ≤ i ≤ j ≤ N and 1 ≤ K ≤ j − i + 1. We are interested
in scenarios where the array A is large, necessitating an
I/O-efficient solution.
For a parameter f with 1 ≤ f ≤ logm n, we
construct a data structure that uses O((N/f) logm n)
space and achieves a query bound of O(logB N +
fK/B) I/Os,1 where B is the block size, M is the
size of the main memory, n := N/B, and m :=
M/B. Our main contribution is to show that this
solution is nearly optimal. To be precise, we show that
achieving a query bound of O(logα n + fK/B) I/Os,
for any constant α, requires ΩN f−1 logM n
log(f−1 logM n) space,
assuming B = Ω(logN). ForM ≥ B1+ε, this is within a
log logm n factor of the upper bound. The lower bound
assumes indivisibility of records and holds even if we
assume K is always set to j − 1 + 1.
We also show that it is the requirement that the
K smallest elements be reported in sorted order which
makes the problem hard. If the K smallest elements
in the query range can be reported in any order, then
we can obtain a linear-size data structure with a query
Original language English Proceedings of the 22nd Annual ACM-SIAM Symposium on Discrete Algoritms. SODA 2011 11 Society for Industrial and Applied Mathematics 2011 390-400 Published - 2011 Symposium on Discrete Algorithms. SODA 2011 - Duration: 17 Dec 2010 → …Conference number: 22

### Conference

Conference Symposium on Discrete Algorithms. SODA 2011 22 17/12/2010 → …

Citationformats

ID: 22229106