TY - GEN
T1 - Elevating annotation summaries to first-class citizens in InsightNotes
AU - Ibrahim, Karim
AU - Xiao, Dongqing
AU - Eltabakh, Mohamed
N1 - Publisher Copyright:
© 2015, Copyright is with the authors.
PY - 2015
Y1 - 2015
N2 - Most scientific and modern applications generate - in addition to the base data - valuable annotations and metadata information at unprecedented scale and complexity. Such annotations warrant the need for advanced annotation management techniques that not only propagate the raw annotations to end-users, but also mine, summarize, and extract useful knowledge from them. Towards this goal, we proposed the InsightNotes system, the first summary-based annotation management engine in relational databases [22]. Insight-Notes relies on creating concise representations of the raw annotations, called annotation summaries. InsightNotes addresses several unique challenges related to the maintenance, propagation, and zooming of these summaries. However, a key limitation is that the annotation summaries are treated as propagate-only (report-only) objects that cannot be directly queried or manipulated. This limitation hinders higher-level applications from applying complex processing over both the base data and its attached annotation summaries even within a single query. In this paper, we propose new extensions to InsightNotes for treating the annotation summaries as first-class citizens. We address the challenges of: (1) Developing new manipulation functions and query operators specific for the annotation summaries, (2) Designing summary-based index structures and access methods for efficient retrieval and predicate evaluation, and (3) Extending the query optimizer to optimize queries accessing both the data and the annotation summaries. The proposed extensions not only make it feasible to natively query and manipulate the annotation summaries, but also achieve more than two orders of magnitude speedup in query evaluation.
AB - Most scientific and modern applications generate - in addition to the base data - valuable annotations and metadata information at unprecedented scale and complexity. Such annotations warrant the need for advanced annotation management techniques that not only propagate the raw annotations to end-users, but also mine, summarize, and extract useful knowledge from them. Towards this goal, we proposed the InsightNotes system, the first summary-based annotation management engine in relational databases [22]. Insight-Notes relies on creating concise representations of the raw annotations, called annotation summaries. InsightNotes addresses several unique challenges related to the maintenance, propagation, and zooming of these summaries. However, a key limitation is that the annotation summaries are treated as propagate-only (report-only) objects that cannot be directly queried or manipulated. This limitation hinders higher-level applications from applying complex processing over both the base data and its attached annotation summaries even within a single query. In this paper, we propose new extensions to InsightNotes for treating the annotation summaries as first-class citizens. We address the challenges of: (1) Developing new manipulation functions and query operators specific for the annotation summaries, (2) Designing summary-based index structures and access methods for efficient retrieval and predicate evaluation, and (3) Extending the query optimizer to optimize queries accessing both the data and the annotation summaries. The proposed extensions not only make it feasible to natively query and manipulate the annotation summaries, but also achieve more than two orders of magnitude speedup in query evaluation.
UR - http://www.scopus.com/inward/record.url?scp=84976274807&partnerID=8YFLogxK
U2 - 10.5441/002/edbt.2015.06
DO - 10.5441/002/edbt.2015.06
M3 - Conference contribution
AN - SCOPUS:84976274807
T3 - EDBT 2015 - 18th International Conference on Extending Database Technology, Proceedings
SP - 49
EP - 60
BT - EDBT 2015 - 18th International Conference on Extending Database Technology, Proceedings
A2 - Popa, Lucian
A2 - Alonso, Gustavo
A2 - Van den Bussche, Jan
A2 - Barcelo, Pablo
A2 - Teubner, Jens
A2 - Paredaens, Jan
A2 - Ugarte, Martin
A2 - Geerts, Floris
PB - OpenProceedings.org, University of Konstanz, University Library
T2 - 18th International Conference on Extending Database Technology, EDBT 2015
Y2 - 23 March 2015 through 27 March 2015
ER -