TY - BOOK
T1 - Supporting annotated relations
AU - Eltabakh, Mohamed Y.
AU - Ouzzani, Mourad
AU - Aref, Walid G.
AU - Elmagarmid, Ahmed Khalifa
AU - Laura-Silva, Y
PY - 2007
Y1 - 2007
N2 - Annotations and provenance data play a key role in under-standing and curating scientific databases. However, cur-rent database management systems lack adequate support for managing annotations and provenance data including: (1) handling annotations at multiple granularities, i.e., at the table, tuple, column and cell levels, (2) propagating an-notations along with query answers, (3) querying data based on their annotations, and (4) providing declarative ways to add, archive, and restore annotations. In this paper, we propose to treat multi-granular annotations and provenance as first class objects inside the database. We introduce the concept of "Annotated Relations" along with new operators and extended semantics for the standard relational oDera-tors in support of annotated relations. We present an ex-pressive and declarative extension to SQL to support the processing and querying of annotated tables. We study sev-eral schemes for storing and indexing annotations based on annotation granularity and annotation size. Extensions to PostgreSQL are introduced to support annotated relations and implementation challenges are discussed. Performance analysis illustrates the potential of annotated relations as they achieve up to an order-of-magnitude reduction in stor-age and I/O costs.
AB - Annotations and provenance data play a key role in under-standing and curating scientific databases. However, cur-rent database management systems lack adequate support for managing annotations and provenance data including: (1) handling annotations at multiple granularities, i.e., at the table, tuple, column and cell levels, (2) propagating an-notations along with query answers, (3) querying data based on their annotations, and (4) providing declarative ways to add, archive, and restore annotations. In this paper, we propose to treat multi-granular annotations and provenance as first class objects inside the database. We introduce the concept of "Annotated Relations" along with new operators and extended semantics for the standard relational oDera-tors in support of annotated relations. We present an ex-pressive and declarative extension to SQL to support the processing and querying of annotated tables. We study sev-eral schemes for storing and indexing annotations based on annotation granularity and annotation size. Extensions to PostgreSQL are introduced to support annotated relations and implementation challenges are discussed. Performance analysis illustrates the potential of annotated relations as they achieve up to an order-of-magnitude reduction in stor-age and I/O costs.
M3 - Commissioned report
BT - Supporting annotated relations
ER -