ROLE
BOOKMARK & SHARE
|
|
1
Mison: a fast JSON parser for data analytics
Yinan Li, Nikos R. Katsipoulakis, Badrish Chandramouli, Jonathan Goldstein,
Donald Kossmann
June 2017
Proceedings of the VLDB Endowment: Volume 10 Issue 10, June 2017
Publisher: VLDB Endowment
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 0, Downloads (12 Months): 0, Downloads (Overall): 0
Full text available:
PDF
The growing popularity of the JSON format has fueled increased interest in loading and processing JSON data within analytical data processing systems. However, in many applications, JSON parsing dominates performance and cost. In this paper, we present a new JSON parser called Mison that is particularly tailored to this class ...
2
Concerto: A High Concurrency Key-Value Store with Integrity
May 2017
SIGMOD '17: Proceedings of the 2017 ACM International Conference on Management of Data
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 300, Downloads (12 Months): 327, Downloads (Overall): 327
Full text available:
PDF
Verifying the integrity of outsourced data is a classic, well-studied problem. However current techniques have fundamental performance and concurrency limitations for update-heavy workloads. In this paper, we investigate the potential advantages of deferred and batched verification rather than the per-operation verification used in prior work. We present Concerto , a ...
Keywords:
key-value stores, main memory, merkle trees, recovery, concurrency, verification, indexing, integrity, secure hardware
3
ParTime: Parallel Temporal Aggregation
Markus Pilman,
Martin Kaufmann,
Florian Köhl,
Donald Kossmann,
Damien Profeta
June 2016
SIGMOD '16: Proceedings of the 2016 International Conference on Management of Data
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 21, Downloads (12 Months): 319, Downloads (Overall): 319
Full text available:
PDF
This paper presents ParTime, a parallel algorithm for temporal aggregation. Temporal aggregation is one of the most important, yet most complex temporal query operators. It has been extensively studied in the past, but so far there has only been one attempt to parallelize this operator. ParTime supports data parallelism and ...
Keywords:
query processing, in-memory databases, shared scans, temporal data, temporal query processing
4
Daniel Abadi,
Rakesh Agrawal,
Anastasia Ailamaki,
Magdalena Balazinska,
Philip A. Bernstein,
Michael J. Carey,
Surajit Chaudhuri,
Jeffrey Dean,
AnHai Doan,
Michael J. Franklin,
Johannes Gehrke,
Laura M. Haas,
Alon Y. Halevy,
Joseph M. Hellerstein,
Yannis E. Ioannidis,
H. V. Jagadish,
Donald Kossmann,
Samuel Madden,
Sharad Mehrotra,
Tova Milo,
Jeffrey F. Naughton,
Raghu Ramakrishnan,
Volker Markl,
Christopher Olston,
Beng Chin Ooi,
Christopher Ré,
Dan Suciu,
Michael Stonebraker,
Todd Walter,
Jennifer Widom
January 2016
Communications of the ACM: Volume 59 Issue 2, February 2016
Publisher: ACM
Bibliometrics:
Citation Count: 2
Downloads (6 Weeks): 68, Downloads (12 Months): 4,249, Downloads (Overall): 11,146
Database researchers paint big data as a defining challenge. To make the most of the enormous opportunities at hand will require focusing on five research areas.
5
MQJoin: efficient shared execution of main-memory joins
January 2016
Proceedings of the VLDB Endowment: Volume 9 Issue 6, January 2016
Publisher: VLDB Endowment
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 3, Downloads (12 Months): 43, Downloads (Overall): 65
Full text available:
PDF
Database architectures typically process queries one-at-a-time, executing concurrent queries in independent execution contexts. Often, such a design leads to unpredictable performance and poor scalability. One approach to circumvent the problem is to take advantage of sharing opportunities across concurrently running queries. In this paper we propose Many-Query Join (MQJoin), a ...
6
Indexing and selecting hierarchical business logic
Alessandra Loro,
Anja Gruenheid,
Donald Kossmann,
Damien Profeta,
Philippe Beaudequin
August 2015
Proceedings of the VLDB Endowment - Proceedings of the 41st International Conference on Very Large Data Bases, Kohala Coast, Hawaii: Volume 8 Issue 12, August 2015
Publisher: VLDB Endowment
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 1, Downloads (12 Months): 12, Downloads (Overall): 15
Full text available:
PDF
Business rule management is the task of storing and maintaining company-specific decision rules and business logic that is queried frequently by application users. These rules can impede efficient query processing when they require the business rule engine to resolve semantic hierarchies. To address this problem, this work discusses hierarchical indexes ...
7
On the Design and Scalability of Distributed Shared-Data Databases
May 2015
SIGMOD '15: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data
Publisher: ACM
Bibliometrics:
Citation Count: 4
Downloads (6 Weeks): 27, Downloads (12 Months): 287, Downloads (Overall): 1,039
Full text available:
PDF
Database scale-out is commonly implemented by partitioning data across several database instances. This approach, however, has several restrictions. In particular, partitioned databases are inflexible in large-scale deployments and assume a partition-friendly workload in order to scale. In this paper, we analyze an alternative architecture design for distributed relational databases that ...
Keywords:
decoupled storage, transaction processing, optimistic concurrency control, shared-data
8
Analytics in Motion: High Performance Event-Processing AND Real-Time Analytics in the Same Database
Lucas Braun,
Thomas Etter,
Georgios Gasparis,
Martin Kaufmann,
Donald Kossmann,
Daniel Widmer,
Aharon Avitzur,
Anthony Iliopoulos,
Eliezer Levy,
Ning Liang
May 2015
SIGMOD '15: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data
Publisher: ACM
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 14, Downloads (12 Months): 132, Downloads (Overall): 715
Full text available:
PDF
Modern data-centric flows in the telecommunications industry require real time analytical processing over a rapidly changing and large dataset. The traditional approach of separating OLTP and OLAP workloads cannot satisfy this requirement. Instead, a new class of integrated solutions for handling hybrid workloads is needed. This paper presents an industrial ...
Keywords:
oltp/olap engine, analytics, event-processing
9
May 2015
SIGMOD '15: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data
Publisher: ACM
Bibliometrics:
Citation Count: 9
Downloads (6 Weeks): 29, Downloads (12 Months): 287, Downloads (Overall): 909
Full text available:
PDF
Database systems running on a cluster of machines, i.e. rack-scale databases, are a common architecture for many large databases and data appliances. As the data movement across machines is often a significant bottleneck, these systems typically use a low-latency, high-throughput network such as InfiniBand. To achieve the necessary performance, parallel ...
Keywords:
distributed query processing, join processing with rdma, rack scale databases, distributed join
10
StoryPivot: Comparing and Contrasting Story Evolution
May 2015
SIGMOD '15: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 9, Downloads (12 Months): 49, Downloads (Overall): 175
Full text available:
PDF
As the world evolves around us, so does the digital coverage of it. Events of diverse types, associated with different actors and various locations, are continuously captured by multiple information sources such as news articles, blogs, social media etc. day by day. In the digital world, these events are represented ...
Keywords:
event management, event processing, story detection, story evolution
11
Daniel Abadi,
Rakesh Agrawal,
Anastasia Ailamaki,
Magdalena Balazinska,
Philip A. Bernstein,
Michael J. Carey,
Surajit Chaudhuri,
Jeffrey Dean,
AnHai Doan,
Michael J. Franklin,
Johannes Gehrke,
Laura M. Haas,
Alon Y. Halevy,
Joseph M. Hellerstein,
Yannis E. Ioannidis,
H. V. Jagadish,
Donald Kossmann,
Samuel Madden,
Sharad Mehrotra,
Tova Milo,
Jeffrey F. Naughton,
Raghu Ramakrishnan,
Volker Markl,
Christopher Olston,
Beng Chin Ooi,
Christopher Ré,
Dan Suciu,
Michael Stonebraker,
Todd Walter,
Jennifer Widom
December 2014
ACM SIGMOD Record: Volume 43 Issue 3, September 2014
Publisher: ACM
Bibliometrics:
Citation Count: 5
Downloads (6 Weeks): 14, Downloads (12 Months): 142, Downloads (Overall): 888
Full text available:
PDF
Every few years a group of database researchers meets to discuss the state of database research, its impact on practice, and important new directions. This report summarizes the discussion and conclusions of the eighth such meeting, held October 14- 15, 2013 in Irvine, California. It observes that Big Data has ...
12
Distributed snapshot isolation: global transactions pay globally, local transactions pay locally
Carsten Binnig,
Stefan Hildenbrand,
Franz Färber,
Donald Kossmann,
Juchang Lee,
Norman May
December 2014
The VLDB Journal — The International Journal on Very Large Data Bases: Volume 23 Issue 6, December 2014
Publisher: Springer-Verlag New York, Inc.
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 4, Downloads (12 Months): 43, Downloads (Overall): 191
Full text available:
PDF
Modern database systems employ Snapshot Isolation to implement concurrency control and isolationbecause it promises superior query performance compared to lock-based alternatives. Furthermore, Snapshot Isolation never blocks readers, which is an important property for modern information systems, which have mixed workloads of heavy OLAP queries and short update transactions. This paper ...
Keywords:
Snapshot isolation, Concurrency control, Distributed databases
13
High availability, elasticity, and strong consistency for massively parallel scans over relational data
August 2014
The VLDB Journal — The International Journal on Very Large Data Bases: Volume 23 Issue 4, August 2014
Publisher: Springer-Verlag New York, Inc.
Bibliometrics:
Citation Count: 2
Downloads (6 Weeks): 1, Downloads (12 Months): 21, Downloads (Overall): 157
Full text available:
PDF
An elastic and highly available data store is a key component of many cloud applications. Existing data stores with strong consistency guarantees are designed and optimized for small updates, key-value access, and (if supported) small range queries over a predefined key column. This raises performance and availability problems for applications ...
Keywords:
Dynamic partial replication, High availability, Scan-only query processing, Atomic multicast
14
Randomly Partitioned Encryption for Cloud Databases
July 2014
DBSec 2014: Proceedings of the 28th Annual IFIP WG 11.3 Working Conference on Data and Applications Security and Privacy XXVIII - Volume 8566
Publisher: Springer-Verlag New York, Inc.
With the current advances in Cloud Computing, outsourcing data has never been so tempting. Along with outsourcing a database comes the privacy versus performance discussion. Order-Preserving Encryption OPE is one of the most attractive techniques for database encryption since it allows to execute range and rank queries efficiently without decrypting ...
Keywords:
Domain Attack, Randomly Partitioned Encryption, Efficient Query Processing, Query Log Attack, Database Encryption, Frequency Attack
15
Towards indexing functions: answering scalar product queries
June 2014
SIGMOD '14: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 9, Downloads (12 Months): 30, Downloads (Overall): 351
Full text available:
PDF
We consider a broad category of analytic queries, denoted by scalar product queries, which can be expressed as a scalar product between a known function over multiple database attributes and an unknown set of parameters. More specifically, given a set of d-dimensional data points, we retrieve all points x which ...
Keywords:
moving object indexing, scalar product query, function indexing, planar index
16
Shared workload optimization
February 2014
Proceedings of the VLDB Endowment: Volume 7 Issue 6, February 2014
Publisher: VLDB Endowment
Bibliometrics:
Citation Count: 7
Downloads (6 Weeks): 3, Downloads (12 Months): 26, Downloads (Overall): 53
Full text available:
PDF
As a result of increases in both the query load and the data managed, as well as changes in hardware architecture (multicore), the last years have seen a shift from query-at-a-time approaches towards shared work (SW) systems where queries are executed in groups. Such groups share operators like scans and ...
17
TPC-BiH: A Benchmark for Bitemporal Databases
August 2013
Revised Selected Papers of the 5th TPC Technology Conference on Performance Characterization and Benchmarking - Volume 8391
Publisher: Springer-Verlag New York, Inc.
An increasing number of applications such as risk evaluation in banking or inventory management require support for temporal data. After more than a decade of standstill, the recent adoption of some bitemporal features in SQL:2011 has reinvigorated the support among commercial database vendors, who incorporate an increasing number of relevant ...
Keywords:
Benchmark, Data Generator, Bitemporal Databases
18
Secure database-as-a-service with Cipherbase
June 2013
SIGMOD '13: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Publisher: ACM
Bibliometrics:
Citation Count: 7
Downloads (6 Weeks): 6, Downloads (12 Months): 70, Downloads (Overall): 500
Full text available:
PDF
Data confidentiality is one of the main concerns for users of public cloud services. The key problem is protecting sensitive data from being accessed by cloud administrators who have root privileges and can remotely inspect the memory and disk contents of the cloud servers. While encryption is the basic mechanism ...
Keywords:
trusted hardware, encryption, privacy, security
19
Workload optimization using SharedDB
June 2013
SIGMOD '13: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Publisher: ACM
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 4, Downloads (12 Months): 32, Downloads (Overall): 262
Full text available:
PDF
This demonstration presents SharedDB, an implementation of a relational database system capable of executing all SQL operators by sharing computation and resources across all running queries. SharedDB sidesteps the traditional query-at-a-time approach and executes queries in batches. Unlike proposed multi-query optimization ideas, in SharedDB queries do not have to contain ...
Keywords:
shared query processing, main memory
20
SODA: generating SQL for business users
June 2012
Proceedings of the VLDB Endowment: Volume 5 Issue 10, June 2012
Publisher: VLDB Endowment
Bibliometrics:
Citation Count: 11
Downloads (6 Weeks): 1, Downloads (12 Months): 17, Downloads (Overall): 202
Full text available:
PDF
The purpose of data warehouses is to enable business analysts to make better decisions. Over the years the technology has matured and data warehouses have become extremely successful. As a consequence, more and more data has been added to the data warehouses and their schemas have become increasingly complex. These ...
|
|