The Library
CRUCIBLE : towards unified secure on- and off-line analytics at scale
Tools
Coetzee, Peter and Jarvis, Stephen A. (2013) CRUCIBLE : towards unified secure on- and off-line analytics at scale. In: The 2013 International Workshop on Data-Intensive Scalable Computing Systems, Denver, Colorado, USA, 18 Nov 2013. Published in: Proceedings of the 2013 International Workshop on Data-Intensive Scalable Computing Systems pp. 43-48. ISBN 9781450325066. doi:10.1145/2534645.2534649
|
PDF
WRAP_Coetzee_CRUCIBLE - Towards Unified Secure On- and Off-Line Analytics at Scale (2).pdf - Accepted Version - Requires a PDF viewer. Download (594Kb) | Preview |
Official URL: http://dx.doi.org/10.1145/2534645.2534649
Abstract
The burgeoning field of data science benefits from the application of a variety of analytic models and techniques to the oft-cited problems of large volume, high velocity data rates, and significant variety in data structure and semantics. Many approaches make use of common analytic techniques in either a streaming or batch processing paradigm.
This paper presents progress in developing a framework for the analysis of large-scale datasets using both of these pools of techniques in a unified manner. This includes: (1) a Domain Specific Language (DSL) for describing analyses as a set of Communicating Sequential Processes, fully integrated with the Java type system, including an Integrated Development Environment (IDE) and a compiler which builds idiomatic Java; (2) a runtime model for execution of an analytic in both streaming and batch environments; and (3) a novel approach to automated management of cell-level security labels, applied uniformly across all runtimes.
The paper concludes with a demonstration of the successful use of this system with a sample workload developed in (1), and an analysis of the performance characteristics of each of the runtimes described in (2).
Item Type: | Conference Item (Paper) | ||||
---|---|---|---|---|---|
Subjects: | Q Science > QA Mathematics > QA76 Electronic computers. Computer science. Computer software | ||||
Divisions: | Faculty of Science, Engineering and Medicine > Science > Computer Science | ||||
Library of Congress Subject Headings (LCSH): | Electronic data processing | ||||
Journal or Publication Title: | Proceedings of the 2013 International Workshop on Data-Intensive Scalable Computing Systems | ||||
Publisher: | ACM | ||||
ISBN: | 9781450325066 | ||||
Book Title: | Proceedings of the 2013 International Workshop on Data-Intensive Scalable Computing Systems - DISCS-2013 | ||||
Official Date: | 18 November 2013 | ||||
Dates: |
|
||||
Page Range: | pp. 43-48 | ||||
DOI: | 10.1145/2534645.2534649 | ||||
Status: | Peer Reviewed | ||||
Publication Status: | Published | ||||
Access rights to Published version: | Restricted or Subscription Access | ||||
Date of first compliant deposit: | 28 July 2016 | ||||
Date of first compliant Open Access: | 28 July 2016 | ||||
Funder: | Engineering and Physical Sciences Research Council (EPSRC) | ||||
Conference Paper Type: | Paper | ||||
Title of Event: | The 2013 International Workshop on Data-Intensive Scalable Computing Systems | ||||
Type of Event: | Workshop | ||||
Location of Event: | Denver, Colorado, USA | ||||
Date(s) of Event: | 18 Nov 2013 | ||||
Related URLs: |
Request changes or add full text files to a record
Repository staff actions (login required)
View Item |
Downloads
Downloads per month over past year