The Library
Goal-based analytic composition for on- and off-line execution at scale
Tools
Coetzee, Peter and Jarvis, Stephen A. (2015) Goal-based analytic composition for on- and off-line execution at scale. In: The 9th IEEE International Conference on Big Data Science and Engineering, Helsinki, Finland, 20-22 Aug 2015. Published in: 2015 IEEE Trustcom/BigDataSE/ISPA ISBN 9781467379526. doi:10.1109/Trustcom.2015.562
|
PDF
WRAP_1257419-cs-140915-mendeleev.pdf - Accepted Version - Requires a PDF viewer. Download (492Kb) | Preview |
Official URL: https://doi.org/10.1109/Trustcom.2015.562
Abstract
Crafting scalable analytics in order to extract actionable business intelligence is a challenging endeavour, requiring multiple layers of expertise and experience. Often, this expertise is irreconcilably split between an organisation’s engineers and subject matter or domain experts. Previous approaches to this problem have relied on technically adept users with tool-specific training. These approaches have generally not targeted the levels of performance and scalability required to harness the sheer volume and velocity of large-scale data analytics.
In this paper, we present a novel approach to the automated planning of scalable analytics using a semantically rich type system, the use of which requires little programming expertise from the user. This approach is the first of its kind to permit domain experts with little or no technical expertise to assemble complex and scalable analytics, for execution both on- and offline, with no lower-level engineering support. We describe in detail (i) an abstract model of analytic assembly and execution; (ii) goal-based planning and (iii) code generation using this model for both on- and off-line analytics. Our implementation of this model, MENDELEEV, is used to (iv) demonstrate the applicability of our approach through a series of case studies, in which a single interface is used to create analytics that can be run in real-time (on-line) and batch (off-line) environments. We (v) analyse the performance of the planner, and (vi) show that the performance of MENDELEEV’s generated code is comparable with that of hand-written analytics.
Item Type: | Conference Item (Paper) | ||||||
---|---|---|---|---|---|---|---|
Subjects: | Q Science > QA Mathematics > QA76 Electronic computers. Computer science. Computer software | ||||||
Divisions: | Faculty of Science, Engineering and Medicine > Science > Computer Science | ||||||
Library of Congress Subject Headings (LCSH): | Web usage mining, Data mining | ||||||
Journal or Publication Title: | 2015 IEEE Trustcom/BigDataSE/ISPA | ||||||
Publisher: | IEEE | ||||||
ISBN: | 9781467379526 | ||||||
Official Date: | 3 December 2015 | ||||||
Dates: |
|
||||||
DOI: | 10.1109/Trustcom.2015.562 | ||||||
Status: | Peer Reviewed | ||||||
Publication Status: | Published | ||||||
Access rights to Published version: | Restricted or Subscription Access | ||||||
Date of first compliant deposit: | 29 May 2016 | ||||||
Date of first compliant Open Access: | 29 May 2016 | ||||||
Funder: | Engineering and Physical Sciences Research Council (EPSRC) | ||||||
Conference Paper Type: | Paper | ||||||
Title of Event: | The 9th IEEE International Conference on Big Data Science and Engineering | ||||||
Type of Event: | Conference | ||||||
Location of Event: | Helsinki, Finland | ||||||
Date(s) of Event: | 20-22 Aug 2015 | ||||||
Related URLs: |
Request changes or add full text files to a record
Repository staff actions (login required)
View Item |
Downloads
Downloads per month over past year