Online transfer learning for concept drifting data streams

[thumbnail of WRAP_Theses_McKay_2022.pdf]
Preview
PDF
WRAP_Theses_McKay_2022.pdf - Submitted Version - Requires a PDF viewer.

Download (17MB) | Preview

Request Changes to record.

Abstract

Online Transfer Learning (TL) allows knowledge to be learnt from a data rich source domain to aid predictions in an online target domain. However, when all domains are online, and a data rich source domain does not exist, we must determine what to transfer, how to combine transferred knowledge, and whether to transfer knowledge. To ensure the feasibility of online TL methods in real-world applications, they should not only aid predictions in receiving domains, but should consider the communication and computational overheads of knowledge transfer. To address these challenges, this thesis presents methods for online TL when all domains are online, which are evaluated using synthetic and real-world regression-based datasets. First, the BOTL framework is introduced, which enables knowledge transfer to be conducted bi-directionally between online data streams, where knowledge is transferred in the form of predictive models, and combined using an OLS metalearner. Second, two methods of selecting a relevant yet diverse subset of transferred and locally learnt models are presented, namely parameterised thresholding and conceptual clustering. These approaches help to prevent over_tting when the number of models transferred is large in comparison to the window of available data. To reduce the computational overhead of selecting subsets of models, a static diversity metric is introduced, which estimates the conceptual similarity between models using the Principal Angles (PAs) between their underlying subspaces. Third, two methods for determining whether to transfer knowledge are presented, namely IdDT and IdCS, which maintain comparable predictive performances to when all models are transferred, while reducing the number of models received in each domain by 47:1% and 30% respectively across the experiments conducted for this thesis.

Item Type: Thesis [via Doctoral College] (PhD)
Subjects: Q Science > QA Mathematics > QA76 Electronic computers. Computer science. Computer software
Library of Congress Subject Headings (LCSH): Transfer learning (Machine learning) , Machine learning, Streaming technology (Telecommunications), Data transmission systems, Artificial intelligence, Adaptive computing systems
Official Date: January 2022
Dates:
Date
Event
January 2022
UNSPECIFIED
Institution: University of Warwick
Theses Department: Department of Computer Science
Thesis Type: PhD
Publication Status: Unpublished
Supervisor(s)/Advisor: Griffiths, Nathan
Sponsors: Engineering and Physical Sciences Research Council ; Jaguar Land Rover (Firm)
Format of File: pdf
Extent: xv, 191 leaves : coloured charts
Language: eng
URI: https://wrap.warwick.ac.uk/171339/

Export / Share Citation


Request changes or add full text files to a record

Repository staff actions (login required)

View Item View Item