Lifelong Infinite Mixture Model Based on Knowledge-Driven Dirichlet Process

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Recent research efforts in lifelong learning propose to grow a mixture of models to adapt to an increasing number of tasks. The proposed methodology shows promising results in overcoming catastrophic forgetting. However, the theory behind these successful models is still not well understood. In this paper, we perform the theoretical analysis for lifelong learning models by deriving the risk bounds based on the discrepancy distance between the probabilistic representation of data generated by the model and that corresponding to the target dataset. Inspired by the theoretical analysis, we introduce a new lifelong learning approach , namely the Lifelong Infinite Mixture (LIMix) model, which can automatically expand its network architectures or choose an appropriate component to adapt its parameters for learning a new task, while preserving its previously learnt information. We propose to incorporate the knowledge by means of Dirichlet processes by using a gat-ing mechanism which computes the dependence between the knowledge learnt previously and stored in each component , and a new set of data. Besides, we train a compact Student model which can accumulate cross-domain representations over time and make quick inferences. The code is available at https://github.com/dtuzi123/ Lifelong-infinite-mixture-model.
Original languageEnglish
Title of host publicationIEEE/CVF International Conference on Computer Vision (ICCV)
PublisherIEEE
Pages10695-10704
Number of pages10
DOIs
Publication statusPublished - 28 Feb 2022

Bibliographical note

© IEEE, 2022. This is an author-produced version of the published paper. Uploaded in accordance with the publisher’s self-archiving policy. Further copying may not be permitted; contact the publisher for details

Cite this