Dynamical Gaussian Process Latent Variable Model for Representation Learning from Longitudinal Data

Many real-world applications involve longitudinal data, consisting of observations of several variables, where different subsets of variables are sampled at irregularly spaced time points. We introduce the Longitudinal Gaussian Process Latent Variable Model (L-GPLVM), a variant of the Gaussian Process Latent Variable Model, for learning compact representations of such data. L-GPLVM overcomes a key limitation of the Dynamic Gaussian Process Latent Variable Model and its variants, which rely on the assumption that the data are fully observed over all of the sampled time points. We describe an effective approach to learning the parameters of L-GPLVM from sparse observations, by coupling the dynamical model with a Multitask Gaussian Process model for sampling of the missing observations at each step of the gradient-based optimization of the variational lower bound. We further show the advantage of the Sparse Process Convolution framework to learn the latent representation of sparsely and irregularly sampled longitudinal data with minimal computational overhead relative to a standard Latent Variable Model. We demonstrated experiments with synthetic data as well as variants of MOCAP data with varying degrees of sparsity of observations that show that L-GPLVM substantially and consistently outperforms the state-of-the-art alternatives in recovering the missing observations even when the available data exhibits a high degree of sparsity. The compact representations of irregularly sampled and sparse longitudinal data can be used to perform a variety of machine learning tasks, including clustering, classification, and regression.

© Le None. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in '', https://dx.doi.org/10.1145/10.1145/3412815.3416894.

Files

Metadata

Work Title Dynamical Gaussian Process Latent Variable Model for Representation Learning from Longitudinal Data
Access
Open Access
Creators
  1. Thanh Le
  2. Vasant Honavar
License In Copyright (Rights Reserved)
Work Type Article
Publisher
  1. ACM
Publication Date October 18, 2020
Publisher Identifier (DOI)
  1. 10.1145/3412815.3416894
Source
  1. Proceedings of the 2020 ACM-IMS on Foundations of Data Science Conference
Deposited September 09, 2021

Versions

Analytics

Collections

This resource is currently not in any collection.

Work History

Version 1
published

  • Created
  • Added ACM_Conference_Proceedings__Master__Template-3-1.pdf
  • Added Creator Thanh Le
  • Added Creator Vasant Honavar
  • Published
  • Updated
  • Updated
  • Updated