Synthesis of sequential data
Information
Författare: Joel ViklundBeräknat färdigt: 2021-05
Handledare: Douglas Torres
Handledares företag/institution: Syndata AB
Ämnesgranskare: Thomas Schön
Övrigt: -
Presentation
Presentatör: Joel ViklundPresentationstid: 2021-04-28 15:15
Opponent: Alva Larsson
Abstract
Good generative models for short time series data exist and have been applied for both data augmentation and privacy protection purposes in the past. A common theme for existing generative models is that they all use a recurrent neural network (RNN) architecture, which makes the models limited regarding the length of the sequences. In real world problems, we might have to deal with data containing longer sequences, and it is such data we in this thesis attempt to synthesize. By combining the recently successful TimeGAN framework with a temporal convolutional network component architecture, we generate synthetic sequential data for two toy data sets: sequential MNIST and multivariate sine waves. The results strongly indicate, although relying solely on a visual inspection, that the model manage to capture long temporal dynamics over time and also relations between different features for the multivariate sine waves data set. In order to make our model applicable for real world data sets, we suggest two improvements. Firstly, the validation of the generated data should not only rely on visual inspection, but also ensure that the synthetic data has the same statistical distribution. Secondly, depending on the task, model refinements such that the synthetic samples look even more realistic should be made.