Abstract
Current data-driven methods for diagnosing gas path faults in aero-engines often rely on large, costly fault sample sets and face challenges related to class imbalances. These include disparities in the quantity of normal and fault data, differences among fault types, and variations in fault severity levels. This paper proposes a novel generative model, TL-GMVAE, which integrates Gaussian Mixture Models (GMM) with Variational Autoencoders (VAE) and incorporates a transfer learning (TL) strategy. Using a large dataset of normal operational data, the VAE learns latent feature mappings of engine behavior and establishes a joint probability distribution across sensor measurements. To account for complexity in engine operating conditions, the GMM is used as the sampling distribution of the VAE. This enhances the model's ability to represent diverse operating scenarios. The pre-trained model is then fine-tuned with a small dataset of gas path fault data, transferring knowledge to fault domains. Each fault-specific TL-GMVAE model serves as an independent generator for synthetic fault samples. The proposed approach is validated using several established classifiers. The impact of different Fault-Normal ratios and imbalances across fault categories on classification accuracy is analyzed. Additionally, the robustness of the method to individual engine variability is evaluated. Results demonstrate that the TL-GMVAE generates high-quality fault samples and significantly improves fault diagnosis accuracy. These findings highlight its potential application in aero-engine health monitoring and fault diagnosis systems.