Performance Degradation Analysis Method using Big Data

DOI : 10.17577/IJERTV5IS090313

Download Full-Text PDF Cite this Publication

Text Only Version

Performance Degradation Analysis Method using Big Data

M. Bhargavi, Assistant Professor, Department of CSE,

Vidyanikethan Engineering College, Tirupati,

  1. Venkata Lokesh,

    Student Scholar,

    1. Vijayalakshmi,

      Assistant Professor,

      Sree Vidyanikethan Engineering College, Tirupati,

      IV CSE, Department of CSE,

      Sree Vidyanikethan Engineering College, Tirupati,

      Abstract. Satellites have features of high control integration, various working modes, and complex telemetry big data, which make it difficult to evaluate their performance degradation. In this paper, a novel data mining analysis method is proposed to analyze the satellites telemetry big data, in which sample en- tropy is calculated to characterize states and support vector data description is utilized to analyze the satellite performance degradation process. The experimental results show that our proposed method could generally describe the performance degradation process of satellites. Meanwhile, it also provides an important approach for the ground-station-monitor to ana- lyze the performance of satellites.

      Keywords: Performance degradation; Telemetry big data; Sample entropy; Support vector data description

      1. INTRODUCTION

        With more and more satellites being sent into space these years, the ground in-orbit managements have to handle such challenges as satellites high control precision, various working modes, and high complexity. As advanced technologies and new materials are utilized in satellites, the sudden failure is not the primary failure mode for most satellite failures, which is replaced by performance degradation. The theory of analyzing satellite performance degradation only focuses on the overall performance of equipment, regardless of failure modes, which is different from analyzing sudden failures.

        In 2001, the University of Wisconsin and the University of Michigan, together with other 40 industry partners, were united to establish the Intelligent Maintenance Systems (IMS) research center under the U.S. National Science Foundation. After then, many methods of performance degradation assessment have been proposed, such as the pattern discrimination model (PDM) based on a cerebellar model articulation controller (CMAC) neural network [1], self-organizing map (SOM) and back propagation neural network methods [2], hidden Markov model (HMM) and hidden semi-Markov model (HSMM) [3], etc. However, these methods are deficient in some aspects. For example, the results of CMAC assessment method are greatly influenced by parameter setting, and the assessment results of the SOM, neural network method and hidden Markov model cannot directly reflect degradation degree. In order to

        accommodate the characteristics of assessment for different key components, the analysis theory of performance degradation has been developed from single degradation variable to a more diverse practical direction. Although some new theories and methods have emerged, the researches on the performance degradation of satellite are still limited. M Tafazoli [4] studied in-orbit failures for more than 130 different spacecraft and revealed that the spacecraft are vulnerable to failures occurring in key components. MA W

        [5] analyzed the space radiation environment of thermal coatings and proposed degradation models for the optical properties of thermal coatings. However, these methods mainly focus on failure data and also require relevant experience.

        The conventional analysis methods for satellite perfor- mance degradation have some shortcomings such as exper- imental difficulties and high cost. According to expert knowledge, large amounts of telemetry big data are generat- ed during the in-orbit operation and monitoring process. Satellites telemetry big data contain monitoring information, abnormal states, space environment, and others, which re- flect the operational status and payload of satellites. A nov- el analysis method for satellite performance degradation with telemetry big data is proposed in this paper. This meth- od uses data mining techniques and provides a quantitative description for satellite performance degradation process. Furthermore, it also can be extended to apply to failure pre- diction.

      2. RELATED CONCEPTS

          1. Sample Entropy

            The sample entropy [6] (SamEn) is an improved algorithm of approximate entropy (ApEn) proposed by Pincus [7]. The advanced algorithm is able to quantify the complexity rate of a nonlinear time series.

            For a data series X N x 1, x 2,…x n , where N is the length of the series, two parameters are defined: m is the embedded dimension of the vector to be formed and r is the threshold that serves as a noise filter. The steps to calculate SamEn are shown as follows:

            1. N m 1 patterns (vectors) are generated, and each pattern owns m dimensions. The pattern is represented as following:

              a,R=R2 . (6)

              As the distance from xi to the core a should not be

              X m i x i, x i 1…x i m 1 i=1, ,N-m+1

              (1)

              larger than radius R for all the samples of the target class X , the constraint of the minimization problem can be

            2. The distance, d X m i, X m j between each two patterns can be computed by using Eq. (2).

              described as Eq. (7):

              i

              i

              x 2 R2

              (7)

              d X m i, X m j max x i k x j k

              k =0,m – 1

              To account for the possibility of outliers in the training

              set, the distance between xi and the core a should not be

              (2)

            3. For each pattern X m i , the number of matching

              strictly smaller than R , but larger distances should be penalized. Therefore, slack variable i is brought in, and the

              pattern, Nm i , i.e., number of

              d X m i, X m j r is

              minimization problem is transformed into

              i

              i

              achieved. Cm i Nm i / N m 1 is the probability that N

              i

              i

              pattern X m

              r

              j matches X m

              i . And the matching probability

              min

              R,a, =R2 C

              of two sequences with m points can be achieved by using Eq.(3):

              s.t xi -a

              R2

              i 1

              (8)

              m r

              1

              N m 1

              N m1

              C i

              C i

              m r

              i 1

              (3)

              i 0 i=1,2, ,N

            4. When the dimension expands to m 1 , steps 1-3 are repeated to find out m +1 r . The theoretical value of the SamEn is defined as follows:

              The penalty factor C makes a trade-off between the volume and the errors. The minimization problem in Eq.(8) can be calculated by using Eq.(9).

              L R,a, , =R2 C

              SamEn m, r lim ln m r m1 r (4)

              i i i

              i

              N

              For a finite length of data points N , the estimated value of the SamEn is given by using Eq.(5):

              R2 2 x 2ax a2

              i i i

              i i i

              i

              (9)

              SamEn m, r, N ln m r m1 r (5)

              ii

              i

              i 0, i 0

              Experiments conducted by Pincus [8] indicate that a reasonable statistical character can be achieved

              In Eq. (9), i and i are the Lagrange multipliers. L

              should be minimized with respect to R , a , and and

              when m 2 , r 0.1 ~ 0.25 std X

              , where

              std X i

              denotes the standard deviation of X x

              1 , x2 , xN .

              maximized with respect to i and i . Respectively taking their partial derivtives equal to zero, and then get the

              Compared with the general nonlinear dynamics, SamEn has more advantages, such as immunity to noise and inference as well as independence from the length of time series. SamEn has been widely used in physiological signal processing because of its excellent characteristics [8, 9]. In

              following constraint Eq.(10):

              i =1

              i

              i xi

              (10)

              a= i = x

              consideration of these characteristics, SamEn is a promising method in describing the performance features for a large amount of telemetry big data. The SamEn is also used in this paper to extract satellite performance features.

              i i

              i i i

              C i i =0 i

          2. Support Vector Data Description

        Support Vector Data Description [10] (SVDD) is inspired

        Substituting (10) into (9),we obtain max L :

        max L=i xi xi i j xi xj

        (11)

        by the Support Vector Classifier. The method is robust

        i =1

        i ,j

        against outliers in the training set and is capable of tightening the description by using negative examples.

        A hypersphere that contains all or most samples of the target class is defined as X =x1 ,x2 , xn . The hypersphere

        is bounded by the core of the hypersphere a and radius R . If the hypersphere covers all the training samples of target class, the classification is established by the empirical error which is equal to zero, and the structural error is defined as follows:

        According to the theory proposed by Vapnik [11], the Kernel trick can be adopted to take the place of dot product. Using the kernel function enables SVDD to handle the mapping of low-dimensional original space to high- dimensional feature space without dimensional disaster. Any function satisfies the Mercers condition can be regarded as the kernel function, and RBF [12] is used as the kernel function in this work:

        x-y 2

        hypersphere bounded by the core (model.a) and the radius

        KG x,y, = exp 2

        (12)

        (model.R)

        The optimization problem described by Eq. (11) can be further transformed into the following explicit form:

        Definition 3 (Performance Degradation Degree)

        Here, dec denotes the distance between the performance eigenvector of satellite and the core of hypersphere. The

        max L=1 i j KG xi ,xj ,

        i ,j

        (13)

        performance degradation degree deg is defined by the difference between dec and the radius of hypersphere

        Equation (13) shows that the core of the hypersphere is a

        linear combination of the objects. Only objects xi with i 0 are needed in the description. Therefore, these objects are called the support vectors of the description (SVs). To test an object z , the distance to the core of the hypersphere and the radius R are respectively calculated by Eq. (14).

        model.R, that is, deg = dec model.R (in Figure 1).

        It means that performance degradation process of the objective equipment may occurs when the value of deg is larger than 0. When the value increases monotonously, the performance degradation process of the objective equipment increases accordingly. As the degree cannot be negative, set

        deg = 0 when dec model.R <0.

        d = z-a =KG z, z -2i KG z, xi +i j KG xi , xj

        i i ,j

        R2 = x -a 2 =1-2 K x , x K x , x

        (14)

        sv i G i sv i j G i j i i ,j

        (15)

        The test object z is accepted when this distance is not greater than the radius (i.e. d R ).

        SVDD has the advantage of requiring only one category as the learning sample, whereas the degradation analysis itself plays down the distinction between specific patterns. The sample points of health status are extracted as the learn- ing samples. Therefore, the process of moving away from the health status with time for the testing samples can be regarded as the degradation process. Only the core of the hypersphere is used to detect the target class of testing sam- ple. And the core can be determined by a few support vec- tors. Moreover, the satisfactory computational speed of SVDD to classify the testing samples makes SVDD a prom- ising alternative method for analyzing satellite performance degradation.

      3. METHOD TO ANALYZE THE PERFORMANCE DEGRADATION OF SATELLITE

          1. Definition description

            Definition 1 (Performance Eigenvector)

            The SamEn of a time period is taken as its performance feature. And the vector composed of the performance features of parameters within the same time period is called performance eigenvector.

            In this study, parameters are not limited to those of the objective equipment, but they also contain a number of closely related equipment parameters. As parameters are relative to specialized knowledge, their selections are conducted based on the domain and expert knowledge.

            Definition 2 (Health Model)

            With SVDD method, the model obtained by training the performance eigenvector of satellite in the healthy status is called health model (model).

            According to the theory of SVDD, the model described in definition 2 is composed of the support vectors of healthy state vector (model.SV), corresponding coefficients ( model. ),number of support vectors (model.len),

            1. Performance states and eigenvectors

            2. Performance degradation degree

              Fig. 1. Principle of the performance degradation degree

              Figure 1 shows the principle of performance degradation degree. However, the model cannot contain all the health status features of the satellite for the operating mode of satellite is complex, and the training sets in healthy status of each operating mode are limited. A satellite may remain in the healthy status under other operating modes, especially when deg is positive. Therefore, Definition 3 is appropriate for the parameters less affected by the operating mode of satellite.

          2. Framework description

        Figure 2 shows the overall framework of the analysis for satellite performance degradation presented in this study, which has four main steps.

        Satellite telemetry data

        Parameter selection Expert knowledge

        Telemetry data processing

        Processing

        (2) The values of time series are normalized into the range [-1,1] for each parameter and each time series are equally divided into 800 groups. The performance features of each group are extracted by Definition 1. Finally, seven performance feature sequences are obtained with a length of 800. The performance eigenvector is composed of the features of seven parameters in the group with same number.

        4.2. Modeling and degradation analysis

        Local Performance Eigenvector

        Sample Entropy extraction

        Sample Entropy extraction

        Median Filter

        1. Performance eigenvector under healthy status are selected as the training data, SVDD method is used by

          Eigenvectors in healthy states

          Support Vector Data

          Description

          Eigenvectors for analysis

          Health Model

          setting =1 in this experiment, and then the health model of satellite is established.

        2. The remaining dataset is used as test data to verify the obtained health model, and the degradation degree is calculated according to Definition 3. Figure 3 shows the final results.

        degradation degree sequence wavelet denoise sequence

        degradation degree sequence wavelet denoise sequence

        0.35

        Fig. 2. Framework of the satellite performance degradation analysis

        Step 1. Selectparameters of the satellite according to expert knowledge. Then, median filter method is used to reduce the noise in satellite telemetry big data so as to generate a new clean dataset.

        Step 2. Extract the performance features from the selected parameters through Step 1 according to Definition 1. And compose the final set of the performance eigenvectors.

        <>Step 3. Select the performance eigenvectors in the healthy status as the training set, and build a health model with SVDD method.

        Performance Degradation Degree

        Performance Degradation Degree

        0.3

        degradation degree

        degradation degree

        0.25

        0.2

        0.15

        0.1

        0.05

        0

        1 100 200 300 400 500 600 700 800

        group number

        Fig. 3. Degradation degree

        Step 4. To measure the degradation status of the new performance eigenvector, calculate the performance degradation degrees according to Definition 3 and the results of the model obtained in Step 3.

        Considering the features of satellite telemetry big data, the median filter method in Step 1 is used to reduce noise in the data.

      4. EXPERIMENTAL RESULTS AND ANALYSIS The telemetry big data of one satellite is used as

        experimental data, which recorded from 2011-05-01 00:00:00.0 to 2011-12-29 18:16:59.987, 14 million data

        frames that contain several failures and performance degradation information. In our experiments, seven important parameters in this dataset are selected by expert knowledge.

        The telemetry big data is stored in Oracle 11g, and the algorithms are coded by Java. The operating system used is Windows Server 2008 R2 Standard with the Intel (R) Xeon

        (R) Eight-core E5606 processor with 8 G RAM.

          1. Telemetry big data processing

            The experimental dataset is processed as the following steps:

            1. The outliers caused by decoding or other errors are removed according to the ranges of the seven parameters. And further, the median filter method in every 30s is used to reduce the noise in the dataset. Finally, a new dataset is achieved.

        The degradation degrees are unsteady, and the curve is not smooth but fluctuant. This is mainly due to the recognition accuracy of SVDD and cyclical factors of original data that does not affect the overall reaction on the degradation process of satellite. In order to reduce the interference of these factors, a relative algorithm [13] is employed and the wavelet denoising sequence is obtained as Figure 3 shows. Overall, the average degradation degree presents an increasing trend. Given the long period, the accidental factors cannot influence the degradation degree all the time. Therefore, we conclude that the satellite has entered the performance degradation state based on Definition 3.

        Aerospace experts confirm that two major failures of satellite did occur from late July to late August (between the 246th group and 370th group) for unknown reasons, and these two failures are corresponding to the two peaks nearby. That proves the correctness of our proposed definition, especially explaining the degradation peak and the high degradation degree level after the peak. In conclusion, the proposed method can efficiently describe the performance degradation process of satellite.

      5. CONCLUSIONS

A method for satellite performance degradation with telemetry big data is proposed in this paper while studies for solving this problem are limited. The experimental analysis shows that the proposed method can extract effective state information from the parameters and provide a quantitative description for satellite performance degradation. Moreover, the analysis on the performance degradation of satellite with telemetry big data has a significant meaning in in-orbit research and management for satellites.

In our study, the definitions may have some limitations; for example, the degradation degree of the experiment is unstable but fluctuant. The sample entropy algorithm may take much time to trim redundant parameters in massive data, which will be improved in our future work.

ACKNOWLEDGMENT

This paper is supported by the National Natural Science Foundation of China (Grant No. U1433116).

REFERENCES

  1. Lee J. Measurement of machine performance degradation using a neural network model[J]. Computers in Industry, 1996, 30(3): 193- 209.

  2. Huang R, Xi L, Li X, et al. Residual life predictions for ball bearings based on self-organizing map and back propagation neural network methods[J]. Mechanical Systems and Signal Processing, 2007, 21(1): 193-207.

  3. Si X S, Wang W, Hu C H, et al. Remaining useful life estimationA review on the statistical data driven approaches[J]. European Journal of Operational Research, 2011, 213(1): 1-14.

  4. Tafazoli M. A study of on-orbit spacecraft failures[J]. Acta Astronautica, 2009, 64(2): 195-205.

  5. MA W, XUAN Y, HAN Y, et al. Degradation Performance of Long- life Satellite Thermal Coating and Its Influence on Thermal Character [J]. Journal of Astronautics, 2010, 2: 043.

  6. Widodo A, Shim M C, Caesarendra W, et al. Intelligent prognostics for battery health monitoring based on sample entropy[J]. Expert Systems with Applications, 2011, 38(9): 11763-11769.

  7. Pincus S M. Assessing serial irregularity and its implications for health[J]. Annals of the New York Academy of Sciences, 2001, 954(1): 245-267.

  8. Alcaraz R, Rieta J J. A review on sample entropy applications for the non-invasive analysis of atrial fibrillation electrocardiograms[J]. Biomedical Signal Processing and Control, 2010, 5(1): 1-14.

  9. Yang A C, Huang C C, Yeh H L, et al. Complexity of spontaneous BOLD activity in default mode network is correlated with cognitive function in normal male elderly: a multiscale entropy analysis[J]. Neurobiology of aging, 2013, 34(2): 428-438.

  10. Weinshall D, Zweig A, Hermansky H, et al. Beyond novelty detection: Incongruent events, when general and specific classifiers disagree[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(10): 1886-1901.

  11. Yu H, Xie T, Paszczynski S, et al. Advantages of radial basis function networks for dynamic system design[J]. IEEE Transactions on Industrial Electronics, 2011, 58(12): 5438-5450.

  12. Chang C C, Lin C J. LIBSVM: a library for support vector machines[J]. ACM Transactions on Intelligent Systems and Technology (TIST), 2011, 2(3): 27.

  13. Shao Q, Zhang X, Qi X, et al. Optical wavelet de-noising applied in multi-span nonlinear fiber links[J]. Optics Communications, 2010, 283(7): 1261-1267.

Leave a Reply