T-Test, Wavelet Transform and Fuzzy Logic in Load Curve Forecasting

DOI : 10.17577/IJERTV5IS110198

Download Full-Text PDF Cite this Publication

Text Only Version

T-Test, Wavelet Transform and Fuzzy Logic in Load Curve Forecasting

P. T.T.Binh

Faculty of Electrical and Electronics Engineering, HoChiMinh city University of Technology,

Ho chi Minh City, Viet Nam

Trong Nghia Le

Dinh X.Thu

HoChiMinh city University of Technology, Ho chi Minh City, Viet Nam

Faculty of Electrical and Electronics Engineering, HoChiMinh city University of Technology and Education, Ho chi Minh City, Viet Nam

Abstract The set of load curves can be regarded as the no stationary time series with some specialties due to the repetition of consumption behavior. These yield some correlation between the loads at specified hour and at other hours in the past. The T- test allows finding these correlations. The set of load curve may be treated as one times series if there are the correlations between load at t-hour and previous hours in the same day, load at t -hour but of the previous days. This set will be separated into two sets of load curves for working days and weekends in the case with eminent load difference between two kinds of day.

This paper will use the MODWT (maximal overlap discreet wavelet transform) and fuzzy logic with subtractive clustering [6] for load curve forecasting

II. T-TEST AND WAVELET DECOMPOSITION

  1. MODWT

    For a time series X with N samples, the MODWT yields an additive decomposition or MRA given by:

    J0

    And finally, 24 time series corresponded to 24 hours are examined if there is not correlation between load at t-hour and previous hours in the same day. This paper presented the load curve forecasting based on MODWT and fuzzy logic. The

    X D j SJ0

    j 1

    where Dj is the detail and

    (1)

    S

    S

    J

    J

    is the trend:

    0

    forecasting is carried out for one utility in the South of Vietnam

    Keywords T-test, time series, MODWT; fuzzy-rules; cluster centers

    N 1 ~ ~

    N 1 ~ ~

    D j,t h j,lW j,t l mod N l 0

    N 1 ~ ~

    1. INTRODUCTION

      Recently the application of modern modeling techniques

      S j ,t

  2. T-test

    g j ,lVj ,t l mo d N

    l 0

    (2)

    like Wavelet decomposition and fuzzy logic in forecasting time series is carried out. It is because theses techniques can overcome the troubles of no stationary series [1]. These techniques are also very successful for load forecasting [2][3][4].

    The daily load curve set can be treated as the time series and the forecasting has some specialties due to the repetition of consumption behavior. In [5], the load forecasting was based on the maximal overlap discrete wavelet transform (MODWT) and the correlation between loads at one moment with another moment is carried out by the experiences. But basically, this correlation must be based on some mathematic analysis. Beside, regarding the load curve set as one time

    T-test is necessary to find out the relationship between one variable with another variables. And if there is some relationship between one variable with itself in the past, the auto regression model will be used. For daily load curve forecasting, T-test will find out whether the load at t-hour is depended on the loads at some previous hours, for example: h previous hours in the same day, t-hour but of d-previous days or w weeks before.

  3. The load curve set is regarded as one time series

    The set of daily load curves may be treated as one time series. From the T-test result, for each series Dj,t, at the current time t, there will be a relationship as the following:

    series may lead to a raw error when there is the eminent difference between load curves of working days and weekends.

    Dj,t f j (Dj,t 24*7*w , , Dj,t 24*d , Dj,t h )

    The same way will be applied for Sj,t.

    (3)

    In this paper, we will use the T-test to find out the correlation between the loads at different moments and propose three ways to load curve forecasting when there is the significant difference between load curves of working days and weekends. For implementation, the forecasting of one utility in the South of Vietnam is carried out.

    Equation (3) means that each Dj,t is related to itself at moment t-1,..t-h in the same day, to itself in 1,2d- days before and 1,2,w weeks before.

  4. The load curve set is divided in two time series

    When the load curves have the eminent difference between the work days and weekends, separating the original time series is necessary to get higher accuracy. If there are strong

    The output will be:

    z

    z

    c

    *

    i i

    correlations between load at t-hour with t-1 hour and t-24 hour, the following considerations are included:

    The working day load curves time series: The forecasting for the t hour of the first working day in a week must use the load of the last weekend. So the load curve of this last day of

    z i1

    c

    c

    i

    i1

    Where c-number of centers

    (7)

    weekend days must be included to involve the influence of t-1 or t-24 hours in this series. The left hand value in (3) or forecasted value will belong to the working days, not to weekends.

    The weekend day load curves time series: Load curve of

    Yager và Filev suggested that Zij in (7) will be the linear function of the inputs as following:

    z* G y h

    Friday must be included because the hourly loads of Saturday

    ij i i

    (8)

    are related to Friday. The left hand value in (3) or forecasted belonged to the weekends, not to Friday.

    To include the influence of w week, it is necessary to say that the day number in one week in (3) is not seven. Suppose the working day load curves time series consists of 5 days a week, then (3) will be:

    Here Gi matrix of constants with (N-1)x1-dimension; h- column vector of constants with N-1 elements where. N-1- dimension of input

    i

    i

    i c

    j

    Dj,t

    f j

    (Dj,t 24*5*w

    , , D

    j,t 24*d

    , Dj,t h )

    (4)

    Now denoting

    j1

    (9)

  5. The load curve set is regarded as 24 time series

If there is no correlation between load at t-hour and t-1 hour, the load curve set may be regarded as 24 time series as the following:

Then (8) is rewritten as:

z z (G y h )

z z (G y h )

c c

*

i i i i i

Series 1 consists of the loads at first hour. Series 2 consists of the loads at second hour and so on. Equation (3) shows only the influence of d previous days and w weeks before.

Or:

i1

i1

GT

(10)

  1. DETERMINING FUZZY RULES

    Equation (3) can be approximated by some rules. The

    1

    1

    1

    hT

    number of rules is the number of cluster centers. The paper

    zT yT yT

    will develop the subtractive clustering in [6]. The elements in

    (3) will be considered as members of the following vector:

    1 1

    c c

    c

    c

    GT

    {Dj,t 24*7*w ,, Dj24*d ,, Dj,t h , Dj,t 1 Dj,t }

    (5)

    hT

    c (11)

    With a set of n inputs {y1,,yn}, the set of outputs will be:

    GT

    1

    1

    Input y output z

    zT yT

    yT

    hT

    Examining the set of vectors x, each vector consists of two

    1

    1,1 1 1,1

    c,1 1

    c,1 1

    parts: input y and output z.

    zT yT

    yT

    GT

    Consider a collection of n data points {x1, x2, xn} in an

    n

    1,n n

    1,n c,n n c,n c

    M dimensional space. Using the subtractive clustering

    hT

    c (12)

    proposed by Chiu [6], the set of centers

    determined.

    {xi } will be

    The estimation of G and h in (12) can be realized by mean least square method. After evaluating G and h, for given y at

    Each centre xi * of input y * and output z* will be regarded as one fuzzy rule. For each input vector y, its degree to satisfying the i-fuzy rule is:

    moment t+1, we can calculate the output zt+1 as the one step ahead forecasting using (10).

    The load forecasting for next moment will be carried out

    as:

    e i

    e i

    ||y y*||2

    i

    (6)

    Xt 1

    J

    J

    j1

    Djt 1

    • SJt 1

      D

      1

      1

      t 1

    • D

      2

      2

      t 1

      … D

      J

      J

      t 1

    • SJt 1

    (13)

    For the case of 2.3 and 2.4, the forecasted load at moment t+1 will be used for forecasting load at t+2 and so on. That means this it the 24 steps ahead forecasting.

  2. APPLICATION

    1. The load curve set is an time series

      The data for training are hourly loads of one utility in the South of Vietnam from 3/1/2011 to 7/18/2011. The data for testing is 15 days, from 7/11 to 8/2. The T-test shows that the load at t hour is related to the loads at the following hours by decreased order of importance: t-1, t-24, t-168, t-338, t-672 hour.

      The forecasted load at t+1 hour will be used for forecasting load at the next hour to get the whole daily load curve. The errors are displayed in the Table 1 with the mean error is 6.10%

    2. The load curve set is divided in two time series

      The T-test for the load curves from 03/01/2011 to 12/19/2011 has the best correlation with h=1; d=1; w=4

      These load curves have the eminent difference between the working days (excluded Monday) and weekend. Here the load curve of Monday looks like those of Weekends

      The first time series is the hourly loads for Mondays, Tuesdays, Wednesdays, Thursdays and Fridays. Monday is included to get the influence of the day before Tuesday. The results are presented in Table 2. The mean error is 3.07%. The concrete details of 24 steps ahead forecasting for one day are shown in Tabl.3 and Figure 1.

      For the time series of Fridays, Saturdays, Sundays and Mondays, the load forecasting is carried out for Saturday, Sunday and Monday from 11/12 /2011 to 12/19/2011. The errors are shown in Table 4. The mean error for 18 weekends is 3.43%. The detail forecasting for one day are presented in Table 5.

      TABLE I. The forecasting errors for 15 days (from 7/11

      Hour

      1h

      2h

      3h

      4h

      5h

      6h

      Forecasting

      1636.142

      1571.178

      1532.683

      1513.131

      1522.08

      1580.978

      Real load

      1590.97

      1564.42

      1528.86

      1498.32

      1494.82

      1539.51

      Hour

      7h

      8h

      9h

      10h

      11h

      12h

      Forecasting

      1726.921

      2164.257

      2419.847

      2481.365

      2487.47

      2300.3

      Real load

      1685.25

      2169.66

      2366.97

      2458.47

      2451.23

      2224.27

      Hour

      13h

      14h

      15h

      16h

      17h

      18h

      Fore- casting

      2362.799

      2500.566

      2532.633

      2527.214

      2414.654

      2374.787

      Real load

      2358.43

      2545.47

      2577.82

      2523.32

      2369.02

      2396.61

      Hour

      19h

      20h

      21h

      22h

      23h

      24h

      Forecasting

      2306.952

      2259.074

      2230.387

      2125.045

      1949.436

      1759.001

      Real load

      2308.53

      2260.67

      2212.38

      2093.63

      1900.8

      1722.37

      Hour

      1h

      2h

      3h

      4h

      5h

      6h

      Forecasting

      1636.142

      1571.178

      1532.683

      1513.131

      1522.08

      1580.978

      Real load

      1590.97

      1564.42

      1528.86

      1498.32

      1494.82

      1539.51

      Hour

      7h

      8h

      9h

      10h

      11h

      12h

      Forecasting

      1726.921

      2164.257

      2419.847

      2481.365

      2487.47

      2300.3

      Real load

      1685.25

      2169.66

      2366.97

      2458.47

      2451.23

      2224.27

      Hour

      13h

      14h

      15h

      16h

      17h

      18h

      Fore- casting

      2362.799

      2500.566

      2532.633

      2527.214

      2414.654

      2374.787

      Real load

      2358.43

      2545.47

      2577.82

      2523.32

      2369.02

      2396.61

      Hour

      19h

      20h

      21h

      22h

      23h

      24h

      Forecasting

      2306.952

      2259.074

      2230.387

      2125.045

      1949.436

      1759.001

      Real load

      2308.53

      2260.67

      2212.38

      2093.63

      1900.8

      1722.37

      to 8/2/2011).

      TABLE III. The errors for 20 working days.

      Date

      Tuesday

      Wednesday

      Thursday

      Friday

      Date

      11/01

      11/02

      11/03

      11/04

      Error

      3.93

      4.67

      4.09

      4.53

      Date

      11/08

      11/09

      11/10

      11/11

      Error

      2.77

      3.35

      2.16

      5.01

      Date

      11/15

      11/16

      11/17

      11/18

      Error

      2.48

      2.83

      4.89

      3.77

      Date

      11/22

      11/23

      11/24

      11/25

      Error

      1.93

      3.16

      2.20

      1.66

      Date

      11/29

      11/30

      12/01

      12/02

      Error

      3.04

      3.74

      2.11

      1.40

      Date

      12/06

      12/07

      12/08

      12/09

      Error

      1.89

      1.89

      2.57

      3.57

      Fig.1. Real load curve and forecasted load curve at 12/2/2011 TABLE III. The forecasting load (MW) for 12/2/2011

      Date

      Day

      Error

      7/19/11

      Tuesday

      0.019175

      7/20/11

      Wednesday

      0.038723

      7/21/11

      Thursday

      0.029726

      7/22/11

      Friday

      0.025508

      7/23/11

      Saturday

      0.020794

      7/24/11

      Sunday

      0.047853

      7/25/11

      Monday

      0.05848

      7/26/11

      Tuesday

      0.064154

      7/27/11

      Wednesday

      0.076005

      7/28/11

      Thursday

      0.068913

      7/29/11

      Friday

      0.059469

      7/30/11

      Saturday

      0.104449

      7/31/11

      Sunday

      0.172467

      8/1/11

      Monday

      0.081312

      8/2/11

      Tuesday

      0.04847

    3. The load curve set is expressed as 24 time series

    The forecasting errors are higher than the previous case. The mean error for working days is 3.7%. It was not good because in this model, the influence of the previous hour was neglected meanwhile according to T-test, there is the correlation between load at t hour and t-1 hour.

    TABLE IV. The errors for weekends and mondays from 11/12/2011 to 12/19/2011

    Saturday

    Sunday

    Monday

    Day

    11/12

    11/13

    11/14

    Error (%)

    3.4

    5.35

    2.93

    Day

    11/19

    11/20

    11/21

    Error (%)

    3.16

    1.87

    3.05

    Day

    11/26

    11/27

    11/28

    Error (%)

    1.51

    2.21

    2.55

    Day

    12/3

    12/4

    12/5

    Error (%)

    3.04

    2.27

    2.06

    Day

    12/10

    12/11

    12/12

    Error (%)

    3.38

    5.48

    7.53

    Day

    12/17

    12/18

    12/19

    Error (%)

    3.82

    3.4

    4.81

    TABLE V. Forecasting loads (MW) for 11/26/2011

  3. CONCLUSION

The T-test is necessary for finding the correlation between load at one moment and at the previous moments. It also leads to make the load curve set be treated as one time series, as two time series or 24 time series. If there are the eminent differences between the load curves of working days and weekend, separating in two time series will improve the forecasting accuracy. The MODTW allows finding the series of details and trend for time series of load curves. The details and trend at forecasted moment will be related with themselves in the past. These correlations are expressed by fuzzy rules based on the subtractive methods. Examining for one utility shows that the proposed approach based Wavelet Transform and Fuzzy Logic with T-test has the good result.

ACKNOWLEDGMENT

This work was supported by the Ho Chi Minh City University of Technology and Ho Chi Minh City University of Technology and Education.

REFERENCES

  1. Saif Ahmad, Ademola Popoola, and Khurshid Ahmad, Wavelet-based Multiresolution Forecasting, UniS Technical Report(June 2005).

  2. Wuhan, Hubei, Electrical Load Forecasting Based on Fuzzy Wavelet Neural Networks, Conference on Future Biomedical Information Engineering , pp.122-125, Dec. 2008

  3. Bhavesh Kumar Chauhan1, Madasu Hanmandlu2 , Load forecasting using wavelet fuzzy neural network , Journal International Journal of Knowledge-Based and Intelligent Engineering Systems, IOS Press, Volume 14, Number 2 / 2010.

    Hours

    1h

    2h

    3h

    4h

    5h

    6h

    Forecasted

    1619.956

    1561.104

    1518.818

    1492.393

    1493.631

    1524.365

    Real

    1659.51

    1569.69

    1541.4

    1518.2

    1516.9

    1546.8

    Hours

    7h

    8h

    9h

    10h

    11h

    12h

    Forecasted

    1660.854

    2028.948

    2287.335

    2380.874

    2418.591

    2212.458

    Real

    1688.3

    2142.33

    2335.52

    2398.77

    2407.61

    2169.51

    Hours

    13h

    14h

    15h

    16h

    17h

    18h

    Forecasted

    2252.476

    2367.698

    2364.746

    2363.908

    2177.314

    2135.019

    Real

    2241

    2384.9

    2330.8

    2305.82

    2137.96

    2181.98

    Hours

    19h

    20h

    21h

    22h

    23h

    24h

    Forecasted

    2140.362

    2099.144

    2084.593

    2007.182

    1842.413

    1652.698

    Real

    2110.05

    2095.67

    2113.4

    2022.87

    1844.55

    1684

    Hours

    1h

    2h

    3h

    4h

    5h

    6h

    Forecasted

    1619.956

    1561.104

    1518.818

    1492.393

    1493.631

    1524.365

    Real

    1659.51

    1569.69

    1541.4

    1518.2

    1516.9

    1546.8

    Hours

    7h

    8h

    9h

    10h

    11h

    12h

    Forecasted

    1660.854

    2028.948

    2287.335

    2380.874

    2418.591

    2212.458

    Real

    1688.3

    2142.33

    2335.52

    2398.77

    2407.61

    2169.51

    Hours

    13h

    14h

    15h

    16h

    17h

    18h

    Forecasted

    2252.476

    2367.698

    2364.746

    2363.908

    2177.314

    2135.019

    Real

    2241

    2384.9

    2330.8

    2305.82

    2137.96

    2181.98

    Hours

    19h

    20h

    21h

    22h

    23h

    24h

    Forecasted

    2140.362

    2099.144

    2084.593

    2007.182

    1842.413

    1652.698

    Real

    2110.05

    2095.67

    2113.4

    2022.87

    1844.55

    1684

  4. Yuancheng Li; Bo Li; Tingjian Fang; Short-term load forecast based on fuzzy wavelet, Intelligent Control and Automation, 2004. WCICA 2004. Fifth World Congress on , 15-19 June 2004.

  5. P.T.T.Binh, N.T.Hung, P.Q.Dung, Lee-Hong Hee, Load Forecasting Based on Wavelet Transform and Fuzzy Logic, POWERCON 2012, Aukland,2012.

  6. Chiu, S.. Fuzzy Model Identification Based on Cluster Estimation.

Journal of Intelligent & Fuzzy Systems 2 (3), 1994, 267-278.

Leave a Reply