Forecasting project (Meandering pattern)
PA 453 - Forecasting Project
(CASE)

(Figure 1) drawing the graph at the beginning

Meandering pattern, which does not contain any systematic or repeating pattern
(APPROACH)
Two ways in forecasting analysis on the meandering pattern
1. Trend analysis
2. Autoregressive modeling
Which forecasting ways I choose will fundamentally depend on policy goal/objectives and environment (such as time, resources and purpose). Herein I will try to carry out both ways.
I will carry out trend analysis to detect the underlying pattern in the data.
(Procedure)
1. Creating regression model
2. Check the model’s figure and R² in each regression model
3. Residual analysis for determining residual values
4. Choosing best model and put independent variables for forecasting
|
|
Regression model |
Standard error of the estimate |
Linear |
Y=0.2529x+2.6705 |
R²=0.1251 |
|
Logarithmic |
Y=0.767LN(x)+3.2668 |
R²=0.0352 |
|
Polynomial |
Y=0.0685x²-0.8427x+5.7745 |
R²=0.2604 |
|
Power curve |
Y=1.973x^0.2224 |
R²=0.0185 |
|
Exponential curve |
Y=1.4625e^0.0891x |
R²=0.0971 |





Selection criteria 1: we choose the curve with the highest R²
Under the criteria 1, I can choose polynomial curve (R²=0.2604).
Selection criteria 2: minimum MAPE (we select the curve that has smallest error over the most recent several months)
The result from computing the Mean Absolute Percentage Error (MAPE) is as following.
(Figure2) Calculation of MAPE

|
|
Linear |
Logarithmic |
Polynomial |
Power |
Exponential |
|
MAPE |
13.60% |
26.50% |
15.66% |
50.70% |
28.87% |
For example, over the most recent three months, on average, the predicted liquid cash based on the liner model differed from the actual liquid cash by 13.60%. In conclusion, based on the second criteria, liner regression model is best among the five models.
I therefore choose the liner model for two reasons. First, its MAPE was lowest value. In addition, I reject the polynomial model because it indicated that liquid cash had peaked and was going down. This situation is not consistent.
(Figure 3) result of forecasting on the basis of polynomial model

Before I use the liner model for forecasting, I should conduct a residual analysis to determine if the model satisfies residual value. The residual plot for a model that satisfies the statistical independence assumption should be horizontal band of residual centered at zero that contain no pattern. Figure 5 suggests there is no pattern to the residuals over time.
(Figure 4) residual analysis

(Figure 5) drawing residual graph

I can use liner model for short-term forecasting. The forecasting result is as following.

On the other, I will check the result of autoregressive model approach by using Excel data-analysis regression.
(The Regression Equation)
“The regression equation is the equation for the straight line that best describes a set of data point. The equation is asymmetric; therefore, the independent and dependent variables must be designated before calculating the equation” (Sullivan & Rassel, 429).
Y = a + b*x
Y= dependent variable (in this case, yield (t-1))
a= intercept or constant
b= regression coefficient for X associated with Y
x= independent variable (in this case, yield (t))
The regression equation gives the independent effect of x (yield t-1) over increase (or decrease) of yield t in this case. (Articulating relationships between yield t and yield t-1 during study period)
(Procedure of forecasting in the autoregressive model)
1. Getting data on two interval variables
2. Plotting the data on a scatter graph (the dependent variable Y is graphed on the vertical axis while the independent variable x is graphed on the horizontal axis.)
3. Deciding whether the plotted data can be appropriately accommodate in a straight line
4. Calculating the regression equation
5. Doing Analysis of Variance (ANOVA) to make the correlation clear
6. Putting data into the regression equation for forecasting implementation
(Figure 6) assignment of two variables (yield t and yield t-1)

(Figure 7) drawing a scatter graph (It looks like there will be no statistically clear relationship between two variables)

(Figure 8) ANOVA

Figure 7 is the scatter plot of yield t for a period versus yield t-1 for the previous period for month 2-15. The swarm of the 15 data points is neither upward nor downward. That is the figure is scattered. It suggest that yield t and yield t-1 are not positively and linearly related. In addition, From the ANOVA table (figure 8), the p-value for the model is 0.8804. In terms of hypothesis testing (null= there is no relationship between yield t and yield t-1), 0.8804 is bigger than 0.05 % significant level. So, I cannot reject the null. Consequently, there is no possibility to use data-analysis regression tool to fit a liner regression model to this case.
In conclusion, I will recommend the forecasting data based on trend analysis, but not autoregressive analysis.