It has been shown that a neural network with sufficient hidden units can approximate any continuous function defined on a closed and bounded set. This has inspired the use of neural networks as general nonlinear regression models. As with other nonlinear regression models, tools of conventional statistical analysis can be applied to neural networks to yield a test for the relevance or irrelevance of a free parameter. The test, a version of Wald’s test, can be extended to yield a test for the relevance or irrelevance of an input variable. This test was applied to the building energy use data of the Energy Prediction Shootout II contest. Input variables were selected by initially constructing a neural network model which had many inputs, then cutting out the inputs which were deemed irrelevant on the basis of the Wald test. Time-lagged values were included for some input variables, with the time lag chosen by inspecting the autocovariance function of the candidate variable. The results of the contest entry are summarized, and the benefits of applying Wald’s test to this problem are assessed.

1.
Haberl
,
J.
, and
Thamilseran
,
S.
,
1996
, “
Predicting Hourly Building Energy Use: The Great Energy Predictor Shootout II: Measuring Retrofit Savings
,”
ASHRAE Trans.
,
102
,
Part 2
Part 2
.
2.
MacKay
,
D.
,
1994
, “
Bayesian Nonlinear Modeling for the Prediction Competition
,”
ASHRAE Trans.
,
100
(
2
), pp.
1053
1062
.
3.
Ohlsson
,
M.
,
Peterson
,
C.
,
Pi
,
H.
,
Rognvaldsson
,
T.
, and
Soderberg
,
B.
,
1994
, “
Predicting System Loads with Artificial Neural Networks
,”
ASHRAE Trans.
,
100
(
2
), pp.
1063
1074
.
4.
Feuston
,
B.
, and
Thurtell
,
J.
,
1994
, “
Generalized Nonlinear Regression with Ensemble of Neural Nets
,”
ASHRAE Trans.
,
100
(
2
), pp.
1075
1080
.
5.
Cybenko
,
G.
,
1989
, “
Approximation by Superpositions of a Sigmoidal Function
,”
Math. Control, Signals, Syst.
,
2
,
303
314
.
6.
Hornik
,
K.
,
Stinchcombe
,
M.
, and
White
,
H.
,
1989
, “
Multilayer Feedforward Networks are Universal Approximators
,”
Neural Networks
,
2
, pp.
359
366
.
7.
Dodier, R., 1995, “Statistical Properties of Neural Networks, with Application to Building Energy Prediction,” Master’s thesis. U. Colorado, Boulder, Colorado.
8.
Kreider
,
J.
,
Claridge
,
D.
,
Curtiss
,
P.
,
Dodier
,
R.
,
Haberl
,
J.
, and
Krarti
,
M.
,
1995
, “
Building Energy Use Prediction and System Identification using Recurrent Neural Networks
,”
J. Sol. Energy Eng.
,
117
(
3
), pp.
161
166
.
9.
Seber, G. A. F., and Wild, C. J. 1989, Nonlinear Regression, John Wiley and Sons, New York.
10.
Raftery, A., 1994, Bayesian Model Selection in Social Research, in Sociological Methodology 1995, edited by P. Marsden, Cambridge, Mass., Blackwells.
11.
Katipamula
,
S.
,
1996
, “
The Great Energy Predictor Shootout II: Modeling Energy Use in Large Commercial Buildings
,” ASHRAE Trans., 100(2).
12.
Abarbanel
,
H. D. I.
,
Brown
,
R.
,
Sidorowich
,
J.
, and
Tsimring
,
L. S.
,
1993
, “
Analysis of Observed Chaotic Data in Physical Systems
,”
Rev. Mod. Phys.
,
65
(
4
), pp.
1331
1392
.
13.
Press, W., Flannery, B., Teukolsky, S., and Vetterling, W., 1988, Numerical Recipes in C, Cambridge, Cambridge University Press.
14.
Raftery
,
A.
,
Madigan
,
D.
, and
Hoeting
,
J. A.
,
1997
, “
Bayesian Model Averaging for Linear Regression Models
,”
J American Statistical Assoc
,
437
, pp.
179
191
.
You do not currently have access to this content.