^{1}

^{*}

^{2}

An analytical model,
*T*
_{A}
(
*t*
), for the observed outside air temperature change,
*T*
_{a}
(
*t*
), with time is developed using two components: one for the variation caused by the Earth’s movement, plus any other quasi-stationary thermodynamic effects due to industrialization; and one for the random variation caused by stochastic and/or chaotic, local environmental changes. The first component,
*T*
_{R}
(
*t*
), describes a regular trend, expressed by periodic functions of time and constants unchanged with time. The second component,
*T*
_{S}, is a random, stochastic variation. For the observed outside air temperature, the analytical model of
*T*
_{A}
(
*t*
)=
*T*
_{R}
(
*t*
) +
*T*
_{S} is such as to give a statistically best approximation for the observed time period with
=
min. Several versions for the
*T*
_{R}
(
*t*
) functions are defined and tested in the study for an example location for 20 years. The best model for
*T*
_{R}
(
*t*
) t is found as a linear function with time plus a variable-coefficient Fourier series with linearly changing amplitude with time. It is found that the final analytical temperature,
*T*
_{A}
(
*t*
), can be used not only to represent the historical daily mean temperature but also to predict the future daily mean temperature at the given location. The upper and lower boundaries give safety limits for the temperature prediction. The stochastic component identified in the model is stable and stationary. The method of model identification for
*T*
_{A}
(
*t*
) can be used for determining input temperature functions for supporting engineering design; or for an unbiased scientific inquiry of temperature change with time in climate studies.

Climate is one of the main elements of the natural environment. Temperature has a direct impact on atmospheric stability, evaporation, precipitation, and many other conditions of life [

Based on the long-term trends study of maximum, minimum and mean annual air temperature, e.g., in the northwest Himalayan region during the twentieth century, increasing trends are seen both in the mean and the diurnal range of temperature. The daily maximum temperatures have increased more rapidly than the decrease in the low temperatures in the last century resulting in a risen mean temperature of about 1.6˚C [

The observed, outside air temperature, T a ( t , τ ) , includes both seasonal components represented by t (days of the year) and τ (the hours in day t). When tabulated, T a ( t , τ ) is a matrix with t rows (the total number of days) and τ columns (24, the number of hours in a day). The outside air temperature has a regular, seasonal variation for the daily average temperature during every year, (more exactly, every four years as a true time period) and an hourly variation during any given day for the hourly mean temperature. These two components are regular and periodic in nature caused by the earth movement in the solar system. Other regular changes superimposed to that of the known movement of the Earth may also be present, such as caused by the heat balance of the globe by industrialization.

The goal of the paper is to separate the observed outside air temperature into variation caused by the Earth’s movement, plus any other quasi-stationary thermodynamic effects, and random variation caused by stochastic and/or chaotic, local environmental changes. It’s necessary to separate the hourly temperate variations from that of the seasonal first. For describing the seasonal temperature variation, the daily average temperature T a ( t ) must be defined. The T a ( t ) may be defined as the integral mean value:

T a ( t ) = 1 T ∫ 0 T T a ( t , τ ) d τ (1)

In (1) t denotes days, τ is the hours in a day, τ ∈ [ 0 , T ] and T = 24 . If hourly average temperature is available, (1) gives the accurate value for the daily average temperature meaning the thermodynamic energy of the air. If the daily mean temperature is obtained from weather service, the data may be given pre-calculated from the Standard Model that provides the average of the daily maximum and minimum air temperatures, T a ( t ) ≅ [ T max ( t ) + T min ( t ) ] / 2 . The Standard Model, reviewed by Bilbao et al. [

Two components of the air temperature are distinguished in the present paper for modeling daily mean air temperature variation, T a ( t ) , with time. The first component, T R ( t ) , describes a regular trend, expressed by functions of time and constants unchanged with time over which the model is defined. The regular trend is defined as stationary for a long period of time, characteristic to a given physical location governed by deterministic causes such as the Earth’s movement in the solar system. The second component, T S , is a random, stochastic variation around the regular trend. The T S component is caused by the stochastic and/or chaotic process in the atmosphere, defined as difference between the observed outside air temperature, T a ( t ) , and the temperature from the stationary trend model as T S = T a − T R . The daily mean value of the outside temperature at any given day is the sum of the regular trend component, T R , and a stochastic variation part, T S :

T a ( t ) = T R ( t ) + T S (2)

Note that the stochastic component is stationary and irrespective of the seasonal variation, a simplification for model formulation. However, the stochastic temperature variation in some part of the year may be more disturbed than in another, raising the possibility for improvement of the assumption used in the current work, a task left for the interested reader.

The analytic function for T R ( t ) must be the best fit to the measured outside temperature data for a given location. The concept of Fourier’s series approximation [

Applying the concept for T R with a mean temperature, T m , and a harmonic variation component, ∑ A ω i sin ( ω i t + α i ) , where A ω i is the amplitude of the harmonic variation component of f i ( t ) = sin ( ω i t + α i ) . The amplitude, A ω i , may be a linear function of time in some models, A ω i ( t ) = d 1 , i + d 2 , i t .

There are various choices to model the T R component, listed as M1 through M5. The M1-type model is a linear function. It has the least coefficients and can be used to describe the yearly mean temperature change. However, it does not have the ability to reflect any periodic temperature variation. The M2-type model is the general Fourier series function. It assumes that the yearly mean temperature and amplitudes for the pre-selected, finite number of frequencies are constant. It might be accurate for a short period of time, such as one year. The problem with the M2-type model is that it cannot reflect the long-term average, the maximum and the minimum temperature changes with time as in Bhutiyani’s study [

Therefore, the five different models tested are as follows:

M1. Variable mean temperature

T R ( t ) = T m + c ⋅ t (3)

M2. Constant mean temperature and constant amplitude series:

T R ( t ) = T m + ∑ A ω i f ( t ) (4)

M3. Variable mean temperature and constant amplitude series:

T R ( t ) = T m ( t ) + ∑ A ω i f ( t ) (5)

M4. Constant mean temperature and variable amplitude series:

T R ( t ) = T m + ∑ A ω i ( t ) f ( t ) (6)

M5. Variable mean temperature and variable amplitude series:

T R ( t ) = T m ( t ) + ∑ A ω i ( t ) f ( t ) (7)

The M1-type model is used for comparison with other models for the yearly mean temperature variation evaluation. Bhutiyani [

The first task is to depress the stochastic temperature variation in order to find the statistically most significant trend for a base stationary temperature model. Second, the stochastic or chaotic deviations must be defined to match the observed temperature. Therefore, the designers or analysts may conduct their studies or design safely without missing the expected, maximum or minimum temperature values for the study time period.

Daily mean temperature measurements are used in the study from a middle-west location in North America for 20 years. The data, T a ( t ) , are taken for 7305 days from 04/01/1996 to 03/31/2016, downloaded from https://www.wunderground.com, and plotted in Figures 2(a).

In the first usage of the data, T a ( t ) , the measured mean daily temperatures for 20 years are divided into twenty sets for single year from 1 to 20 to be able to analyze the model validity for model type M1 and M3. Individual yearly data, T a , y ( t ) , y ∈ [ 1 , 20 ] , t ∈ [ 1 , 365 ] are grouped for 15 regular years and t ∈ [ 1 , 366 ] for 5 leap years.

In the second usage of the data, T a ( t ) , the 20 years measured mean daily temperatures are divided into 4-year period sets, giving 5 groups as 1), 2), 3), 4) and 5). The justification of employing four years as the true solar time period is that the yearly time period for regular years is distorted by the deficiency of 0.25 days while the time period in leap years is longer by 0.75 days and affecting the averaging. The 4-year time sequence of 1461 days is considered as the repeating time period for the T S stationary temperature component in the model. Therefore, properties of temperature (average, etc.) also must be considered distinguished when evaluating for the yearly time period.

1) For year 1 - 4:

T a , a = T a ( t ) , t ∈ [ 1 , 1461 ] (8)

2) For year 5 - 8:

T a , b = T a ( t ) , t ∈ [ 1462 , 2922 ] (9)

3) For year 9 - 12:

T a , c = T a ( t ) , t ∈ [ 2923 , 4383 ] (10)

4) For year 13 - 16:

T a , d = T a ( t ) , t ∈ [ 4384 , 5844 ] (11)

5) For year 17 - 20:

T a , e = T a ( t ) , t ∈ [ 5846 , 7305 ] (12)

In the third usage of the data, T a ( t ) , the total 20 years 7,305 data, T a ( t ) , t ∈ [ 1 , 7305 ] , are used for testing and establishing properties of T R ( t ) in model type M4 and M5.

To determine the model coefficients, the obvious choice is to use the Least-squares (LSQ) fit method. The LSQ method optimally fits the data to a given function with unknown constant parameters in such a way that the root-mean-square of the error between model and measured data is minimalized. The expected, fitted equations represent the significant, regular temperature trend, T R . Determining the significant part of the data out of a noisy observation may also be done by filtering or neural network. The advantage of using the LSQ method is to be able to define function T R in advance, whereas signal processing does not give an analytical form of such a function [

For supporting the ways of using measured data, different fitting function may be defined with a number of unknown parameters to be determined by the LSQ fitting algorithm. Five different fitting functions may be considered for M1, M3 and M5 model types as follows.

The LSQ linear function is:

T m t = T m + c ⋅ t (13)

For single year data in the M2-type model, a regular year of 365 days and a leap year of 366 days must be distinguished. The LSQ function for regular year is written as:

T R , y ( t ) = T m + c × t + A 1 × sin ( 2 π × t × a 1 + b 1 ) + A 2 × sin ( 2 π × t × a 2 + b 2 ) + A 3 × sin ( 2 π × t × a 3 + b 3 ) + ⋯ + A 15 × sin ( 2 π × t × a 15 + b 15 ) (14)

where a k = 1 365 × { { 2 i − 1 } , 3 × { 2 j − 1 } } ; for k ∈ [ 1 , 15 ] , k = i + j ; i = i ∈ [ 1 , 8 ] ;

j ∈ [ 1 , 7 ] , all fixed frequency components. Unknown parameters are T_{m}, c, A_{1} through A_{15}, and b_{1} through b_{15}.

The LSQ function for leap year is derived from (14) by changing 365 days to 366 days as:

T R , y ( t ) = T m + c × t + A 1 × sin ( 2 π × t × a 1 + b 1 ) + A 2 × sin ( 2 π × t × a 2 + b 2 ) + A 3 × sin ( 2 π × t × a 3 + b 3 ) + ⋯ + A 15 × sin ( 2 π × t × a 15 + b 15 ) (15)

where a k = 1 365 × { { 2 i − 1 } , 3 × { 2 j − 1 } } ; for k ∈ [ 1 , 15 ] , k = i + j ; i = i ∈ [ 1 , 8 ] ;

j ∈ [ 1 , 7 ] , all fixed frequency components. Unknown parameters are T_{m}, c, A_{1} through A_{15}, and b_{1} through b_{15}.

The LSQ function for 4 years data and 20 years data will add the four-year and two-year period frequencies, also will change the one-year period to 365.25 days, the function is written as:

T R ( t ) = T m + c × t + A 1 × sin ( 2 π × t × a 1 + b 1 ) + A 2 × sin ( 2 π × t × a 2 + b 2 ) + A 3 × sin ( 2 π × t × a 3 + b 3 ) + ⋯ + A 17 × sin ( 2 π × t × a 17 + b 17 ) (16)

where a k = 1 365.25 × { { 2 i − 3 } , 3 × { 2 j − 1 } } ; for k ∈ [ 1 , 17 ] , k = i + j ; i = i ∈ [ 1 , 10 ] ;

j ∈ [ 1 , 7 ] , all fixed frequency components. Unknown parameters are T_{m}, c, A_{1} through A_{17} and b_{1} through b_{17}.

With the assumption that the amplitudes may also vary with time, a modified LSQ function over (16) is established as:

T R ( t ) = T m + c × t + ( A 1 c + A 1 v × t ) × sin ( 2 π × t × a 1 + b 1 ) + ( A 2 c + A 2 v × t ) × sin ( 2 π × t × a 2 + b 2 ) + ⋯ + ( A 17 c + A 17 v × t ) × sin ( 2 π × t × a 17 + b 17 ) (17)

where a k = 1 365.25 × { { 2 i − 3 } , 3 × { 2 j − 1 } } ; for k ∈ [ 1 , 17 ] , k = i + j ; i = i ∈ [ 1 , 10 ] ;

j ∈ [ 1 , 7 ] , all fixed frequency components. Unknown parameters are T_{m}, c, A 1 c through A 17 c , A 1 v through A 17 v , and b_{1} through b_{17}.

The LSQ fitting method is used to determine the mean temperature trend, T R ( t ) , of the measured data, T a ( t ) . The LSQ method provides the statistically most significant result for T R ( t ) as a regular, deterministic trend, depressing the random variation component of temperature around T R ( t ) with assumed, normal distribution as a noise due to stochastic or chaotic causes.

First, the best LSQ fit is determined on all single year data separately. The parameters of function (14) are applied for the regular years, and (15) is applied for the leap years. The fitted results are shown in

Second, the best LSQ fit is found using function (16) for five 4-year temperature data sets separately. The fitted results are shown in

result are shown in

Third, the best LSQ fit is determined using the M3-type function (16) for all 20 years data, T a ( t ) , together. The fitted function is depicted in

The stochastic variation must be defined by subtracting the statistic periodic function from the measured data. First, the stochastic variation, T S ( t ) , is defined by the difference between deterministic function result, T R ( t ) and measured data T a ( t ) as:

T S ( t ) = T a ( t ) − T R ( t ) (18)

The data of (18) is depicted in

σ = 1 n ∑ ( T S ( t ) − m u ) 2 and n is the number of days, t. Using m u = 0.00 and

σ = 4.0012 , a normally-distributed random noise series is generated to represent T S ( t ) . The N o r m R a n d o m ( m u , σ , t ) function is used in Matlab that

T_{m} | c | A_{1} | a_{1} | A_{2} | a_{2} | A_{3} | a_{3} | A_{4} | a_{4} |
---|---|---|---|---|---|---|---|---|---|

0.731 | −2.48 × 10^{−4} | 0.527 | 0.982 | 0.282 | 2.468 | 10.616 | −0.343 | 2.281 | 3.248 |

A_{5} | a_{5} | A_{6} | a_{6} | A_{7} | a_{7} | A_{8} | a_{8} | A_{9} | a_{9} |

0.257 | 2.759 | 0.584 | −0.816 | 0.798 | 2.985 | 0.072 | −0.525 | 0.399 | 4.295 |

A_{10} | a_{10} | A_{11} | a_{11} | A_{12} | a_{12} | A_{13} | a_{13} | A_{14} | a_{14} |

0.285 | −0.807 | 0.181 | −17.310 | 0.154 | 1.797 | 0.117 | 0.950 | 0.092 | 3.044 |

A_{15} | a_{15} | A_{16} | a_{16} | A_{17} | a_{17} | ||||

0.080 | 1.639 | 0.142 | 4.827 | 0.043 | 1.059 |

T_{m} | c | A_{1} | a_{1} | A_{2} | a_{2} | A_{3} | a_{3} | A_{4} | a_{4} |
---|---|---|---|---|---|---|---|---|---|

9.208 | 1.038 × 10^{−3} | 0.377 | −1.097 | 0.529 | 3.677 | 11.764 | −0.306 | 1.836 | 3.450 |

A_{5} | a_{5} | A_{6} | a_{6} | A_{7} | a_{7} | A_{8} | a_{8} | A_{9} | a_{9} |

0.231 | 2.328 | 0.723 | 2.227 | 0.520 | 2.349 | 0.158 | 0.536 | 0.496 | 3.325 |

A_{10} | a_{10} | A_{11} | a_{11} | A_{12} | a_{12} | A_{13} | a_{13} | A_{14} | a_{14} |

0.287 | 0.278 | 0.116 | 2.199 | 0.234 | −17.177 | 0.472 | 3.009 | 0.221 | 3.941 |

A_{15} | a_{15} | A_{16} | a_{16} | A_{17} | a_{17} | ||||

0.178 | −0.255 | 0.124 | 3.226 | 0.025 | 10.918 |

T_{m} | c | A_{1} | a_{1} | A_{2} | a_{2} | A_{3} | a_{3} | A_{4} | a_{4} |
---|---|---|---|---|---|---|---|---|---|

10.735 | −1.715 × 10^{−3} | 1.028 | 3.156 | 0.756 | 2.810 | 11.843 | −0.268 | 1.931 | 3.252 |

A_{5} | a_{5} | A_{6} | a_{6} | A_{7} | a_{7} | A_{8} | a_{8} | A_{9} | a_{9} |

1.239 | 2.354 | 0.431 | −0.535 | 0.766 | −1.907 | 0.558 | 1.876 | 0.381 | −0.040 |

A_{10} | a_{10} | A_{11} | a_{11} | A_{12} | a_{12} | A_{13} | a_{13} | A_{14} | a_{14} |

0.372 | 1.158 | 0.195 | 3.605 | 0.341 | −0.427 | 0.073 | 4.363 | 0.157 | 1.924 |

A_{15} | a_{15} | A_{16} | a_{16} | A_{17} | a_{17} | ||||

0.167 | 2.334 | 0.036 | 4.023 | 0.044 | 4.296 |

T_{m} | c | A_{1} | a_{1} | A_{2} | a_{2} | A_{3} | a_{3} | A_{4} | a_{4} |
---|---|---|---|---|---|---|---|---|---|

9.242 | 2.539 × 10^{−4} | 0.645 | 0.197 | 0.427 | −0.565 | 11.355 | −0.357 | 2.566 | 3.116 |

A_{5} | a_{5} | A_{6} | a_{6} | A_{7} | a_{7} | A_{8} | a_{8} | A_{9} | a_{9} |

0.543 | 3.556 | 0.790 | 0.205 | 0.723 | 3.246 | 0.252 | 2.302 | 0.482 | 3.649 |

A_{10} | a_{10} | A_{11} | a_{11} | A_{12} | a_{12} | A_{13} | a_{13} | A_{14} | a_{14} |

0.417 | 2.377 | 0.592 | 1.153 | 0.175 | 2.700 | 0.153 | 2.907 | 0.084 | −7.709 |

A_{15} | a_{15} | A_{16} | a_{16} | A_{17} | a_{17} | ||||

0.077 | 2.546 | 0.046 | 0.520 | 0.100 | 3.479 |

T_{m} | c | A_{1} | a_{1} | A_{2} | a_{2} | A_{3} | a_{3} | A_{4} | a_{4} |
---|---|---|---|---|---|---|---|---|---|

10.289 | 1.073 × 10^{−4} | 0.883 | 3.050 | 0.372 | 0.298 | 11.798 | −0.288 | 2.138 | 2.958 |

A_{5} | a_{5} | A_{6} | a_{6} | A_{7} | a_{7} | A_{8} | a_{8} | A_{9} | a_{9} |

1.234 | 3.131 | 0.412 | 10.828 | 0.110 | 1.318 | 0.206 | 4.330 | 0.308 | −0.624 |

A_{10} | a_{10} | A_{11} | a_{11} | A_{12} | a_{12} | A_{13} | a_{13} | A_{14} | a_{14} |

0.081 | 2.438 | 0.126 | 2.924 | 0.125 | 4.272 | 0.147 | 7.643 | 0.243 | 2.356 |

A_{15} | a_{15} | A_{16} | a_{16} | A_{17} | a_{17} | ||||

0.038 | 1.193 | 0.080 | 3.917 | 0.063 | 2.760 |

T_{m} | c | A_{1} | a_{1} | A_{2} | a_{2} | A_{3} | a_{3} | A_{4} | a_{4} |
---|---|---|---|---|---|---|---|---|---|

9.489 | 7.367 × 10^{−5} | 0.095 | 2.478 | 0.092 | 3.072 | 11.489 | −0.311 | 2.113 | 3.194 |

A_{5} | a_{5} | A_{6} | a_{6} | A_{7} | a_{7} | A_{8} | a_{8} | A_{9} | a_{9} |

0.622 | 2.836 | 0.222 | −0.292 | 0.418 | 3.237 | 0.122 | 1.909 | 0.197 | 4.259 |

A_{10} | a_{10} | A_{11} | a_{11} | A_{12} | a_{12} | A_{13} | a_{13} | A_{14} | a_{14} |

0.123 | 1.122 | 0.153 | 1.723 | 0.041 | 1.358 | 0.121 | 2.682 | 0.093 | 2.957 |

A_{15} | a_{15} | A_{16} | a_{16} | A_{17} | a_{17} | ||||

0.047 | −4.723 | 0.052 | 4.109 | 0.031 | 3.388 |

T_{m} | c | A_{1c} | A_{1v} | b_{1} | A_{2c} | A_{2v} | b_{2} | |
---|---|---|---|---|---|---|---|---|

9.477 | 7.367 × 10 − 5 | 0.392 | − 1.254 × 10 − 4 | 0.066 | 0.611 | − 1.409 × 10 − 4 | 3.017 | |

A_{3c} | A_{3v} | b_{3} | A_{4c} | A_{4v} | b_{4} | A_{5c} | A_{5v} | b_{5} |

11.099 | 1.066 × 10 − 4 | −0.312 | 1.953 | 4.541 × 10 − 5 | 3.196 | 0.020 | 1.635 × 10 − 4 | 2.989 |

A_{6c} | A_{6v} | b_{6} | A_{7c} | A_{7v} | b_{7} | A_{8c} | A_{8v} | b_{8} |

0.163 | 1.524 × 10 − 5 | −0.244 | 0.731 | − 8.600 × 10 − 5 | 3.187 | 0.216 | − 2.764 × 10 − 5 | 1.628 |

A_{9c} | A_{9v} | b_{9} | A_{10c} | A_{10v} | b_{10} | A_{11c} | A_{11v} | b_{11} |

0.281 | − 2.284 × 10 − 5 | 4.153 | 0.311 | − 9.252 × 10 − 5 | -0.670 | 0.134 | 4.996 × 10 − 6 | −4.559 |

A_{12c} | A_{12v} | b_{12} | A_{13c} | A_{13v} | b_{13} | A_{14c} | A_{14v} | b_{14} |

0.248 | − 5.649 × 10 − 5 | 1.295 | 0.142 | − 5.864 × 10 − 5 | 2.677 | 0.049 | 1.188 × 10 − 5 | 2.816 |

A_{15c} | A_{15v} | b_{15} | A_{16c} | A_{16v} | b_{16} | A_{17c} | A_{17v} | b_{17} |

0.014 | 8.843 × 10 − 6 | 1.729 | 0.099 | − 1.302 × 10 − 5 | 4.222 | 0.029 | − 1.647 × 10 − 5 | 0.078 |

generates t number of random values from the normal distribution with a mean value m u , and standard deviation value σ . Applying it to T S ( t ) , it gives:

T S ( t ) = N o r m R a n d o m ( m u , σ , t ) (19)

For T A ( t ) , (19) is added to (17):

T A ( t ) = T R ( t ) + N o r m R a n d o m ( m u , σ , t ) (20)

However, (20) is not an analytical function since it includes an algorithm. To overcome this and understanding that the daily variation for random causes is a sample of T S ( t ) , the maximum and minimum values can be generated with a 99 per cent confidence by a fluctuating temperature with a 2-day cycle time:

T A ( t ) = T R ( t ) + 3 σ ( − 1 ) t (21)

Substituting the preferred model in (17), the final analytical temperature model, T A ( t ) is:

T A ( t ) = T m + c × t + ( A 1 c + A 1 v × t ) × sin ( 2 π × t × a 1 + b 1 ) + ( A 2 c + A 2 v × t ) × sin ( 2 π × t × a 2 + b 2 ) + ⋯ + ( A 17 c + A 17 v × t ) × sin ( 2 π × t × a 17 + b 17 ) + 3 σ ( − 1 ) t (22)

Comparison between simulated temperature, T A ( t ) , from (20) and measured data, T a ( t ) , is show in

For comparison purposes, the linear regression function (13) for the entire 20-year data set is applied to various, fitted model results; or original, unprocessed data. The fitted model results for the shorter time periods represent the significant part of the repeated trends whereas the noise is intentionally depressed in the LSQ norm sense. Therefore, a linear regression evaluation for the 20-year long time period is assumed to evaluate the most significant, time-average of the linear change in the magnitude of T a ( t ) . Common expectation dictates that the linear regression evaluation for the 20-year long time period of the original data may provide an un-biased result for the linear change in the magnitude of T a ( t ) . The following studied are completed for fitting a longer-time linear regression to model results, T R ( t ) , of shorter time periods:

a) Yearly mean temperatures, T R ( t ) , (for 15 regular and 5 leap years) evaluated from fitted function to single-years data, T a , y ( t ) with M1-type model;

b) Yearly mean temperatures, T R ( t ) , evaluated from fitted function data to 4-year data sets, T a , a ( t ) , ⋯ , T a , e ( t ) with M1-type model;

c) Yearly mean temperatures, T R ( t ) , from fitted function to continuous 20 years data, T a ( t ) with M1-type model;

d) 20 years measured data, T a ( t ) , used unprocessed.

The results from the evaluation are listed in

Data and Model Type | T_{m} | c | RMS |
---|---|---|---|

a) Yearly mean temperature evaluated from fitted function data, T R ( t ) , using single-years data, T a , y ( t ) and M1-type model | 9.456 | 7.88 × 10 − 5 | 0.603 |

b) Yearly mean temperature evaluated from fitted function data, T R ( t ) , using 4-year sets, T a , a ( t ) , ⋯ , T a , e ( t ) and M1-type model | 9.481 | 7.23 × 10 − 5 | 0.390 |

c) Yearly mean temperature from fitted function data, T R ( t ) , using continuous 20 years data, T a ( t ) and M1-type model | 9.456 | 7.88 × 10 − 5 | 0.603 |

d) 20 years measured data, T a ( t ) , unprocessed; M1-type model | 9.929 | − 4.66 × 10 − 5 | (9.206) |

e) 20 years measured data, T a ( t ) , unprocessed; M3-type model | 9.489 | 7.37 × 10 − 5 | (4.026) |

f) 20 years measured data, T a ( t ) , unprocessed; M5-type model | 9.477 | 7.42 × 10 − 5 | (4.001) |

g) 20 years model output data, T R ( t ) from M5-type model | 9.929 | − 4.66 × 10 − 5 | (8.291) |

temperature is 365.25 days, giving a rounding error with a weight of −0.25/4 days for the regular years and of +0.75/4 day for the leap year in the single-year model fits. The model fit to the 4-year time periods does not have the rounding error problem and, therefore, a smoother fit is expected. Indeed, the RMS error of 0.39 is lower for case b) than value of 0.603 for case a).

The results in case c) is identical to those of case a) for obvious reason of using the same linear regression repeated two times sequentially, the second time obtaining zero RMS value. The result for case d) is very different from those in cases a) through c). Why does a 20-year long data set gives an average decrease of temperature change negative that would translate to “global cooling” as opposed to “global warming” for the example location? The answer is the wrong-type function choice for the most significant variation trend, T R ( t ) , being a linear function with time. This exercise highlights the importance of the selection for the shape of T R ( t ) . If a form as inadequate as a linear function is selected for T R ( t ) for estimating the periodic nature of the outside temperature, the coefficients of the function cannot be trusted even for the general slope, as demonstrated with case d).

Two more choices are also studied for comparison for evaluating the linear trend which the models already include as the mean value, T m , and the slope, c. Due to these built-in components, no additional, linear regression fit is needed for determining the values of T m and c:

e) 20 years measured data, T a ( t ) , unprocessed; M3-type model;

f) 20 years measured data, T a ( t ) , unprocessed; M5-type model.

The results from the evaluation are listed in

Re-fitting another linear regression model to the model output data, T R ( t ) , from the M5-type model in case f) for re-capturing the mean value, T m , and the slope, c, does not give back the same values as those built in the best-fit model, shown in case g) in

The air temperature model component for the description of the random part due to stochastic or chaotic causes is simplified to be time-independent. The stochastic component, T R , satisfies the zero mean value and zero slope with time. No attempt has been made to vary the magnitude of randomness with the seasons. Refinement for this component is left for the interested reader. The observed histogram for the example shows a close-to normal distribution, allowing to estimate the error limit for daily mean temperature fluctuations from the standard deviation, σ, obtained from model identification.

The complete temperature model is given in (21) and (22). The model predicts the daily average temperature variation with time as well as the expected the maximum and minimum temperatures due to stochastic process components. The comparison between measured data and model prediction with ±3σ amplitude around the T R ( t ) function from the M5-type model in (22) is illustrated in

w Analytical functional forms and their numerical algorithms are presented for representing the measured time-variable outside air temperature, T a ( t ) for engineering design and analysis of the human environment. The algorithms for T R ( t ) and T S are easy to use for processing the available data sets, T a ( t ) , at any physical location from the weather service, typically using several tens of thousands of measured values. In the final functional form of the outside air temperature function, T A ( t ) , only a few dozens of constants are needed.

w The final analytical temperature, T A ( t ) , can be used not only to represent the historical temperature data but also to predict the future temperature variations at any given location from which the input data is used from measurements. The upper and lower boundaries may be used for safe temperature prediction.

w The regular component of temperature change with time, T R ( t ) , in the M5-type model is described by a linear function plus a time-variable Fourier series to represent the long term linear change both in mean temperature and amplitude. Only 53 constants are needed, obtainable from the presented method, to represent the outside mean air temperature at any day of the year as long as need over decades of time.

w The confidence interval for the stochastic variation may be selected by the user via the multiplication factor of the standard deviation of the model match between measured, time-variable outside air temperature, T a ( t ) and the regular component in the analytical mode, T R ( t ) .

w The stochastic component used in the final model, T A ( t ) , is stable and stationary. The variability of the stochastic component over the season of the year may be considered in a future study, but presently is omitted for simplicity.

w The study shows that the prediction of temperature trends such as for cooling or warming in the future can only be evaluated using an M5-type model fit to the data. The trend-setting components, such as the annual change of the mean temperature or the variation of the amplitude change with time of the periodic components can only be evaluated with a model which has these components built into the structure of the model.

w The minimum, adequate time period for building an outside air temperature model is 4 years, the periodic cycle time of the solar environment. It is recommended to use a multiple of the 4-year periods for model-building (e.g., the 5 × 4 = 20 years period in present study) preferably for as long a time period as data are available.

A research grant from National Institute of Occupational Safety and Health (NIOSH) is gratefully recognized. The research was thankfully supported by the GINOP-2.3.2-15-2016-00010 “Development of enhanced engineering methods with the aim at utilization of subterranean energy resources” project of the Research Institute of Applied Earth Sciences of the University of Miskolc in the framework of the Széchenyi 2020 Plan, funded by the European Union, co-financed by the European Structural and Investment Funds.

The authors declare no conflicts of interest regarding the publication of this paper.

Danko, G. and Lu, C. (2018) Variable Daily Air Temperature Model for Analysis and Design. Applied Mathematics, 9, 1015-1038. https://doi.org/10.4236/am.2018.98069