$$ \newcommand{\esp}[1]{\mathbb{E}\left(#1\right)} \newcommand{\var}[1]{\mbox{Var}\left(#1\right)} \newcommand{\deriv}[1]{\dot{#1}(t)} \newcommand{\prob}[1]{ \mathbb{P}\!(#1)} \newcommand{\eqdef}{\mathop{=}\limits^{\mathrm{def}}} \newcommand{\by}{\boldsymbol{y}} \newcommand{\bc}{\boldsymbol{c}} \newcommand{\bpsi}{\boldsymbol{\psi}} \def\pmacro{\texttt{p}} \def\like{{\cal L}} \def\llike{{\cal LL}} \def\logit{{\rm logit}} \def\probit{{\rm probit}} \def\one{{\rm 1\!I}} \def\iid{\mathop{\sim}_{\rm i.i.d.}} \def\simh0{\mathop{\sim}_{H_0}} \def\df{\texttt{df}} \def\res{e} \def\xomega{x} \newcommand{\argmin}[1]{{\rm arg}\min_{#1}} \newcommand{\argmax}[1]{{\rm arg}\max_{#1}} \newcommand{\Rset}{\mbox{$\mathbb{R}$}} \def\param{\theta} \def\setparam{\Theta} \def\xnew{x_{\rm new}} \def\fnew{f_{\rm new}} \def\ynew{y_{\rm new}} \def\nnew{n_{\rm new}} \def\enew{e_{\rm new}} \def\Xnew{X_{\rm new}} \def\hfnew{\widehat{\fnew}} \def\degree{m} \def\nbeta{d} \newcommand{\limite}[1]{\mathop{\longrightarrow}\limits_{#1}} \def\ka{k{\scriptstyle a}} \def\ska{k{\scriptscriptstyle a}} \def\kel{k{\scriptstyle e}} \def\skel{k{\scriptscriptstyle e}} \def\cl{C{\small l}} \def\Tlag{T\hspace{-0.1em}{\scriptstyle lag}} \def\sTlag{T\hspace{-0.07em}{\scriptscriptstyle lag}} \def\Tk{T\hspace{-0.1em}{\scriptstyle k0}} \def\sTk{T\hspace{-0.07em}{\scriptscriptstyle k0}} \def\thalf{t{\scriptstyle 1/2}} \newcommand{\Dphi}[1]{\partial_\pphi #1} \def\asigma{a} \def\pphi{\psi} \newcommand{\stheta}{{\theta^\star}} \newcommand{\htheta}{{\widehat{\theta}}} $$

\[ \newcommand{\esp}[1]{\mathbb{E}\left(#1\right)} \newcommand{\var}[1]{\mbox{Var}\left(#1\right)} \]

1 Introduction

A model for continuous data for a single individual can be represented mathematically as follows: \[ y_{j} = f(t_j) + e_j, \quad \quad 1\leq j \leq n, \] where

Then, \(f(t_j)\) is the predicted value of \(y_j\):

\[ \tilde{y}_j = f(t_j) \]

We usually state continuous data models in a slightly more flexible way: \[ y_{j} = f(t_j) + g(t_j ;\xi)\varepsilon_j , \quad \quad 1\leq j \leq n, \] where now

The choice of a residual error model \(g\) is very flexible and allows us to account for many different hypotheses we may have on the error???s distribution.


2 Constant error model

\[y_j=f(t_j)+a\varepsilon_j\] Here, \(g=a\) and \(\xi=a\).

In the following example, $f(t) = 25( e^{-0.6 t} - e^{-t}) $.

You can select the standard deviation \(a\) of the residual errors and the number \(n\) of sample times.

Then, you can plot

The median is obtained with \(\varepsilon_j=0\) while the first and third quartiles are obtained, respectively, with \(\varepsilon_j=-0.6745\) and \(\varepsilon_j=0.6745\).



3 Proportional error model

\[y_j=f(t_j)+b f(t_j) \varepsilon_j\] Here, \(g=bf\) and \(\xi=b\).



4 Combined error model

\[y_j=f(t_j)+(a+b f(t_j)) \varepsilon_j\] Here, \(g=a+bf\) and \(\xi=(a,b)\).


An alternative combined error model assumes that \[y_j=f(t_j)+ a \varepsilon_j^{(1)}+b f(t_j)) \varepsilon_j^{(2)}\] where \(\varepsilon_j^{(1)}\) and \(\varepsilon_j^{(2)}\) are independent. If \(\varepsilon_j^{(1)}\) and \(\varepsilon_j^{(2)}\) are normally distributed, then there exists a sequence of standard normal random variables \((\varepsilon_j)\) such that \[y_j=f(t_j)+ \sqrt{a +b f(t_j)} \varepsilon_j\] Here, \(g=\sqrt{a^2+b^2f^2}\) and \(\xi=(a,b)\).


Remark: In this example, these two combined error models are almost indistinguishable.

5 Transforming the data

The assumption that the distribution of any observation \(y_{j}\) is symmetrical around its predicted value is a very strong one. If this assumption does not hold, we may want to transform the data to make it more symmetric around its (transformed) predicted value. In other cases, constraints on the values that observations can take may also lead us to transform the data.

Basic model can be extended to include a transformation of the data: \[ u(y_{j})=u(f(t_{j}))+ g(t_{j},\xi_i)\varepsilon_{j} , \] where \(u\) is a monotonic transformation (a strictly increasing or decreasing function). As we can see, both the data \((y_{j})\) and the structural model \(f\) are transformed by the function \(u\) so that \(f(t_{j})\) remains the prediction of \(y_{j}\).

Let us see now some examples of basic transformations, assuming a constant error model (in the domain of the transform data), i.e. \(g=a\).

We can then write the model with one of two equivalent representations: \[ \begin{aligned} \log(y_{j})&=\log(f(t_{j}))+ a\varepsilon_{j} \\ y_{j}&=f(t_{j})\, e^{ a\varepsilon_{j} }. \end{aligned} \] This model is sometimes called exponential error model. We remark that when \(a\) is small, the exponential error model is similar to the proportional error one since \(e^{a\varepsilon_j} \approx 1 + a\varepsilon_j\).



We can then write the model with one of two equivalent representations: \[ \begin{aligned} \log\left(\frac{y_{j}}{100-y_{j}}\right)&=\log\left(\frac{f(t_{j})}{100-f(t_{j})}\right)+ a\varepsilon_{j} \\ y_{j}&=\frac{ 100 f(t_{j})}{f(t_{j}) + (100-f(t_{j}))\, e^{ -a\varepsilon_{j}} }. \end{aligned} \]

In the following example, \[ f(t) = \frac{100}{1+e^{-(t-5)}}\]