5Chapter 5 First Difference Model and Lagged Dependent Variable Model

The contents arrangement and the sort of terms are very difficult, energy expending. I have made many revisions, but I have not finished organizing it yet. I am not sure if it's because of my background in psychology that I misunderstood thins wrongly. Maybe I confused the psychometrical models with a econometrical model. I spent a considerable amount of time demostrating the econometric assumptions that the variational model of psychometrics needs to satisfy.

1 Model Space

As chapter 3 has introduced, panel data has a dual dimensional structure (Xi,t,Yi,t)(X_{i,t} , Y_{i,t}): a cross-sectional dimension (i=1,2,...,N)(i=1,2,...,N) and a time dimension (t=1,2,...,T)(t=1,2,...,T). We are ususally interested in changes in the same object and want to expalin the source of those changes. We can model the developemnt and changes of object ΔYi,tΔY_{i,t} by substracting past obervations of the dependent variable from the current observations of the dependent variabel, i.e., the current dependent variable Yi,tY_{i,t} minus the lagged dependent variable Yi,t1Y_{i,t-1}:

ΔYi,t=Yi,tYi,t1ΔY_{i,t}=Y_{i,t}-Y_{i,t-1}

Similarly, we intuitively guess that changes in the dependent variable overtime ΔYi,tΔY_{i,t} may be due to changes in the independent varibale ΔXi,tΔX_{i,t}:

ΔXi,t=Xi,tXi,t1ΔX_{i,t}=X_{i,t}-X_{i,t-1}

The greatest advantage of panel data is that it provides information on units at mutiple points in time, allowing us to use not only the current level values of variables but also their lagged values and first differences to construct models. More importantly, we can combine these variables in different ways toconstruct a variety of model structures, each corresponding to a theoretical assumption and estimation strategy.

First, I will present what we consider to be a reasonable relationship in terms of combination, regardless of how they are derived, which wil be addressed in later chapters.

Level
Change

Level

Yi,t=βXi,t+εi,tY_{i,t}=βX_{i,t}+ε_{i,t}

ΔYi,t=βXi,t+εi,tΔY_{i,t}=βX_{i,t}+ε_{i,t}

ΔYi,t=βYi,t+εi,tΔY_{i,t}=βY_{i,t}+ε_{i,t}

ΔYi,t=βXi,t+γYi,t+εi,tΔY_{i,t}=βX_{i,t}+γY_{i,t}+ε_{i,t}

Lagged Level

Yi,t=βXi,t1+εi,tY_{i,t}=βX_{i,t-1}+ε_{i,t}

ΔYi,t=βXi,t1+εi,tΔY_{i,t}=βX_{i,t-1}+ε_{i,t}

Yi,t=βYi,t1+εi,tY_{i,t}=βY_{i,t-1}+ε_{i,t}

ΔYi,t=βYi,t1+εi,tΔY_{i,t}=βY_{i,t-1}+ε_{i,t}

Yi,t=βXi,t1+Yi,t1+εi,tY_{i,t}=βX_{i,t-1}+Y_{i,t-1}+ε_{i,t}

ΔYi,t=βXi,t1+γYi,t1+εi,tΔY_{i,t}=βX_{i,t-1}+γY_{i,t-1}+ε_{i,t}

Change

Yi,t=βΔXi,t+εi,tY_{i,t}=βΔX_{i,t}+ε_{i,t}

ΔYi,t=βΔXi,t+εi,tΔY_{i,t}=βΔX_{i,t}+ε_{i,t}

Yi,t=βΔYi,t+εi,tY_{i,t}=βΔY_{i,t}+ε_{i,t}

ΔYi,t=βΔYi,t+εi,tΔY_{i,t}=βΔY_{i,t}+ε_{i,t}

Yi,t=βΔXi,t+γΔYi,t+εi,tY_{i,t}=βΔX_{i,t}+γΔY_{i,t}+ε_{i,t}

ΔYi,t=βΔXi,t+γΔYi,t+εi,tΔY_{i,t}=βΔX_{i,t}+γΔY_{i,t}+ε_{i,t}

Lagged Change

Yi,t=βΔXi,t1+εi,tY_{i,t}=βΔX_{i,t-1}+ε_{i,t}

ΔYi,t=βΔXi,t1+εi,tΔY_{i,t}=βΔX_{i,t-1}+ε_{i,t}

Yi,t=βΔYi,t1+εi,tY_{i,t}=βΔY_{i,t-1}+ε_{i,t}

ΔYi,t=βΔYi,t1+εi,tΔY_{i,t}=βΔY_{i,t-1}+ε_{i,t}

Yi,t=βΔXi,t1+γΔYi,t1+εi,tY_{i,t}=βΔX_{i,t-1}+γΔY_{i,t-1}+ε_{i,t}

ΔYi,t=βΔXi,t1+γΔYi,t1+εi,tΔY_{i,t}=βΔX_{i,t-1}+γΔY_{i,t-1}+ε_{i,t}

Level & Change

Yi,t=βXi,t+γΔXi,t+εi,tY_{i,t}=βX_{i,t}+γΔX_{i,t}+ε_{i,t}

Lagged Level & Change

Yi,t=βXi,t1+γΔXi,t+εi,tY_{i,t}=βX_{i,t-1}+γΔX_{i,t}+ε_{i,t}

ΔYi,t=ρYi,t+βΔXi,t+εi,tΔY_{i,t}=ρY_{i,t}+βΔX_{i,t}+ε_{i,t}

Level & Lagged Change

ΔYi,t=ρΔYi,t1+βΔXi,t1+γXi,t+εi,tΔY_{i,t}=ρΔY_{i,t-1}+βΔX_{i,t-1}+γX_{i,t}+ε_{i,t}

The tabel above outlines the mdoel space generated by combining diffirent forms of the dependent and expanatory variables (current level, lagged value, and first difference). In order to uniformly represent and highlight the model structure, all error terms in this table are uniformly denoted as εi,tε_{i,t}. However, it should be noted that in models where ΔYi,tΔY_{i,t} is the dependent variable, this error term in some cases should be interpreted as Δεi,t=εi,tεi,t1 Δε_{i,t}=ε_{i,t}-ε_{i,t-1}, and its statistical properties differ significantly from those of the original error term.

Some of the models — such as change on change or level on lagged level are widely used in applied panel data research, with clear causal logic and estabilished estimation strategies. Others, while algebraically valid, pose logical or econometric challegns. For instance, using YitY_{it} to explain ΔYi,tΔY_{i,t} risks circular reasoning and endogeneity, and should be treated with caution.

There also exist less commonly used model forms that, though rarely applied in prcatice, offer promising theoretical avenues — especially for capturing dynamic effects, inertia, or lagged transmisson. I turely believe that we should not only to master standard models but also to think creatively about variable construction and causal assumptions, paving the way for vovel insights in dynamic panel data modeling.

2 First Change Model

In panel data analysis, we often seek to expalin the change in a variable between two points in time, such as income growth, attitude shifts, or improvements in educational performance. On the surface, these models may appear to differ only in the form or structure of the varibales, but in essence, they represent different theoretical assumptions, error structures, and identification logics.

In this section, we will systematically sort out the following four modeling methods with "change" as the core:

1. Lagged Dependent Variable Model

  1. First Change Model

  2. First Difference Model

  3. Stable Gain Model

Actually, first change model, sometimes also known as change score model, comes from pre-post study in Psychology and Pedagogy. First change model is a descriptive model rather than a explanatory model in panel data analysis. There is no control for fixed effec αi\alpha_i, so this model is suitable for intervention studies, regression trend analysis, but not for panel modeling.

2.1 Model Specification and Assumptions

The First Change Model FC model uses the raw change in the outcome as the dependent variable and regresses it on the current level of an explanatory variable:

ΔYi,t=Yi,tYi,t1=α+βXi,t+ei,t(1)ΔY_{i,t}=Y_{i,t}-Y_{i,t-1}=α+βX_{i,t}+e_{i,t}\quad (1)

The basic idea of FC is that, the change of YY is caused by the current level of XX.

There are 3 main assumptions ffor CSM:

  1. No autocorrelation in error: ei,ti.i.d.(0,σ2)e_{i,t}∼i.i.d. (0,σ^2)

  2. No omitted variable bias form the initial level Yi,t1Y_{i,t-1}

  3. No unobserved confounding between XitX_{it} and unmodeled causes of change

I will use econometric frameworks (I hope oneday I can use the term of sociometric), in particular panel data models and potential outcomes framewokrs, to more rigorously derive and interpret the key 3 assumptions of FCM.

2.2 Assumption 1: Strict exogeneity / Unconfoundedness

Proposition: Given XitX_{it}, the error term eite_{it} is independent of XitX_{it}, and independent of the potential outcomes for all periodes (inparticular tt and t1 t-1). More precisely:

E[eitXit]=0\mathbb{E}[e_{it} | X_{it}]=0

Introduce FCM:

ΔYi,t=Yi,tYi,t1=α+βXi,t+ei,t(1)ΔY_{i,t}=Y_{i,t}-Y_{i,t-1}=α+βX_{i,t}+e_{i,t}\quad (1)

where, eite_{it} contains all the factors that are not modeled.

If OLS is consistent with ββ:

plim β^=β+Cov(Xit,eit)Var(Xit)\text{plim } \hat{\beta} = \beta + \frac{\text{Cov}(X_{it}, e_{it})}{\text{Var}(X_{it})}

If plim β^=β\text{plim } \hat{\beta} = \beta, namely:

Cov(Xit,eit)=0\text{Cov}(X_{it}, e_{it}) = 0

which is equivalent to the form of conditional expectation:

E[eitXit]=0\mathbb{E}[e_{it} | X_{it}]=0

Or the stronger conditional independency in probability theory:

eit ⁣ ⁣ ⁣Xite_{it} \perp\!\!\!\perp X_{it}

But in change modeling, eite_{it} contains the following unmodeled factors:

eit=f(Yi,t1,Zit,αi,uit)e_{it} = f(Y_{i,t-1}, Z_{it}, \alpha_i, u_{it})

Yi,t1Y_{i,t-1}: previous values of YY, which may affect the dependency patrh of YitY_{it};

ZitZ_{it}: unobserved confounding variables;

αi\alpha_i: the unchanged fiexed effect of unit;

uitu_{it}: i.i.d. error term.

In order for E[eitXit]=0\mathbb{E}[e_{it} | X_{it}]=0 to hold, it must be required that:

Xit ⁣ ⁣ ⁣{Yi,t1,Zit,αi}X_{it} \perp \!\!\! \perp \{Y_{i,t-1}, Z_{it}, \alpha_i\}

That is, the explanatory variable XitX_{it} must be independent of each other from all factors that are not modeled but affect ΔYit\Delta Y_{it}.

For example, if the real data generation process is:

Yi,t=αi+β1Xi,t+β2Yi,t1+ui,tY_{i,t}=α_i+β_1X_{i,t}+β_2Y_{i,t-1}+u_{i,t}

which is dynamic panel model. But we use ΔYi,t=Yi,tYi,t1=αi+β1Xi,t+ei,t ΔY_{i,t}=Y_{i,t}-Y_{i,t-1}=α_i+β_1X_{i,t}+e_{i,t},

ei,t=Yi,tYi,t1αiβ1Xi,te_{i,t}=Y_{i,t}-Y_{i,t-1}-α_i-β_1X_{i,t}
ei,t=αi+β1Xi,t+β2Yi,t1+ui,tYi,t1αiβ1Xi,te_{i,t}=α_i+β_1X_{i,t}+β_2Y_{i,t-1}+u_{i,t}-Y_{i,t-1}-α_i-β_1X_{i,t}
ei,t=Yi,t1(β21)+ui,te_{i,t}=Y_{i,t-1}(β_2 -1)+u_{i,t}

then, Cov(Xit,eit)=(β21)Cov(Xit,Yi,t1)\text{Cov}(X_{it}, e_{it}) = (β_2 -1)\text{Cov}(X_{it}, Y_{i,t-1}).

2.3 Assumption 2: No Dynamic Selection / Exogeneity of Initial Conditions

Proposition: Yi,t1Y_{i,t-1} has no impact on ΔYi,tΔY_{i,t}, even hat impact on ΔYi,tΔY_{i,t}, irrel

if the real data generation process is:

Yi,t=αi+β1Xi,t+β2Yi,t1+ui,tY_{i,t}=α_i+β_1X_{i,t}+β_2Y_{i,t-1}+u_{i,t}

but we use ΔYi,t=Yi,tYi,t1=αi+β1Xi,t+ei,t ΔY_{i,t}=Y_{i,t}-Y_{i,t-1}=α_i+β_1X_{i,t}+e_{i,t}, then, ei,t=Yi,t1(β21)+ui,te_{i,t}=Y_{i,t-1}(β_2 -1)+u_{i,t},

E[β^1CSM]=β1+β2δ\mathbb{E}[\hat{\beta}_1^{CSM}] = \beta_1 + \beta_2 \cdot \delta
Yi,t1=γ0+γ1Xit+vitδ=γ1Y_{i,t-1} = \gamma_0 + \gamma_1 X_{it} + v_{it} \quad \Rightarrow \quad \delta = \gamma_1

So, the following conditions must be hold

Yi,t1 ⁣ ⁣ ⁣XitY_{i,t-1} \perp\!\!\!\perp X_{it}
Cov(Yi,t1,eit)=0\text{Cov}(Y_{i,t-1}, e_{it}) = 0
E[β^1CSM]=β1+β2Cov(Xit,Yi,t1)Var(Xit)\boxed{ \mathbb{E}[\hat{\beta}_1^{CSM}] = \beta_1 + \beta_2 \cdot \frac{\text{Cov}(X_{it}, Y_{i,t-1})}{\text{Var}(X_{it})} }

2.4 Assumption 3: Serially Uncorrelated Errors

Proposition: Error term eite_{it} is not serially uncorrelated, that is :

Cov(ei,t1,eit)=0\text{Cov}(e_{i,t-1}, e_{it}) = 0

Consider the change model we hypothese:

ΔYit=β1ΔXit+εit\Delta Y_{it} = \beta_1 \Delta X_{it} + ε_{it}

If the true data generation process model is: Yit=β1Xit+αi+uitY_{it} = \beta_1 X_{it} + \alpha_i + u_{it}, and after the difference we get:

εit=uitui,t1ε_{it} = u_{it} - u_{i,t-1}

let εi,t1=ui,t1ui,t2ε_{i,t-1} = u_{i,t-1} - u_{i,t-2}, then

Cov(εit,εi,t1)=Cov(uitui,t1, ui,t1ui,t2)=σ2\text{Cov}(\varepsilon_{it}, \varepsilon_{i,t-1}) = \text{Cov}(u_{it} - u_{i,t-1},\ u_{i,t-1} - u_{i,t-2}) = -\sigma^2

which indicates that even if ui,ti.i.d.(0,σ2)u_{i,t}∼i.i.d. (0,σ^2), residual error εitε_{it} after difference will result in first order negative serial correlation (MA(1) structure). So, it must be hold: Cov(ei,t1,eit)=0\text{Cov}(e_{i,t-1}, e_{it}) = 0.

3 First Difference Model (FDM)

In contraction to FCM, first differecen model bases its explanation for ΔYit\Delta Y_{it} on ΔXit\Delta X_{it}, instead of XitX_{it}. Before introduce the specification of first difference model, we consider the general linear models in two periods: Yi,t=βXi,t+εi,tY_{i,t}=βX_{i,t}+ε_{i,t} and Yi,t1=βXi,t1+εi,t1Y_{i,t-1}=βX_{i,t-1}+ε_{i,t-1}. And we can differentiate them:

ΔYi,t=Yi,tYi,t1=βXi,tβXi,t1+εi,tεi,t1\Delta Y_{i,t}=Y_{i,t}-Y_{i,t-1}=βX_{i,t}-βX_{i,t-1}+ε_{i,t}-ε_{i,t-1}

in which we have asummed that the coefficient of XX is consistent over time, so

ΔYi,t=βΔXi,t+εi,tεi,t1\Delta Y_{i,t}=β\Delta X_{i,t} + ε_{i,t}-ε_{i,t-1}

where the problem lies in whether we should assume εi,t=εiε_{i,t}= ε_{i}, that is, εi,tεi,t1=0ε_{i,t}-ε_{i,t-1}=0, which means residual term itself does not fluctuate over time. Assuming εi,t=εiε_{i,t}= ε_{i},

ΔYi,t=βΔXi,t(2)\Delta Y_{i,t}=β\Delta X_{i,t} \quad (2)

Clearly, ΔYi,t=βΔXi,t\Delta Y_{i,t}=β\Delta X_{i,t} is not a statistical model, but a structural identity, which makes it impossible to do statistical inference. Naturally, some of us might think of adding an error term ui,ti.i.du_{i,t} ∼ i.i.d to the right side of this identity to make it statistically inferable, i.e.,

ΔYi,t=βΔXi,t+ui,t(3)\Delta Y_{i,t}=β\Delta X_{i,t} +u_{i,t}\quad (3)

Crutially, Model (3) is alomst impossible to hold true in the panel data structure. But

ΔYi=βΔXi+ui(4)\Delta Y_{i}=β\Delta X_{i} +u_{i}\quad (4)

will hold true in the cross-sectional data structure and experimental data structure, equivalent to Y=βX+uY=βX+u, and in essence, formally linear and statistically capable of consistent estimation and standard inference using OLS.

3.1 Model Specification and Estimation

In perspective of DGP: Yi,t=βXi,t+εi,tY_{i,t}=βX_{i,t}+ε_{i,t}, εit=αi+eitε_{it}=\alpha_i+e_{it} , after difference,

ΔYi,t=βΔXi,t+Δei,t(5)\Delta Y_{i,t}=β\Delta X_{i,t} + \Delta e_{i,t} \quad (5)

where Δεi,t=Δei,t\Delta ε_{i,t}= \Delta e_{i,t}, which is actually the specification of First Difference Model FDM.

The estimate for coefficient β\beta of the FCM by OLS is:

β^FD=i(ΔXiΔXˉ)(ΔYiΔYˉ)i(ΔXiΔXˉ)2(6)\hat{\beta}^{FD} = \frac{\sum_i (\Delta X_i - \bar{\Delta X})(\Delta Y_i - \bar{\Delta Y})}{\sum_i (\Delta X_i - \bar{\Delta X})^2} \quad (6)

Substituting (5) into (6) yields:

β^FD=β+i(ΔXiΔXˉ)Δeii(ΔXiΔXˉ)2(7)\hat{\beta}^{FD} = \beta + \frac{\sum_i (\Delta X_i - \bar{\Delta X}) \Delta e_i}{\sum_i (\Delta X_i - \bar{\Delta X})^2} \quad (7)

To make the first difference estimator β^FD\hat{\beta}^{FD} converges to the true value β\beta with probability, that is, β^FDpβ\hat{\beta}^{FD} \xrightarrow{p} \beta, the following condition must be satisfied:

plim1Ni(ΔXiΔXˉ)Δei=0(8)\plim \frac{1}{N} \sum_i (\Delta X_i - \bar{\Delta X})\Delta e_i=0 \quad (8)

and the sufficient condition for proposition (8) to be true is

E[(ΔXiΔXˉ)Δei]=0(9)\mathbb{E}[(\Delta X_i - \bar{\Delta X})\Delta e_i] = 0\quad (9)

(9) is equivalent to (10), as ΔXˉ\bar{\Delta X} is constant:

E[ΔXiΔei]=0(10)\mathbb{E}[\Delta X_i \Delta e_i] = 0 \quad (10)

3.2 Assumption 1: Strict Exogeneity

Proposition: E[eitXi1,Xi2,αi]=0for t=1,2\mathbb{E}[e_{it} \mid X_{i1}, X_{i2}, \alpha_i] = 0 \quad \text{for } t=1,2

As Δei=ei2ei1\Delta e_i = e_{i2}-e_{i1}, then, E[ΔXiΔei]=E[ei2ei1Xi2Xi1]\mathbb{E}[\Delta X_i \mid \Delta e_i]=\mathbb{E}[ e_{i2}-e_{i1} \mid X_{i2}- X_{i1}]

To satisfy E[ΔXiΔei]=0\mathbb{E}[\Delta X_i \Delta e_i] = 0 , the following must be hold:

E[ΔXiΔei]=E[ei2ei1Xi2Xi1]=0\mathbb{E}[\Delta X_i \mid \Delta e_i]=\mathbb{E}[ e_{i2}-e_{i1} \mid X_{i2}- X_{i1}]=0

so, E[eitXi1,Xi2,αi]=0for t=1,2\mathbb{E}[e_{it} \mid X_{i1}, X_{i2}, \alpha_i] = 0 \quad \text{for } t=1,2 . It must be assumed that Xi1X_{i1},Xi2X_{i2} and e𝑖1e_{𝑖1},e𝑖2e_{𝑖2} are uncorrelated, i.e., strict exogeneity is a necessary condition for effective identification by FDM.

3.3 Assumption 2: Fixed Effect

Proposition: αit=αi=constantΔαi=0\alpha_{it}=\alpha_{i}=constant⇒Δ\alpha_{i}=0

If αit\alpha_{it} is not a constant rather changes over time, then:

ΔYi,t=βΔXi,t+Δei,t+αi2αi1=βΔXi,t+Δei,t+Δαit\Delta Y_{i,t}=β\Delta X_{i,t} + \Delta e_{i,t} +\alpha_{i2}-\alpha_{i1}=β\Delta X_{i,t} + \Delta e_{i,t} +\Delta \alpha_{it}

Apparently.

3.4 Error term structurally idependent / Homoskedasti

Proposition: Δeii.i.d.(0,σ2)\Delta e_i \sim \text{i.i.d.}(0,\sigma^2)

Var(ΔeiΔxi)=σ2\text{Var}(\Delta e_i \mid \Delta x_i) = \sigma^2

4 Conditional Change Model CCM

The conditional change model CCM, also known as the lagged dependent variable LDV model, is a widely useed approach in panel data analysis. Unlike the first difference model, which focuses on raw changes over time, the CCM explicitly incorporates the lagged value of the dependent variable as a regressor, allowing us to model dynamic adjustments whlie controlling for past states.

However, astute readers will surely notice that we seem to have a priori assumed true structure of the panel data, such as Yi,t=αi+β1Xi,t+β2Yi,t1+ui,tY_{i,t}=α_i+β_1X_{i,t}+β_2Y_{i,t-1}+u_{i,t}, which will be discussed later.

In Nerlove's (1958) study on farmer's price response, researches found that farmer's supply behavior had hysteresis in reponding to current prices, that is, supply adjustment often lagged behind price changes. The static linear model used at the time could not explain this phenomenon. Nerlove intoduced the lagged dependent variable Yt1Y_{t-1}, which is used to represent the individual's own "adjustment inertia" or "expectation formation lag" machanism. After modeling in this way, the model can naturally expalin the partial adjustment process, that is:

Yi,t=β1Xi,t+β2Yi,t1+εi,tY_{i,t}=β_1X_{i,t}+β_2Y_{i,t-1}+ε_{i,t}

The lag term Yt1Y_{t-1} is actually a minimal approximation of a low-dimensional statae space. In situations where inertia exists but higer-order differential or latent variable dynamics cannot be modeled, Yt1Y_{t-1} provides a highly practical capable of approxmating more complex system memory. This also explains why lagged dependent variables are nearly indispensasble components in social science research across fields.

2.1 Unconditional Change Model

Starting from the most basic linear cross-sectional regression model, we assume that for any unit ii and time point tt, the dependent variable YitY_{it} is determined by the explanatory variable XitX_{it}:

Yi,t=α+βXi,t+εi,t(1)Y_{i,t}=α+βX_{i,t}+ε_{i,t}\quad (1)

where, εi,tε_{i,t} is disturbance term. Now, consider the observed value of the same unit in the previous period t1t-1, and its structure is as follows:

Yi,t1=α+βXi,t1+εi,t1Y_{i,t-1}=α+βX_{i,t-1}+ε_{i,t-1}

If we differentiate the equations at the above two points, we obtain:

Yi,tYi,t=βXi,tβXi,t1+εi,tεi,t1Y_{i,t}-Y_{i,t}=βX_{i,t}-βX_{i,t-1}+ε_{i,t}-ε_{i,t-1}

i.e.,

ΔYi,t=βΔXi,t+Δεi,t(2)ΔY_{i,t}=βΔX_{i,t}+Δε_{i,t}\quad (2)

Δεi,t=εi,tεi,t1Δε_{i,t}=ε_{i,t}-ε_{i,t-1}. This regression of change of Y to the change of X is known as unconditional change score or first difference method of panel analysis.

Assume εi,t=ai+ei,tε_{i,t}=a_i+e_{i,t}, then

Yi,tYi,t=βXi,tβXi,t1+aiai+ei,tei,t1Y_{i,t}-Y_{i,t}=βX_{i,t}-βX_{i,t-1}+a_i-a_i+e_{i,t}-e_{i,t-1}
ΔYi,t=βΔXi,t+Δui,t(3)ΔY_{i,t}=βΔX_{i,t}+Δu_{i,t}\quad (3)

From a statistical perspective, the differencing operation serves not merely as a nummerical transofrmation, bus as a structural reparameterization of the data-generating process. By differenceing observations of the same unit over time, we effetively eliminate all time invariant sources of heterogenity, including both the intercept term αα and unit-specific fixed effects aia_i, assuming they are constant across periods. This transforms the model into one that estimates the marginal efect ββ purely based on within-unit variation over time, which allows for sonstent estimation even when Xi,tX_{i,t} is correlated with the fixed effect aia_i— a common souce of omitted variable bias in cross-sectional analyses.

Crucially, this first-differenced model alters the statitical properties of the error term. The composite error Δεi,t=εi,tεi,t1Δε_{i,t}=ε_{i,t}-ε_{i,t-1} inherits a moving average (MA(1)) struture, due to temporal dependence between adjacent errors. This has direct implications for inference: while the OLS estimator of ββ remains unbiased under standard assumptions, standard errors computed withoutaccounting for this serial correlation may be inconsistent. Robust inference therefore requires either corrected variance estimation or alternative methods such as GMM, which will be discussed in later sections.

The identification of the change score model, which implies the premise of "no dynamic dependence", i.e.,

YtYt1XtY_t ⊥Y_{t-1} | X_t

In other words, lagged dependent variable (or lagged endogenous variable) Yt1Y_{t-1} has no impact on YtY_t and ΔYtΔY_{t}. Im many social processes, the dependent variable exhibits temporal inertia or lagged feefback, whereby Yt1Y_{t-1} carries predictive power for YtY_t even after accounting for XtX_t. Ignoring this leads to model misspecification.

2.2 Conditional change Model

Include the lag-dependent variable in regression model (1) yieds the so called conditional change model, as follows:

Yi,t=α+β1Xi,t+β2Yi,t1+εi,t(4)Y_{i,t}=α+β_1X_{i,t}+β_2Y_{i,t-1}+ε_{i,t}\quad (4)
Yi,tYi,t1=α+β1Xi,t+β2Yi,t1Yi,t1+εi,tY_{i,t}-Y_{i,t-1}=α+β_1X_{i,t}+β_2Y_{i,t-1}-Y_{i,t-1}+ε_{i,t}
ΔYt=α+β1Xi,t+Yi,t1(β21)+εi,t(5)ΔY_{t}=α+β_1X_{i,t}+Y_{i,t-1}(β_2-1)+ε_{i,t}\quad (5)
ΔYi,t=βΔXi,t+Δεi,t(2)ΔY_{i,t}=βΔX_{i,t}+Δε_{i,t}\quad (2)

can be transformed into

Yi,t=βΔXi,t+Yi,t1+Δεi,t(6)Y_{i,t}=βΔX_{i,t}+Y_{i,t-1}+Δε_{i,t}\quad (6)

1.2 Lagged Variation Model

In panel data analysis, we often focus on th impact of the current or lagged level of the explanatory variable on the dependent variable. Whether for explanatory or dependent variables, the most common approaches are estimation their first differences or dynamic adjustment paths. However, one less commonly discussed structure— models with lagges differences as predictors— actually holds siginificant potential for theoretical interpretation, partucularly when analyzing delayed response mechanisms and trend persistence mechanisms.

The basic idea of lagged difference model is that the trend of cahneg in the explanatory variable, rather than its level, is the key driver of changes or levels in the dependent variable. In other words, the behavior of social individuals or institutions is not directly inflenced by the magnitute of a variable's value but rather responds to its past rate or direction of change. This modeling logic can be expressed in two ways, respectively explaining the level or the trend of change in the dependent variable.

The first type of structure is level on lagged change model, whose basic form is as follows:

Yi,t=βΔXi,t1+εi,tY_{i,t}=βΔX_{i,t-1}+ε_{i,t}

The core assumption of this model is that the current statle of the dependent variable is not directly determined by the level of the expalnotory variable, but rather driven by its trend of change in the previous time period. For example, the social impact of a policy change ofther does not manifest immediately but gradually emerges over a period of time following its implementation. Similarly, significant fluctuations in household income may not alter happiness scores in the current period but could lead to a reassessment of psychological well-being in the following year. This model therefore well-suited for capturing institutional lagged response mechanisms, emootional adjustment lags, and lagged path transition processes. Since its explanatory variables are the prevous period' change terms, it has weaker endogeneity risks compared to lagged level models in terms of metrological identification and clearly refects the "change first, then stabilize" response mechanism.

The second type of structure is change on lagged change model. whose bosic form is as follows:

ΔYi,t=βΔXi,t1+εi,tΔY_{i,t}=βΔX_{i,t-1}+ε_{i,t}

Unlike the former model, this model emphasizes the continuity of change trends. Its hypothetical structure is as follows: if the explanatory variable undergoes rapid cahnge at an earlier time point, this trend will continue ti influence the current change in the dependent variable. For example, in the process of information diffusion, the rapid rise in online public participation rates in later stages; similarly, sustained gowth in education spending may manifest itself in the sustained improvement of student academic performance several periods later. This model is suitable for analyzing acceleration effects, lagged mobilization mechanisms, and trend inertia processes in social diffusion. Affitionally, it can serve as a linear approximation embedded within more complex diffusion or mobilization models.

In areas such as institutional change, policy response, behavioral imitaion, groupp polarization, and risk perception, social behavior is often not as a static response to variable levels, but rather a dynamic response to the path of variables chanegs. Lagged difference models provide the possibility of capturing this non-instantaneous causal chain.

Last updated