Class 1: Introduction to Econometrics
Contents
The Summation Operator
Definition
Many elementary propositions in econometrics (and statistics) involve the use of the sums of numbers. Mathematicians often use the summation operator (the greek letter Σ –“sigma”) as a shorthand, rather than writing everything out the long way. It will be worth your time to understand the summation operator, and some of its properties, and how these can provide shortcuts to proving more advanced theorems in econometrics.
Let X be a random variable from which a sample of n observations is observed, so we have a sequence {x1,x2,...,xn} i.e. $x_i, $ for i=1,2,...,n. Then the total sum of the observations (x1+x2+...+xn) can be represented as:
n∑i=1xi=x1+x2+...+xn
- The term beneath Σ is known as the index,’’ which tells us where to begin our adding (at the 1 individual x term, x1)
- Note other letters, such as j, or k may be used (especially if i is defined elsewhere)
- The term above Σ is the total number of x terms we should add (n)
- Essentially, read n∑i=1xi as “add up all the individual x observations from the 1 (x1) to the final n (xn).”
Useful Properties of Summation Operators
Rule 1: The summation of a constant k times a random variable Xi is equal to the constant times the summation of that random variable:
n∑i=1kXi=kn∑i=1Xi
Proof:
n∑i=1kXi=kx1+kx2+...+kxn=k(x1+x2+...xn)=kn∑i=1Xi.
Rule 2: The summation of a sum of two random variables is equal to the sum of their summations:
n∑i=1(Xi+Yi)=n∑i=1Xi+n∑i=1Yi
Proof:
n∑i=1(Xi+Yi)=(X1+Y1)+(X2+Y2)+...(Xn+Yn)=(X1+X2+...+Xn)+(Y1+Y2+...+Yn)=n∑i=1Xi+n∑i=1Yi
Rule 3: The summation of constant over n observations is the product of the constant and n:
n∑i=1k=nk
Proof:
n∑i=1k=k+k+...+k⏟n times=nk
Combining these 3 rules: for the sum of a linear combination of a random variable (a+bX):
n∑i=1(a+bXi)=na+bn∑i=1Xi
Proof: left to you as an exercise!
Advanced: Useful Properties for Regression
There are some additional properties of summations that may not be immediately obvious, but will be quite essential in proving properties of linear regressions.
Using the properties above, we can describe the mean, variance, and covariance of random variables.For more beyond the mere definition, see the appendix on Covariance and Correlation
First, define the mean of a sequence {Xi:i=1,...,n} and {Yi:i=1,...,n} as:
ˉX=1nn∑i=1Xi
Second, the variance of X is:
var(X)=1nn∑i=1(Xi−ˉX)2
Third, the covariance of X and Y is:
cov(X,Y)=1nn∑i=1(Xi−ˉX)(Yi−ˉY)
Rule 4: The sum of the deviations of observations of Xi from its mean (ˉX) is 0:
n∑i=1(Xi−ˉX)=0
Proof:
n∑i=1(Xi−ˉx)=n∑i=1Xi−n∑i=1ˉX=n∑i=1Xi−nˉXSince ˉx is a constant=nn∑i=1Xin⏟ˉX−nˉXMultiply the first term by nn=1=nˉX−nˉXBy the definition of the mean ˉX=0
Rule 5: The squared deviations of X are equal to the product of X times its deviations:
n∑i=1(Xi−ˉX)2=n∑i=1Xi(Xi−ˉX)
Proof:
n∑i=1(Xi−ˉX)2=n∑i=1(Xi−ˉX)(Xi−ˉX)Expanding the square=n∑i=1Xi(Xi−ˉX)−n∑i=1ˉX(Xi−ˉX)Breaking apart the first term=n∑i=1Xi(Xi−ˉX)−ˉXn∑i=1(Xi−ˉX)Since ˉX is constant, not depending on i′s=n∑i=1Xi(Xi−ˉX)−ˉX(0)From rule 4=n∑i=1Xi(Xi−ˉX)Remainder after multiplying by 0
Rule 6: The following summations involving X and Y are equivalent:
n∑i=1Yi(Xi−ˉX)=n∑i=1Xi(Yi−ˉY)=n∑i=1(Xi−ˉX)(Yi−ˉY)
Proof:
n∑i=1(Xi−ˉX)(Yi−ˉY)=n∑i=1Yi(Xi−ˉX)−n∑i=1ˉY(Xi−ˉX)Breaking apart the second term=n∑i=1Yi(Xi−ˉX)−ˉY(0)From rule 4=n∑i=1Yi(Xi−ˉX)Remainder after multiplying by 0
equivalently:
n∑i=1(Xi−ˉX)(Yi−ˉY)=n∑i=1Xi(Yi−ˉY)−n∑i=1ˉX(Yi−ˉY)Breaking apart the first term=n∑i=1Xi(Yi−ˉY)−ˉX(0)From rule 4=n∑i=1Xi(Yi−ˉY)Remainder after multiplying by 0