Grading
Critiques 10%
Replications 30%
Midterm 20%
Final 40%
You get graded based directly on your replications, and also indirectly through the exams, which will have code- and paper-based questions based directly on the replications that you have undertaken. As you can see, the purpose of this course is to get you to do econometrics, via the replications.
Assignments
A Replication is an exercise in which you attempt to repeat the empirical exercise undertaken by the author of the paper. Typically, you will only replicate a small number of lines in a table of results. A good start is to do the stata tutorials linked below. All requests for Stata help should be preceded by you googling "help stata whatever" and typing "help whatever" in Stata. In Stata, the help menus have examples at the bottom. Good starts here "help use", "help recode", "help keep" and "help regress".
Reading
1. Green, William, Econometric Analysis, Prentice Hall, 5th/6th Edition, 2008.
2. Kennedy, Peter, A Guide to Econometrics, 5th Edition (Paperback).
3. Angrist, Joshua and Jorn-Steffen Pischke, Mostly Harmless Econometrics: An Empiricist's Companion (Paperback).
I will try to let you know where to look for relevant supporting material in these textbooks. In addition, I will post links to other material you may find helpful.
Lecture Notes to Introduce Ordinary Least Squares.
Useful links for learning Stata are UCLA economics Stata Tutorials and UCLA statistics Stata Tutorials.
A nice introduction to quantile regression is in Koenker and Hallock.
Assignment 1a due in-class Monday 16 Jan
Assignment 1b due in-class Monday 23 Jan
Here is the Stata code that I used to do the in-class Monte Carlo exercise on Monday 23 Jan.
Assignment 2a due in-class Wednesday 01 Feb
Assignment 2b due in-class Wednesday 8 Feb
These data are given at the state-year level. "nf1" gives the basic no-fault indicator. This replication will use the regress command in Stata, with weights and robust standard errors. "Count" gives the number of observations used to compute each datum in each state-year, and so should be used to weight all regressions (we'll learn about this later). Use hetero-robust standard errors with the ",r" subcommand (we'll learn about this later). Other variables give statistics about the distribution of the age at first marriage, computed at the state-year level. "p_**" gives the ** percentile of age at first marriage. Finally, please note that in Table 7, the regressand (dependent variable) is the log of the interquartile ratio, defined as log(P_75/P_25), and not as described in the text of the paper.
Lecture Notes on Panels.
Lecture Notes on GLS/FGLS.
Assignment 3a: Critique due at the beginning of class Wednesday 29 Feb
Assignment 3b: Replication due at the beginning of class Wednesday 7 March
FYI: 'tot' is 'terms-of-trade'; 'ervol' is 'exchange rate volatility'; 'standard' is a gold-standard classification: 1-gold, 2-silver, 3-bimetallic, 4-neither; 'usder' is 'US dollar exchange rate (in British Pounds)'; 'pusa' is 'price index for US'; 'freight' is the 'freight price index with bandwidth of 20'; 'frt**' are 'freight price index with bandwidth **'; '**_mean' and '**_stddev' are mean and std dev of temperature in ocean quadrants; 'exclude' is an indicator equal to 1 for a country/year combination that should be excluded from regressions due to data-quality issues.
The replication assignment is to replicate all of the upper panel of Table 4 plus whatever elements of the bottom panel you are able. Code to do this part may be based on our code if you wish. Then, replicate the same material improving the IV approach that the authors use. Here, you should think about: can you make a more informative instrument vector?; can you make a more exogenous instrument vector?; can you get a more efficient estimator? In the replication part, please include a paragraph that states whether or not you believe the authors' interpretation of their tests, and why, and how you think your endogeneity correction is better than that in the paper.
Lecture Notes on Endogeneity.
More on Venn Diagrams for Regression, Peter Kennedy, 2002. This paper presents the Ballentine Diagrams discussed in class, relating to Multicollinearity and Endogeneity.
Midterm
Study Questions for the Midterm. The midterm will be comprised of 3-4 questions from the study questions, and 3-4 other questions. I recommend that you work on the study questions in groups. You should NOT delegate study questions and then report back; rather you should work on them together, so that you can understand them better.
Responses for the midterm.
Assignment 4a: Critique due at the beginning of class Wednesday 21 March
Assignment 4b: Replication due at the beginning of class Monday 2 April
Seemingly Unrelated Regression.
Assignment 5: Due at the beginning of class Wednesday 11 April
Useful Reading:
Sources for OLS: Greene, 5th Ed, Chapters 1-3; Kennedy, 5th Ed, Chapters 1-3; Angrist-Pischke Ch 1-3.
Sources for GLS, Heteroskedasticity, Panel Methods: Greene, Chapters 10-13: Kennedy, 5th Ed, Chapters 8, 14, 17, Appendix B; Angrist-Pischke Ch5 (for panel s tuff)
Sources for Endogeneity (and also SUR): Green, Chapters 14-15; Kennedy, 5th Ed, Chapters 9, 10; Angrist-Pischke Ch4.
Sources for Testing: Greene, 5th Ed, Chapters 5,6,4; Kennedy, 5th Ed, Chapter 4.
Sources for SUR: Greene, Chapter 14; Kennedy, 5th Ed, Chapter 10.