Intermediate
Social Statistics
Course providers: Ray Duch (raymond.duch@nuffield.ox.ac.uk)
and
Course Outline in PDF format
Lecture Handouts
Matrices
and Linear Regression
Maximum
likelihood and model selection
Multinomial
logit and Ordered Logit/Probit
Factor
analysis and Principal Component Analysis
Event
Count and Duration Models
Models
with Non-random Selection
FINAL Exam (Click here)
FINAL Exam Datasets (Click here)
Course structure
There will be a two hour
lecture on Wednesday 2-4 pm in Lecture Theatre in the
Topics covered each week
|
Week |
Topic |
|
|
|
|
1 |
Matrices and Linear regression
(Snijders) This week introduces linear
regression in matrix notation, and the necessary knowledge of matrix-algebra
as background. The aim is to increase understanding of linear regression and
to provide a basis for more advanced models. |
|
2 |
Maximum likelihood estimation and
model selection (Duch) This
lecture introduces the idea of maximum likelihood as a method of estimating
parameters in a statistical model and thereby providing further theoretical
background to the models introduced in later weeks. MLE also provides a
method of model selection. Freeman,
chapter 6 Long,
chapter 2 |
|
3 |
Binary logit and probit models with
applications (Duch) This
week introduces regression models for binary dependent variables such as
yes/no or voted/abstained. Long,
J. Scott. 1997 Regression Models for Categorical and Limited Dependent
Variables. Chapter 3 and 4. |
|
4 |
Multinomial and ordered logit/
probit. (Duch) These
models are for ordered categorical variables, e.g. survey questions
where there are ordered categories such as strongly agree, agree, neither
agree nor disagree, disagree, strongly disagree. Long,
J. Scott. 1997 Regression Models for Categorical and Limited Dependent
Variables. Chapter 5 and 6 |
|
5 |
Factor analysis and principal
component analysis (Snijders) These are methods for summarizing
the association between several interval or ratio level variables. They are
often used when it is thought that some of the variables are essentially
measuring the same phenomenon, or that there are a few underlying ('latent')
phenomena affecting the values of the observed variables. |
|
6 |
Scale Construction (Snijders) Methods will be discussed for
constructing scales from binary items, and from items consisting of ordered
categories. The main method discussed will be Mokken scaling. |
|
7 |
Event count and Duration models
(Duch) Models
for count data are used for relatively infrequent events, with say an average
between 0 and 7 (e.g. the number of presidential vetoes in a year). Long,
J. Scott. 1997 Regression Models for Categorical and Limited Dependent Variables.
Chapter 8; and TBA |
|
8 |
Models with Non-random Selection
(Duch) Long,
J. Scott. 1997 Regression Models for Categorical and Limited Dependent
Variables. Chapter 7; and TBA |
Assessment
For those who are taking this
course for credit, 1) half of your grade will be based on weekly assignments
and 2) half your grade will be based on an exam to be set by the relevant
lecturer(s) no later than Friday of the eighth week of Hilary Term. Your homework assignments should be handed
into your class instructors at the date indicated on each of the weekly
assignments. Answers to the exam should
be handed in to the Politics Graduate Admin Office by 12 noon on Friday of 6th
week of Trinity Term. Students should
not write their names on their answers, but identify their scripts with their
candidate number only. Candidates will
be expected to demonstrate skill in the application of, and ability to evaluate
critically, the models and methods discussed in both the lectures and classes.
Principal
Texts:
J Scott Long. 1997. Regression Models for Categorical and
Limited Dependent Variables. Sage.
J Scott Long and Jeremy Freese.
2006. Regression
Models for Categorical Dependent Variables Using Stata. Stata Press.
David J. Bartholomew,
Fiona Steele, Irini Moustaki, and Jane I. Galbraith. The Analysis and
Interpretation of Multivariate Data for Social Scientists. Chapman &
Hall/CRC, 2002.
David A. Freedman.
Statistical Models; Theory and Practice.
David J. Bartholomew,
Fiona Steele, Irini Moustaki, and Jane I. Galbraith. The Analysis and
Interpretation of Multivariate Data for Social Scientists. Chapman &
Hall/CRC, 2002.
Other
John Aldrich. 1984. LPM, Logit, and Probit. Sage.
Vani Kant Borooah. 2001. Logit and Probit: Ordered and Multinomial Models. Sage.
John Fox. 1991. Regression Diagnostics. Sage.
John Fox. 1997. Applied Regression Analysis, Linear Models,
and Related Methods. Sage.
Thomas H. Wonnacott and Ronald
J. Wonnacott. 1990. Introductory
Statistics. Wiley.