ANALYSING MICRO DATA IN STATA

COURSE OVERVIEW

TStat’s Analysing Micro Data in Stata course offers participants a comprehensive introduction to the principle methodologies used in the analysis of micro data. Micro data, data which contains information at the level of a specific unit (such as individuals, firms or entities), has by its very nature become an increasingly important source of information offering researchers and policy makers an effective tool with which to obtain a more in-depth understanding of an array of political, socio-economic and public health phenomena. As such the collection and subsequent analysis of micro data over recent years has proved to be the key to policy formulation, the targeting of interventions and the subsequent monitoring and measurement of the impact of such interventions and policies. Whilst these techniques have been traditionally more applied in the field of economics, the increasing availability of micro data has over recent years resulted in a steady increase in the analysis of micro data by researchers working in Political and Social Sciences, Biostatistics, Epidemiology and Public Health.

COURSE STRUCTURE

TStat’s introduction to micro data analysis course focuses, from both a theoretical and applied point of view, on the following methodologies: count models, binary dependent variable models, multinomial models, Tobit and Interval Regression models, models with treatment variables and Sample Selection and the Control function approach.

In common with TStat’s training philosophy, each individual session is composed of both a theoretical component (in which the techniques and underlying principles behind them are explained), and an applied (hands-on) segment, during which participants have the opportunity to implement the techniques using real data under the watchful eye of the course tutor. Throughout the course, theoretical sessions are reinforced by case study examples, in which the course tutor discusses and highlights potential pitfalls and the advantages of individual techniques. The intuition behind the choice and implementation of a specific technique is of the utmost importance. In this manner, the course leader is able to bridge the “often difficult” gap between abstract theoretical methodologies, and the practical issues one encounters when dealing with real data.

COURSE OUTCOME

At the end of the course, participants are expected to be able to autonomously implement (with the help of the Stata routine templates specifically developed for the course) the appropriate estimation techniques, given both the nature of their data and the analysis in hand.

TARGET AUDIENCE

Researchers and Ph.D. Students, Professionals working in biostatistics, economics, epidemiology, finance, public health, psychology, social and political sciences needing to acquire the necessary statistical requisites required to independently conduct empirical analysis using micro data.

PREREQUISITE

Participants are required to have a working knowledge of:

the classical OLS regression model: Model Assumptions, Estimation and Inference;
Instrumental Variables (IV) and General Method of Moments (GMM) estimation techniques;
the statistical software Stata: including familiarity with Stata variable creation commands and Stata do files.

Those needing to refresh these concepts are referred to the reading lists on the respective course pages and to:

Cameron, A. C. & Trivedi, P. K. (2022). Microeconometrics Using Stata, Volume I: Cross-Sectional and Panel Regression Methods. Second Edition. Stata Press. Chapters: 1-7.

PROGRAM

SESSION I: COUNT MODELS

Count Model Estimators in Stata: The Poisson Model
- Non-Linear Least Squares and GMM Estimators, Maximum Likelihood Estimators in Stata: nl, gmm, poisson
- Models with endogenous regressors: gmm and ivpoisson
Estimation and Specification tests in the presence of overdispersion: the Generalized Negative Binomial Model: nbreg, gnbreg
Estimation and interpretation of marginal effects using the Stata post estimation command margins

SESSION II: DISCRETE DEPENDENT VARIABLE MODELS

Estimating linear models with binary dependent variables – Logit, Probit and the Linear Probability Model: probit, logit, regress
The Heteroskedastic Probit Model and tests of heteroskadicity: hetprobit
Measures of Goodness of Fit and Specification Tests: tabulate, estat classification, estat gof
Independent Latent Heterogeneity in Probit Models
Estimating marginal effects: margins
Numerical problems with Logit and Probit

SESSION III: PROBIT MODELS WITH ENDOGENOUS REGRESSORS

The Control Function (CF) in the presence of continuous endogenous regressors
Testing for exogeneity in the CF framework
Bootstrap standard error estimation in the CF approach
Maximum likelihood estimation in the presence of continuous endogenous regressors: ivprobit
The multivariate recursive Probit estimator as a solution to the problem of the presence of binary endogenous regressors: biprobit, mvprobit, cmp
Measures of Goodness of Fit: tabulate, estat classification, estat correlation
Estimating marginal effects: margins

SESSION IV: MULTINOMIAL MODELS

Ordered categorical variable models (the Ordered Probit and Ordered Logit Estimators): oprobit and ologit
The Heteroskedastic Probit Model and tests of heteroskadicity: hetoprobit
Models with categorical (but unordered) variables – Multinomial Logit and Multinomial Probit estimators: mlogit, mprobit
MacFadden’s Choice Model – categorical variable models with alternative specific regressors: cmclogit, cmcprobit
Measures of Goodness of Fit and Specification Tests
Estimation and interpretation of marginal effects using the Stata post estimation command margins

SESSION V: THE TOBIT MODEL, INTERVAL REGRESSION E SAMPLE SELECTION

The Tobit Model – ML and Two-Step Least Squares: tobit, heckman
The Control Function (CF) approach in the presence of continuous endogenous regressors, exogeneity tests and Bootstrap standard errors
The Maximum Likelihood estimator for Tobit models with endogenous regressors: ivtobit
Interval Regression: a generalization of the Tobit Model: intreg
Estimators for Sample Selection Models: heckman
Estimation and interpretation of marginal effects using the Stata post estimation command margins

DATE AND LOCATION

The 2024 edition of this training course will be offered online on a part-time basis on the 9th-10th, 16th-17th, and 23rd-24th from 10 am to 1:30 pm and the 26th September from 2 pm to 4:30 pm Central European Summer Time.

FEES AND REGISTRATION

Full-time Students*: € 1300.00
Ph.D. Students: € 1670.00
Academic: € 1930.00
Commercial: € 2585.00

*To be eligible for student prices, participants must provide proof of their full-time student status for the current academic year. Our standard policy is to provide all full-time students, be they Undergraduates or Masters students, access to student participation rates. Part-time master and doctoral students who are also currently employed will however, be allocated academic status.

Fees are subject to VAT (applied at the current Italian rate of 22%). Under current EU fiscal regulations, VAT will not however applied to companies, Institutions or Universities providing a valid tax registration number.

The number of participants is limited to 8. Places will be allocated on a first come, first serve basis. The course will be officially confirmed, when at least 5 individuals are enrolled.

Course fees cover: i) teaching materials – copies of lecture slides, databases and Stata programs specifically developed for the course; ii) a temporary licence of Stata valid for 30 days from the day before the course commences.

Individuals interested in attending this workshop must return their completed registration forms by email (training@tstat.eu) to TStat by the 30th of August 2024.

DOWNLOAD THE COURSE IN PDF FORMAT

COURSE OVERVIEW

TARGET AUDIENCE

PREREQUISITE

PROGRAM

DATE AND LOCATION

FEES AND REGISTRATION

ONLINE FORMAT