Econometrics
Econometrics has been defined as "the application of mathematics
and statistical methods to economic data" and described as the branch of economics
"that aims to give empirical
content to economic relations." More precisely, it is "the quantitative analysis of actual economic phenomena based on the concurrent development of theory and observation, related by appropriate methods of inference." The first known use of the term "econometrics" (in cognate
form) was by Paweł Ciompa in 1910. Ragnar Frisch
is credited with coining the term in the sense that it is used today.
content to economic theory by formulating economic models in testable form, to estimate those models, and to test them as to acceptance or rejection.
For example, consider one of the basic relationships in economics: the relationship between the price of a commodity and the quantities of that commodity that people wish to purchase at each price (the demand relationship). According to economic theory, an increase in the price would lead to a decrease in the quantity demanded, holding other relevant variables constant so as to isolate the relationship of interest. A mathematical equation can be written that describes the relationship between quantity, price, other demand variables like income, and a random term ε to reflect simplification and imprecision of the theoretical model:
Regression analysis
could be used to estimate the unknown parameters , , and in the relationship, using data on price, income, and quantity. The model could then be tested for statistical significance
as to whether an increase in price is associated with a decrease in the quantity, as hypothesized: .
There are complications even in this simple example, and it is often easy to mistake statistical significance with economic significance. Statistical significance is neither necessary nor sufficient for economic significance. In order to estimate the theoretical demand relationship, the observations in the data set must be price and quantity pairs that are collected along a demand schedule that is stable. If those assumptions are not satisfied, a more sophisticated model or econometric method may be necessary to derive reliable estimates and tests.
Theoretical econometrics examines the statistical properties
of econometric procedures. Such properties include the power
of hypothesis tests and efficiency of estimator
s and of surveysampling methods. Applied econometrics uses theoretical econometrics and realworld data
for assessing economic theories, developing econometric model
s, analyzing economic history
, and forecasting.
Econometrics may use standard statistical model
s to study economic questions, but most often they are with observational
data, rather than in controlled experiments
. In this, the design of observational studies in econometrics is similar to the design of studies in other observational disciplines, such as astronomy, epidemiology, and political science. Analysis of data from an observational studies is guided by the study protocol, although exploratory
data analysis
may by useful for generating new hypotheses. Economics often analyzes systems of equations and inequalities, such as supply and demand
hypothesized to be in equilibrium. Consequently, the field of econometrics has developed methods for identification
and estimation
of simultaneousequation models. These methods are analogous to methods used in other areas of science, such as the field of system identification
in systems analysis
and control theory
. Such methods may allow researchers to estimate models and investigate their empirical consequences, without directly manipulating the system.
In recent decades, econometricians have increasingly turned to use of experiments
to evaluate the oftencontradictory conclusions of observational studies. Here, controlled and randomized experiments provide statistical inferences that may yield better empirical performance than do purely observational studies.
One of the fundamental statistical methods used by econometricians is regression analysis
. For an overview of a linear implementation of this framework, see linear regression
. Regression methods are important in econometrics because economists typically cannot use controlled experiments. Econometricians often seek illuminating natural experiment
s in the absence of evidence from controlled experiments. Observational data may be subject to omittedvariable bias
and a list of other problems that must be addressed using causal analysis of simultaneousequation models.
Data set
s to which econometric analyses are applied can be classified as timeseries data, crosssectional data
, panel data
, and multidimensional panel data
. Timeseries data sets contain observations over time; for example, inflation over the course of several years. Crosssectional data sets contain observations at a single point in time; for example, many individuals' incomes in a given year. Panel data sets contain both timeseries and crosssectional observations. Multidimensional panel data sets contain observations across time, crosssectionally, and across some third dimension. For example, the Survey of Professional Forecasters
contains forecasts for many forecasters (crosssectional observations), at many points in time (time series observations), and at multiple forecast horizons (a third dimension).
Econometric analysis may also be classified on the basis of the number of relationships modeled. Singleequation methods
model a single variable (the dependent variable) as a function of one or more explanatory (or independent) variables. In many econometric contexts,
the commonlyused ordinary least squares
method may not recover the theoretical relation desired or may produce estimates with poor statistical properties, because the assumptions for valid use of the method are violated. One widelyused remedy is the method of instrumental variable
s (IV). For an economic model described by more than one equation, simultaneousequation methods may be used to remedy similar problems, including two IV variants, TwoStage Least Squares (2SLS), and ThreeStage Least Squares (3SLS).
Other important unifying or distinguishing methods include the Method of Moments, Generalized Method of Moments (GMM
), time series analysis, and Bayesian methods.
Computational concerns
are important for evaluating econometric methods and for use in decision making. Such concerns include mathematical
wellposedness
: the existence
, uniqueness, and stability
of any solutions to econometric equations. Another concern is the numerical efficiency and accuracy of software. A third concern is also the usability of econometric software.
This example assumes that the natural logarithm
of a person's wage is a linear function of (among other things) the number of years of education that person has acquired. The parameter measures the increase in the natural log of the wage attributable to one more year of education. The term is a random variable representing all other factors that may have direct influence on wage. The econometric goal is to estimate the parameters, under specific assumptions about the random variable . For example, if is uncorrelated with years of education, then the equation can be estimated with ordinary least squares
.
If the researcher could randomly assign people to different levels of education, the data set thus generated would allow estimation of the effect of changes in years of education on wages. In reality, those experiments cannot be conducted. Instead, the econometrician observes the years of education of and the wages paid to people who differ along many dimensions. Given this kind of data, the estimated coefficient on Years of Education in the equation above reflects both the effect of education on wages and the effect of other variables on wages, if those other variables were correlated with education. For example, people born in certain places may have higher wages and higher levels of education. Unless the econometrician controls for place of birth in the above equation, the effect of birthplace on wages may be falsely attributed to the effect of education on wages.
The most obvious way to control for birthplace is to include a measure of the effect of birthplace in the equation above. Exclusion of birthplace, together with the assumption that is uncorrelated with education produces a misspecified model. A second technique for dealing with omitted variables is instrumental variables estimation. Still a third technique is to include in the equation additional set of measured covariates which are not instrumental variables, yet render identifiable. An overview of econometric methods used to study this problem can be found in Card (1999).
recipients in the field of econometrics:
The following are other influential econometricians that have won the John Bates Clark Medal
in the field of econometrics:
, the Journal of Econometrics
, the Review of Economics and Statistics
, Econometric Theory
, the Journal of Applied Econometrics
, Econometric Reviews
, the Econometrics Journal
, Applied Econometrics and International Development
, the Journal of Business & Economic Statistics
, and the Journal of Economic and Social Measurement.
If the researcher could randomly assign people to different levels of education, the data set thus generated would allow estimation of the effect of changes in years of education on wages. In reality, those experiments cannot be conducted. Instead, the econometrician observes the years of education of and the wages paid to people who differ along many dimensions. Given this kind of data, the estimated coefficient on Years of Education in the equation above reflects both the effect of education on wages and the effect of other variables on wages, if those other variables were correlated with education. For example, people born in certain places may have higher wages and higher levels of education. Unless the econometrician controls for place of birth in the above equation, the effect of birthplace on wages may be falsely attributed to the effect of education on wages.
The most obvious way to control for birthplace is to include a measure of the effect of birthplace in the equation above. Exclusion of birthplace, together with the assumption that is uncorrelated with education produces a misspecified model. A second technique for dealing with omitted variables is instrumental variables estimation. Still a third technique is to include in the equation additional set of measured covariates which are not instrumental variables, yet render identifiable. An overview of econometric methods used to study this problem can be found in Card (1999).
Noted econometricians
Journals
Software
 EViewsEViewsEViews is a statistical package for Windows, used mainly for timeseries oriented econometric analysis. It is developed by Quantitative Micro Software , now a part of IHS. Version 1.0 was released in March 1994, and replaced MicroTSP...
 GretlGretlgretl is an opensource statistical package, mainly for econometrics. The name is an acronym for Gnu Regression, Econometrics and Timeseries Library. It has a graphical user interface and can be used together with X12ARIMA, TRAMO/SEATS, R, Octave, and Ox. It is written in C, uses GTK as widget...
 MATLABMATLABMATLAB is a numerical computing environment and fourthgeneration programming language. Developed by MathWorks, MATLAB allows matrix manipulations, plotting of functions and data, implementation of algorithms, creation of user interfaces, and interfacing with programs written in other languages,...
and Econometrics Toolbox  OxMetricsOxMetricsOxMetrics is an econometric software including the Ox programming language for econometrics and statistics, developed by Jurgen Doornik and David Hendry...
 SHAZAM
 StataStataStata is a generalpurpose statistical software package created in 1985 by StataCorp. It is used by many businesses and academic institutions around the world...
 TSPTSP (econometrics software)TSP is a programming language for the estimation and simulation of econometric models. TSP stands for "Time Series Processor", although it is also commonly used with cross section and panel data. The company behind the program is TSP International which was founded in 1978 by Bronwyn H...
 RR (programming language)R is a programming language and software environment for statistical computing and graphics. The R language is widely used among statisticians for developing statistical software, and R is widely used for statistical software development and data analysis....
 PSPPPSPPPSPP is a free software application for analysis of sampled data. It has a graphical user interface and conventional command line interface. It is written in C, uses GNU Scientific Library for its mathematical routines, and plotutils for generating graphs....
 RATSRATS (statistical package)RATS, an abbreviation of Regression Analysis of Time Series, is a statistical package for time series analysis and econometrics. RATS is developed and sold by Estima, Inc. , located in Evanston, IL.History:...
 SAS
 SPSSSPSSSPSS is a computer program used for survey authoring and deployment , data mining , text analytics, statistical analysis, and collaboration and deployment ....
See also
Further reading
External links
