DATA MANAGEMENT



Datum
- singular, quantitative pieces of info

Data - plural

Data Set - number of series of data (GDP, UN, INT)

Observations - individual datum in a series. (inflation from 1970.01 to 1979.12 would be 120 individual observations)

Variable - measurable characteristic about which we can collect. Xs, Ys, etc. - consumption, income, # children, marital status, race, etc - MICRO

MACRO - GDP, inflation, ex-rates, etc.

Rectangular/Matrix Data Set. Variables across the top. Observations down the side. Spreadsheet.

i = 1 to n, where n is the total number of observations.

Xi, Yi, Zi. X1 to Xn. Y1 to Yn. Z1 to Zn.

Data collection - In contrast to other disciplines, economists don't usually collect original data. Most data is available from government agencies - Fed Reserve, Dept of Commerce, IMF, World Bank, Dept of Labor, OECD, Dept of Agriculture, Dept of Census, etc. Or available from industry groups - NYSE, NARealtors, National Mortgage Assoc, etc.

CROSS-SECTION DATA - Observations are data for different entities (persons, firms, states, countries) at the same point in time.

Examples:
a. survey data from interviews in response to questions about some variable of interest.
b. survey about state spending on education across the 50 states.
c. sales tax collection by county in Mich
d. per capita income in 50 countries.

Dummy variables - Qualitative variable. 0 or 1.
0 = white 1 = non-white
0 = F    1 = M
0 = union   1=non-union

Can be used for race, sex, marital status, etc. Also for event studies.

TIME SERIES DATA

Observations for the same variable over time - months, quarters, years, etc.

Example: Annual observations for 25 years, 56-80 for 12 variables. GNP, PCE, DPI, M1, etc.

Also: Cross sectional-time series.

Example: Xit, i=US, UK, Japan, GER, FR. t= 56-80.

CHANGE AND GROWTH -

Often we want to generate a variable from another variable.

1. Absolute Change = Delta Yt = Yt - Yt-1
Can be negative, we are NOT dealing with "abs value", but absolute change. Change in a variable from one period to another. Change in the budget, change in the deficit, example.

EVIEWS: genr dx = d(x)
genr dgdp = d(gdp)

Tells us how fast the variable is changing, measured in the units of the variable, example dollars.

2. Percentage Change (or Rate of change, Rate of growth, percentage growth)

% Change at time t = Change Y / Yt-1 = (Yt - Yt-1)/Yt-1 = (Yt/Yt-1) - 1
Multiply by 100 to scale up to a whole number.

EVIEWS: genr grgdp = d(gdp)/gdp(-1) * 100 (for annual data)

Tells us how fast the variable is growing, in relative terms, as a percentage.

Problems with percentage:

100 to 200 to 100

+100% - 50% net= +50%

Impose constant growth, or continuous compounding by using logs.

EVIEWS: genr inf = log(CPI/CPI(-1))*1200

100 to 200 to 100

+69.31%    -69.31%


CONSTRUCTED VARIABLES

Sometimes the desired data has to be constructed from "raw" data. Examples:

Growth rate of GDP, instead of GDP in dollars.

Four examples:

1. Index values - some data are presented in index form. Examples: PPI, CPI, IP, Index of Leading Economic Indicators, etc. Some base year is arbitrarily chosen, Index set at 100 in base year.

Growth rate is then calculated. Inflation = growth rate in the CPI. Percentage change from the period before or the percentage change from a year ago.

2. Nominal vs. Real. (Current vs constant) In many cases, we want to adjust nominal variables into real variables. Real GDP, real budget deficits, real Stock Returns, real income, real wages, etc. We need two variables - Nominal Variable and a Price Index - we can then calculate the variable in REAL TERMS.

Example: We have Nominal GDP (GDP) and GDP Deflator (GD) from 59-96. We want to construct Real GDP (RGDP):

REAL GDPt = NOM GDPt x  (100 /GDt)

where 100 is the GD in the base year.

Example:  YEAR          GDP(current $)  GD
               85                 4200                95
               86                 4400                97
               87                 4500               100
               88                 4700               104
               89                 4800               108

RGDP:           85 = 4200 X 100/95 = 4421 (constant 1987 $)
                      86 = 4400 X 100/97 = 4536
                      87 = 4500 X 100/100 = 4500
                      88 = 4700 X 100/104 = 4519
                      89 = 4800 X 100/108 = 4444


3. Variables as a ratio of two others: DEF/GDP, FED/GDP, utilization rate (Food stamp example), percentage of school district getting free lunch, percentage of children passing exam, etc...

4. Converting all variables to logs. Log form.

Example: genr lgdp = log(gdp)

5. Converting to a different frequency. Monthly to quarterly or annually. Quarterly to monthly.

Example: NIPA accounts are quarterly. GDP, GNP, Deficits, GDP Deflator, Government expenditures, PCE, etc.

Most other statistics are released monthly: IP, CPI, PPI, interest rates, ex-rates, money supply, etc.

If you wanted to test for the effects of deficits on economic activity or interest rates, you would have variables with two different frequencies. You would have to convert everything to quarterly frequency, or annual.

EVIEWS: Converting Quarterly or Monthly Data to Annual

1. "store" the variable you want to convert (e.g. monthly or quarterly frequency)
2. start a new workfile, select annual frequency.
3. select "options" then "Dates-Frequency" to get conversion menu.
(Options: Average, Sum, First Obs, Last Observation).
4. "fetch" the M or Q variable and conversion will happen automatically.

6. Seasonal adjustment. SA/NSA. Randomness issue.

EVIEWS: for Q and monthly data only.
1. type: "seas x" to seasonally adjust variable x.
2. select the type of seasonal adjustment.