Revisions Analysis

Economic indicators are often revised many times after their first publication. This is done since Eurostat and the other statistical agencies put special attention on the quality of the data and at the same time try to back the users’ need of disposing of the most recent data. In particular, there exists a trade-off between the timeliness of data publication, on the one hand, and the reliability and stability of the data on the other. The principles of Eurostat’s publication policy, which are reported in Eurostat’s Code of Practice (Eurostat 2017), are intended to balance the need of reliability and accuracy with that of timeliness. As a first step, Eurostat publishes early releases of the statistics, in order to meet the timeliness principle; afterwards, it revises and updates them, as new information becomes available. The publishing of the early versions of the data allow to meet the timeliness principle: as Principle 13 states, “European Statistics are released in a timely and punctual manner”, and “a standard daily time for the release of statistics is made public”. Then, to address potential inaccuracies in the early versions, “statistical processes are routinely monitored and revised as required” and “revisions follow standard, well-established and transparent procedures” (Principle 8).

Revisions are the differences between the early and the latest value of the statistics and can be considered as a measure of the data reliability. Principle 12 in Eurostat (2017) states that the aim of the European Statistics is to “… accurately and reliably portray reality”, through the integration of different sources of data and revisions which are “… regularly analysed in order to improve data sources, statistical processes and outputs”.

Classification of revisions

Revisions can be carried out for different reasons (Eurostat 2014):

i) Incorporation of additional data: e.g. late responses to surveys, replacement of forecasts with available data, incorporation of data which more closely matches concepts and definitions;

ii) Updating of routine adjustment/treatment or compilation: e.g. adjusting for seasonal factors, changing the base year for the time series;

iii) The introduction of new methods and concepts: e.g. improvements in the estimation methods, changes in classifications, introduction of new definitions;

iv) Correction of errors caused by the incorrect management of source data and/or by wrong answers given by survey respondents.

Revisions occur periodically shortly after the first publication, usually after a few months, and are done to add new information and replace forecasts with actual data. Moreover, periodic revisions occur on a longer time span for wider adjustments. When this is the case, the schedule is regular, defined ex-ante and communicated to the public. Instead, in the case of unexpected errors, revisions are carried out as soon as possible, and are therefore occasional and timely. To sum up, we can distinguish different kind of revisions according to their timing (Eurostat 2014):

i) Routine revisions, generally regarding recent periods;

ii) Annual revisions, carried out when more information emerges;

iii) Major revisions occurring at longer intervals (3/4 years), which depend on the change of classifications, base periods and so on. They require the re-computation of the whole time series;

iv) Unexpected revisions, usually done when previous errors are discovered.

Definition of revisions

Given a particular indicator (that is, a time series) and two subsequent estimates of it, which both refer to the same period inline_formula not implemented (for example, a given month or quarter) a revision can be defined as

formula not implemented

where inline_formula not implemented is a a preliminary (or earlier) estimate and inline_formula not implemented a later (more recent) estimate of the indicator. This definition of revision is used when dealing with indicators measured in growth rates (period-on-period or year-on-year growth rates). Instead, when dealing with revisions to values in level, we define revisions in relative terms:

formula not implemented

Real-time datasets

A useful tool for producers of official statistics to undertake revision analysis and present its results are the so-called “real-time datasets” (or revision triangles). These show how estimates change over time and provide further information about the dissemination policy, the timing of revisions, explanation of revision sources and the status of the published data. To construct a real-time dataset, it is necessary to collect several vintages of the same indicator. A vintage is defined as a “set of data (sequence of values) that represented the latest estimate for each reference point in the time series at a particular moment in time” (Mckenzie and Gamba, 2008). Therefore, a vintage can be thought of as a photograph of the state of the art knowledge on an indicator (relative to a given point in time) taken at a specific point in time. We can define the estimate relative to a particular indicator inline_formula not implemented as

formula not implemented

where inline_formula not implemented represents the point in time to which the indicator refers and inline_formula not implemented represents the point in time when the vintage that contained that particular estimate of the indicator was released, that is, when the indicator relative to time inline_formula not implemented was estimated. The real-time dataset is constructed as follows.

The revision triangle can be read “horizontally”, “vertically” or “diagonally”. When it is read horizontally, it provides time series forecasts released at the available dates (this can be useful to assess the forecasting models currently used). Instead, when it is read vertically, it gives the revision history referred to a given period inline_formula not implemented, from the preliminary estimates to the latest (such information is useful to assess the reliability of the earlier estimates). Finally, when triangles are read along the main diagonal (or the first sub-diagonal, or the second sub-diagonal, …), they give the time series of the first (or second, or third, …) releases. To be more specific, assuming that inline_formula not implemented, the first entry inline_formula not implemented is the preliminary estimate of the indicator relative to time inline_formula not implemented, while the rest of the first row (in grey) are the forecast made at time inline_formula not implemented of the same indicator at the following periods inline_formula not implemented, inline_formula not implemented, inline_formula not implemented It is easy to understand why it is called triangle, since the grey part of the table represents forecasts and not data that is actually available at the moment of its release. The first column collects the subsequent revisions that have been made to update the estimate of the indicator relative to time inline_formula not implemented. However, most often, the real-time dataset is read diagonally, since this allows to build time series of revisions on which the descriptive analysis is carried out. For example, the main diagonal (in red) collects all the data series when it was first released, the first sub-diagonal (in orange) collects all the data series after it was revised once, and so on.

Assuming that the indicator inline_formula not implemented is collected as growth rates, the revision time series can be easily calculated, according to the formula inline_formula not implemented, by subtracting horizontally the sub-diagonals to the main diagonal. So, for instance, the first revision will be the time series obtained as inline_formula not implemented, the second revision will be obtained as inline_formula not implemented, and so on.

Descriptive statistics

Once one has calculated the revisions, it is possible to proceed to the actual analysis of the revisions themselves. This is done by means of a series of descriptive statistics that can be calculated to answer specific questions. In what follows inline_formula not implemented refers to the number of observations used for the analysis.

Mean revision (or arithmetic average).

formula not implemented

The sign of this measure indicates wether on average the estimate of the earlier releases is biased. This bias is positive (negative) if the sign of the statistics is negative (positive). However, since revisions of opposite sign cancel out, this measure, often also called “average bias”, is of limited use.

Statistical significance of the mean revision. A modified t-test can be performed to determine whether the mean revision is statistically different from zero, which may give an insight on wether an actual bias exists in the preliminary estimates. The t statistics is computed as

formula not implemented

where the denominator is the heteroscedasticity and autocorrelation consistent standard deviation of mean revision and it is defined as the square root of inline_formula not implemented with inline_formula not implemented

The critical values of the t statistic and the p-value are computed in the usual way.

Median revision. It can be a piece of useful supplementary information to the mean revision as it is not affected by outliers.

formula not implemented

Pecentage of negative/zero/positive revisions. These measure can also be useful supplementary information to the mean revision and can be computed as

formula not implemented

where inline_formula not implemented if inline_formula not implemented for negative, zero, positive revisions respectively.

Mean absolute revision.

formula not implemented

It indicates the average size of the revisions. However, unlike the mean revision, it does not provide an indication of the directional bias and it is therefore more stable.

Range that 90% of the revisions lie within. This is simply the interval of the inline_formula not implemented to the inline_formula not implemented percentile of the distribution of revisions. It gives information about the "expected" range the revisions usually lie within.

Median absolute revision. It is a measure of central tendency that is little influenced by outliers and can complement the mean absolute revision.

formula not implemented

Standard deviation of revisions.

formula not implemented

This measure is used to assess the spread of revisions around their mean. It is sensitive to outliers and therefore it is better not to use it with skewed distributions. It is useful to compare the volatilities of different revision intervals.

Root mean square revision.

formula not implemented

This is essentially a combination of the mean revision and the variance of revision statistics. It is therefore a broader measure than the standard deviation of revision.

Maximum and minimum revision, interquartile deviation and range of revision. These measures can be used to retrieve additional information regarding the distribution of the revisions.

Skewness.

formula not implemented

This is a formal measure of asymmetry of the distribution of the revisions. When the statistics is negative (positive) the median is greater (smaller) than the mean and the distribution presents a fatter tail towards the left (right).

Correlation between revision and earlier estimate (test if revisions are "noise").

formula not implemented

where inline_formula not implemented

When the correlation between inline_formula not implemented and inline_formula not implemented is statistically significant, this implies that there was information available at the time of the first estimate which was not efficiently exploited and therefore the revisions can be interpreted as "noise".

Correlation between revision and earlier estimate (test if revisions are "news").

formula not implemented

This is in a sense the opposite statistics to the previous one. If the correlation between inline_formula not implemented and inline_formula not implemented is statistically significant, this implies that actual new information is being exploited to compute the revision, and therefore the revision can be interpreted as "news".

Serial correlation of revisions.

formula not implemented

Finally, this correlation measure is used to assess wether there is bias in the revision process (that is, if the revision process is in some way predictable).

Decomposition of the Mean Squared Revision. It can be shown that the following holds:

formula not implemented

where inline_formula not implemented is the mean revision, inline_formula not implemented and inline_formula not implemented are the standard deviations of the latest and preliminary estimates, respectively, and inline_formula not implemented is their correlation. Dividing by inline_formula not implemented yields inline_formula not implemented.

inline_formula not implemented is the proportion of the MSR due to the mean revision being different from zero. It is sometimes called "mean error".

inline_formula not implemented is the proportion of MSR which is due to the slope coefficient inline_formula not implemented being different from inline_formula not implemented, in the linear regression inline_formula not implemented. It is sometimes called "slope error".

inline_formula not implemented is the proportion of MSR which is not caused by systematic differences between earlier and later estimates.

A quick interpretation of these measures is that earlier estimates are "good" if the decomposition gives low values for inline_formula not implemented and inline_formula not implemented and a high value for inline_formula not implemented.

An example in R

Revision triangles for the GDP series of the EU and Euro area can be downloaded from Eurostat's website: https://ec.europa.eu/eurostat/web/national-accounts/data/other.

As an example, the file below reports a revision triangle of the GDP series, and it has been published to demonstrate the reliability of the GDP estimates for the Euro area. The reported revision triangle is relative to the period 2017:Q4-2020:Q3. In particular, this example shows an application to the Quarter on Quarter percentage rates of changes of seasonally and calendar adjusted GDP.

RevisionsTriangles_EuroAreaGDP_QonQ.xlsx

13.83 KBDownload

library(readxl)

## Import the revision triangled <- read_xlsx(RevisionsTriangles_EuroAreaGDP_QonQ.xlsx)d

As a first step, we organize the data into separate vectors of quarterly estimates containing the revisions. Starting from the first day after the end of the considered quarter, the four estimates in our dataset are: the preliminary estimate (released after 30 days), the flash estimate (after 45 days), a second release (after 65 days) and the last version (after 100 days). We can easily isolate the rows referring to each release by using the function grepl on the first column of d. The four series of quarterly GDP belonging to the first to fourth releases are then obtained by isolating the diagonal of the resulting triangles.

## Data preparationd1 <- d[grepl("\\(T\\+30\\)", unlist(d[,1])), ]P <- diag(as.matrix(d1[,-1])) ## Series of the preliminary released2 <- d[grepl("\\(T\\+45\\)", unlist(d[,1])), ]t45 <- diag(as.matrix(d2[,-1])) ## Series of the flash release (after 45 days)d3 <- d[grepl("\\(T\\+65\\)", unlist(d[,1])), ]t65 <- diag(as.matrix(d3[,-1])) ## Series of the second release (after 65 days)d4 <- d[grepl("\\(T\\+100\\)", unlist(d[,1])), ]L <- diag(as.matrix(d4[,-1]))  ## Series of the third release (after 100 days)

## Showing the resulting datadata <- cbind(P, t45, t65, L)row.names(data) <- names(d)[-1]data

For illustrative purposes, we study the first three series of revisions.

## Isolating the relevant revisionsR1 <- t45-PR2 <- t65-t45R3 <- L-t65n <- length(R1)

We can now proceed with the actual analysis of the revisions. In doing so, we try to answer some specific questions.

What is the average size of revisions, or the usual range that revisions lie within?

## Mean absolute revisionMAR1 <- sum(abs(R1))/nMAR2 <- sum(abs(R2))/nMAR3 <- sum(abs(R3))/n## Range that 90% of revisions lie withinrange90_1 <- quantile(R1,0.95) - quantile(R1,0.05)range90_2 <- quantile(R2,0.95) - quantile(R2,0.05)range90_3 <- quantile(R3,0.95) - quantile(R3,0.05)## Median absolute revisionabs.median1 <- median(abs(R1))abs.median2 <- median(abs(R2))abs.median3 <- median(abs(R3))## Organize the results in a tabletable1 <- cbind(  rbind(MAR1, MAR2, MAR3),    rbind(abs.median1, abs.median2, abs.median3),  rbind(range90_1, range90_2, range90_3))colnames(table1) <- c("Mean absolute revision", "Median absolute revision", "90% range")rownames(table1) <- c("Revision 1", "Revision 2", "Revision 3") table1

As we can easily observe by looking at table1, the first revision is the most substantial, as it is natural to expect. Subsequent revisions tend to introduce slighter corrections in the estimates. We can also see that later revisions tend to be more concentrated in a smaller interval for each data point. Moreover, in all three revisions, the median absolute revision is smaller than the mean, suggesting that some outliers (bigger revisions) occur that drive the mean upwards.

Is the average level of revision close to zero, or is there an indication that a possible bias exists in the earlier estimate?

## Mean revisionMR1 <- sum(R1)/nMR2 <- sum(R2)/nMR3 <- sum(R3)/n## Median revisionmedian1 <- median(R1)median2 <- median(R2)median3 <- median(R3)## % of positive/zero/negative revisionsneg1 <- sum(R1 < -0.005)/nzer1 <- sum(-0.005 < R1 & R1 < 0.005)/npos1 <- sum(R1 > 0.005)/nneg2 <- sum(R2 < -0.005)/nzer2 <- sum(-0.005 < R2 & R2 < 0.005)/npos2 <- sum(R2 > 0.005)/nneg3 <- sum(R3 < -0.005)/nzer3 <- sum(-0.005 < R3 & R3 < 0.005)/npos3 <- sum(R3 > 0.005)/n##  Testing whether the mean revision is statistically different from zerot.test_adj <- function(R){   n <- length(R)  e <- R - mean(R)  varR <- (sum(e^2) + 0.75*(sum(e[2:n]*e[1:(n-1)])) +   (2/3)*sum(e[3:n]*e[1:(n-2)]))/(n*(n-1))  t_stat <- mean(R)/sqrt(varR)  p_value = 2*pt(abs(t_stat), n-1, lower.tail=F)  list(t_stat, p_value)}## Table of resultstable2 <- cbind(  rbind(t.test_adj(R1)[1], t.test_adj(R2)[1], t.test_adj(R3)[1]),    rbind(t.test_adj(R1)[2], t.test_adj(R2)[2], t.test_adj(R3)[2]),  rbind(neg1, neg2, neg3),  rbind(zer1, zer2, zer3),  rbind(pos1, pos2, pos3))colnames(table2) <- c("t-statistc", "p-value", "Negative %", "Close to zero %", "Positive %")rownames(table2) <- c("Revision 1", "Revision 2", "Revision 3") table2

As the t-statistics in table2 shows, there is no evidence of any significant deviation of the average revision from zero at a 95% confidence level. This result supports the reliability of Eurostat's estimates for the GDP of the Euro area. However, the percentage of positive revisions in our sample tends to be quite higher that the percentage of negative revisions. This could indicate either that the preliminary estimate is negatively biased or that the revision process is positively biased.

What is the extent of variability of revisions?

## Standard deviation of revisionSDR1 <- sd(R1)SDR2 <- sd(R2)SDR3 <- sd(R3)## Root mean square revisionRMSR1 <- sqrt(sum(R1^2)/(n-1))RMSR2 <- sqrt(sum(R2^2)/(n-1))RMSR3 <- sqrt(sum(R3^2)/(n-1))## Skewnessskew1 <- 3*(MR1-median1)/SDR1skew2 <- 3*(MR2-median2)/SDR2skew3 <- 3*(MR3-median3)/SDR3## Table of results table3 <- cbind(  rbind(SDR1, SDR2, SDR3),    rbind(RMSR1, RMSR2, RMSR3),  rbind(skew1, skew2, skew3))colnames(table3) <- c("Std. dev. of revision", "Root mean square revision", "Skewness of revision")rownames(table3) <- c("Revision 1", "Revision 2", "Revision 3") table3

We can see that the later releases are more concentrated around their mean and we can see this from both the standard deviation of revision and the root mean square revision. Furthermore, the distributions of the revisions are not entirely symmetric. In particular, the first revision stands out as being negatively skewed while the third revision as being positively skewed.

As a general rule of thumb: if the skewness is less than -1 or greater than 1, the distribution is highly skewed; if the skewness is between -1 and -0.5 or between 0.5 and 1, the distribution is moderately skewed; if the skewness is between -0.5 and 0.5, the distribution is approximately symmetric.

What is the average size of revision relative to the estimate itself?

## Relative mean absolute revisionRMAR1 <- sum(abs(R1))/sum(abs(P))RMAR2 <- sum(abs(R2))/sum(abs(P))RMAR3 <- sum(abs(R3))/sum(abs(P))## Average absolute value of first published estimatep_bar <- sum(abs(P))/n## Table of results table4 <- rbind(RMAR1, RMAR2, RMAR3)colnames(table4) <- "Relative mean absolute revision"rownames(table4) <- c("Revision 1", "Revision 2", "Revision 3") table4

The mean absolute revision can be interpreted as the expected proportion of the first published estimate that is likely to be revised over the revision interval being considered. Therefore, within 45 days, around 3% of the estimetes are likely to be revised and in the subsequent 55 days only around an additional 3%.

Is the earlier published estimate a good or ‘efficient’ forecast of the later published estimate?

## Test if revisions are "noise"rho1 <- cor(R1,P)rho2 <- cor(R2,P)rho3 <- cor(R3,P)## Test if revisions are "news"rhor1 <- cor(R1,L)rhor2 <- cor(R2,L)rhor3 <- cor(R3,L)## Serial correlation of revisionsser.cor1 <- cor(R1[2:12],R1[1:11])ser.cor2 <- cor(R2[2:12],R2[1:11])ser.cor3 <- cor(R3[2:12],R3[1:11])## Mean squared revisionMSR1 <- sum(R1^2)/(n-1)MSR2 <- sum(R2^2)/(n-1)MSR3 <- sum(R3^2)/(n-1)## Table of results table5 <- cbind(  rbind(rho1, rho2, rho2),    rbind(rhor1, rhor2, rhor3),  rbind(ser.cor1, ser.cor2, ser.cor3),  rbind(MSR1, MSR2, MSR3))colnames(table5) <- c("Noise", "News", "Serial correlation", "MSR")rownames(table5) <- c("Revision 1", "Revision 2", "Revision 3") table5

We do not find evidence that the revision process is driven by "noise" nor by "news". Rather, since the revisions and the earlier estimates are negatively correlated, it appears that the revision process is actually "smoothing out" the initial estimates. That is, if a given estimate was initially particularly high, the revision process tends to lower that estimate and does the converse for particularly low initial estimates. Futhermore, there is no strong serial correlation, so that there appears not to be strong biases in the revision process.

References

McKenzie, Richard, and Michela Gamba (2008). "Interpreting the results of Revision Analyses: Recommended Summary Statistics." Contribution to OECD/Eurostat Task Force on “Performing Revisions Analysis for Sub-Annual Economic Statistics.
Eurostat (2014), “Memobust Handbook on Methodology of Modern Business Statistics”. https://ec.europa.eu/eurostat/cros/system/files/Quality Aspects-02-T-Revisions of Economic Official Statistics v1.0.pdf
Eurostat (2017). “European Statistics Code of Practice”. https://ec.europa.eu/eurostat/documents/4031688/8971242/KS-02-18-142-EN-N.pdf/e7f85f07-91db-4312-8118-f729c75878c7?t=1528447068000
ESS guidelines on revision policy for PEEIs, 2013 edition. https://ec.europa.eu/eurostat/documents/3859598/5935517/KS-RA-13-016-EN.PDF.pdf/42d365e5-8a65-42f4-bc0b-aacb02c93cf7?t=1558683870000

Revisions Analysis

Classification of revisions

Definition of revisions

Real-time datasets

Descriptive statistics

An example in R

References

Runtimes (1)