A Simple, Positive Semi-Definite, Heteroskedasticity and Autocorrelation Consistent Covariance Matrix
Source: Newey, W. K. & West, K. D. (1987) · Econometrica 55(3), 703–708 · DOI: 10.2307/1913610
Problem it solves
GMM and regression inference needs a consistent estimate of the long-run (frequency-zero) covariance matrix S of the moment conditions / score. The simplest truncated estimator — a sum of the sample autocovariances out to lag m — is consistent but need not be positive semi-definite in finite samples, so estimated variances and t-statistics can come out negative. Ordinary (and even White heteroskedasticity-robust) standard errors are also wrong when errors are serially correlated, which is pervasive in finance with overlapping observations or persistent predictors; uncorrected, naive t-statistics overstate significance and manufacture spurious predictability.
The method
Estimate the long-run variance as a weighted sum of autocovariances up to a chosen lag (bandwidth) m, using modified Bartlett (triangular) weights that decline linearly with lag:
where Ω̂ⱼ is the j-th sample autocovariance of the moment terms. This is numerically 2π times a spectral-density-at-zero estimator smoothed with the Bartlett window. The triangular weighting guarantees Ŝ_T is positive semi-definite by construction (the paper's Theorem 1, proved via positive-semi-definiteness of the Bartlett-weighted autocovariance matrix). The bandwidth m must grow slowly with sample size T for consistency (Theorem 2). The resulting "HAC" standard errors give valid inference.
Assumptions & inputs
Inputs: the moment/score series (e.g., x·residual in a regression) and a bandwidth m.
Requires the moments to have the mixing/moment conditions for a CLT; results are large-sample (asymptotic).
For a single regression coefficient this delivers the familiar Newey–West standard errors.
How to use it
The default fix for inference in time-series and panel finance regressions: predictive regressions, long-horizon/overlapping returns, and the second stage of Fama–MacBeth.
Choose the lag length; a common rule of thumb is m ≈ 4(T/100)^{2/9} or data-driven choices (Andrews 1991; Newey–West 1994).
Directly relevant to honest backtest evaluation — uncorrected autocorrelation is a classic way regressions look significant when they are not. Pairs conceptually with the multiple-testing / deflated-Sharpe literature.
Limitations & pitfalls
Results depend on the bandwidth/lag choice: too few lags under-corrects, too many is noisy (Andrews 1991 gives data-driven selection).
A large-sample correction; small samples can still mislead, and HAC errors can be downward-biased with highly persistent regressors.
The triangular kernel is not the most efficient (the quadratic-spectral kernel is optimal in Andrews 1991), but it is simple and guaranteed PSD.
Key references
Newey, W. & West, K. (1987) — A Simple, Positive Semi-Definite, HAC Covariance Matrix — Econometrica
White, H. (1980) — A Heteroskedasticity-Consistent Covariance Matrix Estimator — Econometrica
Hansen, L. (1982) — Large Sample Properties of Generalized Method of Moments Estimators — Econometrica
Andrews, D. (1991) — Heteroskedasticity and Autocorrelation Consistent Covariance Matrix Estimation — Econometrica
Provenance: verified/generated from the paper's full text.