Abstract
We propose an out-of-sample prediction approach that combines unrestricted mixed-data sampling with machine learning (mixed-frequency machine learning, MFML). We use the MFML approach to generate a sequence of nowcasts and backcasts of weekly unemployment insurance initial claims based on a rich trove of daily Google Trends search volume data for terms related to unemployment. The predictions are based on linear models estimated via the LASSO and elastic net, nonlinear models based on artificial neural networks, and ensembles of linear and nonlinear models. Nowcasts and backcasts of weekly initial claims based on models that incorporate the information in the daily Google Trends search volume data substantially outperform those based on models that ignore the information. Predictive accuracy increases as the nowcasts and backcasts include more recent daily Google Trends data. The relevance of daily Google Trends data for predicting weekly initial claims is strongly linked to the COVID-19 crisis.
Original language | English |
---|---|
Journal | International Journal of Forecasting |
Volume | 39 |
Issue | 3 |
Pages (from-to) | 1122-1144 |
Number of pages | 23 |
ISSN | 0169-2070 |
DOIs | |
Publication status | Published - Jul 2023 |
Keywords
- Elastic net
- Internet search
- LASSO
- Mixed-frequency data
- Neural network
- Unemployment insurance
- Variable importance