EXPLORING THE INAR MODEL ON HEAVY TAILED TIME SERIES DATA WITH OUTLIERS
DOI:
https://doi.org/10.62050/fjst2024.v8n1.281Keywords:
Count Data, Modelling, Outlier, Simulation, INAR(p)Abstract
Count data are intrinsically measures of event frequency; it is clear that there is an intrinsic relationship with recurring time to event. Events are typically tallied within time intervals for practical and convenient reasons. The existence of outliers is one issue that prevents count data from being stationary in time series analysis; this has an impact on the effectiveness of fitting several common stationary models to the count data collected over time. Thus, the purpose of this study was to examine how well the Integer Valued Autoregressive (INAR) model performed while modeling count data that included outliers. While this model has been studied for count time series data, it has not been studied for varying degrees of outliers. A monte-carlo simulation was carried out to select the best INAR(p), where p=1,2,3 and 4 on data with 10%, 20% and 30% outliers at different sample sizes. The INAR (4) has the best fit across the sample sizes at the larger percentages of outliers while INAR (3) at the lowest percentage with smallest information criteria of assessment and they are therefore recommended for such modeling.
Downloads
References
Akeyede, I. Bakari, H.R. and Muhammad, R.B.. (2022): Robustness of ARIMA and ACP Models to Over-dispersion in Analysis of Count Data, Journal of Nigeria Statistical Association. ISSN: 0331-9504. Volume 34, pp 95-105, https://nsang.org/_uploads/uploads/2021/60112b447fdc2_5.pdf
Freeland RK (1998) Statistical analysis of discrete time series with applications to the analysis of workers compensation claims data. PhD Thesis, University of British Columbia, Canada
Johnson, N. L., Kemp, A. W., and Kotz, S. (2005). Univariate Discrete Distribution. Wiley, New York.
Ndwiga A. Macharia, Oscar Ngesa, Anthony Wanjoya and Damaris FelistusMulwa (2019). Comparison of Statistical Models in Modeling Over Dispersed Count Data with Excess Zeros. International Journal of Research and Innovation in Applied Science (IJRIAS) | Volume IV, Issue V, 80-90.
Silva, I.; Silva, M.E.; Pereira, I. and Silva, N. (2005). Replicated INAR(1) Process, Methodology and Computing in Applied Probability, 7, 517–542.
Steutel, F.W. and Van Harn, K. (1979). Discrete analogues of self-decomposability and stability, The Annals of Probability, 5, 893–899
Weiß H. (2009). Modelling time series of counts with overdispersion.Stat Methods Appl. 18:507–519. https://doi.org/10.1007/s10260-008-0108-6