List of Past Projects (Sorted by Date)
Forecasts of Motor Vehicle Traffic Fatalities in the United States using Time Series Regression Models
With an increased accident rate, national road safety has become an issue during COVID-19. The more accidents increase the likelihood of more fatalities. This article considers a time series regression model for forecasting monthly fatality counts in the United States with the publicly accessible traffic fatality data. The time series regression model accounts for the prominent statistical features in the data, including seasonality, autocorrelation, outliers, and the intervention effect of the COVID-19 pandemic. Additionally, total miles traveled and average gas prices are used as the potential covariates to help forecast fatality counts. Overall, forecasts generated by our method are more accurate than those using the seasonal autoregressive integrative moving average models, exponential smoothing methods, and the recurrent neural network model. They are roughly similar to those using the time series cross-sectional regression model implemented by the National Highway Traffic Safety Administration. Moreover, our approach has the capability to forecast monthly fatality counts in the coming years.
Transfer Function Analysis for the Dow Jones and the Australia Ordinaries indices
This project focuses on Transfer Function Analysis applied to the Dow Jones and Australia Ordinaries indices. In multivariate time series analysis, we often encounter situations where one time series is influenced by the past values of another, indicating a relationship between them. This interdependence is characterized by a transfer function relationship, which can be effectively modeled. Our analysis aims to explore this relationship between the indices, providing insights into their dynamic interactions and enhancing our understanding of their respective behaviors in the financial market.
Crime Analysis of the city of Baltimore using Point Pattern Analysis
This study introduces an approach to spatial and temporal crime analysis using point pattern analysis, focusing on crime events in Baltimore. We explore the nature of point processes—random collections of points in continuous space or time—and examine their application to various phenomena, including crime occurrences. Our research emphasizes the importance of effective statistical methods for analyzing data derived from point processes, classifying them into the spatial and spatio-temporal models. We apply three structured point process models: the Inhomogeneous Poisson process, the Log-Gaussian Cox process, and the Hawkes process, to capture the spatial patterns of crime incidents. Additionally, we investigate the impact of the COVID-19 pandemic on these patterns and predict future crime hotspots. This work aims to enhance the understanding of crime dynamics and inform policymakers about the potential future trends.
Does Crime Cause Itself? A Time Series Analysis of Violent Crime Rates in Virginia
Crime has captivated the American public and policymakers for over a century. While numerous theories explore the nature and causes of violent crime, there has been limited attention given to the potential endogeneity of crime. Additionally, quantitative models that assess causality and predictive capability are largely absent. This research aims to address these gaps by applying ARIMA time series techniques to violent crime rates in Virginia, covering a 35-year period from 1985 to 2020. We test several models and find that the AR(1) model demonstrates the best explanatory power for Virginia’s crime rates. Our analysis also indicates that violent crime rates exhibit auto-regressive causality. Using the AR(1) model, we predict a gradual increase in violent crime in the Commonwealth over the next decade. The conclusions of this paper should prove valuable to government officials and other stakeholders affected by crime.
Hierarchical Bayesian Modeling of Binary data: A Heart Failure Clinical Research
Heart failure occurs when the heart is unable to pump blood effectively. Various conditions can weaken the heart’s pumping ability, preventing it from meeting the normal demands of circulating blood throughout the body. Numerous factors may influence a patient’s experience with heart failure, potentially leading to mortality. In this research, we apply hierarchical analysis to a dataset collected over the time in Pakistan, which includes information on individuals suffering from heart failure. Initially, the data is clustered based on patients’ ages, followed by the fitting of a hierarchical logistic regression model to the binary response variable.
Prediction of death due to heart failure using logistic regression
In this project, we begin with a brief exploration of the dataset, describing each variable individually. Next, we fit a logistic regression model to the data and interpret the results. Following this, we apply a subset selection method to develop a reduced model with fewer predictors for the response variable. Finally, we use principal component analysis (PCA) for data reduction, fitting a logistic model using the principal component scores. We perform model comparisons and predict mortality for patients suffering from heart failure for each model introduced.
Bayesian Modeling of Binary and Ordinal data: A survey analysis on COVID-19
The COVID-19 virus causes severe respiratory diseases, and the pandemic that began in 2020 significantly impacted lives around the globe. This research presents a simple analysis of data collected through an online survey in Iran. Initially, we compute the posterior distribution and the values of the hyperparameters. Subsequently, we derive Bayesian credible intervals and conduct Bayesian hypothesis testing. A logistic regression model is fitted to both binary and ordinal responses, followed by the calculation of Bayesian estimates for the model parameters.