Minerva Elements Records

Permanent URI for this collection

Search Results

Now showing 1 - 4 of 4
  • Item
    Thumbnail Image
    Imputation techniques on missing values in breast cancer treatment and fertility data.
    Wu, X ; Akbarzadeh Khorshidi, H ; Aickelin, U ; Edib, Z ; Peate, M (BioMed Central, 2019-12-01)
    Clinical decision support using data mining techniques offers more intelligent way to reduce the decision error in the last few years. However, clinical datasets often suffer from high missingness, which adversely impacts the quality of modelling if handled improperly. Imputing missing values provides an opportunity to resolve the issue. Conventional imputation methods adopt simple statistical analysis, such as mean imputation or discarding missing cases, which have many limitations and thus degrade the performance of learning. This study examines a series of machine learning based imputation methods and suggests an efficient approach to in preparing a good quality breast cancer (BC) dataset, to find the relationship between BC treatment and chemotherapy-related amenorrhoea, where the performance is evaluated with the accuracy of the prediction. To this end, the reliability and robustness of six well-known imputation methods are evaluated. Our results show that imputation leads to a significant boost in the classification performance compared to the model prediction based on listwise deletion. Furthermore, the results reveal that most methods gain strong robustness and discriminant power even the dataset experiences high missing rate (> 50%).
  • Item
    Thumbnail Image
    Multi-objective semi-supervised clustering to identify health service patterns for injured patients
    Khorshidi, HA ; Aickelin, U ; Haffari, G ; Hassani-Mahmooei, B (SPRINGER, 2019-08-29)
    PURPOSE: This study develops a pattern recognition method that identifies patterns based on their similarity and their association with the outcome of interest. The practical purpose of developing this pattern recognition method is to group patients, who are injured in transport accidents, in the early stages post-injury. This grouping is based on distinctive patterns in health service use within the first week post-injury. The groups also provide predictive information towards the total cost of medication process. As a result, the group of patients who have undesirable outcomes are identified as early as possible based health service use patterns. METHODS: We propose a multi-objective optimization model to group patients. An objective function is the cost function of k-medians clustering to recognize the similar patterns. Another objective function is the cross-validated root-mean-square error to examine the association with the total cost. The best grouping is obtained by minimizing both objective functions. As a result, the multi-objective optimization model is a semi-supervised clustering which learns health service use patterns in both unsupervised and supervised ways. We also introduce an evolutionary computation approach includes stochastic gradient descent and Pareto optimal solutions to find the optimal solution. In addition, we use the decision tree method to reproduce the optimal groups using an interpretable classification model. RESULTS: The results show that the proposed multi-objective semi-supervised clustering identifies distinct groups of health service uses and contributes to predict the total cost. The performance of the multi-objective model has been examined using two metrics such as the average silhouette width and the cross-validation error. The examination proves that the multi-objective model outperforms the single-objective ones. In addition, the interpretable classification model shows that imaging and therapeutic services are critical services in the first-week post-injury to group injured patients. CONCLUSION: The proposed multi-objective semi-supervised clustering finds the optimal clusters that not only are well-separated from each other but can provide informative insights regarding the outcome of interest. It also overcomes two drawback of clustering methods such as being sensitive to the initial cluster centers and need for specifying the number of clusters.
  • Item
    Thumbnail Image
    Predictors of Return to Work for Occupational Rehabilitation Users in Work-Related Injury Insurance Claims: Insights from Mental Health
    Khorshidi, HA ; Marembo, M ; Aickelin, U (Springer Verlag, 2019-12-01)
    Purpose This study evaluates the Occupational Rehabilitation (OR) initiatives regarding return to work (RTW) and sustaining at work following work-related injuries. This study also identifies the predictors and predicts the likelihoods of RTW and sustainability for OR users. Methods The study is conducted on the compensation claim data for people who are injured at work in the state of Victoria, Australia. The claims which commenced OR services between the first of July 2012 and the end of June 2015 are included. The claims which used original employer services (OES) have been separated from claims which used new employer services (NES). We investigated a range of predictors categorised into four groups as claimant, injury, and employment characteristics and claim management. The RTW and sustaining at work are outcomes of interest. To evaluate the predictors, we use Chi-squared test and logistic regression modelling. Also, we prioritized the predictors using Akaike Information Criterion (AIC) measure and Cross-validation error. Four predictive models are developed using significant predictors for OES and NES users to predict RTW and sustainability. We examined the multicollinearity of the developed models using Variance Inflation Factor (VIF). Results About 75% and 60% of OES users achieved RTW and have been sustained at work respectively, whilst just approximately 30% of NES users have been placed at a new employer and 25% of them have been sustained at work. The predictors which have the most association with OES and NES outcomes are the use of psychiatric services and age groups respectively. We found that having mental conditions is as an important indicator to allocate injured workers into OES or NES initiatives. Our study shows that injured workers with mental issues do not always have lower RTW rate. They just need special consideration. Conclusion Understanding the predictors of RTW and sustainability helps to develop interventions to ensure sustained RTW. This study will assist decision makers to improve design and implementation of OR services and tailor services according to clients’ needs.
  • Item
    Thumbnail Image
    Early Identification of Undesirable Outcomes for Transport Accident Injured Patients Using Semi-Supervised Clustering.
    Khorshidi, HA ; Haffari, G ; Aickelin, U ; Hassani-Mahmooei, B (IOS Press, 2019-08-08)
    Identifying those patient groups, who have unwanted outcomes, in the early stages is crucial to providing the most appropriate level of care. In this study, we intend to find distinctive patterns in health service use (HSU) of transport accident injured patients within the first week post-injury. Aiming those patterns that are associated with the outcome of interest. To recognize these patterns, we propose a multi-objective optimization model that minimizes the k-medians cost function and regression error simultaneously. Thus, we use a semi-supervised clustering approach to identify patient groups based on HSU patterns and their association with total cost. To solve the optimization problem, we introduce an evolutionary algorithm using stochastic gradient descent and Pareto optimal solutions. As a result, we find the best optimal clusters by minimizing both objective functions. The results show that the proposed semi-supervised approach identifies distinct groups of HSUs and contributes to predict total cost. Also, the experiments prove the performance of the multi-objective approach in comparison with single- objective approaches.