STÓR

BCONDS: Borderline Counterfactual Oversampling with Noise Elimination and Density Scoring

Qureshi, Asifa Mehmood and Kaushik, Abishek and Loughran, Roisin and McCaffery, Fergal (2025) BCONDS: Borderline Counterfactual Oversampling with Noise Elimination and Density Scoring. In: Artificial Intelligence in Healthcare. Second International Conference, AIiH 2025, September 8–10, 2025, Cambridge, UK.

[thumbnail of 978-3-032-00652-3_30.pdf] PDF
Download (941kB)

Abstract

Class imbalance in medical datasets may lead to the generation of biased Machine Learning models. Several methods are used to balance datasets but they do not consider the majority class samples while oversampling. Therefore, in this study, we proposed a novel technique called Borderline Counterfactual Oversampling with Noise elimination and Density Scoring (BCONDS). The method utilises isolation forest to remove the noisy samples from the majority class. Gower distance is used to find borderline minority class instances and extract their corresponding majority class neighbours. These neighbouring samples are then used to generate counterfactuals in order to enhance the separability of classes. The empirical analysis of four benchmark medical datasets indicates that our proposed technique outperforms other state-of-the-art techniques. On average, an improvement of 9.6% and 5.9% is recorded in the AUC and Gmean values of BCONDS when compared with other methods.

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: Counterfactual; Borderline; Density scoring; Oversampling; Gower distance; Medical data sets.
Subjects: Computer Science
Research Centres: Regulated Software Research Centre
Depositing User: Sean McGreal
Date Deposited: 17 Dec 2025 09:18
Last Modified: 17 Dec 2025 09:18
License: Creative Commons: Attribution-Noncommercial-Share Alike 4.0
URI: https://eprints.dkit.ie/id/eprint/984

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year