Publicatie

Cost-based quality measures in subgroup discovery

Tijdschriftbijdrage - Tijdschriftartikel

We consider data where examples are not only labeled in the classical sense (positive or negative), but also have costs associated with them. In this sense, each example has two target attributes, and we aim to find clearly defined subsets of the data where the values of these two targets have an unusual distribution. In other words, we are focusing on a Subgroup Discovery task with a somewhat unusual target concept, and investigate quality measures that take into account both the binary and the cost target. In defining such quality measures, we aim to produce an interpretable valuation of a subgroup, such that data analysts can directly appreciate the findings, and relate these to monetary gains or losses. Our work is particularly relevant in the domain of health care fraud detection. In this domain, the binary target identifies the patients of a specific medical practitioner under investigation, and the cost target specifies the money spent on each patient. When looking for differences in claim behavior, we need to take into account both the 'positive' examples (patients of the practitioner) and 'negative' examples (other patients), as well as information about costs of all patients. A typical subgroup will list a number of treatments, and the target practitioner's patients behavioral difference in both treatment prevalence and associated costs. An additional angle is the Local Subgroup Discovery task, where subgroups are judged according to the difference with a local reference group instead of the entire dataset. We show how the cost-based analysis of data specifically fits this local focus.

Tijdschrift: JOURNAL OF INTELLIGENT INFORMATION SYSTEMS

ISSN: 0925-9902

Issue: 3

Volume: 45

Pagina's: 337 - 355

Jaar van publicatie:2015

DOI: https://doi.org/10.1007/s10844-014-0313-8
WoS Id: 000364570700004
Handle: http://hdl.handle.net/1854/LU-7044692

Publicatie

Cost-based quality measures in subgroup discovery

Tijdschriftbijdrage - Tijdschriftartikel

Auteurs/uitgever