What is Concordant – Discordant ratio?

This is again one of the most important metric for any classification predictions problem. To understand this let’s assume we have 3 students who have some likelihood to pass this year. Following are our predictions :

A – 0.9

B – 0.5

C – 0.3

Now picture this. if we were to fetch pairs of two from these three student, how many pairs will we have? We will have 3 pairs : AB , BC, CA. Now, after the year ends we saw that A and C passed this year while B failed. No, we choose all the pairs where we will find one responder and other non-responder. How many such pairs do we have?

We have two pairs AB and BC. Now for each of the 2 pairs, the concordant pair is where the probability of responder was higher than non-responder. Whereas discordant pair is where the vice-versa holds true. In case both the probabilities were equal, we say its a tie. Let’s see what happens in our case :

AB – Concordant

BC – Discordant

Hence, we have 50% of concordant cases in this example. Concordant ratio of more than 60% is considered to be a good model. This metric generally is not used when deciding how many customer to target etc. It is primarily used to access the model’s predictive power. For decisions like how many to target are again taken by KS / Lift charts.

Wanted to give another example of how to calculate concordant ratio:
Consider the following table
image

Step 1: Form the pairs of 0s and 1s of true class. So there are 3 possible pairs, P1-P2, P3-P2 and P4-P2.

Step 2: For each pair identify if the probability corresponding to the true class ‘1’ is higher than that of ‘0’.
P1-P2: Yes
P3-P2: No
P4-P2: Yes

Step 3: Overall concordance is 2/3 as only 2 pairs out of the three possible pairs are concordant.