NMI and AMI use inconsistent definitions of mutual information #10308

kno10 · 2017-12-13T22:34:31Z

There exist many defintions of NMI and AMI.

Vinh, N. X., Epps, J., & Bailey, J. (2010). Information theoretic measures for clusterings comparison: Variants, properties, normalization and correction for chance. Journal of Machine Learning Research, 11(Oct), 2837-2854.

mention 5 different definitions of NMI, and based on that give 4 different AMI.

The NMI implemented in sklearn uses sqrt(H(U), H(V)) for normalization.
The AMI implemented in sklearn uses max(H(U), H(V)) for normalization.

There exists an NMI with the max normalization, and a AMI with the sqrt normalization, so this is inconsistent in sklearn. Ideally, they would both use the same definition by default, and allow using any of the others via an option.

The text was updated successfully, but these errors were encountered:

amueller · 2017-12-14T21:10:51Z

This is indeed not very nice.
The first (and possibly easiest) step would be to add normalization options to both that can be max or sqrt and possibly add min and sum if we think they might be useful. We should check out what is commonly used. Haven't read the paper, but it looks really nice.

Second step is to decide whether this is worth a deprecation cycle, and if so, what the new default should be. This is probably the hardest part (unless there is a clear consensus in the community, for example if that paper makes a clear recommendation that is widely accepted).

Third step is implementing the deprecation cycle.

amueller · 2017-12-14T21:17:02Z

Maybe @robertlayton can give us some insight into why these were chosen originally? See #402.

amueller · 2017-12-14T21:21:18Z

Or maybe I should ask myself #776 ...

amueller · 2017-12-14T21:28:11Z

OMG I just saw the paper I added by Andrew Rosenberg (who I now know personally) and Julia Hirschberg (the head of the CS department I'm in)... that's.. weird... anyhow, I digress...

I added the same paper you're referencing to the docs, I'm not sure why I did the sqrt... To make it identical to the V-measure? That seems strange.

aryamccarthy · 2018-05-23T20:32:50Z

Yang et al. claimed not to observe big differences based on the measure. Vinh et al. don't make a recommendation either. Danon et al. were the first to use it for community detection, and they followed a line of work that actually used sum.

It'll come down to whoever decides to implement it. I'm happy to do it and make a note in the docs that the normalization constants are different—but will converge in a later version. My vote is sum.

This could also be a good first step toward implementing multiple randomness models, like one-sided AMI

amueller · 2018-05-23T20:40:03Z

I am against sum because that would require changing both and it looks like it's less used in the clustering literature. I think I'm leaning max but I really don't care that much ;)

aryamccarthy · 2018-05-23T20:47:05Z

Ah, an instance of different preferences in different fields. We can do max. Do I have permission to implement this?

amueller · 2018-05-23T21:01:57Z

Yes, please go ahead. The stronger argument for me is that we need to change behavior in one of the metrics then.

aryamccarthy · 2018-05-23T22:59:14Z

I’ve created a PR; waiting for tests to pass. ~~I think converging on sqrt is best for uniformity with V-measure.~~ EDIT: Nope, it's not sqrt. It's sum.

aryamccarthy · 2018-05-24T15:51:55Z

Ooh, a twist. Sum is actually what V-measure uses—not sqrt. It seems we've covered the entire gamut. I'm going to take that as another argument in favor of sum. << Thought I hit 'Comment' on this some time ago.

qinhanmin2014 · 2018-08-01T02:47:07Z

I think this one can be closed given #11124

kno10 mentioned this issue Dec 13, 2017

Fix equations for mutual_information. #10292

Merged

jnothman added Documentation help wanted Moderate Anything that requires some knowledge of conventions and best practices and removed Documentation labels Dec 15, 2017

amueller mentioned this issue May 21, 2018

AMI and NMI use two different normalizing constants. #11085

Closed

aryamccarthy mentioned this issue May 23, 2018

Add averaging option to AMI and NMI #11124

Merged

qinhanmin2014 closed this as completed Aug 1, 2018

glemaitre mentioned this issue Dec 22, 2021

adjusted_mutual_info_score behaves like a non-adjusted index when average_method="arithmetic" #20066

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NMI and AMI use inconsistent definitions of mutual information #10308

NMI and AMI use inconsistent definitions of mutual information #10308

kno10 commented Dec 13, 2017 •

edited

amueller commented Dec 14, 2017

amueller commented Dec 14, 2017 •

edited

amueller commented Dec 14, 2017

amueller commented Dec 14, 2017

aryamccarthy commented May 23, 2018 •

edited

amueller commented May 23, 2018

aryamccarthy commented May 23, 2018

amueller commented May 23, 2018

aryamccarthy commented May 23, 2018 via email •

edited

aryamccarthy commented May 24, 2018

qinhanmin2014 commented Aug 1, 2018

NMI and AMI use inconsistent definitions of mutual information #10308

NMI and AMI use inconsistent definitions of mutual information #10308

Comments

kno10 commented Dec 13, 2017 • edited

amueller commented Dec 14, 2017

amueller commented Dec 14, 2017 • edited

amueller commented Dec 14, 2017

amueller commented Dec 14, 2017

aryamccarthy commented May 23, 2018 • edited

amueller commented May 23, 2018

aryamccarthy commented May 23, 2018

amueller commented May 23, 2018

aryamccarthy commented May 23, 2018 via email • edited

aryamccarthy commented May 24, 2018

qinhanmin2014 commented Aug 1, 2018

kno10 commented Dec 13, 2017 •

edited

amueller commented Dec 14, 2017 •

edited

aryamccarthy commented May 23, 2018 •

edited

aryamccarthy commented May 23, 2018 via email •

edited