REL scikit-learn 1.4.1 #28414

glemaitre · 2024-02-13T14:51:41Z

This is the branch preparing the 1.4.1 release.

For this release, we need to

remove the NumPy < 2 pinning that is inside the setup.py
backport FIX handle inconsistence between fill_value and X dtype in SimpleImputer #28365
backport DOC update changelog for 1.4.1 release #28413

TODO list:

github-actions · 2024-02-13T14:53:03Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 24fefa5. Link to the linter CI: here}

…arn#28056)

…28016) Co-authored-by: Lock file bot <noreply@github.com>

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org> Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

…-learn#26881)

…cikit-learn#28133)

scikit-learn#28121)

Co-authored-by: ArturoAmorQ <arturo.amor-quiroz@polytechnique.edu>

…#27916)

…nt wheel installing numpy<2 (scikit-learn#28163)

…core (scikit-learn#28156) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

…ikit-learn#28062) Co-authored-by: Alejandro Martin <alejandro.martingil@tno.nl>

…t-learn#28193)

…-learn#28196) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

scikit-learn#28185) Co-authored-by: Loïc Estève <loic.esteve@ymail.com>

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

…#28214) Co-authored-by: Lock file bot <noreply@github.com>

Co-authored-by: Tim Head <betatim@gmail.com> Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

scikit-learn#28390) Co-authored-by: Guillaume Lemaitre <guillaume@probabl.ai>

…mpurity` (scikit-learn#28327) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> Co-authored-by: Loïc Estève <loic.esteve@ymail.com>

…#28371)

…cikit-learn#28167) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

…eImputer` (scikit-learn#28365) Co-authored-by: Guillaume Lemaitre <guillaume@probabl.ai> Co-authored-by: Loïc Estève <loic.esteve@ymail.com>

thomasjpfan · 2024-02-14T03:04:19Z

remove the NumPy < 2 pinning that is inside the setup.py

Is it safe to remove the pin now? I thought we had to wait for NumPy 2.0RC before we can remove it.

glemaitre · 2024-02-14T09:55:48Z

Is it safe to remove the pin now? I thought we had to wait for NumPy 2.0RC before we can remove it.

@lesteve and @ogrisel realized the current pin is also problematic. I don't exactly recall but it was an issue with having an old scikit-learn while updating.

adrinjalali · 2024-02-14T10:32:02Z

the issue is that if we pin, the user will end up with an older version of scikit-learn when a numpy=2 is installed, since we haven't been pinning in the old versions.

glemaitre · 2024-02-14T10:56:35Z

Right now, one would get:

A module that was compiled using NumPy 1.x cannot be run in
NumPy 2.0.0.dev0 as it may crash. To support both 1.x and 2.x
versions of NumPy, modules must be compiled against NumPy 2.0.

If you are a user of the module, the easiest solution will be to
either downgrade NumPy or update the failing module (if available).

NOTE: When testing against pre-release versions of NumPy 2.0
or building nightly wheels for it, it is necessary to ensure
the NumPy pre-release is used at build time.
The main way to ensure this is using no build isolation
and installing dependencies manually with NumPy.
For cibuildwheel for example, this may be achieved by using
the flag to pip:
    CIBW_BUILD_FRONTEND: pip; args: --no-build-isolation
installing NumPy with:
    pip install --pre --extra-index-url https://pypi.anaconda.org/scientific-python-nightly-wheels/simple
in the `CIBW_BEFORE_BUILD` step.  Please compare with the
solutions e.g. in astropy or matplotlib for how to make this
conditional for nightly wheel builds using expressions.
If you do not worry about using pre-releases of all
dependencies, you can also use `--pre --extra-index-url` in the
build frontend (instead of build isolation).
This will become unnecessary as soon as NumPy 2.0 is released.

If your dependencies have the issue, check whether they
have nightly wheels build against NumPy 2.0.

Ideally, we should release 1.4.2 that build against the NumPy 2.0.0.rc1 that allow to be compatible with the ABI. With the current status, when the NumPy RC is released, we are going to break all CIs that rely on the RC (pip install --pre) and install scikit-learn.

So we have 2 imperfect situation but I think this is more unlikely to have someone trying pip install numpy==2 scikit-learn than someone doing pip install --pre numpy scikit-learn.

If this is fine, I'll do a new tag 1.4.1-1 (I did not push to PyPI and conda-forge) and pin numpy.

adrinjalali · 2024-02-14T11:14:28Z

Is there a way to have two sets of binaries on pypi, one against numpy=1 and one against numpy=2, and let pip decide which one to pull?

Also,

Right now, one would get:

Where do you get this? We shouldn't have any numpy=2 in our wheel / conda builds, do we?

glemaitre · 2024-02-14T12:25:49Z

Where do you get this? We shouldn't have any numpy=2 in our wheel / conda builds, do we?

This is when installing NumPy dev so this would be the situation when numpy get released.

thomasjpfan · 2024-02-14T13:25:08Z

Can we pin on the release branch 1.4.X but not pin on main?

glemaitre · 2024-02-14T13:26:19Z

Can we pin on the release branch 1.4.X but not pin on main?

It is already what we had indeed.

thomasjpfan · 2024-02-14T13:32:49Z

Looking at numpy/numpy#24300 (comment) , it recommends always setting an upper bound for the release version.

we have 2 imperfect situation but I think this is more unlikely to have someone trying pip install numpy==2 scikit-learn

I see people doing:

# assume NumPy 2.0 is available at this time
pip install numpy

# Installs older version
pip install scikit-learn

I do not see a great solution here.

ogrisel · 2024-02-14T13:55:18Z

Actually, based on experiments, it seems that if we configure scikit-learn 1.4.1 to depend on "numpy<2", then the second command (pip install scikit-learn) will downgrade numpy to the latest version that is compatible.

The only problem is:

pip install scikit-learn numpy==2.0.0

once numpy 2.0.0 is released, then pip will install the last scikit-learn version without an upper bound pin on numpy, which is scikit-learn 1.3.2 in our case. This is what is is being discussed here: numpy/numpy#24300 (comment).

However, arguably this is quite a rare case though.

Note that the command:

pip install scikit-learn numpy

issued when numpy 2.0.0 is released, and assuming scikit-learn 1.4.1 is the last stable released version with a dependency on "numpy<2", would install the last compatible numpy 1.x along with scikit-learn 1.4.1 as expected.

So upper bounding numpy<2 in scikit-1.4.x is not that bad in that respect.

ogrisel · 2024-02-14T13:57:21Z

I don't think we want to retrospectively upper bound all old scikit-learn releases and yank the non upper-bounded releases on pypi.org though. We can just accept that on rare occasions pip will do something weird with old releases.

ogrisel · 2024-02-14T14:04:10Z

However, now we have a problem because 1.4.1 has already been publicly tagged without the upper-bound dep on numpy. Neither the source tarball, wheels nor the conda-forge packages have been uploaded though.

So what should we do?

Deleting the 1.4.1 tag and retagging will annoy people with automated CI that monitor our github repo. EDIT: we did once in the past and some people complained and advised that we should never do this.
Tagging 1.4.1-1 and uploading tarball and wheels with 1.4.1 names without the -1 suffix is weird. Note that the -1 suffix is only possible in wheel filenames, not the tarball filename. EDIT: plus it's very tedious to manually rename the wheel filenames to add the -1 suffix before uploading manually to pypi.org.
Shall we the release 1.4.1 tarball with the numpy<2 upper bound, but that would not match the tag?
Shall we just not release 1.4.1 at all and instead release 1.4.1.post1 (with a matching new tag) straight away with the numpy<2 dependency?
Shall we release 1.4.1 (maybe just the tarball) then yank it, and then release 1.4.1.post1 straight away with the numpy<2 dependency.

EDIT: I am done editing that comment. I think I am in favor of one of the last 2 options.

thomasjpfan · 2024-02-14T14:28:13Z

Shall we just not release 1.4.1 at all and instead release 1.4.1.post1 (with a matching new tag) straight away with the numpy<2 dependency?

If it works, then I would go with this option. (I do not know if we can issue a post release without a 1.4.1 release first.)

Otherwise, releasing 1.4.1 and yanking is okay with me.

glemaitre · 2024-02-14T14:31:08Z

Yep I'm going with the post1 solution. This is the less questioning one and the most straightforward as a release process. Thanks everyone for the input. I'll open a new PR.

lesteve · 2024-02-14T14:32:39Z

Agreed.

For completeness, if I understand #27658 correctly what is annoying for downstream packaging is deleting the tag and and reusing the same tag on a newer commit.

Not sure if deleting the 1.4.1 tag would be considered less a problem.

github-actions bot added the cython label Feb 13, 2024

glemaitre marked this pull request as draft February 13, 2024 14:51

StefanieSenger and others added 27 commits February 13, 2024 16:18

ENH Improved error messages for UnsetMetadataPassedError (scikit-le…

c083854

…arn#28056)

🔒 🤖 CI Update lock files for scipy-dev CI build(s) 🔒 🤖 (scikit-learn#…

3a32b20

…28016) Co-authored-by: Lock file bot <noreply@github.com>

TST Tweak tests to facilitate Meson usage (scikit-learn#28094)

e441ca2

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org> Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

TST Skip test using subprocess in Pyodide (scikit-learn#28116)

a0b8ed5

TST Tweak one more test to facilitate Meson usage (scikit-learn#28112)

dc941a8

DOC Update Mixin classes documentation and examples (scikit-learn#28146)

db79ae3

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

DOC add example in docstring of ridge_regression (scikit-learn#28122)

7bbddaf

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

DOC Add dropdowns to module 9.1 Python specific serialization (scikit…

25926f1

…-learn#26881)

MAINT fix update_environments_and_lock_files for non-posix systems (s…

e3829d0

…cikit-learn#28133)

FIX AffinityPropagation assigning multiple clusters for equal points (

8ebd9ff

scikit-learn#28121)

DOC: Added dropdowns to 4.1 PDPs (scikit-learn#27187)

077cd5f

DOC Fix blank space in dropdown (scikit-learn#28166)

cc6040e

Co-authored-by: ArturoAmorQ <arturo.amor-quiroz@polytechnique.edu>

DOC: Added drop down menus to 1.8 Cross Decomposition (scikit-learn…

8af14e1

…#27916)

Fix prevent infinite loop in KMeans (scikit-learn#28165)

ed4a53a

CI Remove temporary work-around related to scipy and pandas developme…

59988e6

…nt wheel installing numpy<2 (scikit-learn#28163)

DOC Added relation between ROC-AUC and Gini in docstring of roc_auc_s…

5ce24fc

…core (scikit-learn#28156) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

MAINT Update SECURITY.md for 1.4.0 (scikit-learn#28182)

5fae7e8

DOC use list for the ridge_regression docstring (scikit-learn#28168)

3ac7cf7

DOC Fix for roc_auc_score documentation (scikit-learn#28190)

cbb99a7

MNT changed order pre-commits hooks following ruff recommendation (sc…

bc86cfb

…ikit-learn#28062) Co-authored-by: Alejandro Martin <alejandro.martingil@tno.nl>

DOC add docstring example to sklearn.metrics.consensus_score (sciki…

e438382

…t-learn#28193)

DOC add docstring example to sklearn.metrics.coverage_error (scikit…

d3583c9

…-learn#28196) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

ENH Checks pandas and polars directly (scikit-learn#28195)

879aded

FIX _convert_container should be able to convert from sparse to sparse (

6848af1

scikit-learn#28185) Co-authored-by: Loïc Estève <loic.esteve@ymail.com>

DOC Add docstring examples for covariance module (scikit-learn#28192)

8cea6cc

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

DOC Add a docstring examples for utils functions (scikit-learn#28181)

1177450

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

🔒 🤖 CI Update lock files for cirrus-arm CI build(s) 🔒 🤖 (scikit-learn…

f423814

…#28214) Co-authored-by: Lock file bot <noreply@github.com>

ogrisel and others added 5 commits February 13, 2024 16:24

DOC Update the FAQ entry on GPU support (scikit-learn#28328)

4b78abe

Co-authored-by: Tim Head <betatim@gmail.com> Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

DOC added example for sklearn.feature_extraction.image.grid_to_graph (

677e04f

scikit-learn#28390) Co-authored-by: Guillaume Lemaitre <guillaume@probabl.ai>

FIX handle properly missing value in MSE and Friedman-MSE `children_i…

13e6244

…mpurity` (scikit-learn#28327) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> Co-authored-by: Loïc Estève <loic.esteve@ymail.com>

FIX EmptyRequest.get defaults to Bunch of METHODS (scikit-learn…

504ac9a

…#28371)

MNT Checking function _estimator_has also raises AttributeError (s…

ee527d0

…cikit-learn#28167) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

glemaitre force-pushed the release-1.4.1 branch from be35d8c to ee527d0 Compare February 13, 2024 15:29

glemaitre added the No Changelog Needed label Feb 13, 2024

glemaitre and others added 6 commits February 13, 2024 17:30

remove numpy pin < 2

0b347d2

bump to version 1.4.1

62603a5

[cd build][azure parallel] Trigger CI/CD

0c59797

DOC update changelog for 1.4.1 release (scikit-learn#28413)

381d7d6

FIX handle inconsistence between fill_value and X dtype in `Simpl…

3e069aa

…eImputer` (scikit-learn#28365) Co-authored-by: Guillaume Lemaitre <guillaume@probabl.ai> Co-authored-by: Loïc Estève <loic.esteve@ymail.com>

[cd build][azure parallel] Trigger CI/CD

24fefa5

glemaitre marked this pull request as ready for review February 13, 2024 17:18

glemaitre merged commit 0d00f7b into scikit-learn:1.4.X Feb 13, 2024
46 of 50 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

REL scikit-learn 1.4.1 #28414

REL scikit-learn 1.4.1 #28414

glemaitre commented Feb 13, 2024 •

edited

github-actions bot commented Feb 13, 2024 •

edited

thomasjpfan commented Feb 14, 2024

glemaitre commented Feb 14, 2024

adrinjalali commented Feb 14, 2024

glemaitre commented Feb 14, 2024

adrinjalali commented Feb 14, 2024

glemaitre commented Feb 14, 2024

thomasjpfan commented Feb 14, 2024

glemaitre commented Feb 14, 2024

thomasjpfan commented Feb 14, 2024 •

edited

ogrisel commented Feb 14, 2024 •

edited

ogrisel commented Feb 14, 2024

ogrisel commented Feb 14, 2024 •

edited

thomasjpfan commented Feb 14, 2024

glemaitre commented Feb 14, 2024

lesteve commented Feb 14, 2024 •

edited

REL scikit-learn 1.4.1 #28414

REL scikit-learn 1.4.1 #28414

Conversation

glemaitre commented Feb 13, 2024 • edited

TODO list:

github-actions bot commented Feb 13, 2024 • edited

✔️ Linting Passed

thomasjpfan commented Feb 14, 2024

glemaitre commented Feb 14, 2024

adrinjalali commented Feb 14, 2024

glemaitre commented Feb 14, 2024

adrinjalali commented Feb 14, 2024

glemaitre commented Feb 14, 2024

thomasjpfan commented Feb 14, 2024

glemaitre commented Feb 14, 2024

thomasjpfan commented Feb 14, 2024 • edited

ogrisel commented Feb 14, 2024 • edited

ogrisel commented Feb 14, 2024

ogrisel commented Feb 14, 2024 • edited

thomasjpfan commented Feb 14, 2024

glemaitre commented Feb 14, 2024

lesteve commented Feb 14, 2024 • edited

glemaitre commented Feb 13, 2024 •

edited

github-actions bot commented Feb 13, 2024 •

edited

thomasjpfan commented Feb 14, 2024 •

edited

ogrisel commented Feb 14, 2024 •

edited

ogrisel commented Feb 14, 2024 •

edited

lesteve commented Feb 14, 2024 •

edited