DOC Update rst doctests to be compatible with numpy >= 2 #30495

lesteve · 2024-12-17T04:50:17Z

Similar to #29613 but for doctests inside rst files.

Let's see what the CI has to say about it.

Edit: turns out we were not running the rst doctests in any of our CI build (because the combination numpy<2 and matplotlib and maybe other things to run doctests were not inside any CI build) ... oh well there was only one failure to fix.

github-actions · 2024-12-17T04:52:06Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 51ada5b. Link to the linter CI: here}

lesteve · 2024-12-17T05:42:57Z

CI is green and the rst doctests ran in a few CI builds e.g. this one and this one

lesteve · 2024-12-17T05:56:21Z

doc/developers/develop.rst

@@ -346,7 +346,8 @@ the correct interface more easily.
 And you can check that the above estimator passes all common checks::

    >>> from sklearn.utils.estimator_checks import check_estimator
-    >>> check_estimator(TemplateClassifier())  # passes
+    >>> check_estimator(TemplateClassifier())  # passes            # doctest: +SKIP


I had to skip this one because the estimator is defined interactively and does not pickle ...

____________________________________________________________________________________________________________ [doctest] develop.rst _____________________________________________________________________________________________________________ 341 ... # Input validation 342 ... X = validate_data(self, X, reset=False) 343 ... 344 ... closest = np.argmin(euclidean_distances(X, self.X_), axis=1) 345 ... return self.y_[closest] 346 347 And you can check that the above estimator passes all common checks:: 348 349 >>> from sklearn.utils.estimator_checks import check_estimator 350 >>> check_estimator(TemplateClassifier()) # passes UNEXPECTED EXCEPTION: PicklingError("Can't pickle <class 'TemplateClassifier'>: attribute lookup TemplateClassifier on builtins failed") Traceback (most recent call last): File "/home/lesteve/micromamba/envs/scikit-learn-dev/lib/python3.13/doctest.py", line 1395, in __run exec(compile(example.source, filename, "single", ~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ compileflags, True), test.globs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "<doctest develop.rst[10]>", line 1, in <module> File "/home/lesteve/dev/scikit-learn/sklearn/utils/_param_validation.py", line 216, in wrapper return func(*args, **kwargs) File "/home/lesteve/dev/scikit-learn/sklearn/utils/estimator_checks.py", line 857, in check_estimator check(estimator) ~~~~~^^^^^^^^^^^ File "/home/lesteve/dev/scikit-learn/sklearn/utils/_testing.py", line 147, in wrapper return fn(*args, **kwargs) File "/home/lesteve/dev/scikit-learn/sklearn/utils/estimator_checks.py", line 2338, in check_estimators_pickle pickled_estimator = pickle.dumps(estimator) _pickle.PicklingError: Can't pickle <class 'TemplateClassifier'>: attribute lookup TemplateClassifier on builtins failed /home/lesteve/dev/scikit-learn/doc/developers/develop.rst:350: UnexpectedException =====================================================================================================

lesteve · 2024-12-17T05:57:55Z

doc/developers/develop.rst

@@ -267,6 +267,7 @@ interactions with `pytest`)::
  >>> from sklearn.utils.estimator_checks import check_estimator
  >>> from sklearn.tree import DecisionTreeClassifier
  >>> check_estimator(DecisionTreeClassifier())  # passes
+  [{'estimator': DecisionTreeClassifier(), ...]


Not sure what to show here maybe only ... would be enough?

I ended up using ..., I don't think it is useful to try to show part of the output ...

doc/modules/clustering.rst

thomasjpfan

It's unfortunate that the output is so much more verbose now. But I'm okay with moving into the future with NumPy's new printing style.

jeremiedbb · 2025-01-03T15:01:57Z

Reading #27339, it looks like there's a consensus for always returning python scalars from scorers. So I'm afraid that all the changes here will have to be reverted once this is done. Shouldn't we do the "always return a python scalar" first ?

lesteve · 2025-01-03T15:21:18Z

This PR is a simple one that makes sure that we are actually testing our rst file doctests because we are currently not testing them and we never realized ...

When #27339 happens, I am more than happy to be pinged to fix the doctests 😉.

lesteve · 2025-01-03T15:25:12Z

Side-comment, there is also #30496 that could help because scipy-doctests allows you to ignore the difference between 0.214 and np.float64(0.214), see #30495 (comment).

But same thing I am not very keen on making a "Quick Review" PR depend on a potentially more controversial one 😉

jeremiedbb

LGTM.

beyond the scorers output repr, there are also a few ones from different origins that I find really not user friendly, like

    >>> list(le.classes_)
    [np.str_('amsterdam'), np.str_('paris'), np.str_('tokyo')]

that we might consider dealing with at some point as well.

lesteve · 2025-01-03T16:46:32Z

beyond the scorers output repr, there are also a few ones from different origins that I find really not user friendly

Yep agreed. IIRC sometimes we get this because we do things like list(array) rather than array.tolist() in the doctest code but in other cases this is nested deeper inside the scikit-learn code ...

…n#30495)

DOC Update rst doctests to be compatible with numpy >= 2

8cde279

github-actions bot added the Documentation label Dec 17, 2024

[azure parallel]

7f7d346

[azure parallel] Skip doctest

7043db9

lesteve added the Quick Review For PRs that are quick to review label Dec 17, 2024

Do not skip when possible [azure parallel]

707f249

lesteve commented Dec 17, 2024

View reviewed changes

[azure parallel] simpler doctest result

4e03a05

This was referenced Dec 17, 2024

CI Use scipy-doctest for more convenient doctests #30496

Merged

Allow per-example atol, rtol; support Ellipsis in numeric values scipy/scipy_doctest#147

Open

lesteve added 2 commits December 18, 2024 06:01

Fix doctest

754fcb0

[azure parallel]

51ada5b

thomasjpfan reviewed Dec 18, 2024

View reviewed changes

doc/modules/clustering.rst Show resolved Hide resolved

thomasjpfan approved these changes Dec 20, 2024

View reviewed changes

thomasjpfan added the Waiting for Second Reviewer First reviewer is done, need a second one! label Dec 29, 2024

jeremiedbb approved these changes Jan 3, 2025

View reviewed changes

jeremiedbb merged commit 5cfbe87 into scikit-learn:main Jan 3, 2025
34 checks passed

lesteve deleted the rst-doctests-numpy-2 branch January 3, 2025 16:44

lesteve mentioned this pull request Jan 4, 2025

TST Fix doctest due to GradientBoostingClassifier difference with scipy 1.15 #30583

Merged

jeremiedbb pushed a commit to jeremiedbb/scikit-learn that referenced this pull request Jan 8, 2025

DOC Update rst doctests to be compatible with numpy >= 2 (scikit-lear…

8c64e9c

…n#30495)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC Update rst doctests to be compatible with numpy >= 2 #30495

DOC Update rst doctests to be compatible with numpy >= 2 #30495

lesteve commented Dec 17, 2024 •

edited

Loading

github-actions bot commented Dec 17, 2024 •

edited

Loading

lesteve commented Dec 17, 2024

lesteve Dec 17, 2024 •

edited

Loading

lesteve Dec 17, 2024

lesteve Dec 17, 2024

thomasjpfan left a comment

jeremiedbb commented Jan 3, 2025

lesteve commented Jan 3, 2025 •

edited

Loading

lesteve commented Jan 3, 2025 •

edited

Loading

jeremiedbb left a comment

lesteve commented Jan 3, 2025 •

edited

Loading

DOC Update rst doctests to be compatible with numpy >= 2 #30495

DOC Update rst doctests to be compatible with numpy >= 2 #30495

Conversation

lesteve commented Dec 17, 2024 • edited Loading

github-actions bot commented Dec 17, 2024 • edited Loading

✔️ Linting Passed

lesteve commented Dec 17, 2024

lesteve Dec 17, 2024 • edited Loading

Choose a reason for hiding this comment

lesteve Dec 17, 2024

Choose a reason for hiding this comment

lesteve Dec 17, 2024

Choose a reason for hiding this comment

thomasjpfan left a comment

Choose a reason for hiding this comment

jeremiedbb commented Jan 3, 2025

lesteve commented Jan 3, 2025 • edited Loading

lesteve commented Jan 3, 2025 • edited Loading

jeremiedbb left a comment

Choose a reason for hiding this comment

lesteve commented Jan 3, 2025 • edited Loading

lesteve commented Dec 17, 2024 •

edited

Loading

github-actions bot commented Dec 17, 2024 •

edited

Loading

lesteve Dec 17, 2024 •

edited

Loading

lesteve commented Jan 3, 2025 •

edited

Loading

lesteve commented Jan 3, 2025 •

edited

Loading

lesteve commented Jan 3, 2025 •

edited

Loading