Lazyload submodules python37+ #1007

orsonadams · 2020-02-22T17:54:02Z

Summary of changes

Allow submodule lazy loading for python version3.7+
Addresses #771 using PEP: 562

Pull Request Checklist

Changes have tests
Need help with testing
Authors have been added to AUTHORS.md
News fragment added in changelog.d. See CONTRIBUTING.md for details

This is fairly straightforward given the introduction of __getattr__ for modules. The challenge is testing the import functionality for two subsets of versions: >= 3.7 & < 3.7.

I added what I thought would be reasonable tests if I used tox. Trouble is import dateutil loads the submodules regardless of versions - ignoring __init__.py , Which when I think about it maybe is expected behavior.

How to test this bad boy?

jbrockmendel · 2020-02-22T17:56:33Z

@ParseThis im guessing this speeds up import time if we only import e.g. dateutil.parser? can you ballpark the size of the improvement?

pganssle · 2020-02-22T18:08:30Z

@jbrockmendel That's already how the library is designed:

>>> import dateutil
>>> dateutil.parser
---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
<ipython-input-12-2740e4456f59> in <module>
----> 1 dateutil.parser

AttributeError: module 'dateutil' has no attribute 'parser

The idea here is that in Python 3.7, this will work:

import dateutil
dateutil.parser.parse  # This imports `dateutil.parser` the first time you load it

But import dateutil by itself won't actually get any slower.

orsonadams · 2020-02-22T18:36:05Z

@jachen20 Yeah, its more about expectations. For instance, I would like the below to work.

import dateutil
dateutil.tz

Instead of throwing an AttributeError as it would today. But accomplish this, while still not loading the other submodules.

jbrockmendel · 2020-02-22T18:45:22Z

got it, thanks

pganssle

Import-based tests are some of the trickiest to come up with because it's easy to modify global state by accident and then have your tests rely on the order they were executed in.

I don't know of any way to tell pytest that it should run each of these tests in their own separate process with their own separate sys.modules variable.

I think for the moment we should abandon the concept of thread safety and use a pytest fixture to ensure that each of these import tests gets its own copy of sys.modules. I have not tried this, but here's what I'm thinking:

  @pytest.fixture(scope="function")
def clean_imports():
    """
    Fixture for providing a clean-ish environment for testing import behavior.

    No effort has been made to make this thread safe, as it directly modifies
    the sys.modules dictionary.
    """
    # Stash all the existing dateutil modules for later restoration
    du_modules = {mod_name: mod for mod_name, mod in sys.modules.items()
                  if mod_name.startswith("dateutil")}

    # Keep a list of what was in sys.modules before the test so we can delete
    # stuff outside of dateutil that was imported indirectly
    other_modules = {mod_name for mod_name in sys.modules
                     if mod_name not in du_modules}

    for mod_name in du_modules:
        del sys.modules[mod_name]

    yield

    # Delete anything that wasn't in the original list
    for mod_name in list(sys.modules):
        if mod_name not in other_modules:
            del sys.modules[mod_name]

    # Now restore the original dateutil modules we stashed
    for mod_name, mod in du_modules.items():
        sys.modules[mod_name] = mod

Then your test_lazy_import (and, in a later PR maybe, all the other import tests) just needs to take the clean_imports fixture and it should Just Work.

One other thing: we can do it as a separate PR if you want, but it is usually good to implement both __getattr__ and __dir__, so that dir(dateutil) reflects all the available modules.

I know this is a lot of comments for one fairly simple PR, thanks for doing this!

dateutil/__init__.py

dateutil/test/test_imports.py

dateutil/__init__.py

orsonadams · 2020-02-25T18:20:09Z

Interesting, running tests for py2.7, this happens.

import dateutil, importlib
RuntimeWarning: Parent module 'dateutil.test' not found while handling absolute import
dateutil/test/test_imports.py:37: RuntimeWarning

Which I get, we're removing dateutil.test from submodules.bnBut why not throw this for Python 3.6, 3.7 tests? Although I would argue that since we're not testing the import on test that it shouldn't be a module we stash in the fixture.

This is fixed by checking to make sure the test submodule in deleted from sys.modules in the clean_import fixture.

du_modules = {mod_name: mod for mod_name, mod in sys.modules.items()               
                        if mod_name.startswith('dateutil')                                
                        and not 'dateutil.test' in mod_name}

Instead of,

du_modules = {mod_name: mod for mod_name, mod in sys.modules.items()               
                        if mod_name.startswith('dateutil') }

pganssle · 2020-02-25T18:23:38Z

@ParseThis In this case I think it's fair to call pytest.xfail in the fixture itself.

At some point soon-ish I will restructure the repository to put the tests in their own directory (not a subdirectory of dateutil), and that will solve the problem, and at the moment the only thing the fixture is being used for is a test that was going to xfail on Python 2.7 anyway, so NBD.

pganssle · 2020-02-25T18:30:36Z

Oh wait, I just realized, this is a RuntimeWarning that's causing it to fail. These tests are already xfailing anyway.

I think you can just suppress the warning, either by using pytest.warns and asserting that the number of warnings is 1 for Python 2.7 and 0 for Python 3 or by conditionally defining a decorator like pytest.mark.filterwarnings, see: https://docs.pytest.org/en/latest/warnings.html

In the second case you'd do something like:

# Some comment about why this is OK
if six.PY2:
    filter_import_warning = pytest.mark.filterwarnings("RuntimeWarning")
else:
    def filter_import_warning(f):
        return f

...

@filter_import_warning
@pytest.mark.parametrize(...)
...

dateutil/__init__.py

pganssle · 2020-04-24T13:56:41Z

@ParseThis I see this is still marked as a draft - do you know the current status here?

orsonadams · 2020-04-28T00:37:28Z

@pganssle ready to move to this to a proper PR. WIll have a look at the handling dir in a couple of days. I have an intermediate solution, would like it reviewed.

orsonadams · 2020-04-29T22:30:51Z

@pganssle Curious about those windows-latest failures; seem like some cancellations on the coverage side reporting.

This uses PEP 562 to implement lazy loading of submodules in dateutil (dateutilGH-771).

mariocj89 · 2021-07-16T09:24:20Z

Thanks a lot @orsonadams !

pganssle requested changes Feb 22, 2020

View reviewed changes

pganssle mentioned this pull request Feb 23, 2020

migrate away from unittest and adopt pytest for test_imports #976

Closed

3 tasks

orsonadams force-pushed the lazyload-py37 branch from daa1503 to ffc4be1 Compare February 25, 2020 19:34

cclauss reviewed Feb 26, 2020

View reviewed changes

dateutil/__init__.py Outdated Show resolved Hide resolved

pganssle added this to the 2.9.0 milestone Apr 24, 2020

orsonadams marked this pull request as ready for review April 28, 2020 00:37

pganssle force-pushed the lazyload-py37 branch 2 times, most recently from 7853ec5 to e532459 Compare May 24, 2021 14:26

pganssle approved these changes May 24, 2021

View reviewed changes

pganssle added the enhancement label May 24, 2021

Lazy-load submodules in Python 3.7+

e78c3c7

This uses PEP 562 to implement lazy loading of submodules in dateutil (dateutilGH-771).

mariocj89 force-pushed the lazyload-py37 branch from e532459 to e78c3c7 Compare July 16, 2021 09:11

mariocj89 self-assigned this Jul 16, 2021

mariocj89 changed the title ~~WIP: should lazyload submodules python37+~~ Lazyload submodules python37+ Jul 16, 2021

mariocj89 merged commit 28da62d into dateutil:master Jul 16, 2021

mariocj89 mentioned this pull request Jul 16, 2021

Lazy-load dateutil submodules? #771

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lazyload submodules python37+ #1007

Lazyload submodules python37+ #1007

orsonadams commented Feb 22, 2020 •

edited by mariocj89

jbrockmendel commented Feb 22, 2020

pganssle commented Feb 22, 2020

orsonadams commented Feb 22, 2020 •

edited

jbrockmendel commented Feb 22, 2020

pganssle left a comment

orsonadams commented Feb 25, 2020 •

edited

pganssle commented Feb 25, 2020

pganssle commented Feb 25, 2020

pganssle commented Apr 24, 2020

orsonadams commented Apr 28, 2020 •

edited

orsonadams commented Apr 29, 2020

mariocj89 commented Jul 16, 2021

Lazyload submodules python37+ #1007

Lazyload submodules python37+ #1007

Conversation

orsonadams commented Feb 22, 2020 • edited by mariocj89

Summary of changes

Pull Request Checklist

jbrockmendel commented Feb 22, 2020

pganssle commented Feb 22, 2020

orsonadams commented Feb 22, 2020 • edited

jbrockmendel commented Feb 22, 2020

pganssle left a comment

Choose a reason for hiding this comment

orsonadams commented Feb 25, 2020 • edited

pganssle commented Feb 25, 2020

pganssle commented Feb 25, 2020

pganssle commented Apr 24, 2020

orsonadams commented Apr 28, 2020 • edited

orsonadams commented Apr 29, 2020

mariocj89 commented Jul 16, 2021

orsonadams commented Feb 22, 2020 •

edited by mariocj89

orsonadams commented Feb 22, 2020 •

edited

orsonadams commented Feb 25, 2020 •

edited

orsonadams commented Apr 28, 2020 •

edited