read/write support for csv, eventtxt and csz #3285

trichter · 2023-03-24T09:22:26Z

What does this PR do?

Id adds read and write support for csv, eventtxt and csz. csz is a custom format and basically a collection of csv files bundled in a zip file (similar to numpys npz format) for io of a catalog with picks.

Why was it initiated? Any relevant Issues?

Fixes #1467

PR Checklist

trichter · 2023-03-24T09:51:00Z

Doc link. Some functions are mising its doc string.

trichter · 2023-03-24T09:57:16Z

Do we need an __init__.py file in the tests directory for the tests to be discovered by pytest? Why?

d-chambers · 2023-03-24T12:50:43Z

Do we need an init.py file in the tests directory for the tests to be discovered by pytest? Why?

Do we have another 'test_csv.py' anywhere? If so, that explains it. Pytest's test importer is a bit wonky. Maybe try giving your test file a more unique name?

megies · 2023-03-24T13:26:09Z

Do we have another 'test_csv.py' anywhere?

Doesn't look like it.

megies

Still needs changelog entry, otherwise looks good to me 🚀

megies · 2023-03-24T13:36:50Z

Ah and needs to be added here: misc/docs/source/packages/index.rst

d-chambers

Good addition, I just left a few suggestions.

obspy/io/csv/tests/test_csv.py

d-chambers · 2023-03-24T15:29:58Z

obspy/io/csv/tests/test_csv.py

+        try:
+            import zlib  # noqa: F401
+        except ImportError:
+            pass


consider suppress from context lib here to make it more concise and clear.

I did not change anything here for now

obspy/io/csv/tests/test_csv.py

obspy/io/csv/core.py

d-chambers · 2023-03-24T15:49:36Z

obspy/io/csv/core.py

+    with _open(fname) as f:
+        for _ in range(skipheader):
+            f.readline()
+        if names is not None:
+            kwargs.setdefault('fieldnames', _names_sequence(names))
+        reader = csv.DictReader(f, **kwargs)
+        for row in reader:
+            if 'time' in row:
+                time = UTC(row['time'])
+            else:
+                time = UTC(
+                    '{year}-{mon}-{day} {hour}:{minu}:{sec}'.format(**row))
+            try:
+                if 'depm' in row:
+                    dep = float(row['depm'])
+                else:
+                    dep = float(row['dep']) * 1000
+                if math.isnan(dep):
+                    raise
+            except Exception:
+                dep = None
+            author = _string(row, 'author')
+            contrib = _string(row, 'contrib')
+            if author is not None or contrib is not None:
+                info = evmod.CreationInfo(author=author, agency_id=contrib)
+            else:
+                info = None
+            origin = evmod.Origin(
+                time=time,
+                latitude=row['lat'],
+                longitude=row['lon'],
+                depth=dep,
+                creation_info=info
+            )
+            try:
+                # add zero to eliminate negative zeros in magnitudes
+                mag = float(row['mag']) + 0
+                if math.isnan(mag):
+                    raise
+            except Exception:
+                magnitudes = []
+            else:
+                try:
+                    magtype = row['magtype']
+                    if magtype.lower() in ('', 'none', 'null', 'nan'):
+                        raise
+                except Exception:
+                    magtype = default.get('magtype')
+                magauthor = _string(row, 'magauthor')
+                info = (evmod.CreationInfo(author=magauthor) if magauthor
+                        else None)
+                magnitudes = [evmod.Magnitude(
+                    mag=mag, magnitude_type=magtype, creation_info=info)]
+            try:
+                id_ = evmod.ResourceIdentifier(row['id'].strip())
+            except Exception:
+                id_ = None
+            region = _string(row, 'region')
+            descs = ([evmod.EventDescription(region, 'region name')]
+                     if region else [])
+            event = evmod.Event(
+                magnitudes=magnitudes,
+                origins=[origin],
+                resource_id=id_,
+                event_descriptions=descs
+            )
+            events.append(event)
+            if format_check:
+                return True


Certainly optional, but this is a bit long. Maybe can be refactored into separate helper functions for reading each part?

Let's postpone that

d-chambers · 2023-03-24T15:52:17Z

obspy/io/csv/core.py

+            try:
+                author = origin.creation_info.author
+            except AttributeError:
+                author = ''
+            try:
+                contrib = origin.creation_info.agency_id
+            except AttributeError:
+                contrib = ''


These cant be re-phrased as getattr with default values because both creation_info and creation_info's sub attribute might be missing right?

obspy/io/csv/core.py

megies · 2023-03-28T08:22:43Z

Thanks for the thorough review @d-chambers, definitely some very valid points there 😃

better review showing need for some changes

megies

I collapsed everything that was addressed and made linter happy. Looks like the rest is optional stuff that can be left as is?

d-chambers · 2023-03-30T13:16:01Z

👍 looks good to me.

trichter added 6 commits March 22, 2023 17:43

add entry points of csv plugin

0d67d30

add io.csv module

2997afe

add io.csv tests

9bd8a57

add io.csv test files

c711aa4

add io.csv to codeowners

4a0ab02

add docs of io.csv module

3e9df6f

trichter added the build_docs Docs will be automatically built and deployed in github actions on pushes to the PR label Mar 24, 2023

trichter requested review from megies and d-chambers as code owners March 24, 2023 09:22

trichter added the .io issues generally related to our read/write plugins label Mar 24, 2023

trichter added this to the 1.5.0 milestone Mar 24, 2023

trichter added 6 commits March 24, 2023 10:26

pep8

b3b2b33

do not use bare except

d326d41

flake8: ignore unused import

db17525

flake8: comparison to None

d3bae01

csv tests: correct import

fcb3701

flake8: ignore F401

96f0541

io.csv: add init file in tests directoy

89dfd25

trichter added the ready for review PRs that are ready to be reviewed to get marked ready to merge label Mar 24, 2023

megies previously approved these changes Mar 24, 2023

View reviewed changes

d-chambers reviewed Mar 24, 2023

View reviewed changes

trichter added 2 commits March 30, 2023 11:17

add match argument in pytest.warns

f5426b8

remove obspycsv version string

58201cb

trichter and others added 11 commits March 30, 2023 11:20

doc

ae0b15f

transform to doctest

f75d148

remove pytest mark

9473be7

check for seedid is None

26796f4

do not use assert

98f4fd4

make events2array a private function

7104f1e

handle path objects

0e08ee1

update doc, remove check_compression

c7d56ad

add to doc index

e3ad21e

changelog

fba9810

pep8

932c7f7

megies approved these changes Mar 30, 2023

View reviewed changes

megies merged commit c1ce508 into master Mar 30, 2023

megies deleted the csv branch March 30, 2023 13:53

trichter mentioned this pull request Mar 31, 2023

Improve documentation of io.csv module #3288

Merged

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

read/write support for csv, eventtxt and csz #3285

read/write support for csv, eventtxt and csz #3285

trichter commented Mar 24, 2023 •

edited by megies

trichter commented Mar 24, 2023

trichter commented Mar 24, 2023

d-chambers commented Mar 24, 2023

megies commented Mar 24, 2023

megies left a comment

megies commented Mar 24, 2023

d-chambers left a comment

d-chambers Mar 24, 2023

trichter Mar 30, 2023

d-chambers Mar 24, 2023 •

edited

trichter Mar 30, 2023

d-chambers Mar 24, 2023

trichter Mar 30, 2023

megies commented Mar 28, 2023

megies left a comment

d-chambers commented Mar 30, 2023

read/write support for csv, eventtxt and csz #3285

read/write support for csv, eventtxt and csz #3285

Conversation

trichter commented Mar 24, 2023 • edited by megies

What does this PR do?

Why was it initiated? Any relevant Issues?

PR Checklist

trichter commented Mar 24, 2023

trichter commented Mar 24, 2023

d-chambers commented Mar 24, 2023

megies commented Mar 24, 2023

megies left a comment

Choose a reason for hiding this comment

megies commented Mar 24, 2023

d-chambers left a comment

Choose a reason for hiding this comment

d-chambers Mar 24, 2023

Choose a reason for hiding this comment

trichter Mar 30, 2023

Choose a reason for hiding this comment

d-chambers Mar 24, 2023 • edited

Choose a reason for hiding this comment

trichter Mar 30, 2023

Choose a reason for hiding this comment

d-chambers Mar 24, 2023

Choose a reason for hiding this comment

trichter Mar 30, 2023

Choose a reason for hiding this comment

megies commented Mar 28, 2023

megies left a comment

Choose a reason for hiding this comment

d-chambers commented Mar 30, 2023

trichter commented Mar 24, 2023 •

edited by megies

d-chambers Mar 24, 2023 •

edited