Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add pandas-2.x support in cudf #14916

Merged
merged 246 commits into from Jan 30, 2024
Merged

Add pandas-2.x support in cudf #14916

merged 246 commits into from Jan 30, 2024

Conversation

galipremsagar
Copy link
Contributor

@galipremsagar galipremsagar commented Jan 29, 2024

Description

closes: #12794

This PR introduces pandas-2.x support in cudf. Minimum requirement of pandas is also bumped to 2.0.

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

This PR updates value_counts behavior to match pandas-2.x, the result name will be count (or proportion if normalize=True is passed), and the index will be named after the original object name. This PR also fixes two dtype APIs that are breaking changes on pandas side.
This PR drops `inplace` parameters in categorical methods, these are also removed as part of pandas-2.0
…12843)

This PR raises an error when numeric_only=True for rank if the Series is of non-numeric dtype.
This PR drops support for `Series.is_monotonic` & `Index.is_monotonic`. Instead, the alternative will be `.is_monotonic_increasing`.
This PR removes support for `datetime_is_numeric` parameter in `describe`.
…in `MultiIndex.copy` (#12898)

This PR removes `dtype` in Index & `MultiIndex.copy`, and `names` in Index.copy
This PR drops `kind` parameter from `Index.get_slice_bound` to match pandas-2.0 API.
- [x] This PR removes the deprecation of `numeric_only=None` and defaults to `numeric_only=False`.
- [x] Removes `level` parameter from reduction APIs to match pandas-2.0
- [x] Change `axis` defaults to match pandas-2.0 APIs.
This PR removes `DataFrame.append` & `Series.append` to match pandas-2.0 API. Test usages are now replaced with `.concat` API calls.
This PR drops support for `na_sentinel` in factorize APIs, to match with pandas-2.0
This PR adds additional information for the following Index APIs to match with pandas 2.0:

 is_numeric
 is_boolean
 is_integer
 is_floating
 is_object
 is_categorical
 is_interval
@galipremsagar
Copy link
Contributor Author

/okay to test

@vyasr vyasr removed the 5 - DO NOT MERGE Hold off on merging; see PR for details label Jan 30, 2024
@galipremsagar galipremsagar added the 5 - DO NOT MERGE Hold off on merging; see PR for details label Jan 30, 2024
@galipremsagar
Copy link
Contributor Author

Adding don't merge because we don't want to auto-merge this pr using the bot

@galipremsagar galipremsagar added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 3 - Ready for Review Ready for review by team labels Jan 30, 2024
@AyodeAwe AyodeAwe merged commit 238a03f into branch-24.04 Jan 30, 2024
67 of 68 checks passed
@bdice bdice mentioned this pull request Jan 31, 2024
3 tasks
rapids-bot bot pushed a commit that referenced this pull request Feb 1, 2024
A few cleanups in test files following #14916.

Authors:
  - Bradley Dice (https://github.com/bdice)

Approvers:
  - GALI PREM SAGAR (https://github.com/galipremsagar)
  - Charles Blackmon-Luca (https://github.com/charlesbluca)

URL: #14941
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
5 - DO NOT MERGE Hold off on merging; see PR for details 5 - Ready to Merge Testing and reviews complete, ready to merge breaking Breaking change conda conda issue cuDF (Python) Affects Python cuDF API. improvement Improvement / enhancement to an existing function
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

[FEA] Add pandas-2.0 support for cudf
8 participants