Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add pandas-2.x support in cudf #14916

Merged
merged 246 commits into from
Jan 30, 2024
Merged

Add pandas-2.x support in cudf #14916

merged 246 commits into from
Jan 30, 2024

Commits on Feb 23, 2023

  1. Update value_counts with new behavior (#12835)

    This PR updates value_counts behavior to match pandas-2.x, the result name will be count (or proportion if normalize=True is passed), and the index will be named after the original object name. This PR also fixes two dtype APIs that are breaking changes on pandas side.
    galipremsagar committed Feb 23, 2023
    Configuration menu
    Copy the full SHA
    14f54ac View commit details
    Browse the repository at this point in the history

Commits on Feb 24, 2023

  1. Configuration menu
    Copy the full SHA
    fc33639 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    395fa58 View commit details
    Browse the repository at this point in the history
  3. Drop inplace parameter in categorical methods (#12846)

    This PR drops `inplace` parameters in categorical methods, these are also removed as part of pandas-2.0
    galipremsagar committed Feb 24, 2023
    Configuration menu
    Copy the full SHA
    7d62d4e View commit details
    Browse the repository at this point in the history
  4. [REVIEW] Raise error when numeric_only=True for non-numeric Series (#…

    …12843)
    
    This PR raises an error when numeric_only=True for rank if the Series is of non-numeric dtype.
    galipremsagar committed Feb 24, 2023
    Configuration menu
    Copy the full SHA
    d1b1ea8 View commit details
    Browse the repository at this point in the history

Commits on Feb 25, 2023

  1. Configuration menu
    Copy the full SHA
    9f2b0c2 View commit details
    Browse the repository at this point in the history

Commits on Feb 27, 2023

  1. Configuration menu
    Copy the full SHA
    9ef1b37 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    ae00532 View commit details
    Browse the repository at this point in the history

Commits on Feb 28, 2023

  1. Drop is_monotonic (#12853)

    This PR drops support for `Series.is_monotonic` & `Index.is_monotonic`. Instead, the alternative will be `.is_monotonic_increasing`.
    galipremsagar committed Feb 28, 2023
    Configuration menu
    Copy the full SHA
    6317733 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    fd0c1dd View commit details
    Browse the repository at this point in the history

Commits on Mar 6, 2023

  1. Configuration menu
    Copy the full SHA
    10ea515 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    ea2099c View commit details
    Browse the repository at this point in the history

Commits on Mar 7, 2023

  1. merge

    galipremsagar committed Mar 7, 2023
    Configuration menu
    Copy the full SHA
    25b87f0 View commit details
    Browse the repository at this point in the history

Commits on Mar 8, 2023

  1. [REVIEW] Drop datetime_is_numeric parameter from describe (#12890)

    This PR removes support for `datetime_is_numeric` parameter in `describe`.
    galipremsagar committed Mar 8, 2023
    Configuration menu
    Copy the full SHA
    5af0583 View commit details
    Browse the repository at this point in the history
  2. Drop names, dtype in Index.copy and dtype, levels, codes

    …in `MultiIndex.copy` (#12898)
    
    This PR removes `dtype` in Index & `MultiIndex.copy`, and `names` in Index.copy
    galipremsagar committed Mar 8, 2023
    Configuration menu
    Copy the full SHA
    531f52c View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    620e35f View commit details
    Browse the repository at this point in the history
  4. Drop kind parameter from Index.get_slice_bound (#12856)

    This PR drops `kind` parameter from `Index.get_slice_bound` to match pandas-2.0 API.
    galipremsagar committed Mar 8, 2023
    Configuration menu
    Copy the full SHA
    7ec76b7 View commit details
    Browse the repository at this point in the history

Commits on Mar 9, 2023

  1. Configuration menu
    Copy the full SHA
    cecf651 View commit details
    Browse the repository at this point in the history

Commits on Mar 10, 2023

  1. [REVIEW] Update numeric_only behavior in reduction APIs (#12847)

    - [x] This PR removes the deprecation of `numeric_only=None` and defaults to `numeric_only=False`.
    - [x] Removes `level` parameter from reduction APIs to match pandas-2.0
    - [x] Change `axis` defaults to match pandas-2.0 APIs.
    galipremsagar committed Mar 10, 2023
    Configuration menu
    Copy the full SHA
    58b9acb View commit details
    Browse the repository at this point in the history
  2. merge

    galipremsagar committed Mar 10, 2023
    Configuration menu
    Copy the full SHA
    99a4148 View commit details
    Browse the repository at this point in the history
  3. [REVIEW] Drop DataFrame.append and Series.append (#12839)

    This PR removes `DataFrame.append` & `Series.append` to match pandas-2.0 API. Test usages are now replaced with `.concat` API calls.
    galipremsagar committed Mar 10, 2023
    Configuration menu
    Copy the full SHA
    e115ba5 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    620483b View commit details
    Browse the repository at this point in the history

Commits on Mar 13, 2023

  1. Configuration menu
    Copy the full SHA
    dc1b813 View commit details
    Browse the repository at this point in the history
  2. Drop na_sentinel from factorize (#12924)

    This PR drops support for `na_sentinel` in factorize APIs, to match with pandas-2.0
    galipremsagar committed Mar 13, 2023
    Configuration menu
    Copy the full SHA
    4a87cbd View commit details
    Browse the repository at this point in the history
  3. Merge remote-tracking branch 'upstream/pandas_2.0_feature_branch' int…

    …o pandas_2.0_feature_branch
    galipremsagar committed Mar 13, 2023
    Configuration menu
    Copy the full SHA
    c0ab786 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    61843ed View commit details
    Browse the repository at this point in the history
  5. Add information about Index.is_* method deprecation (#12909)

    This PR adds additional information for the following Index APIs to match with pandas 2.0:
    
     is_numeric
     is_boolean
     is_integer
     is_floating
     is_object
     is_categorical
     is_interval
    galipremsagar committed Mar 13, 2023
    Configuration menu
    Copy the full SHA
    d1377a5 View commit details
    Browse the repository at this point in the history

Commits on Mar 16, 2023

  1. Configuration menu
    Copy the full SHA
    55de5a4 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    30f6f8e View commit details
    Browse the repository at this point in the history

Commits on Mar 28, 2023

  1. Configuration menu
    Copy the full SHA
    e42619b View commit details
    Browse the repository at this point in the history

Commits on Mar 29, 2023

  1. Configuration menu
    Copy the full SHA
    0f3172f View commit details
    Browse the repository at this point in the history

Commits on Mar 31, 2023

  1. [REVIEW] Miscellaneous pytest fixes for pandas-2.0 (#12962)

    This PR contains miscellaneous fixes in pytests. The changes in pytests are due to suttle changes in behaviors from the pandas-2.0 side.
    galipremsagar committed Mar 31, 2023
    Configuration menu
    Copy the full SHA
    48c1016 View commit details
    Browse the repository at this point in the history

Commits on Apr 3, 2023

  1. Configuration menu
    Copy the full SHA
    ddf8996 View commit details
    Browse the repository at this point in the history

Commits on Apr 4, 2023

  1. Configuration menu
    Copy the full SHA
    6bbcc23 View commit details
    Browse the repository at this point in the history

Commits on Apr 7, 2023

  1. Add get_indexer

    galipremsagar committed Apr 7, 2023
    Configuration menu
    Copy the full SHA
    dd15a19 View commit details
    Browse the repository at this point in the history
  2. Fix ufunc tests (#13083)

    Pandas 2.0 introduced support for ufuncs when the two columns are indexed. This PR updates the pytests accordingly.
    galipremsagar committed Apr 7, 2023
    Configuration menu
    Copy the full SHA
    6dce4ef View commit details
    Browse the repository at this point in the history
  3. [REVIEW] datetime and timedelta improvements (#12934)

    This PR fixes 1046 pytest failures that are related to `datetime64` & `timedelta64` types.
    
    This PR(`time_2.0`):
    ```bash
    = 990 failed, 86109 passed, 2034 skipped, 995 xfailed, 165 xpassed in 546.70s (0:09:06) =
    ```
    
    on `pandas_2.0_feature_branch`:
    ```bash
    == 2036 failed, 85423 passed, 2034 skipped, 860 xfailed in 720.53s (0:12:00) ===
    ```
    galipremsagar committed Apr 7, 2023
    Configuration menu
    Copy the full SHA
    192e204 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    1621fcb View commit details
    Browse the repository at this point in the history
  5. Fix MultiIndex construction in pandas 2.0 (#13092)

    This PR removes a `MultiIndex` construction workaround that retains correct dtypes of each level. Thus fixing 19 pytests:
    
    ```bash
    = 907 failed, 86196 passed, 2034 skipped, 992 xfailed, 165 xpassed in 536.13s (0:08:56) =
    ```
    
    On `pandas_2.0_feature_branch`:
    ```bash
    = 926 failed, 86177 passed, 2034 skipped, 992 xfailed, 165 xpassed in 545.17s (0:09:05) =
    ```
    galipremsagar committed Apr 7, 2023
    Configuration menu
    Copy the full SHA
    60c257a View commit details
    Browse the repository at this point in the history

Commits on Apr 11, 2023

  1. Configuration menu
    Copy the full SHA
    f472c0d View commit details
    Browse the repository at this point in the history

Commits on Apr 14, 2023

  1. [REVIEW] Enable numeric_only for row-wise ops (#13090)

    This PR enables numeric_only for row-wise ops to be on parity with pandas-2.0.
    galipremsagar committed Apr 14, 2023
    Configuration menu
    Copy the full SHA
    be19968 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    095f17b View commit details
    Browse the repository at this point in the history

Commits on Apr 15, 2023

  1. [REVIEW] Fix DataFrame.__getitem__ to work with pandas-2.0 (#13139)

    This PR updates `DataFrame.__getitem__` to be able to work with pandas-2.0. For which, we conditionally pass `dtype` to `pandas.Series` constructor so that we don't get a warning in `<2.0` versions.
    
    This PR also fixes 76 pytests:
    ```
    = 907 failed, 86353 passed, 2034 skipped, 992 xfailed, 165 xpassed in 504.93s (0:08:24) =
    ```
    on `pandas_2.0_feature_branch`:
    ```
    = 983 failed, 86277 passed, 2034 skipped, 992 xfailed, 165 xpassed in 515.47s (0:08:35) =
    ```
    galipremsagar committed Apr 15, 2023
    Configuration menu
    Copy the full SHA
    8ff4861 View commit details
    Browse the repository at this point in the history

Commits on Apr 17, 2023

  1. Configuration menu
    Copy the full SHA
    8a41a7f View commit details
    Browse the repository at this point in the history

Commits on Apr 18, 2023

  1. Drop backfill and pad in GroupBy (#13156)

    This PR drops support for `pad` and `backfill` in `GroupBy`.
    
    This PR:
    ```
    = 881 failed, 86383 passed, 2034 skipped, 956 xfailed, 165 xpassed in 522.05s (0:08:42) =
    ```
    On `pandas_2.0_feature_branch`:
    ```
    = 911 failed, 86389 passed, 2034 skipped, 956 xfailed, 165 xpassed in 521.12s (0:08:41) =
    ```
    galipremsagar committed Apr 18, 2023
    Configuration menu
    Copy the full SHA
    bd38d70 View commit details
    Browse the repository at this point in the history
  2. [REVIEW] Add no_default and adapt Series.reset_index to different…

    …iate `None` for `name` parameter (#13152)
    
    In `pandas-2.0` the behavior for `name` parameter has changed to actually name a column `0` if no value is passed to `name`. But if `name=None`, the column will be named `None` too:
    
    ```python
    In [1]: import pandas as pd
    
    In [2]: s = pd.Series([10, 11, 23], index=[2, 3, 5])
    
    In [3]: s
    Out[3]: 
    2    10
    3    11
    5    23
    dtype: int64
    
    In [4]: s.reset_index()
    Out[4]: 
       index   0
    0      2  10
    1      3  11
    2      5  23
    
    In [5]: s.reset_index(name=None)
    Out[5]: 
       index  None
    0      2    10
    1      3    11
    2      5    23
    ```
    
    To achieve the same behavior in `cudf`, we had to introduce `no_default` value(which is same as pandas's `no_default` value). 
    
    This also fixes 18 pytests:
    ```
    = 965 failed, 86325 passed, 2044 skipped, 992 xfailed, 165 xpassed in 508.32s (0:08:28) =
    ```
    On `pandas_2.0_feature_branch`:
    
    ```
    = 983 failed, 86277 passed, 2034 skipped, 992 xfailed, 165 xpassed in 541.87s (0:09:01) =
    ```
    galipremsagar committed Apr 18, 2023
    Configuration menu
    Copy the full SHA
    81565cf View commit details
    Browse the repository at this point in the history
  3. Fix is_string_dtype to adapt to pandas-2.0 changes (#13141)

    With `pandas-2.0`, `pd.api.types.is_string_dtype(obj)` is going to perform a data-introspection to determine the true dtype of the underlying data. This path won't work for gpu objects, hence this PR adds special handling for GPU objects before we hit `pd.api.types.is_string_dtype(obj)` API.
    
    This PR fixes 56 pytests:
    ```
    = 927 failed, 86333 passed, 2034 skipped, 992 xfailed, 165 xpassed in 506.69s (0:08:26) =
    ```
    On `pandas_2.0_feature_branch`:
    ```
    = 983 failed, 86277 passed, 2034 skipped, 992 xfailed, 165 xpassed in 557.07s (0:09:17) =
    ```
    galipremsagar committed Apr 18, 2023
    Configuration menu
    Copy the full SHA
    199787d View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    27d2a75 View commit details
    Browse the repository at this point in the history

Commits on Apr 19, 2023

  1. Handle pandas warnings for pad and backfill (#13168)

    This PR adds pytest handling for wanring incase of pad and backfill.
    galipremsagar committed Apr 19, 2023
    Configuration menu
    Copy the full SHA
    47492da View commit details
    Browse the repository at this point in the history
  2. [REVIEW] Fix datetime pytests & raise errors for timezone un-aware ty…

    …pecasting (#13164)
    
    
    This PR fixes some of the `to_datetime` related pytests and also raises error while constructing a time-zone un-aware type to datetime types.
    
    This PR fixes 62 pytests:
    ```
    = 745 failed, 87877 passed, 2044 skipped, 956 xfailed, 165 xpassed in 492.06s (0:08:12) =
    ```
    On `pandas_2.0_feature_branch`:
    ```
    = 807 failed, 87819 passed, 2044 skipped, 956 xfailed, 165 xpassed in 488.43s (0:08:08) =
    ```
    galipremsagar committed Apr 19, 2023
    Configuration menu
    Copy the full SHA
    fbe1848 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    352753a View commit details
    Browse the repository at this point in the history
  4. Merge remote-tracking branch 'upstream/pandas_2.0_feature_branch' int…

    …o pandas_2.0_feature_branch
    galipremsagar committed Apr 19, 2023
    Configuration menu
    Copy the full SHA
    a31d62c View commit details
    Browse the repository at this point in the history
  5. [REVIEW] Fix pytests where empty column indexes are compared (#13166)

    This PR fixes pytests where empty column object comparisons fail, this is because of the following inconsistency between pandas & cudf:
    ```python
    In [1]: import pandas as pd
    
    In [2]: import cudf
    
    In [3]: pd.DataFrame().columns
    Out[3]: RangeIndex(start=0, stop=0, step=1)
    
    In [4]: cudf.DataFrame().columns
    Out[4]: Index([], dtype='object')
    
    In [5]: pd.DataFrame().columns.dtype
    Out[5]: dtype('int64')
    
    In [6]: cudf.DataFrame().columns.dtype
    Out[6]: dtype('O')
    
    ```
    
    This PR fixes 28 failures:
    ```
    = 779 failed, 87847 passed, 2044 skipped, 956 xfailed, 165 xpassed in 483.17s (0:08:03) =
    ```
    On `pandas_2.0_feature_branch`:
    ```
    = 807 failed, 87819 passed, 2044 skipped, 956 xfailed, 165 xpassed in 488.43s (0:08:08) =
    ```
    galipremsagar committed Apr 19, 2023
    Configuration menu
    Copy the full SHA
    615828d View commit details
    Browse the repository at this point in the history
  6. [REVIEW] Raise error when there is a binary operation between certain…

    … `DataFrame` and `Series` objects (#13138)
    
    This PR raises an error when there is a binary operation performed between `DataFrame` & `Series` with unequal `columns` and `index` respectively.
    
    This PR fixes 120 pytests:
    ```
    = 833 failed, 86451 passed, 2034 skipped, 968 xfailed, 165 xpassed in 490.86s (0:08:10) =
    ```
    on `pandas_2.0_feature_branch`:
    ```
    = 953 failed, 86307 passed, 2034 skipped, 992 xfailed, 165 xpassed in 511.09s (0:08:31) =
    ```
    galipremsagar committed Apr 19, 2023
    Configuration menu
    Copy the full SHA
    8e8a1ea View commit details
    Browse the repository at this point in the history
  7. Merge remote-tracking branch 'upstream/pandas_2.0_feature_branch' int…

    …o pandas_2.0_feature_branch
    galipremsagar committed Apr 19, 2023
    Configuration menu
    Copy the full SHA
    69af242 View commit details
    Browse the repository at this point in the history

Commits on Apr 20, 2023

  1. Fix datetime64 related inconsistencies in pytests (#13175)

    This PR fixes `datetime64` related pytest failures where pandas returns `ns` time resolutions for quite a lot of cases, i.e., mostly on the IO APIs side.
    
    Fixes 72 pytests:
    ```
    = 484 failed, 88162 passed, 2044 skipped, 932 xfailed, 165 xpassed in 444.53s (0:07:24) =
    ```
    On `pandas_2.0_feature_branch`:
    ```
    = 556 failed, 88090 passed, 2044 skipped, 932 xfailed, 165 xpassed in 456.49s (0:07:36) =
    ```
    galipremsagar committed Apr 20, 2023
    Configuration menu
    Copy the full SHA
    901a971 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    e7ddc69 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    8a13810 View commit details
    Browse the repository at this point in the history
  4. Fix DataFrame.describe pytests (#13191)

    #12890 dropped support for `datetime_is_numeric` from `describe` API. This PR cleans-up a remaining pytest that was using this parameter.
    
    This PR fixes 20 pytests:
    ```
    = 464 failed, 88182 passed, 2044 skipped, 932 xfailed, 165 xpassed in 440.68s (0:07:20) =
    ```
    On `pandas_2.0_feature_branch`:
    ```
    = 484 failed, 88162 passed, 2044 skipped, 932 xfailed, 165 xpassed in 457.87s (0:07:37) =
    ```
    galipremsagar committed Apr 20, 2023
    Configuration menu
    Copy the full SHA
    31e08c9 View commit details
    Browse the repository at this point in the history

Commits on Apr 21, 2023

  1. Configuration menu
    Copy the full SHA
    b772017 View commit details
    Browse the repository at this point in the history

Commits on Apr 22, 2023

  1. Change default dtype for get_dummies to bool (#13174)

    This PR changes the default dtype for get_dummies to bool from uint8 to match pandas-2.0: pandas-dev/pandas#48022
    galipremsagar committed Apr 22, 2023
    Configuration menu
    Copy the full SHA
    27e18c8 View commit details
    Browse the repository at this point in the history
  2. [REVIEW] Update parameter ordering in DataFrame.pivot (#13190)

    This PR updates parameter ordering in `DataFrame.pivot` to match pandas-2.0.
    
    This PR fixes 7 related pytests:
    ```
    = 477 failed, 88169 passed, 2044 skipped, 932 xfailed, 165 xpassed in 438.55s (0:07:18) =
    ```
    On `pandas_2.0_feature_branch`:
    ```
    = 484 failed, 88162 passed, 2044 skipped, 932 xfailed, 165 xpassed in 457.87s (0:07:37) =
    ```
    galipremsagar committed Apr 22, 2023
    Configuration menu
    Copy the full SHA
    6a86385 View commit details
    Browse the repository at this point in the history

Commits on Apr 24, 2023

  1. Configuration menu
    Copy the full SHA
    00f61cd View commit details
    Browse the repository at this point in the history

Commits on Apr 25, 2023

  1. Configuration menu
    Copy the full SHA
    e4500a6 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4fe23bb View commit details
    Browse the repository at this point in the history

Commits on Apr 26, 2023

  1. Fix ceil, floor and round pytests (#13218)

    A fix for pandas-dev/pandas#52761 has been merged by @mroeschke , this PR xfails the pytests conditionally for `2.0.0` and passes for rest of the versions.
    
    This PR fixes 27 pytests: 
    ```
    = 404 failed, 88221 passed, 2044 skipped, 959 xfailed, 165 xpassed in 442.21s (0:07:22) =
    ```
    
    On `pandas_2.0_feature_branch`:
    ```
    = 431 failed, 88221 passed, 2044 skipped, 932 xfailed, 165 xpassed in 456.25s (0:07:36) =
    ```
    galipremsagar committed Apr 26, 2023
    Configuration menu
    Copy the full SHA
    ea7d18c View commit details
    Browse the repository at this point in the history

Commits on Apr 27, 2023

  1. Configuration menu
    Copy the full SHA
    e355ba4 View commit details
    Browse the repository at this point in the history
  2. Fix kurtosis pytests to support numeric_only parameter (#13217)

    #12847 introduced support for `numeric_only`, this PR cleans up a `kurt` related pytest that was relying on the old behavior.
    
    This PR fixes 18 pytests :
    ```
    = 413 failed, 88257 passed, 2044 skipped, 932 xfailed, 165 xpassed in 463.03s (0:07:43) =
    ```
    
    On `pandas_2.0_feature_branch`:
    ```
    = 431 failed, 88221 passed, 2044 skipped, 932 xfailed, 165 xpassed in 456.25s (0:07:36) =
    ```
    galipremsagar committed Apr 27, 2023
    Configuration menu
    Copy the full SHA
    569b3e7 View commit details
    Browse the repository at this point in the history
  3. Fix parquet pytests errors with pandas-2.0 (#13216)

    Pandas-2.0 fixed the following issue where dtype of the column was being changed even when `np.nan` was not being written to it:
    ```python
    
    In [1]: import pandas as pd
    
    In [2]: df = pd.DataFrame({'a':[1, 2, 3]})
    
    In [3]: df.dtypes
    Out[3]: 
    a    int64
    dtype: object
    
    In [4]: df
    Out[4]: 
       a
    0  1
    1  2
    2  3
    
    In [7]: df[[False]*3] = np.nan
    
    In [8]: df
    Out[8]: 
       a
    0  1
    1  2
    2  3
    
    In [9]: df.dtypes
    Out[9]: 
    a    int64
    dtype: object
    ```
    
    Bug in pre-2.0:
    ```python
    
    In [1]: import pandas as pd
    
    In [2]: df = pd.DataFrame({'a':[1, 2, 3]})
    
    In [3]: df.dtypes
    Out[3]: 
    a    int64
    dtype: object
    
    In [4]: df
    Out[4]: 
       a
    0  1
    1  2
    2  3
    
    In [7]: df[[False]*3] = np.nan
    
    In [8]: df
    Out[8]: 
         a
    0  1.0
    1  2.0
    2  3.0
    
    In [9]: df.dtypes
    Out[9]: 
    a    float64
    dtype: object
    ```
    
    `make_pdf` was basically operating correctly with the help of this bug, this PR makes some fixes to the method and the callers to preserve the pytest behaviors.
    
    This PR fixes 6 pytests:
    ```
    = 425 failed, 88227 passed, 2044 skipped, 932 xfailed, 165 xpassed in 471.32s (0:07:51) =
    ```
    On `pandas_2.0_feature_branch`:
    ```
    = 431 failed, 88221 passed, 2044 skipped, 932 xfailed, 165 xpassed in 456.25s (0:07:36) =
    ```
    galipremsagar committed Apr 27, 2023
    Configuration menu
    Copy the full SHA
    bbc84f6 View commit details
    Browse the repository at this point in the history
  4. merge

    galipremsagar committed Apr 27, 2023
    Configuration menu
    Copy the full SHA
    34aa2c5 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    38fbdf5 View commit details
    Browse the repository at this point in the history

Commits on May 15, 2023

  1. Merge

    galipremsagar committed May 15, 2023
    Configuration menu
    Copy the full SHA
    71a5a88 View commit details
    Browse the repository at this point in the history

Commits on May 16, 2023

  1. merge

    galipremsagar committed May 16, 2023
    Configuration menu
    Copy the full SHA
    3b679e5 View commit details
    Browse the repository at this point in the history

Commits on May 17, 2023

  1. merge

    galipremsagar committed May 17, 2023
    Configuration menu
    Copy the full SHA
    b057436 View commit details
    Browse the repository at this point in the history
  2. Merge

    galipremsagar committed May 17, 2023
    Configuration menu
    Copy the full SHA
    97b1642 View commit details
    Browse the repository at this point in the history

Commits on May 22, 2023

  1. Configuration menu
    Copy the full SHA
    e47e5c0 View commit details
    Browse the repository at this point in the history

Commits on May 23, 2023

  1. Configuration menu
    Copy the full SHA
    97c0eee View commit details
    Browse the repository at this point in the history
  2. Fix csv reader pytest & MultiIndex docstring (#13417)

    Pandas-2.0 moved to a very strict & consistent date format inference, we should plan to move similarly but meanwhile for the pytest to pass, we will need to pass date_format='mixed'.
    This PR also fixes a miscellaneous issue with MultiIndex.copy docstring.
    galipremsagar committed May 23, 2023
    Configuration menu
    Copy the full SHA
    3a85f64 View commit details
    Browse the repository at this point in the history
  3. Merge

    galipremsagar committed May 23, 2023
    Configuration menu
    Copy the full SHA
    6bfbfe3 View commit details
    Browse the repository at this point in the history

Commits on May 24, 2023

  1. Configuration menu
    Copy the full SHA
    1fd2c91 View commit details
    Browse the repository at this point in the history

Commits on May 25, 2023

  1. Merge

    galipremsagar committed May 25, 2023
    Configuration menu
    Copy the full SHA
    7b13714 View commit details
    Browse the repository at this point in the history

Commits on May 26, 2023

  1. Deprecate Groupby.dtypes (#13453)

    This PR deprecates `Groupby.dtypes` since it is deprecated in `pandas-2.1`
    
    This PR fixes 5 pytests:
    ```
    = 474 failed, 95510 passed, 2044 skipped, 763 xfailed, 300 xpassed in 459.93s (0:07:39) =
    ```
    
    On `pandas_2.0_feature_branch`:
    ```
    = 479 failed, 95505 passed, 2044 skipped, 763 xfailed, 300 xpassed in 471.66s (0:07:51) =
    ```
    galipremsagar committed May 26, 2023
    Configuration menu
    Copy the full SHA
    c1e78b9 View commit details
    Browse the repository at this point in the history
  2. Enforce Groupby.__iter__ deprecation and miscellaneous pytest fixes (#…

    …13423)
    
    This PR:
    
    - [x] Enforces deprecation in `GroupBy.__iter__` 
    - [x] Fixes miscellaneous pytest failures due to already existing differences in cudf vs pandas & nuly introduced `inferred_type` in Index.
    galipremsagar committed May 26, 2023
    Configuration menu
    Copy the full SHA
    2dafcfc View commit details
    Browse the repository at this point in the history

Commits on May 30, 2023

  1. Preserve Index and grouped columns in Groupby.nth (#13442)

    In pandas-2.0 `groupby.nth` behavior has changed: https://pandas.pydata.org/docs/whatsnew/v2.0.0.html#dataframegroupby-nth-and-seriesgroupby-nth-now-behave-as-filtrations
    
    This PR enables preserving the callers index in the end result and returns grouping columns as part of the result.
    
    This PR fixes all 12 pytests in `python/cudf/cudf/tests/test_groupby.py::test_groupby_nth`
    galipremsagar committed May 30, 2023
    Configuration menu
    Copy the full SHA
    16c987e View commit details
    Browse the repository at this point in the history
  2. Index class deprecation enforcements (#13204)

    This PR:
    
    - [x] Enforces `Index` related deprecations by removing `Float32Index`, `Float64Index`, `GenericIndex`, `Int8Index`, `Int16Index`, `Int32Index`, `Int64Index`, `StringIndex`, `UInt8Index`, `UInt16Index`, `UInt32Index`, `UInt64Index`.
    - [x] Cleans up the repr logic to more closely align with pandas for `<NA>` value representation incase of `string` dtype.
    - [x] Fixes docstring and pytests to support the removals of the above classes.
    
    This PR also fixes 202 pytests:
    ```bash
    = 267 failed, 95670 passed, 2044 skipped, 763 xfailed, 300 xpassed in 442.18s (0:07:22) =
    ```
    
    On `pandas_2.0_feature_branch`:
    ```bash
    = 469 failed, 95464 passed, 2044 skipped, 763 xfailed, 300 xpassed in 469.26s (0:07:49) =
    ```
    galipremsagar committed May 30, 2023
    Configuration menu
    Copy the full SHA
    258bf3d View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    bb1c8d5 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    72a663e View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    8791749 View commit details
    Browse the repository at this point in the history
  6. Update docs

    galipremsagar committed May 30, 2023
    Configuration menu
    Copy the full SHA
    ac39341 View commit details
    Browse the repository at this point in the history

Commits on May 31, 2023

  1. Fix parquet paritioning pytest failures (#13474)

    This PR fixes parquet pytest failures, mostly working around two upstream issues:
    
    1. pandas-dev/pandas#53345
    2. apache/arrow#33321
    
    Thus fixes 20 pytest failure:
    This PR:
    ```
    = 231 failed, 95767 passed, 2045 skipped, 764 xfailed, 300 xpassed in 426.65s (0:07:06) =
    ```
    On `pandas_2.0_feature_branch`:
    ```
    = 251 failed, 95747 passed, 2045 skipped, 764 xfailed, 300 xpassed in 433.50s (0:07:13) =
    ```
    galipremsagar committed May 31, 2023
    Configuration menu
    Copy the full SHA
    a92ad86 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0b81bd6 View commit details
    Browse the repository at this point in the history

Commits on Jun 2, 2023

  1. Configuration menu
    Copy the full SHA
    f56ea26 View commit details
    Browse the repository at this point in the history

Commits on Jun 3, 2023

  1. Enforce merge validation deprecation (#13499)

    This PR raises an error when a merge is being performed between data consisting of different levels.
    galipremsagar committed Jun 3, 2023
    Configuration menu
    Copy the full SHA
    63b8fb1 View commit details
    Browse the repository at this point in the history
  2. Enable sort=True for Index.union, Index.difference and `Index.i…

    …ntersection` (#13497)
    
    This PR enables `sort=True` for `union`, `difference`, and `intersection` APIs in `Index`. 
    
    This also fixes 1 pytest failure and adds 77 pytests:
    On `Index_sort_2.0`:
    ```
    = 230 failed, 95836 passed, 2045 skipped, 768 xfailed, 308 xpassed in 438.88s (0:07:18) =
    ```
    On `pandas_2.0_feature_branch`:
    ```
    = 231 failed, 95767 passed, 2045 skipped, 764 xfailed, 300 xpassed in 432.59s (0:07:12) =
    ```
    
    xref: pandas-dev/pandas#25151
    galipremsagar committed Jun 3, 2023
    Configuration menu
    Copy the full SHA
    139e32d View commit details
    Browse the repository at this point in the history
  3. Fix a groupby pytest related to numeric_only (#13496)

    This PR fixes a groupby pytest by performing a special version based handling, we will need this handling because of no support for numeric_only in groupby.agg yet.
    galipremsagar committed Jun 3, 2023
    Configuration menu
    Copy the full SHA
    a6869e8 View commit details
    Browse the repository at this point in the history
  4. Drop special handling of min_periods for Rolling.count (#13483)

    This PR drops a special handling for `Rolling.count` where we always default to `min_periods=0`, this is an inconsistency that pandas-2.0 resolves in: pandas-dev/pandas#48839
    
    This PR fixes 2 pytest failures:
    ```
    = 229 failed, 95769 passed, 2045 skipped, 764 xfailed, 300 xpassed in 458.04s (0:07:38) =
    ```
    On `pandas_2.0_feature_branch`:
    ```
    = 231 failed, 95767 passed, 2045 skipped, 764 xfailed, 300 xpassed in 457.58s (0:07:37) =
    ```
    galipremsagar committed Jun 3, 2023
    Configuration menu
    Copy the full SHA
    6001bbf View commit details
    Browse the repository at this point in the history
  5. Fix JSON pytests (#13476)

    This PR fixes 3 json reader pytests:
    
    This PR:
    ```
    = 228 failed, 95770 passed, 2045 skipped, 764 xfailed, 300 xpassed in 473.29s (0:07:53) =
    ```
    
    On `pandas_2.0_feature_branch`:
    ```
    = 231 failed, 95767 passed, 2045 skipped, 764 xfailed, 300 xpassed in 445.90s (0:07:25) =
    ```
    galipremsagar committed Jun 3, 2023
    Configuration menu
    Copy the full SHA
    4416a24 View commit details
    Browse the repository at this point in the history

Commits on Jun 6, 2023

  1. Fixed strings

    galipremsagar committed Jun 6, 2023
    Configuration menu
    Copy the full SHA
    d6324d1 View commit details
    Browse the repository at this point in the history
  2. Fix DataFrame.mode pytest (#13500)

    This PR xfails a condition that is failing due to a pandas bug: pandas-dev/pandas#53497
    galipremsagar committed Jun 6, 2023
    Configuration menu
    Copy the full SHA
    361e96e View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    8e4b448 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    8bf7b04 View commit details
    Browse the repository at this point in the history

Commits on Jun 7, 2023

  1. annotate

    galipremsagar committed Jun 7, 2023
    Configuration menu
    Copy the full SHA
    0dc0a3d View commit details
    Browse the repository at this point in the history
  2. Fix issues

    galipremsagar committed Jun 7, 2023
    Configuration menu
    Copy the full SHA
    261f594 View commit details
    Browse the repository at this point in the history
  3. Switch to outer inner

    galipremsagar committed Jun 7, 2023
    Configuration menu
    Copy the full SHA
    dc08ef0 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    fbd6d12 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    9760d57 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    2f30179 View commit details
    Browse the repository at this point in the history

Commits on Jun 8, 2023

  1. Configuration menu
    Copy the full SHA
    20ed443 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b47ddc5 View commit details
    Browse the repository at this point in the history

Commits on Jun 9, 2023

  1. Configuration menu
    Copy the full SHA
    41d2c6c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    6d0c3a4 View commit details
    Browse the repository at this point in the history

Commits on Jun 12, 2023

  1. Configuration menu
    Copy the full SHA
    d3d1780 View commit details
    Browse the repository at this point in the history

Commits on Jun 13, 2023

  1. Fix dask_cudf pytest failures for pandas-2.0 upgrade (#13548)

    This PR fixes all `dask_cudf` side failures that happen due to `pandas-2.0` upgrade. The fixes are trivial to be broken down into separate PRs. 
    
    - [x] `check_less_precise` is removed in `pandas-2.0`, since it is a parameter that we never supported and just had it for the sake of matching signature I removed it from all the methods.
    - [x] Due to the removal of `StringIndex`, we had to perform some re-ordering of `if/elif` logic in `_nonempty_index`.
    - [x] `dask_cudf.DataFrame.var` got `numeric_only` support.
    - [x] `Series.count` doesn't have `skipna` support. Hence removed it from the call.
    
    
    This PR fixes 56 pytest failures:
    
    ```
    == 1100 passed, 13 skipped, 8 xfailed, 5 xpassed, 114 warnings in 57.10s ==
    ```
    
    On `pandas_2.0_feature_branch`:
    
    ```
    == 56 failed, 1044 passed, 13 skipped, 8 xfailed, 5 xpassed, 114 warnings in 73.73s (0:01:13) ==
    ```
    galipremsagar committed Jun 13, 2023
    Configuration menu
    Copy the full SHA
    4289ef4 View commit details
    Browse the repository at this point in the history
  2. simplify

    galipremsagar committed Jun 13, 2023
    Configuration menu
    Copy the full SHA
    e7eb1d3 View commit details
    Browse the repository at this point in the history

Commits on Jun 14, 2023

  1. Enable writing column names with mixed dtype in parquet writer when `…

    …mode.pandas_compatible=True` (#13505)
    
    This PR enables writing a dataframe that has column names that are of mixed types to a parquet file when pandas compatibility mode is enabled(`mode.pandas_compatible=True`).
    
    ---------
    
    Co-authored-by: Bradley Dice <bdice@bradleydice.com>
    galipremsagar and bdice committed Jun 14, 2023
    Configuration menu
    Copy the full SHA
    fb99b0a View commit details
    Browse the repository at this point in the history

Commits on Jun 20, 2023

  1. Configuration menu
    Copy the full SHA
    b69fb13 View commit details
    Browse the repository at this point in the history
  2. address reviews

    galipremsagar committed Jun 20, 2023
    Configuration menu
    Copy the full SHA
    2488d91 View commit details
    Browse the repository at this point in the history
  3. fix

    galipremsagar committed Jun 20, 2023
    Configuration menu
    Copy the full SHA
    7f216cf View commit details
    Browse the repository at this point in the history
  4. simplify

    galipremsagar committed Jun 20, 2023
    Configuration menu
    Copy the full SHA
    13d62c5 View commit details
    Browse the repository at this point in the history

Commits on Jun 21, 2023

  1. Merge pull request #13234 from galipremsagar/get_indexer_2.0

    Resolves: #12312
    This PR:
    - Enforces Index.get_loc deprecation by removing method and tolerance parameter support.
    - Introduces Index.get_indexer to match pandas: https://pandas.pydata.org/docs/reference/api/pandas.Index.get_indexer.html
    
    This PR fixes 219 pytests & adds 100 pytests:
    ```
    = 32 failed, 96054 passed, 2045 skipped, 766 xfailed, 300 xpassed in 409.82s (0:06:49) =
    ```
    On `pandas_2.0_feature_branch`:
    ```
    = 251 failed, 95747 passed, 2045 skipped, 764 xfailed, 300 xpassed in 433.50s (0:07:13) =
    ```
    wence- committed Jun 21, 2023
    Configuration menu
    Copy the full SHA
    a1f5581 View commit details
    Browse the repository at this point in the history
  2. merge

    galipremsagar committed Jun 21, 2023
    Configuration menu
    Copy the full SHA
    fdac177 View commit details
    Browse the repository at this point in the history

Commits on Jul 11, 2023

  1. Merge

    galipremsagar committed Jul 11, 2023
    Configuration menu
    Copy the full SHA
    fa2f0da View commit details
    Browse the repository at this point in the history

Commits on Jul 14, 2023

  1. Configuration menu
    Copy the full SHA
    85846be View commit details
    Browse the repository at this point in the history

Commits on Jul 18, 2023

  1. Merge

    galipremsagar committed Jul 18, 2023
    Configuration menu
    Copy the full SHA
    bc9b7b2 View commit details
    Browse the repository at this point in the history

Commits on Jul 31, 2023

  1. merge

    galipremsagar committed Jul 31, 2023
    Configuration menu
    Copy the full SHA
    ca463a7 View commit details
    Browse the repository at this point in the history
  2. Fix default behavior of index metaclass instance and subclass checks #…

    …13787
    
    The current implementation of IndexMeta gives the wrong result for subclasses of Index: for instance, DatetimeIndex.__instancecheck__(DatetimeIndex(), DatetimeIndex) or DatetimeIndex.__subclasscheck__(DatetimeIndex, DatetimeIndex) would return False. In the case of isinstance, however, we have been saved by https://bugs.python.org/issue35083, wherein Python is silently injecting a if type(instance) is self: return True short-circuit. issubclass currently does have the wrong behavior, though. The fix is to fall back to the original behavior of isinstance/issubclass rather than hardcoding a boolean result. That will also ensure that we get the correct behavior if e.g. a subclass of Index is itself subclassed, e.g. if we introduced a class SpecialDatetimeIndex(DatetimeIndex).
    vyasr committed Jul 31, 2023
    Configuration menu
    Copy the full SHA
    273945b View commit details
    Browse the repository at this point in the history

Commits on Aug 17, 2023

  1. merge

    galipremsagar committed Aug 17, 2023
    Configuration menu
    Copy the full SHA
    10e5459 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    22db02f View commit details
    Browse the repository at this point in the history

Commits on Oct 11, 2023

  1. Merge

    galipremsagar committed Oct 11, 2023
    Configuration menu
    Copy the full SHA
    de76328 View commit details
    Browse the repository at this point in the history
  2. merge fix

    galipremsagar committed Oct 11, 2023
    Configuration menu
    Copy the full SHA
    db92536 View commit details
    Browse the repository at this point in the history

Commits on Oct 12, 2023

  1. Configuration menu
    Copy the full SHA
    fc6a30f View commit details
    Browse the repository at this point in the history
  2. Deprecate is_categorical_dtype (#14274)

    This PR deprecates `is_categorical_dtype` to match pandas-dev/pandas#52527 which was introduced in `pandas-2.x`. This PR internalizes the public API since this is a needed utility in our code base.
    
    This PR:
    ```
    = 23835 failed, 5698 passed, 1613 skipped, 288 xfailed, 423 errors in 1976.84s (0:32:56) =
    ```
    
    On `pandas_2.0_feature_branch`:
    ```
    = 24297 failed, 5115 passed, 1613 skipped, 288 xfailed, 480 errors in 1980.46s (0:33:00) =
    ```
    galipremsagar committed Oct 12, 2023
    Configuration menu
    Copy the full SHA
    ad3ae65 View commit details
    Browse the repository at this point in the history
  3. Deprecate is_interval_dtype and is_datetime64tz_dtype (#14275)

    This PR deprecates `is_datetime64tz_dtype` and `is_interval_dtype` to have parity with pandas-2.x: pandas-dev/pandas#52607, alternatively this PR internalizes these utilities.
    
    This PR:
    ```
    = 1584 failed, 98570 passed, 2069 skipped, 776 xfailed, 312 xpassed, 20 errors in 737.24s (0:12:17) =
    ```
    
    On `pandas_2.0_feature_branch`:
    ```
    = 23835 failed, 5698 passed, 1613 skipped, 288 xfailed, 423 errors in 1976.84s (0:32:56) =
    ```
    galipremsagar committed Oct 12, 2023
    Configuration menu
    Copy the full SHA
    7c6d8f2 View commit details
    Browse the repository at this point in the history

Commits on Oct 13, 2023

  1. Deprecate method in fillna API (#14278)

    This PR deprecates `method` parameter in all public `fillna` APIs to match pandas: pandas-dev/pandas#53496
    
    This PR:
    ```
    = 1056 failed, 99098 passed, 2069 skipped, 776 xfailed, 312 xpassed, 20 errors in 670.87s (0:11:10) =
    ```
    
    On `pandas_2.0_feature_branch`:
    ```
    = 1584 failed, 98570 passed, 2069 skipped, 776 xfailed, 312 xpassed, 20 errors in 737.24s (0:12:17) =
    ```
    galipremsagar committed Oct 13, 2023
    Configuration menu
    Copy the full SHA
    2461315 View commit details
    Browse the repository at this point in the history

Commits on Nov 9, 2023

  1. Merge

    galipremsagar committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    88e1978 View commit details
    Browse the repository at this point in the history

Commits on Nov 29, 2023

  1. Deprecate fill_method and limit in pct_change APIs (#14277)

    This PR deprecated `fill_method` and `limit` in `Series.pct_change`, `DataFrame.pct_change` and `Groupby.pct_change`
    
    This PR:
    ```
    = 1263 failed, 98996 passed, 2069 skipped, 776 xfailed, 312 xpassed, 20 errors in 701.08s (0:11:41) =
    ```
    
    On `pandas_2.0_feature_branch`:
    ```
    = 1584 failed, 98570 passed, 2069 skipped, 776 xfailed, 312 xpassed, 20 errors in 737.24s (0:12:17) =
    ```
    galipremsagar committed Nov 29, 2023
    Configuration menu
    Copy the full SHA
    90788f2 View commit details
    Browse the repository at this point in the history
  2. Merge

    galipremsagar committed Nov 29, 2023
    Configuration menu
    Copy the full SHA
    b15e438 View commit details
    Browse the repository at this point in the history

Commits on Dec 2, 2023

  1. Merge

    galipremsagar committed Dec 2, 2023
    Configuration menu
    Copy the full SHA
    c01a8db View commit details
    Browse the repository at this point in the history

Commits on Dec 5, 2023

  1. Replace PandasArray with NumpyExtensionArray (#14549)

    This PR replaces usages of `PandasArray` with `NumpyExtensionArray` to not have warnings during runtime.
    
    On `pandas_2.0_feature_branch`:
    ```
    = 15895 failed, 61649 passed, 1840 skipped, 735 xfailed, 312 xpassed, 371 errors in 4361.35s (1:12:41) =
    ```
    
    On this PR:
    ```
    = 923 failed, 100684 passed, 2071 skipped, 786 xfailed, 312 xpassed, 20 errors in 1216.98s (0:20:16) =
    ```
    galipremsagar committed Dec 5, 2023
    Configuration menu
    Copy the full SHA
    c51444f View commit details
    Browse the repository at this point in the history
  2. Fix copy creation of a columnAccessor (#14551)

    This PR fixes a copy creation in ColumnAccessor by properly passing the rangeindex and label_dtype to it's newly constructed object.
    galipremsagar committed Dec 5, 2023
    Configuration menu
    Copy the full SHA
    e04b88b View commit details
    Browse the repository at this point in the history
  3. Fix to_pandas calls (#14552)

    This PR removes nullable=True in two pytests as we error when nullable is passed when there is decimal / list / struct data.
    galipremsagar committed Dec 5, 2023
    Configuration menu
    Copy the full SHA
    29b3ac8 View commit details
    Browse the repository at this point in the history
  4. Add missing is_categorical_dtype to cudf.api.types namespace (#14555

    )
    
    This PR adds back cudf.api.types.is_categorical that was missing due to a bad merge.
    galipremsagar committed Dec 5, 2023
    Configuration menu
    Copy the full SHA
    19952eb View commit details
    Browse the repository at this point in the history
  5. Fix name in Index.difference (#14556)

    This PR fixes result names for Index.difference in some early exit scenarios.
    galipremsagar committed Dec 5, 2023
    Configuration menu
    Copy the full SHA
    ac07b3d View commit details
    Browse the repository at this point in the history
  6. Filter deprecation warning in ffill and bfill APIs (#14554)

    This PR doesn't let the fillna warnings propagate to the user when ffill and bfill APIs are invoked.
    galipremsagar committed Dec 5, 2023
    Configuration menu
    Copy the full SHA
    2bdd8b8 View commit details
    Browse the repository at this point in the history
  7. Fix typo in value_counts (#14550)

    This PR fixes the return type of Series.value_counts to return int64, correcting a typo that was int34.
    galipremsagar committed Dec 5, 2023
    Configuration menu
    Copy the full SHA
    a068b10 View commit details
    Browse the repository at this point in the history
  8. Enforce Index.to_frame deprecations (#14553)

    This PR enforces deprecations of Index.to_frame and updates pytests related to this API.
    galipremsagar committed Dec 5, 2023
    Configuration menu
    Copy the full SHA
    ccfbe71 View commit details
    Browse the repository at this point in the history

Commits on Dec 6, 2023

  1. Deprecate DataFrame.applymap and use map instead (#14579)

    Pandas 2.1.0 deprecated DataFrame.applymap, This PR deprecated applymap and introduces map to be used as the new alternative API.
    galipremsagar committed Dec 6, 2023
    Configuration menu
    Copy the full SHA
    9b478b0 View commit details
    Browse the repository at this point in the history

Commits on Dec 7, 2023

  1. Deprecate first and last (#14583)

    This PR deprecates first and last APIs to bring parity with pandas, where these APIs were deprecated starting 2.1.0
    galipremsagar committed Dec 7, 2023
    Configuration menu
    Copy the full SHA
    0e83e20 View commit details
    Browse the repository at this point in the history

Commits on Dec 13, 2023

  1. Fix CategoricalDtype docstring (#14622)

    This PR fixes the docstring in CategoricalDtype where the repr has added a new field (categories_dtype).
    
    This PR fixes 2 doctest failures.
    galipremsagar committed Dec 13, 2023
    Configuration menu
    Copy the full SHA
    5f3ecd6 View commit details
    Browse the repository at this point in the history
  2. Fix DataFrame.sort_index when a index is a MultiIndex (#14621)

    This PR fixes sorting of a MultiIndex by removing an existing hard-coded na_position value that was based on ascending flag, essentially ignoring the user-passed parameter.
    
    On pandas_2.0_feature_branch:
    
    = 501 failed, 101106 passed, 2071 skipped, 786 xfailed, 312 xpassed, 20 errors in 1234.91s (0:20:34) =
    This PR:
    
    = 405 failed, 101034 passed, 2071 skipped, 954 xfailed, 312 xpassed, 20 errors in 1124.69s (0:18:44) =
    galipremsagar committed Dec 13, 2023
    Configuration menu
    Copy the full SHA
    72221b3 View commit details
    Browse the repository at this point in the history
  3. Deprecate reading literal string in cudf.read_json (#14619)

    This PR deprecates reading literal strings in read_json, instead users will need to pass StringIO for these cases to silence the warning.
    
    This change is to match: pandas-dev/pandas#53409
    
    On pandas_2.0_feature_branch:
    
    = 501 failed, 101106 passed, 2071 skipped, 786 xfailed, 312 xpassed, 20 errors in 1234.91s (0:20:34) =
    This PR:
    
    = 426 failed, 101181 passed, 2091 skipped, 786 xfailed, 312 xpassed in 1126.93s (0:18:46) =
    galipremsagar committed Dec 13, 2023
    Configuration menu
    Copy the full SHA
    d7dc16e View commit details
    Browse the repository at this point in the history

Commits on Dec 16, 2023

  1. Preserve column ordering in DataFrame.stack (#14626)

    This PR preserves original column ordering in DataFrame.stack
    
    On pandas_2.0_feature_branch:
    
    = 328 failed, 101111 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1113.40s (0:18:33) =
    This PR:
    
    = 316 failed, 101123 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1123.65s (0:18:43) =
    galipremsagar committed Dec 16, 2023
    Configuration menu
    Copy the full SHA
    eea5f10 View commit details
    Browse the repository at this point in the history

Commits on Dec 18, 2023

  1. Change is_.._dtype deprecations to DeprecationWarning instead of …

    …`FutureWarning` (#14617)
    
    This PR changes all FutureWarning's to DeprecationWarning's to match with pandas: pandas-dev/pandas#55703
    
    On pandas_2.0_feature_branch:
    
    = 501 failed, 101106 passed, 2071 skipped, 786 xfailed, 312 xpassed, 20 errors in 1234.91s (0:20:34) =
    This PR:
    
    = 445 failed, 101162 passed, 2071 skipped, 786 xfailed, 312 xpassed, 20 errors in 1216.79s (0:20:16) =
    galipremsagar committed Dec 18, 2023
    Configuration menu
    Copy the full SHA
    bc5584b View commit details
    Browse the repository at this point in the history

Commits on Dec 19, 2023

  1. Version dataframe.mode pytest (#14650)

    This PR versions the xfail properly to not fail in version of pandas where this bug is fixed.
    galipremsagar committed Dec 19, 2023
    Configuration menu
    Copy the full SHA
    194e487 View commit details
    Browse the repository at this point in the history
  2. Filter ufunc related warnings in pytests (#14652)

    This PR ignores ufunc runtime warnings that show up in eval API and setitem deprecation warnings.
    
    On pandas_2.0_feature_branch:
    
    = 260 failed, 101179 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1104.58s (0:18:24) =
    This PR:
    
    = 211 failed, 101228 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1095.49s (0:18:15) =
    galipremsagar committed Dec 19, 2023
    Configuration menu
    Copy the full SHA
    f736d72 View commit details
    Browse the repository at this point in the history
  3. Deprecate positional access for label based indexes in Series.__getit…

    …em__ (#14654)
    
    This PR deprecates positional access in `Series.__getitem__` when a label-based index is present.
    
    xref: pandas-dev/pandas#53201
    
    On `pandas_2.0_feature_branch`:
    
    ```
    = 260 failed, 101179 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1104.58s (0:18:24) =
    ```
    
    This PR:
    ```
    = 248 failed, 101190 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1105.78s (0:18:25) =
    ```
    galipremsagar committed Dec 19, 2023
    Configuration menu
    Copy the full SHA
    4539f4f View commit details
    Browse the repository at this point in the history

Commits on Dec 27, 2023

  1. Deprecate method in interpolate and calculation on object dtype (

    …#14667)
    
    This PR:
    
    - [x] Deprecates `method` in `interpolate`.
    - [x] Deprecates performing `interpolate` on string columns.
    
    On `pandas_2.0_feature_branch`:
    ```
    = 198 failed, 101241 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1098.81s (0:18:18) =
    ```
    
    This PR:
    ```
    = 187 failed, 101252 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1090.48s (0:18:10) =
    ```
    galipremsagar committed Dec 27, 2023
    Configuration menu
    Copy the full SHA
    c1411b6 View commit details
    Browse the repository at this point in the history
  2. Add more validation to MultiIndex.to_frame (#14671)

    This PR adds validation to `name` inputs in `MultiIndex.to_frame` API.
    
    On `pandas_2.0_feature_branch`:
    
    ```
    = 198 failed, 101241 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1098.81s (0:18:18) =
    ```
    
    This PR:
    ```
    = 180 failed, 101247 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1025.07s (0:17:05) =
    ```
    galipremsagar committed Dec 27, 2023
    Configuration menu
    Copy the full SHA
    2b9ab53 View commit details
    Browse the repository at this point in the history
  3. Deprecate ignoring empty objects in concat (#14672)

    This PR deprecates ignoring `empty` objects for dtype calculation in `concat`.
    
    On `pandas_2.0_feature_branch`:
    ```
    = 198 failed, 101241 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1098.81s (0:18:18) =
    ```
    
    This PR:
    ```
    = 179 failed, 101260 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1225.23s (0:20:25) =
    ```
    
    ## Checklist
    - [x] I am familiar with the [Contributing Guidelines](https://github.com/rapidsai/cudf/blob/HEAD/CONTRIBUTING.md).
    - [x] New or existing tests cover these changes.
    - [x] The documentation is up to date with these changes.
    galipremsagar committed Dec 27, 2023
    Configuration menu
    Copy the full SHA
    46ef148 View commit details
    Browse the repository at this point in the history
  4. Deprecate setting of incompatible dtypes to an existing column (#14668)

    This PR deprecates the setting of a value that is not of same dtype as that of a column.
    
    
    On `pandas_2.0_feature_branch`:
    ```
    = 198 failed, 101241 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1098.81s (0:18:18) =
    ```
    
    This PR:
    ```
    = 176 failed, 101263 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1096.08s (0:18:16) =
    ```
    galipremsagar committed Dec 27, 2023
    Configuration menu
    Copy the full SHA
    e218f5c View commit details
    Browse the repository at this point in the history
  5. Fix datetime related assertions and warnings in pytests (#14673)

    This PR fixes all `datetime` related pytests by properly handling their assertions with bug-fixes made in pandas-2.x and filtering newly introduced warnings where not necessary to propagate to the end-user.
    
    On `pandas_2.0_feature_branch`:
    ```
    = 198 failed, 101241 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1098.81s (0:18:18) =
    ```
    
    This PR:
    ```
    = 161 failed, 101280 passed, 2091 skipped, 952 xfailed, 312 xpassed in 1106.29s (0:18:26) =
    ```
    galipremsagar committed Dec 27, 2023
    Configuration menu
    Copy the full SHA
    fd1f986 View commit details
    Browse the repository at this point in the history

Commits on Dec 29, 2023

  1. Fix pytest condition to include more warning scenarios (#14680)

    This PR fixes calculation of cond variable in test_corr1d which will include more cases for warnings. This change fixes, 9 pytest failures.
    galipremsagar committed Dec 29, 2023
    Configuration menu
    Copy the full SHA
    cb09a39 View commit details
    Browse the repository at this point in the history
  2. Sort Index.difference & union results for early exit scenarios (#…

    …14681)
    
    This PR sorts results in `Index.difference` & `union` in the early exit scenarios similar to: pandas-dev/pandas#51346
    
    On `pandas_2.0_feature_branch`:
    ```
    = 110 failed, 101331 passed, 2091 skipped, 952 xfailed, 312 xpassed in 1064.30s (0:17:44) =
    ```
    
    This PR:
    ```
    = 87 failed, 101354 passed, 2091 skipped, 952 xfailed, 312 xpassed in 1004.34s (0:16:44) =
    ```
    galipremsagar committed Dec 29, 2023
    Configuration menu
    Copy the full SHA
    1c54354 View commit details
    Browse the repository at this point in the history

Commits on Dec 30, 2023

  1. Fix column parameter handling in read_orc (#14666)

    When `columns=[]` for `read_orc`, pandas actually only drops the column and preserves the `index` while reading an orc file. Fixing the `cudf` behavior to match the same.
    
    On `pandas_2.0_feature_branch`:
    ```
    = 198 failed, 101241 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1098.81s (0:18:18) =
    ```
    
    This PR:
    ```
    = 185 failed, 101254 passed, 2091 skipped, 954 xfailed, 312 xpassed in 1088.47s (0:18:08) =
    ```
    galipremsagar committed Dec 30, 2023
    Configuration menu
    Copy the full SHA
    8a8b627 View commit details
    Browse the repository at this point in the history
  2. Handle missing warning assertions for concat pytests (#14682)

    This PR adds warning assertions that were missed in #14672
    
    On `pandas_2.0_feature_branch`:
    ```
    = 110 failed, 101331 passed, 2091 skipped, 952 xfailed, 312 xpassed in 1064.30s (0:17:44) =
    ```
    This PR:
    ```
    = 105 failed, 101336 passed, 2091 skipped, 952 xfailed, 312 xpassed in 1068.90s (0:17:48) =
    ```
    galipremsagar committed Dec 30, 2023
    Configuration menu
    Copy the full SHA
    3344377 View commit details
    Browse the repository at this point in the history
  3. Fix a typo error in where pytest (#14683)

    This PR fixes a typo in isinstance check, thus fixing 6 pytest failures.
    galipremsagar committed Dec 30, 2023
    Configuration menu
    Copy the full SHA
    eabba98 View commit details
    Browse the repository at this point in the history

Commits on Jan 8, 2024

  1. Change empty column dtype to string from float64 (#14691)

    This PR enforces deprecation where an empty column now defaults to `str` dtype rather than `float64` dtype.
    On `pandas_2.0_feature_branch`:
    ```
    = 68 failed, 101373 passed, 2091 skipped, 952 xfailed, 312 xpassed in 1000.21s (0:16:40) =
    ```
    
    This PR:
    ```
    = 65 failed, 101364 passed, 2091 skipped, 964 xfailed, 312 xpassed in 1054.55s (0:17:34) =
    ```
    galipremsagar committed Jan 8, 2024
    Configuration menu
    Copy the full SHA
    bcdeb19 View commit details
    Browse the repository at this point in the history

Commits on Jan 20, 2024

  1. Fix merge

    galipremsagar committed Jan 20, 2024
    Configuration menu
    Copy the full SHA
    50cf007 View commit details
    Browse the repository at this point in the history

Commits on Jan 23, 2024

  1. Preserve empty index types in parquet reader (#14818)

    This PR preserves types of empty column index objects whose metadata is already present in the parquet file.
    
    This PR:
    
    = 107 failed, 101869 passed, 2091 skipped, 976 xfailed, 312 xpassed in 1265.57s (0:21:05) =
    On pandas_2.0_feature_branch:
    
    = 111 failed, 101865 passed, 2091 skipped, 976 xfailed, 312 xpassed in 1292.26s (0:21:32) =
    galipremsagar committed Jan 23, 2024
    Configuration menu
    Copy the full SHA
    6bcaf44 View commit details
    Browse the repository at this point in the history

Commits on Jan 24, 2024

  1. Fix Dataframe.agg to not return incorrect dtypes (#14851)

    This PR fixes `DataFrame.agg` API where the actual dataframe was being casted to incorrect dtype (object dtype) and then the operations were being performed. This PR adds strict checks at two places.
    
    This PR:
    ```
    = 95 failed, 101829 passed, 2091 skipped, 976 xfailed, 312 xpassed in 1479.83s (0:24:39) =
    ```
    
    On `pandas_2.0_feature_branch`:
    ```
    = 111 failed, 101865 passed, 2091 skipped, 976 xfailed, 312 xpassed in 1265.57s (0:21:05) =
    ```
    galipremsagar committed Jan 24, 2024
    Configuration menu
    Copy the full SHA
    bdbf0bc View commit details
    Browse the repository at this point in the history
  2. Catch warnings in reductions (#14852)

    This PR validates the warnings generated by certain reduction ops.
    galipremsagar committed Jan 24, 2024
    Configuration menu
    Copy the full SHA
    28b1814 View commit details
    Browse the repository at this point in the history
  3. Catch groupby jit apply warnings (#14858)

    This PR catches `RuntimeWarning`'s in jit groupby pytests.
    
    This PR:
    ```
    = 61 failed, 101866 passed, 2091 skipped, 976 xfailed, 312 xpassed in 1446.19s (0:24:06) =
    ```
    
    On `pandas_2.0_feature_branch`:
    ```
    = 91 failed, 101836 passed, 2091 skipped, 976 xfailed, 312 xpassed in 1348.36s (0:22:28) =
    ```
    galipremsagar committed Jan 24, 2024
    Configuration menu
    Copy the full SHA
    df5c78b View commit details
    Browse the repository at this point in the history
  4. Fix all reduction pytest failures (#14869)

    This PR fixes all the remaining one-off reduction pytest failures.
    
    This PR: 
    ```
    = 54 failed, 101872 passed, 2091 skipped, 977 xfailed, 312 xpassed in 1432.99s (0:23:52) =
    ```
    
    On `pandas_2.0_feature_branch`:
    ```
    = 61 failed, 101866 passed, 2091 skipped, 976 xfailed, 312 xpassed in 1446.19s (0:24:06) =
    ```
    galipremsagar committed Jan 24, 2024
    Configuration menu
    Copy the full SHA
    8784551 View commit details
    Browse the repository at this point in the history

Commits on Jan 25, 2024

  1. Fix empty groupby return types (#14871)

    Closes #14862
    
    This PR fixes the errors in #14862 by ensuring we match the pandas return type when doing grouped count, size, idxmax, idxmin.
    
    ---------
    
    Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>
    shwina and shwina committed Jan 25, 2024
    Configuration menu
    Copy the full SHA
    d7f9688 View commit details
    Browse the repository at this point in the history
  2. Support kurt/skew(axis=None) for multi columns/low row count (#14874)

    closes #14866
    
    @galipremsagar it appears the linked failing test in the issue test_reductions_axis_none_warning expected FutureWarning from these calls. Should they be expected for kurt/skew too?
    mroeschke committed Jan 25, 2024
    Configuration menu
    Copy the full SHA
    8a25f70 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    7bf4376 View commit details
    Browse the repository at this point in the history
  4. Preserve columns dtype in dataframe constructor (#14878)

    This PR preserves columns dtype in DataFrame constructor.
    
    This PR:
    
    = 52 failed, 101872 passed, 2091 skipped, 977 xfailed, 312 xpassed in 1188.72s (0:19:48) =
    On pandas_2.0_feature_branch:
    
    = 61 failed, 101866 passed, 2091 skipped, 976 xfailed, 312 xpassed in 1446.19s (0:24:06) =
    galipremsagar committed Jan 25, 2024
    Configuration menu
    Copy the full SHA
    d83f12e View commit details
    Browse the repository at this point in the history
  5. Disable style check

    vyasr committed Jan 25, 2024
    Configuration menu
    Copy the full SHA
    8db3b70 View commit details
    Browse the repository at this point in the history
  6. Pin pandas

    vyasr committed Jan 25, 2024
    Configuration menu
    Copy the full SHA
    4b5b8af View commit details
    Browse the repository at this point in the history
  7. Disable some more jobs

    vyasr committed Jan 25, 2024
    Configuration menu
    Copy the full SHA
    d2cc4db View commit details
    Browse the repository at this point in the history
  8. Actually remove the jobs

    vyasr committed Jan 25, 2024
    Configuration menu
    Copy the full SHA
    32e0982 View commit details
    Browse the repository at this point in the history

Commits on Jan 26, 2024

  1. Unpin numpy<1.25

    mroeschke committed Jan 26, 2024
    Configuration menu
    Copy the full SHA
    302c876 View commit details
    Browse the repository at this point in the history
  2. Merge pull request #14890 from vyasr/feat/enable_ci

    Enable CI in pandas 2.0 branch
    vyasr committed Jan 26, 2024
    Configuration menu
    Copy the full SHA
    0b79d70 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    2b07cd1 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    80090d7 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    481ea9c View commit details
    Browse the repository at this point in the history
  6. FIx more miscellaneous pytests failures (#14895)

    This PR fixes multiple issues:
    
     Enables corr and cov for Datetime and Timedelta types.
     Properly disables all and any for StringColumn.
     Preserves groupby categorical index ordering.
     Catches FutureWarnings in pytests.
    galipremsagar committed Jan 26, 2024
    Configuration menu
    Copy the full SHA
    4444909 View commit details
    Browse the repository at this point in the history
  7. Fix some pytests (#14894)

    np.product call I think will be redundant with the existing params, np.var call adjusted to what was tested before
    matmul failure existed upstream in pandas
    Snuck in a clean up files leftover by a parquet test (found these leftover when running the test suite locally)
    mroeschke committed Jan 26, 2024
    Configuration menu
    Copy the full SHA
    23d189b View commit details
    Browse the repository at this point in the history
  8. Align datetimeindex slicing behaviour with Pandas 2.x (#14887)

    * Align with pandas slicing behaviour for non-monotonic datetime index
    
    * Not a TODO
    
    ---------
    
    Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>
    shwina and shwina committed Jan 26, 2024
    Configuration menu
    Copy the full SHA
    7df96e7 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    6368c47 View commit details
    Browse the repository at this point in the history
  10. Deprecations in replace

    galipremsagar committed Jan 26, 2024
    Configuration menu
    Copy the full SHA
    87a4d12 View commit details
    Browse the repository at this point in the history
  11. Parquet Writer: Write non-string columns pandas-compatibility mode …

    …only (#14899)
    
    This PR enables writing of non-string columns in parquet writer only in pandas-compatibility mode.
    
    This PR:
    ```
    = 8 failed, 102249 passed, 2090 skipped, 976 xfailed, 312 xpassed in 1363.59s (0:22:43) =
    ```
    On `pandas_2.0_feature_branch`:
    ```
    = 9 failed, 102247 passed, 2091 skipped, 976 xfailed, 312 xpassed in 1336.47s (0:22:16) =
    ```
    
    
    Co-authored-by: Lawrence Mitchell <wence@gmx.li>
    
    ---------
    
    Co-authored-by: Lawrence Mitchell <wence@gmx.li>
    galipremsagar and wence- committed Jan 26, 2024
    Configuration menu
    Copy the full SHA
    7d3e72a View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    b662093 View commit details
    Browse the repository at this point in the history
  13. Use sets for argument checking.

    bdice committed Jan 26, 2024
    Configuration menu
    Copy the full SHA
    b61b39d View commit details
    Browse the repository at this point in the history
  14. Merge pull request #14900 from galipremsagar/replace_deprecations

    Deprecate `method` in `replace` and warn for a specific combination of `value` and `to_replace`
    vyasr committed Jan 26, 2024
    Configuration menu
    Copy the full SHA
    1256825 View commit details
    Browse the repository at this point in the history
  15. Fix usage

    vyasr committed Jan 26, 2024
    Configuration menu
    Copy the full SHA
    78eff48 View commit details
    Browse the repository at this point in the history
  16. Merge pull request #14892 from mroeschke/deps/numpy/unpin

    Unpin numpy<1.25
    mroeschke committed Jan 26, 2024
    Configuration menu
    Copy the full SHA
    2ff132e View commit details
    Browse the repository at this point in the history
  17. Merge pull request #14903 from vyasr/fix/revert_incorrect_set_usage

    Fix incorrect set usage
    vyasr committed Jan 26, 2024
    Configuration menu
    Copy the full SHA
    cb1889b View commit details
    Browse the repository at this point in the history

Commits on Jan 27, 2024

  1. Remove pandas Index subclasses in cudf pandas (#14902)

    We won't have to proxy these types anymore since they are removed in pandas 2.0
    
    Also removed references to the cudf Index subclasses that are removed in this branch
    
    Authors:
      - Matthew Roeschke (https://github.com/mroeschke)
    
    Approvers:
      - Ashwin Srinath (https://github.com/shwina)
    
    URL: #14902
    mroeschke committed Jan 27, 2024
    Configuration menu
    Copy the full SHA
    5618d3d View commit details
    Browse the repository at this point in the history

Commits on Jan 28, 2024

  1. Allow any and all only for all-NA and empty string columns (#14898

    )
    
    This PR allows any and all for all-NA string columns and string columns that have size 0. This is an essential workaround for time-being because any and all aren't natively supported for string types in libcudf and without these workarounds, multiple places in the reduction APIs will need if/elif checks which will make it harder to maintain.
    
    This PR:
    
    = 5 failed, 102252 passed, 2090 skipped, 976 xfailed, 312 xpassed in 1375.59s (0:22:55) =
    On pandas_2.0_feature_branch:
    
    = 9 failed, 102247 passed, 2091 skipped, 976 xfailed, 312 xpassed in 1336.47s (0:22:16) =
    galipremsagar committed Jan 28, 2024
    Configuration menu
    Copy the full SHA
    d8df8e4 View commit details
    Browse the repository at this point in the history

Commits on Jan 29, 2024

  1. Prevent converting strings to arrow strings in dask_cudf pytests (#…

    …14914)
    
    dask is natively converting all object types to arrow[string] types if proper pyarrow dependency is detected. This is being done in assert_eq API. We will need a change in cudf and dask upstream to be able to support this kind of conversion. I'm coming up with a solution in 24.04 dev cycle, but in the interest of shipping pandas-2.x I'm feeling confident to disable this auto-conversion by setting the dataframe.convert-string dask config to False where necessary.
    galipremsagar committed Jan 29, 2024
    Configuration menu
    Copy the full SHA
    9fa9dc5 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    f7b0bf6 View commit details
    Browse the repository at this point in the history
  3. Enable full CI

    galipremsagar committed Jan 29, 2024
    Configuration menu
    Copy the full SHA
    784fe95 View commit details
    Browse the repository at this point in the history
  4. Fix spacings

    galipremsagar committed Jan 29, 2024
    Configuration menu
    Copy the full SHA
    51e42c1 View commit details
    Browse the repository at this point in the history
  5. Update pr.yaml

    galipremsagar committed Jan 29, 2024
    Configuration menu
    Copy the full SHA
    eae873e View commit details
    Browse the repository at this point in the history
  6. Fix style issues in 2.0 feature branch (#14918)

    This PR fixes all style issues in pandas-2.0 feature branch
    galipremsagar committed Jan 29, 2024
    Configuration menu
    Copy the full SHA
    dbf08cb View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    bf49a66 View commit details
    Browse the repository at this point in the history
  8. Remove gated xfails (#14905)

    This removes xpassing tests from the test output.
    vyasr committed Jan 29, 2024
    Configuration menu
    Copy the full SHA
    e74fe0a View commit details
    Browse the repository at this point in the history
  9. Add Groupby.indices property and deprecate obj in get_group (#1…

    …4912)
    
    This PR:
    
     Introduces Groupby.indices property.
     Deprecates obj in Groupby.get_group
    galipremsagar committed Jan 29, 2024
    Configuration menu
    Copy the full SHA
    f69ae1d View commit details
    Browse the repository at this point in the history
  10. Change pandas version range (#14919)

    This PR pins pandas version range from 2.0 through 2.1.4
    galipremsagar committed Jan 29, 2024
    Configuration menu
    Copy the full SHA
    fc790ab View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    2a2c9c8 View commit details
    Browse the repository at this point in the history
  12. Fix custreamz pytests to test on float64 types (#14925)

    This PR passes types to empty dataframe construction because reductions were being performed on float64 types and now empty column default type is object.
    galipremsagar committed Jan 29, 2024
    Configuration menu
    Copy the full SHA
    5abe6b5 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    eb957d9 View commit details
    Browse the repository at this point in the history

Commits on Jan 30, 2024

  1. Configuration menu
    Copy the full SHA
    7f7e237 View commit details
    Browse the repository at this point in the history
  2. Remove pandas 1.3, 1.4 checks (#14927)

    Removes pandas 1.3, 1.4 checks in unit tests
    
    Authors:
      - Matthew Roeschke (https://github.com/mroeschke)
    
    Approvers:
      - GALI PREM SAGAR (https://github.com/galipremsagar)
      - Bradley Dice (https://github.com/bdice)
    
    URL: #14927
    mroeschke committed Jan 30, 2024
    Configuration menu
    Copy the full SHA
    c635335 View commit details
    Browse the repository at this point in the history
  3. Apply suggestions from code review

    Co-authored-by: Bradley Dice <bdice@bradleydice.com>
    Co-authored-by: Vyas Ramasubramani <vyasr@nvidia.com>
    3 people committed Jan 30, 2024
    Configuration menu
    Copy the full SHA
    adcd7e9 View commit details
    Browse the repository at this point in the history
  4. Allow hash_array to be findable in pandas 2.0; add workaround for tes…

    …t_resample for cudf.pandas (#14908)
    
    Fixes new failure in test_hash_array. Open to feedback on a better approach.
    
    The main issue is that some public methods are defined under __getattr__ with no __dir__ to find them (which we rely on for module population)
    mroeschke committed Jan 30, 2024
    Configuration menu
    Copy the full SHA
    86a4068 View commit details
    Browse the repository at this point in the history
  5. Remove pandas 1.5 checks (#14928)

    Remove pandas 1.5 checks mostly in unit tests.
    mroeschke committed Jan 30, 2024
    Configuration menu
    Copy the full SHA
    92b6472 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    601ce8f View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    132978f View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    30f873d View commit details
    Browse the repository at this point in the history
  9. Fix custreamz pytests to test on float64 types (#14934)

    This PR passes types to empty dataframe construction because reductions were being performed on float64 types and now empty column default type is object.
    galipremsagar committed Jan 30, 2024
    Configuration menu
    Copy the full SHA
    2b05b59 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    2e30753 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    1937252 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    6d07cc2 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    71d87d5 View commit details
    Browse the repository at this point in the history
  14. Revert "Add back reftarget change for cudf.Index"

    This reverts commit 71d87d5.
    vyasr committed Jan 30, 2024
    Configuration menu
    Copy the full SHA
    3438af0 View commit details
    Browse the repository at this point in the history
  15. Move abs to IndexedFrame

    vyasr committed Jan 30, 2024
    Configuration menu
    Copy the full SHA
    ffa473e View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    abcd15d View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    50d287f View commit details
    Browse the repository at this point in the history
  18. Configuration menu
    Copy the full SHA
    0013faa View commit details
    Browse the repository at this point in the history
  19. Move mask to IndexedFrame

    vyasr committed Jan 30, 2024
    Configuration menu
    Copy the full SHA
    11ab9e8 View commit details
    Browse the repository at this point in the history
  20. Configuration menu
    Copy the full SHA
    2563b90 View commit details
    Browse the repository at this point in the history
  21. Configuration menu
    Copy the full SHA
    9716f52 View commit details
    Browse the repository at this point in the history
  22. Move rolling to IndexedFrame

    vyasr committed Jan 30, 2024
    Configuration menu
    Copy the full SHA
    fdf31e3 View commit details
    Browse the repository at this point in the history
  23. Configuration menu
    Copy the full SHA
    0bcdb2d View commit details
    Browse the repository at this point in the history
  24. Move pipe to IndexedFrame

    vyasr committed Jan 30, 2024
    Configuration menu
    Copy the full SHA
    ea7ebfb View commit details
    Browse the repository at this point in the history
  25. Move conversion functions

    vyasr committed Jan 30, 2024
    Configuration menu
    Copy the full SHA
    7b0bcde View commit details
    Browse the repository at this point in the history
  26. Add missing methods to the docs

    vyasr committed Jan 30, 2024
    Configuration menu
    Copy the full SHA
    28548f6 View commit details
    Browse the repository at this point in the history
  27. Configuration menu
    Copy the full SHA
    59af57d View commit details
    Browse the repository at this point in the history
  28. Revert "Move isnull (alias) to IndexedFrame"

    This reverts commit 50d287f.
    vyasr committed Jan 30, 2024
    Configuration menu
    Copy the full SHA
    c6f5392 View commit details
    Browse the repository at this point in the history
  29. Revert "Move notnull (alias) to IndexedFrame"

    This reverts commit 0bcdb2d.
    vyasr committed Jan 30, 2024
    Configuration menu
    Copy the full SHA
    6301538 View commit details
    Browse the repository at this point in the history
  30. Configuration menu
    Copy the full SHA
    a95bc6a View commit details
    Browse the repository at this point in the history
  31. Configuration menu
    Copy the full SHA
    4f0563d View commit details
    Browse the repository at this point in the history
  32. Address feedback

    vyasr committed Jan 30, 2024
    Configuration menu
    Copy the full SHA
    07e9872 View commit details
    Browse the repository at this point in the history
  33. Merge pull request #14937 from vyasr/fix/doc_errors

    Fix documentation builds with pandas 2 changes
    vyasr committed Jan 30, 2024
    Configuration menu
    Copy the full SHA
    f281b90 View commit details
    Browse the repository at this point in the history