Removing Values That Repeat More Than 5 Times In Pandas Dataframe

December 02, 2023 Post a Comment

I am using pandas to work with csv files. I need to remove a few repeated values if they occur consecutively. I understand there is a duplicate function that removes any value th

Solution 1:

This should do it:

>> df = pd.Series([1,1,3,1,1,1,1,1,2])
>> df.groupby((df.shift() != df).cumsum())\
     .filter(lambdax: len(x) < 5)
01112382

Solution 2:

Showing how answer by elyase also works for DataFrame (not Series).

>> df = pd.DataFrame(np.array([[1,1,3,1,1,1,1,1,2]]).transpose(),columns = ["col"])
>> df.groupby((df["col"].shift() != df["col"]).cumsum()).filter(lambda x: len(x) < 5)
    col
 01112382

Baca Juga

Uneven Line Diagram Plotting And Change Of Date Format
Pandas Read Sql Integer Became Float
How To Use Shift() Within Apply() In Dataframe And Still Access Full Series?

Learn Python Tutorials

Removing Values That Repeat More Than 5 Times In Pandas Dataframe

Solution 1:

Solution 2:

Post a Comment for "Removing Values That Repeat More Than 5 Times In Pandas Dataframe"