Рет қаралды 1,413
In this video, I cover some strategies for aggregating and merging rows that are similar or near-duplicates in a Python pandas dataframe into a single row. This is helpful for situations where the information could easily be captured in a single row and you want to preserve the information but decrease the amount of rows. I used this recently when working with a set of Twitter data from Kaggle. I also cover how to simply drop the true duplicates, or drop similar rows if that is the preferred solution in your case.
Written tutorial and source code: syntaxbytetutorials.com/how-t...
Chapters:
0:00 Introduction and Drop True Duplicates
2:45 Simple Aggregate Example
5:15 Complex Aggregate Example