Friday, November 18, 2022

How to Sort a Dataframe in PySpark

 Below are different ways to sort a dataframe:

  1. sort
    1. df.sort(df.ColumnName)
    2. df.sort(df.ColumnName.desc())
    3. df.sort(df.ColumnName.desc(), df.ColumnName2)
    4. df.sort(col("ColumnName"))
  2. orderBy
    1. df.orderBy(df.ColumnName)
    2. df.orderBy(df.ColumnName.desc())
    3. df.orderBy(df.ColumnName1.desc(), df.ColumnName2)
    4. df.orderBy(col("ColumnName"))

No comments:

Post a Comment