To find the mean absolute deviation of a series or a column in a DataFrame in pandas, the easiest way is to use the pandas mad() function.

df["Column1"].mad()

When doing data analysis, the ability to compute different summary statistics, such as the mean or standard deviation of a variable, is very useful to help us understand the data. One such summary statistic which can be useful is the mean absolute deviation of a variable.

The mean absolute deviation of a variable is computed as the mean of absolute deviation of data points from their mean.

Finding the mean absolute deviation of columns or a Series using pandas is easy. We can use the pandas mad() function to find the mean absolute deviation of a column of numbers.

Let’s say we have the following DataFrame.

df = pd.DataFrame({'Name': ['Jim', 'Sally', 'Bob', 'Sue', 'Jill', 'Larry'],
                   'Weight': [160.20, 160.20, 209.45, 150.35, 187.52, 187.52],
                   'Height': [50.10, 68.94, 71.42, 48.56, 59.37, 63.42] })

print(df)
# Output: 
    Name  Weight  Height
0    Jim  160.20   50.10
1  Sally  160.20   68.94
2    Bob  209.45   71.42
3    Sue  150.35   48.56
4   Jill  187.52   59.37
5  Larry  187.52   63.42

To get the mean absolute deviation of all columns in our DataFrame, we can use the pandas mad() function on the DataFrame in the following Python code:

print(df.mad())

# Output:
Weight    18.956667
Height     7.625000
dtype: float64

If we only want to get the mean absolute deviation of the column “Height”, we can do so easily like in the following Python code:

print(df["Height"].mad())

# Output:
7.625

Hopefully this article has been helpful for you to understand how to find the mean absolute deviation of a variable within a column or Series using the pandas mad() function in Python.

Categorized in:

Python,

Last Update: February 26, 2024