The “fillna” function in Pandas not only can replace missing values with a given constant value, like in this example:
import pandas as pd
import numpy as np
df = pd.DataFrame([[np.nan], [2], [np.nan], [0]])
df
data:image/s3,"s3://crabby-images/8adf3/8adf3aa9c250d28b0a93e378eef590a757cad546" alt="A dataframe with missing values"
df.fillna(47)
data:image/s3,"s3://crabby-images/eac2c/eac2c1b06075438c440c680ebb3fda90f8f90161" alt="Missing values replaced with a constant"
You can also replace a missing value with the next (or previous) value in the data frame!
df.fillna(method = "ffill")
data:image/s3,"s3://crabby-images/1a962/1a962bd5f439cd3e79beeaa029c4ebd44aeebb51" alt="Missing values filled with the previous existing value."
Note that the first value cannot be replaced because nothing is preceding it.
Want to build AI systems that actually work?
Download my expert-crafted GenAI Transformation Guide for Data Teams and discover how to properly measure AI performance, set up guardrails, and continuously improve your AI solutions like the pros.
You can also use the value of the next row to fill a missing value.
df.fillna(method = "bfill")
data:image/s3,"s3://crabby-images/c8ecd/c8ecd4daee6298a35aafe19ef9f70897796c474a" alt="Missing values filled with the next existing value."