DataFrame.dropna

DataFrame.dropna(axis=0, how=’any’, thresh=None, subset=None, inplace=False)
于删除缺失数据。

Parameters

axis 0或index:删除包含缺失的行;1或columns:删除包含缺失值的列
how any:如果存在NA值,删除该行或列;all:如果所有值都是NA,删除该行或列
thresh 需要的非NA值个数
subset 删除行,需要包括的列的列表;删除列,需要包括的行的列表
inplace False:返回副本;True:就地执行操作并返回None

Example

  1. import pandas as pd
  2. df = pd.DataFrame({'site':['google', 'baidu', 'wiki'],
  3. 'age':[18, 39, 22],
  4. 'price': [None, 2.0, 3.0],
  5. 'color': [None, 'black', None]})
  6. df.dropna()
  7. -------------------------------------
  8. site age price color
  9. 1 baidu 39 2.0 black

Example

  1. import pandas as pd
  2. df = pd.DataFrame({'site':['google', 'baidu', 'wiki'],
  3. 'age':[18, 39, 22],
  4. 'price': [None, 2.0, 3.0],
  5. 'color': [None, 'black', None]})
  6. df.dropna(axis='columns')
  7. ---------------------------------------
  8. site age
  9. 0 google 18
  10. 1 baidu 39
  11. 2 wiki 22

Example

  1. import pandas as pd
  2. df = pd.DataFrame({'site':[None, 'baidu', 'wiki'],
  3. 'age':[None, 39, 22],
  4. 'price': [None, 2.0, 3.0],
  5. 'color': [None, 'black', None]})
  6. df.dropna(how='all')
  7. ---------------------------------------
  8. site age price color
  9. 1 baidu 39.0 2.0 black
  10. 2 wiki 22.0 3.0 None

Example

  1. import pandas as pd
  2. df = pd.DataFrame({'site':['google', 'baidu', 'wiki'],
  3. 'age':[18, 39, 22],
  4. 'price': [None, 2.0, 3.0],
  5. 'color': [None, 'black', None]})
  6. df.dropna(thresh=3)
  7. -------------------------------------
  8. site age price color
  9. 1 baidu 39 2.0 black
  10. 2 wiki 22 3.0 None

Example

  1. import pandas as pd
  2. df = pd.DataFrame({'site':['google', 'baidu', 'wiki'],
  3. 'age':[18, 39, 22],
  4. 'price': [None, 2.0, 3.0],
  5. 'color': [None, 'black', None]})
  6. df.dropna(subset=['age', 'price'])
  7. --------------------------------------------
  8. site age price color
  9. 1 baidu 39 2.0 black
  10. 2 wiki 22 3.0 None