DataFrame.value_counts

DataFrame.value_counts(subset=None, normalize=False, sort=True, ascending=False)
返回一个包含DataFrame中唯一行个数的Series。

Parameters

subset 计算唯一组合时要使用的列
normalize True:返回比例;False:返回频率
sort 是否按频率排序
ascending 是否按升序排序
dropna 是否去掉包含NA值的行

Example

  1. import pandas as pd
  2. df = pd.DataFrame({'site':['google', 'baidu', 'wiki', 'google'],
  3. 'age':[18, 39, 22, 18],
  4. 'price': [1.0, 2.0, 3.0, 1.0],
  5. 'color': ['red', 'black', None, 'red']})
  6. df.value_counts(normalize=True)
  7. ------------------------------------------------------------------
  8. site age price color
  9. google 18 1.0 red 0.666667
  10. baidu 39 2.0 black 0.333333
  11. dtype: float64

Example

  1. import pandas as pd
  2. df = pd.DataFrame({'site':['google', 'baidu', 'wiki', 'google'],
  3. 'age':[18, 39, 22, 18],
  4. 'price': [1.0, 2.0, 3.0, 1.0],
  5. 'color': ['red', 'black', None, 'red']})
  6. df.value_counts(normalize=False)
  7. ---------------------------------------------------------------------
  8. site age price color
  9. google 18 1.0 red 2
  10. baidu 39 2.0 black 1
  11. dtype: int64

Example

  1. import pandas as pd
  2. df = pd.DataFrame({'site':['google', 'baidu', 'wiki', 'google'],
  3. 'age':[18, 39, 22, 18],
  4. 'price': [1.0, 2.0, 3.0, 1.0],
  5. 'color': ['red', 'black', None, 'red']})
  6. df.value_counts('site')
  7. ----------------------------------------------------------------
  8. site
  9. google 2
  10. wiki 1
  11. baidu 1
  12. dtype: int64