统计函数

size()和shape () 是numpy模块中才有的函数,

size(),计算数组和矩阵所有数据的个数 <br />      shape： 读取矩阵的长度

info: info() 是pandas中DataFrame的方法
index:s.index(sub: str, start?: int, end?: int): int#返回索引
columns:
计算：+-*/
value_counts:是一种查看表格某列中有多少个不同值的快捷方法，并计算每个不同值有在该列中个数, pandas下面的顶层函数，也可以作用在Series、DataFrame下

pd.value_counts(

 values,<br />       sort=True,        #是否排序，默认是要排序<br />       ascending=False,     #默认降序排列<br />       normalize=False,     #标准化、转化成百分比形式<br />       bins=None,    #可以自定义分组区间，默认是没有，但也可以自定义区间<br />       dropna=True,   #是否删除nan，默认删除

str.count(sub, start= 0,end=len(string))

Args	Annotations
sub	搜索的子字符串
start	字符串开始搜索的位置。默认为第一个字符,第一个字符索引值为0。
end	字符串中结束搜索的位置。字符中第一个字符的索引为 0。默认为字符串的最后一个位置。

list.count(obj)

Args	Annotations
obj	搜索的list

tali/head
dtypes

unique

for col in data.columns:<br />      print('{} unique element : {}'.format(col,data[col].unique()))

nunique
format
describe(include = “all”)
drop_duplicates

startswith()方法语法：

str.startswith(str, beg=0,end=len(string));<br />      str -- 检测的字符串。<br />      strbeg -- 可选参数用于设置字符串检测的起始位置。<br />      strend -- 可选参数用于设置字符串检测的结束位置。

map/apply/applymap

先看数据结构是DataFrame还是Series,Seriesj结构直接用map(),DataFrame结构的话再看是要按行还是按列进行操作来选择对应的函数即可~

capitalize（）首字母变大写，其他字母变小写
set_index( )
reset_index()
del
merge():merge(df_2,df_1,how=’inner’,on=’userid’).

https://mp.weixin.qq.com/ssrc=11&timestamp=1583917168&ver=2209&signature=mxKK6pCenQiZVvWR11pIy3wQ3CdGjzc7E8hdojLXwB4zSFhNXViZKzrhCKcSOVGRV6JQ0FwLRZ4fq6ngA8iOK9UA3UN4JZYGCADCXeAzGGntFdjNwMZwLmgG5FRBUWP&new=1
rename
to_frame()
str.contains
read()/readlines()/readline()
with语句

统计函数

skew（）偏度
str（）
agg（）
div除

Python 常见问题及方案

python常用函数

统计函数