首先导入数据
Students.xlsx
1.如何查看数据框(DataFrame)的索引、数据类型及内存信息
data.info() # 查看数据框的索引、数据类型及内存信息"""<class 'pandas.core.frame.DataFrame'>RangeIndex: 20 entries, 0 to 19Data columns (total 4 columns):ID 20 non-null int64Name 20 non-null objectAge 20 non-null int64Score 20 non-null int64dtypes: int64(3), object(1)memory usage: 600.0+ bytes"""# 从中我们可以得到很多信息,例如数据框一共有16个条目,4个数据列,及每列的数据类型;涉及到的数据类型;内存信息
2.查看数据框的内容
- 查看数据框的行数与列数
print(data.shape)"""(20, 4)表示数据框一共有20行,4列"""
- 查看数据框的前n行
print(data.head()) # 查看数据框的前n行 默认前五行"""ID Name Age Score0 1 Student_001 16 871 2 Student_002 26 922 3 Student_003 33 1003 4 Student_004 27 734 5 Student_005 32 63"""
- 查看数据框的后n行
print(data.tail()) # 查看数据框的后n行 默认后五行"""ID Name Age Score15 16 Student_016 30 9616 17 Student_017 16 7717 18 Student_018 35 8718 19 Student_019 19 8619 20 Student_020 20 94"""
- 如何查看中间的的数据
# data.iloc[a:b, c:d] # 选取第a到b行,c至d列的数据print(data.iloc[:, :2]) # 选取全部行,0到2列的数据
3.如果当读取的Excel表有空行 空列时,该如何正确读取数据?
import pandas as pdbooks = pd.read_excel('tmp\Books.xlsx', skiprows=3, usecols='C:F')print(books)"""ID Name InStore Date0 NaN Book_001 NaN NaN1 NaN Book_002 NaN NaN2 NaN Book_003 NaN NaN3 NaN Book_004 NaN NaN4 NaN Book_005 NaN NaN5 NaN Book_006 NaN NaN6 NaN Book_007 NaN NaN7 NaN Book_008 NaN NaN8 NaN Book_009 NaN NaN9 NaN Book_010 NaN NaN10 NaN Book_011 NaN NaN11 NaN Book_012 NaN NaN12 NaN Book_013 NaN NaN13 NaN Book_014 NaN NaN14 NaN Book_015 NaN NaN15 NaN Book_016 NaN NaN16 NaN Book_017 NaN NaN17 NaN Book_018 NaN NaN18 NaN Book_019 NaN NaN19 NaN Book_020 NaN NaN"""
