Pandas庫之Series使用介紹

作者：由早睡覺多運動233 發表于書法時間：2020-08-04

Pandas庫

pandas 是基於NumPy 的資料分析包，Pandas 的常用資料結構是 Series（一維資料）與 DataFrame（二維資料）

import pandas as pd

Pandas庫之series

Series是帶標籤的一維陣列，可儲存整數、浮點數、字串、Python物件型別的資料。軸標籤統稱為

索引

。

建立Series

Series屬性

增刪改

切片

索引

布林索引

修改index

資料型別轉換

資料結構轉換

統計函式

計數函式

排序

檢視資料缺失

資料填充

資料重複

逐元素操作

繪圖舉例

建立Series

呼叫

pd。Series

函式即可建立Series：

。

Series

（

data

，

index

）

data支援三種資料結構

1。字典

。

Series

（{

‘a’

：

，

‘b’

：

}）

2。列表

pd。Series（［1，2，3］，index=［‘a’，‘b’，‘c’］）

a 1

b 2

c 3

3。標量

pd。Series（1）

0 1

Series屬性

Series的value值是numpy中的陣列型別

。

Series

（［

，

］）

。

dtype

（

‘int64’

）

。

index

#index預設從0開始

RangeIndex

（

strat

，

stop

，

step

）

。

values

array

（［

，

］，

dtype

int64

）

。

size

。

Series

（［

，

］）

。

shape

（

，）

增刪改

。

Series

（［

，

］）

dtype

：

int64

增加元素

［

］

dtype

：

int64

2。刪除元素

。

drop

（

）

#不改變原來的x，需要賦值

dtype

：

int64

3。修改元素

［

］

dtype

：

int64

切片

Series切片是對index進行切片，然後選出對應的值

。

Series

（

range

（

，

））

dtype

：

int64

［

：

］

dtype

：

int64

2。當index是字串時，切片時包含右端點

。

Series

（［

，

］，

index

［

‘b’

，

‘d’

，

‘c’

，

‘a’

］）

［

‘d’

：

‘a’

］

dtype

：

int64

索引

。

Series

（［

，

］，

index

［

‘b’

，

‘d’

，

‘c’

，

‘a’

，

‘e’

］）

dtype

：

int64

［［

‘a’

，

‘c’

］］

dtype

：

int64

布林索引

。

Series

（［

，

。

NaN

，

］，

index

［

‘b’

，

‘d’

，

‘c’

，

‘a’

，

‘e’

］）

1。0

2。0

3。0

NaN

5。0

dtype

：

float64

（

）

#x>2 對於值來說

False

True

False

True

dtype

：

bool

［

］

#按值索引

3。0

5。0

dtype

：

float64

修改index

。

Series

（［

for

range

（

）］，

index

［

‘a’

，

‘b’

，

‘c’

］）

（

）

dtype

：

int64

。

index

［

‘c’

，

‘d’

，

‘e’

］

（

）

dtype

：

int64

。

reset_index

（

drop

True

）

#drop=True刪除原有索印列

（

）

dtype

：

int64

資料型別轉換

astype

。

Series

（［

，

］）

（

）

dtype

：

int64

。

astype

（

。

float

）

（

）

1。0

2。0

3。0

dtype

：

float64

資料結構轉換

Series->ndarray/list/dict/frame

。

Series

（［

，

］）

（

。

values

）

［

］

（

。

to_numpy

（））

［

］

（

。

to_list

（））

［

，

］

（

。

to_dict

（））

{

：

，

：

，

：

}

（

。

to_frame

（））

統計函式

。

Series

（［

，

］）

（

。

mode

（））

#出現次數最多的值，可以是多個值，返回值為Series型別

dtype

：

int64

（

。

max

（））

（

。

mean

（））

2。5

（

。

median

（））

1。5

（

。

std

（））

1。0488088481701516

計數函式

。

Series

（［

，

］）

（

。

count

（））

#統計非空元素個數

（

。

value_counts

（））

#統計每個元素個數，預設降序排列，ascending=True則升序排列

dtype

：

int64

排序

。

Series

（［

，

］，

index

［

‘b’

，

‘a’

，

‘c’

，

‘c’

］）

（

）

dtype

：

int64

（

。

sort_index

（

ascending

False

））

dtype

：

int64

（

。

sort_values

（））

dtype

：

int64

檢視資料缺失

。

Series

（［

，

。

NaN

，

］）

（

［

。

isnull

（）］）

NaN

dtype

：

float64

缺失填充

method=‘None’

用一個指定值去填充缺失值（預設預設這種方式）

。

Series

（［

，

。

NaN

，

］）

（

。

fillna

（

））

1。0

2。0

9。0

3。0

dtype

：

float64

2。

method=pad/ffill

用

前一個

非缺失值去填充該缺失值

（

。

fillna

（

method

‘ffill’

））

1。0

2。0

3。0

dtype

：

float64

3。

method=backfill/bfill

用

下一個

非缺失值填充該缺失值

（

。

fillna

（

method

‘backfill’

））

1。0

2。0

3。0

dtype

：

float64

資料重複

。

Series

（［

，

］）

（

。

unique

（））

［

］

（

［

。

duplicated

（）］）

dtype

：

int64

（

［

。

duplicated

（）］）

dtype

：

int64

（

。

drop_duplicates

（

keep

False

，

inplace

False

））

#keep=‘first’表示保留第一次出現的重複行，是預設值。

#“last”和False，分別表示保留最後一次出現的重複行和去除所有重複行。

#inplace=True表示直接在原來的DataFrame上刪除重複項，而預設值False表示生成一個副本

dtype

：

int64

逐元素操作

apply（func，convert_dtype=True，args=（），**kwds）

。

Series

（［

，

］）

def

num_nap

（

，

bias

）：

return

bias

（

。

apply

（

num_nap

，

args

（

，）））

dtype

：

int64

繪圖舉例

Pandas可呼叫matplotlib繪圖，相比matplotlib，pandas繪圖簡單快捷，不需要很多引數，從而可以專注於資料分析

import

pandas

import

numpy

from

matplotlib

import

pyplot

plt

。

Series

（［

，

］）

counts

。

value_counts

（）

（

counts

）

counts

。

plot

。

bar

（）

plt

。

show

（）

dtype

：

int64

標簽： Series Pd print index int64

上一篇:一個女生對你遮蔽朋友圈什麼意思？

下一篇：時間序列分類@ShapeNet: A Shapelet-Neural Network Approach for Multivariate Time Series Classification

Pandas庫之Series使用介紹

猜你喜歡

黃哥漫談Python 生成器。

55個任務，55種用法幫你搞定資料切片，索引和過濾（建議收藏）

Apple Watch 的心電圖靠譜嗎？

Apple Watch S5 和 Apple Watch S4 該怎麼選？

現在值得買apple watch se嗎（7月）?