bins

您當前的位置：首頁 > 標簽>bins

資料分箱之pd.cut()
cut（ x， bins， right=True， labels=None， retbins=False， precision=3， include_lowest=False，
2020-05-26標簽：分箱 Pd Cut bins df
閱讀更多
技巧篇：分箱方法（等距、等頻、聚類）
count（）［‘total_point’］等頻分箱df［‘point_bins_f’］=pd
2022-07-26標簽： df point 聚類 bins kmodel
閱讀更多
IEEE-CIS Fraud Detection 覆盤——嘗試對原始資料負取樣後進行細緻的特徵工程
append（PSI_cal（5，X，y，cats））可以看到，聚類分箱的方法相對於有監督分箱的結果要差很多，但是穩定性確實很高，並且聚類的蔟越多，iv值越高，這個後續有空考慮放進去試試quantile 等寬mergeiv=［］PSIs=［
2019-10-29標簽： TransactionAmt bins train 特徵分箱
閱讀更多
如何在 Matplotlib 中繪製資料列表的直方圖？
hist（x，bins=None，range=None，density=False，weights=None，cumulative=False，bottom=None，histtype=‘bar’，align=‘mid’，orientati
2021-12-23標簽： plt bins 直方圖 hist none
閱讀更多
【風控建模】基於邏輯迴歸的評分卡開發（I）
plot_roc（vali_y， vali_proba_df，plot_micro=False，figsize=（6，6），plot_macro=False）def plot_model_ks（y_label， y_pred）：“”“繪製k
2020-03-29標簽： bins df num list 分箱
閱讀更多
使用hive和python多種方式實現PSI的計算
strftime（“%Y-%m-%d”））dt_train = df［（df［split_date］ >= strat_day）&（df［split_date］ < apply_day）］df_test = df［（df
2020-07-19標簽： actual Predict df PSI bins
閱讀更多
連續資料離散化
分段：KBinsDiscretizer（n_bins=5， encode=‘onehot’， strategy=‘quantile’）需要三個引數n_bins， encode， strategyn_bins:分段的數量encode:編碼的方
2020-08-24標簽： bins encode est quantile xv
閱讀更多
pandas的cut，qcut函式的使用和區別
cut（d_cut［‘number’］， 4， labels=False）d_cut我們可以看到，上面的cut_group的標籤由開閉區間改變成了數字
2019-06-06標簽： Cut qcut 分組 GROUP bins
閱讀更多

資料分箱之pd.cut()

技巧篇：分箱方法（等距、等頻、聚類）

IEEE-CIS Fraud Detection 覆盤——嘗試對原始資料負取樣後進行細緻的特徵工程

如何在 Matplotlib 中繪製資料列表的直方圖？

【風控建模】基於邏輯迴歸的評分卡開發（I）

使用hive和python多種方式實現PSI的計算

連續資料離散化

pandas的cut，qcut函式的使用和區別