如何使用海运为我的DataFrame创建堆叠条形图

2022-02-27 00:00:00 python pandas matplotlib seaborn

问题描述

我有一个DataFramedf

df = pd.DataFrame(columns=["App","Feature1", "Feature2","Feature3", "Feature4","Feature5", "Feature6","Feature7","Feature8"], data=[['SHA', 0, 0, 1, 1, 1, 0, 1, 0], ['LHA', 1, 0, 1, 1, 0, 1, 1, 0], ['DRA', 0, 0, 0, 0, 0, 0, 1, 0], ['FRA', 1, 0, 1, 1, 1, 0, 1, 1], ['BRU', 0, 0, 1, 0, 1, 0, 0, 0], ['PAR', 0, 1, 1, 1, 1, 0, 1, 0], ['AER', 0, 0, 1, 1, 0, 1, 1, 0], ['SHE', 0, 0, 0, 1, 0, 0, 1, 0]])

# display(df)
   App  Feature1  Feature2  Feature3  Feature4  Feature5  Feature6  Feature7  Feature8
0  SHA         0         0         1         1         1         0         1         0
1  LHA         1         0         1         1         0         1         1         0
2  DRA         0         0         0         0         0         0         1         0
3  FRA         1         0         1         1         1         0         1         1
4  BRU         0         0         1         0         1         0         0         0
5  PAR         0         1         1         1         1         0         1         0
6  AER         0         0         1         1         0         1         1         0
7  SHE         0         0         0         1         0         0         1         0

我要创建堆叠条形图,以便每个堆栈与App相对应,而Y轴将包含1值的计数,而X轴将为Feature

它应该类似于此条形图,唯一的区别是现在我希望看到堆叠条形图和带颜色的图例:

df_c = df.iloc[:, 1:].eq(1).sum().rename_axis('Feature').reset_index(name='Cou‌nt')
df_c = df_c.sort_values('Cou‌nt')
plt.figure(figsize=(12,8))
ax = sns.barplot(x="Feature", y='Cou‌nt', data=df_c, palette=sns.color_palette("GnBu", 10))
plt.xticks(rotation='vertical')
ax.grid(b=True, which='major', color='#d3d3d3', linewidth=1.0)
ax.grid(b=True, which='minor', color='#d3d3d3', linewidth=0.5)
plt.show()


解决方案

您可以按照@bharath的建议使用 pandas 图:

import seaborn as sns
sns.set()
df.set_index('App').T.plot(kind='bar', stacked=True)

输出:

更新:

从matplotlib导入ListedColormap df.set_index(‘App’) .reindex_axis(df.set_index(‘App’).sum().sort_values().index,轴=1) .T.Plot(KIND=‘BAR’,STACKED=True, colormap=ListedColormap(sns.color_palette("GnBu",10)), FigSize=(12,6))

更新的 pandas 0.21.0+reindex_axis已弃用,请使用reindex

from matplotlib.colors import ListedColormap

df.set_index('App')
  .reindex(df.set_index('App').sum().sort_values().index, axis=1)
  .T.plot(kind='bar', stacked=True,
          colormap=ListedColormap(sns.color_palette("GnBu", 10)), 
          figsize=(12,6))

输出:

相关文章