用 pandas 为excel中的单元格着色
问题描述
我需要一些帮助.所以我有这样的东西
I need some help here. So i have something like this
import pandas as pd
path = '/Users/arronteb/Desktop/excel/ejemplo.xlsx'
xlsx = pd.ExcelFile(path)
df = pd.read_excel(xlsx,'Sheet1')
df['is_duplicated'] = df.duplicated('#CSR')
df_nodup = df.loc[df['is_duplicated'] == False]
df_nodup.to_excel('ejemplo.xlsx', encoding='utf-8')
所以基本上这个程序将 ejemlo.xlsx
(ejemlo 是西班牙语的例子,只是文件名)加载到 df
(一个 DataFrame
),然后检查特定列中的重复值 .它会删除重复项并再次保存文件.那部分工作正常.问题是,我需要用不同的颜色(如黄色)突出显示包含重复项的单元格,而不是删除重复项.
So basically this program load the ejemplo.xlsx
(ejemplo is example in Spanish, just the name of the file) into df
(a DataFrame
), then checks for duplicate values in a specific column. It deletes the duplicates and saves the file again. That part works correctly. The problem is that instead of removing duplicates, I need highlight the cells containing them with a different color, like yellow.
解决方案
你可以创建一个函数来做高亮...
You can create a function to do the highlighting...
def highlight_cells():
# provide your criteria for highlighting the cells here
return ['background-color: yellow']
然后将突出显示功能应用于您的数据框...
And then apply your highlighting function to your dataframe...
df.style.apply(highlight_cells)
相关文章