在python中打开一个受保护的pdf文件

2022-01-22 00:00:00 python pdf passwords

问题描述

我写了一个pdf破解,找到了被保护的pdf文件的密码.我想用 Python 编写一个程序,可以在没有密码的情况下在屏幕上显示该 pdf 文件.我使用 PyPDF 库.我知道如何在没有密码的情况下打开文件,但不知道受保护的文件.知道吗?谢谢

I write a pdf cracking and found the password of the protected pdf file. I want to write a program in Python that can display that pdf file on the screen without password.I use the PyPDF library. I know how to open a file without the password, but can't figure out the protected one.Any idea? Thanks

filePath = raw_input()
password = 'abc'
if sys.platform.startswith('linux'):
       subprocess.call(["xdg-open", filePath])


解决方案

KL84给出的方法基本可以,但是代码不正确(每页都写输出文件).清理后的版本在这里:

The approach shown by KL84 basically works, but the code is not correct (it writes the output file for each page). A cleaned up version is here:

https://gist.github.com/bzamecnik/1abb64affb21322256f1c4ebbb59a364

# Decrypt password-protected PDF in Python.
# 
# Requirements:
# pip install PyPDF2

from PyPDF2 import PdfFileReader, PdfFileWriter

def decrypt_pdf(input_path, output_path, password):
  with open(input_path, 'rb') as input_file, 
    open(output_path, 'wb') as output_file:
    reader = PdfFileReader(input_file)
    reader.decrypt(password)

    writer = PdfFileWriter()

    for i in range(reader.getNumPages()):
      writer.addPage(reader.getPage(i))

    writer.write(output_file)

if __name__ == '__main__':
  # example usage:
  decrypt_pdf('encrypted.pdf', 'decrypted.pdf', 'secret_password')

相关文章