PHP - 从上传的 Microsoft Word 文档中获取字数

2021-12-31 00:00:00 ms-word upload php .doc

我正在尝试从上传的 word doc (.doc, .docx, .rtf) 中获取字数,但它总是带有烦人的 Word 格式.

I am trying to grab a word count from an uploaded word doc (.doc, .docx, .rtf) but it always carries through the annoying Word formatting.

有人之前解决过这个问题并知道如何解决吗?谢谢:)

Anybody tackled this issue before and know how to solve it? Thanks :)

推荐答案

您需要:

  1. 区分文件类型

  1. Distinguish the file type

$file_name = $_FILES['image']['name'];
$file_extn = end(explode(".", strtolower($_FILES['image']['name'])));

if($file_extn == "doc" || $file_extn == "docx"){
    docx2text();
}elseif($file_extn == "rtf"){
    rtf2text();
}

  • 将文档转换为文本

  • Convert the document to text

    https://stackoverflow.com/a/7371315/2512934 用于 doc 或 docxhttp://webcheatsheet.com/php/reading_the_clean_text_from_rtf.php 用于 rtf

    https://stackoverflow.com/a/7371315/2512934 for doc or docx http://webcheatsheet.com/php/reading_the_clean_text_from_rtf.php for rtf

    数词http://php.net/manual/en/function.str-word-count.php

  • 相关文章