当前位置：首页 > news >正文

世界网站百度信息流开户多少钱

news 2026/4/13 9:49:37

世界网站,百度信息流开户多少钱,装饰网站建设多少钱,东莞免费建站模板Python使用总结之py-docx将word文件中的图片保存，并将内容返回使用py-docx读取word文档的内容，其中包含标题、文本和图片等信息。该方法将标题和内容返回，并将文件中的图片保存到指定的文件夹中。实现步骤加载文件内容读取文件的段落对文…

Python使用总结之py-docx将word文件中的图片保存，并将内容返回

使用py-docx读取word文档的内容，其中包含标题、文本和图片等信息。该方法将标题和内容返回，并将文件中的图片保存到指定的文件夹中。

实现步骤

加载文件内容
读取文件的段落
对文件段落做判断
根据判断结果进行数据保存或者文件保存

代码部分

from docx import Document
import os
import redef extract_images_and_text(doc_path, output_folder):# 判断文件是否存在os.makedirs(output_folder, exist_ok=True)# 获取文件内容doc = Document(doc_path)# 创建数据保存字典content_dict = {}# 保存文件标题title = doc.paragraphs[0].text.strip() if doc.paragraphs else "Untitled"# 导出文件内容和图片full_text = ""img_count = 0for para in doc.paragraphs:full_text += para.text + "\n"for rel in doc.part.rels:rel = doc.part.rels[rel]if "image" in rel.target_ref:img_count += 1img_data = rel.target_part.blobimg_filename = f"image_{img_count}.png"img_path = os.path.join(output_folder, img_filename)with open(img_path, "wb") as img_file:img_file.write(img_data)# 清楚特殊符号content_dict[title] = re.sub(r'\n\s*\n', '\n', full_text.strip())return content_dict# Example usage:
doc_path = "path/to/your/document.docx"
output_folder = "path/to/output/folder"
result = extract_images_and_text(doc_path, output_folder)
print(result)