Python之pdf内容读取_python读取pdf内容

作者：机器笔尖舞者 | 2024-01-31 19:08:48

踩

python读取pdf内容

import pandas as pd
import pdfplumber

with pdfplumber.open(r'C:\Users\2023\02\开发.pdf') as pdf:
    for page in pdf.pages:
        #输出文本内容，类型为STR
        print(type(page.extract_text()))
        #获取表格
        tables=page.extract_tables()
        tables_n=len(tables)
        for ind in range(tables_n):
            #表格第一行为标题，类型为pandas.core.frame.DataFrame转换为list
            print(pd.DataFrame(tables[ind][1:],columns=tables[ind][0]).values.tolist())
1
2
3
4
5
6
7
8
9
10
11
12
13

声明：本文内容由网友自发贡献，不代表【wpsshop博客】立场，版权归原作者所有，本站不承担相应法律责任。如您发现有侵权的内容，请联系我们。转载请注明出处：https://www.wpsshop.cn/article/detail/51530