Commit Graph

22 Commits

Author SHA1 Message Date
CaptainB 57b15a8a7f feat: 知识库支持上传csv和excel
--story=1016154 --user=刘瑞斌 【知识库】-支持上传表格类型文档(Excel/CSV)按行分段 https://www.tapd.cn/57709429/s/1567910
2024-08-30 15:46:20 +08:00
shaohuzhang1 a9443a638c fix: 修复上传文档中后缀为PDF 不识别 2024-08-27 14:16:03 +08:00
CaptainB 2a87af6172 chore: 解析错误时输出错误原因 2024-08-22 10:43:48 +08:00
shaohuzhang1 00af530d27
chore: 解析错误时输出错误原因 (#996)
Some checks are pending
sync2gitee / repo-sync (push) Waiting to run
Typos Check / Spell Check with Typos (push) Waiting to run
Co-authored-by: CaptainB <bin@fit2cloud.com>
2024-08-20 22:03:58 +08:00
CaptainB 17af603397 refactor: 优化pdf加载,修复部分pdf中文乱码的问题
Some checks are pending
sync2gitee / repo-sync (push) Waiting to run
Typos Check / Spell Check with Typos (push) Waiting to run
2024-08-20 16:58:04 +08:00
CaptainB 01d8204cb5 refactor: 逐页加载pdf, 图片类型单独保存成文件加载 2024-08-16 15:08:22 +08:00
CaptainB 0d59ab2be9 refactor: 使用lazy_load方式加载pdf
Some checks are pending
sync2gitee / repo-sync (push) Waiting to run
Typos Check / Spell Check with Typos (push) Waiting to run
2024-08-16 10:43:20 +08:00
CaptainB e266dd9d99 refactor: 支持解析pdf中的图片
Some checks are pending
sync2gitee / repo-sync (push) Waiting to run
Typos Check / Spell Check with Typos (push) Waiting to run
2024-08-15 20:53:44 +08:00
shaohuzhang1 b3c7120372
fix: 修复QA文件解析失败 (#933) 2024-08-06 14:47:28 +08:00
shaohuzhang1 22e192ed11
fix: 修复文档导入解析错误 (#570)
Some checks are pending
sync2gitee / repo-sync (push) Waiting to run
Typos Check / Spell Check with Typos (push) Waiting to run
2024-05-28 17:32:29 +08:00
shaohuzhang1 efe5a2b021
fix: 修复excel导入失败问题 (#554) 2024-05-27 16:06:14 +08:00
shaohuzhang1 e9a05b1255
fix: 修复qa知识库导入失败错误 (#536) 2024-05-24 17:59:02 +08:00
shaohuzhang1 28938104c0
* feat: 支持上传 Excel/CSV 类型的问答对 (#430)
Some checks are pending
sync2gitee / repo-sync (push) Waiting to run
Typos Check / Spell Check with Typos (push) Waiting to run
2024-05-23 18:57:49 +08:00
shaohuzhang1 86f500208f
feat: 支持上传html格式的文档 #364 (#518) 2024-05-23 14:19:18 +08:00
shaohuzhang1 1f916a5c3e
feat: 【知识库】docx支持图片上传 #69 (#267) 2024-04-26 18:03:02 +08:00
shaohuzhang1 8b31fd6b36 fix: 分段不支持类型的文件报错 2024-04-10 17:05:46 +08:00
shaohuzhang1 bd3f6e4a9b fix: word分段支持表格数据 2024-04-10 10:38:17 +08:00
shaohuzhang1 765c79ed9d fix: 修改分段正则,优化分段逻辑 2024-04-09 18:05:50 +08:00
shaohuzhang1 b038f12a52 fix: 上传文档大小扩大到100MB 2024-04-09 15:21:40 +08:00
shaohuzhang1 11d8c6f174
fix: 修改已知bug(#30)
* fix: 刷新公共访问链接后,客户端统计重置

* fix: 导出未提交的sql文件

* fix: 创建 MaxKB 在线文档的知识库,只能获取根地址数据,子地址数据无法获取
2024-04-02 19:32:04 +08:00
shaohuzhang1 16ab1f0eae
Pr@main@fix bugs (#27)
* fix: 优化word分段规则

* fix: 去除标题特殊字符

* fix: 对话重新生成问题

---------

Co-authored-by: wangdan-fit2cloud <dan.wang@fit2cloud.com>
2024-04-01 14:39:56 +08:00
shaohuzhang1 c55bb3f6e5
Pr@main@pdf (#23)
* feat: 分段API支持word,pdf

* fix: 通用型知识库支持上传 PDF/DOC 格式的文档#19

---------

Co-authored-by: wangdan-fit2cloud <dan.wang@fit2cloud.com>
2024-03-29 18:28:05 +08:00