CaptainB
|
9b1a497925
|
fix: 处理某些pdf中不包括目录和内部链接不能完整导入的问题
(cherry picked from commit fb8b96779c)
|
2024-12-09 14:10:26 +08:00 |
|
CaptainB
|
7346ef6a2c
|
fix: 过滤空白的sheet
--bug=1049943 --user=刘瑞斌 【文档内容提取】-上传的excel中sheet为空时报错 https://www.tapd.cn/57709429/s/1625062
|
2024-12-04 16:30:43 +08:00 |
|
shaohuzhang1
|
6b4cee1412
|
fix: 修复对话使用api调用无法响应数据 (#1755)
|
2024-12-04 14:19:37 +08:00 |
|
shaohuzhang1
|
b6c65154c5
|
fix: 修复子应用表单调用无法调用问题 (#1741)
|
2024-12-03 15:23:53 +08:00 |
|
shaohuzhang1
|
b8aa4756c5
|
fix: 修复工作流节点输出等问题 (#1716)
sync2gitee / repo-sync (push) Has been cancelled
Typos Check / Spell Check with Typos (push) Has been cancelled
|
2024-11-29 19:26:16 +08:00 |
|
CaptainB
|
78cd949f43
|
fix: 修复上传xlsx里的图片没在文档提取中显示的问题
|
2024-11-29 16:28:34 +08:00 |
|
CaptainB
|
2a07a50a60
|
fix: 修复文档提取doc图片没有保存和展示的问题
|
2024-11-28 16:17:23 +08:00 |
|
CaptainB
|
f638abdea2
|
fix: 修复文档提取doc图片没有保存和展示的问题
|
2024-11-28 15:07:21 +08:00 |
|
CaptainB
|
59f5c8ac76
|
fix: 修复文档提取报错没有显示的问题
|
2024-11-27 12:20:16 +08:00 |
|
CaptainB
|
64e8f4dc9f
|
chore: 文档内容无法提取的时候输出错误信息
|
2024-11-22 17:56:07 +08:00 |
|
CaptainB
|
e1df4b2857
|
fix: 处理PDF中出现 \0 字符报 Null characters are not allowed
--bug=1048190 --user=刘瑞斌 【知识库】- 上传PDF文档 报错 ,关联issue #1468 https://www.tapd.cn/57709429/s/1611070
|
2024-11-18 12:46:37 +08:00 |
|
CaptainB
|
10e53f08e2
|
feat: 高级编排支持文件上传(WIP)
|
2024-11-14 14:24:36 +08:00 |
|
CaptainB
|
b57a619bdb
|
feat: 高级编排支持文件上传(WIP)
|
2024-11-14 13:36:16 +08:00 |
|
shaohuzhang1
|
22d9fdc42f
|
fix: 修复旧word文档图片无法正常识别 #1533
|
2024-11-06 14:20:10 +08:00 |
|
CaptainB
|
834ccaa35b
|
refactor: PDF分段强制按字数限制
--bug=1047568 --user=刘瑞斌 【github#1363】pdf 文件高级分段默认分段长度为500,但生成的段落长度超过29000字符 https://www.tapd.cn/57709429/s/1600183
|
2024-10-29 11:44:37 +08:00 |
|
shaohuzhang1
|
83d97439e4
|
fix: 修复导入word文档,有的图片导入不进去
|
2024-10-28 17:44:11 +08:00 |
|
CaptainB
|
76f63642e5
|
fix: 修复导入csv空行没有过滤的问题
--bug=1047841 --user=刘瑞斌 【知识库】上传csv格式的表格模版,第一行标题导入后分段显示不全 https://www.tapd.cn/57709429/s/1597113
|
2024-10-24 11:13:26 +08:00 |
|
wxg0103
|
d5bbf48d01
|
style: 优化样式
|
2024-10-18 15:51:03 +08:00 |
|
Henry-Shaw
|
33d63c8efe
|
fix: 修复知识库上传旧版本docx文件后,图片未正常识别导入的问题 (#1382)
|
2024-10-16 14:39:52 +08:00 |
|
CaptainB
|
e16e827028
|
fix: 处理文本前后的空白字符
|
2024-09-25 16:00:30 +08:00 |
|
CaptainB
|
6cacb5be71
|
fix: 处理不规范的pdf中前言部分没在目录中标识出来,导致不能正常识别的问题
|
2024-09-24 12:06:51 +08:00 |
|
shaohuzhang1
|
885ab5410a
|
fix: 修复【知识库】语雀导出的word,导入知识库是空白的 #1148
|
2024-09-20 19:37:22 +08:00 |
|
shaohuzhang1
|
49efb185e0
|
fix: 修复【模型设置】使用应用baseurl创建模型报错
|
2024-09-20 18:48:10 +08:00 |
|
shaohuzhang1
|
fda0bcb5d6
|
fix: 修复知识库导出后再导入,有一部分内容会丢失
|
2024-09-20 16:20:08 +08:00 |
|
CaptainB
|
3e3b77e34d
|
refactor: 处理纵向合并的单元格
|
2024-09-18 12:37:33 +08:00 |
|
CaptainB
|
746f587698
|
fix: 表格数据区分xls和xlsx
|
2024-09-12 10:49:31 +08:00 |
|
shaohuzhang1
|
b924958176
|
feat: 上传文档表格对支持xlsx文件单元格图片
|
2024-09-11 18:27:44 +08:00 |
|
shaohuzhang1
|
37445762b2
|
feat: 上传文档qa问答对支持xlsx文件单元格图片
|
2024-09-11 15:55:29 +08:00 |
|
shaohuzhang1
|
f9a76d7948
|
feat: 支持openai接口 #353 (#1128)
|
2024-09-09 14:47:25 +08:00 |
|
CaptainB
|
70f44b990c
|
refactor: 格式规范的pdf通过目录来分段
|
2024-09-06 10:56:27 +08:00 |
|
CaptainB
|
57b15a8a7f
|
feat: 知识库支持上传csv和excel
--story=1016154 --user=刘瑞斌 【知识库】-支持上传表格类型文档(Excel/CSV)按行分段 https://www.tapd.cn/57709429/s/1567910
|
2024-08-30 15:46:20 +08:00 |
|
shaohuzhang1
|
a9443a638c
|
fix: 修复上传文档中后缀为PDF 不识别
|
2024-08-27 14:16:03 +08:00 |
|
CaptainB
|
2a87af6172
|
chore: 解析错误时输出错误原因
|
2024-08-22 10:43:48 +08:00 |
|
shaohuzhang1
|
00af530d27
|
chore: 解析错误时输出错误原因 (#996)
sync2gitee / repo-sync (push) Waiting to run
Typos Check / Spell Check with Typos (push) Waiting to run
Co-authored-by: CaptainB <bin@fit2cloud.com>
|
2024-08-20 22:03:58 +08:00 |
|
CaptainB
|
17af603397
|
refactor: 优化pdf加载,修复部分pdf中文乱码的问题
sync2gitee / repo-sync (push) Waiting to run
Typos Check / Spell Check with Typos (push) Waiting to run
|
2024-08-20 16:58:04 +08:00 |
|
CaptainB
|
01d8204cb5
|
refactor: 逐页加载pdf, 图片类型单独保存成文件加载
|
2024-08-16 15:08:22 +08:00 |
|
CaptainB
|
0d59ab2be9
|
refactor: 使用lazy_load方式加载pdf
sync2gitee / repo-sync (push) Waiting to run
Typos Check / Spell Check with Typos (push) Waiting to run
|
2024-08-16 10:43:20 +08:00 |
|
CaptainB
|
e266dd9d99
|
refactor: 支持解析pdf中的图片
sync2gitee / repo-sync (push) Waiting to run
Typos Check / Spell Check with Typos (push) Waiting to run
|
2024-08-15 20:53:44 +08:00 |
|
shaohuzhang1
|
b3c7120372
|
fix: 修复QA文件解析失败 (#933)
|
2024-08-06 14:47:28 +08:00 |
|
RatishT
|
6c67b65e6a
|
Fix pylint warning `no-else-return` (#678)
|
2024-07-02 10:55:26 +08:00 |
|
shaohuzhang1
|
22e192ed11
|
fix: 修复文档导入解析错误 (#570)
sync2gitee / repo-sync (push) Waiting to run
Typos Check / Spell Check with Typos (push) Waiting to run
|
2024-05-28 17:32:29 +08:00 |
|
shaohuzhang1
|
efe5a2b021
|
fix: 修复excel导入失败问题 (#554)
|
2024-05-27 16:06:14 +08:00 |
|
shaohuzhang1
|
e9a05b1255
|
fix: 修复qa知识库导入失败错误 (#536)
|
2024-05-24 17:59:02 +08:00 |
|
shaohuzhang1
|
7953706895
|
perf: 优化错误提示 (#533)
|
2024-05-24 16:18:33 +08:00 |
|
shaohuzhang1
|
28938104c0
|
* feat: 支持上传 Excel/CSV 类型的问答对 (#430)
sync2gitee / repo-sync (push) Waiting to run
Typos Check / Spell Check with Typos (push) Waiting to run
|
2024-05-23 18:57:49 +08:00 |
|
shaohuzhang1
|
86f500208f
|
feat: 支持上传html格式的文档 #364 (#518)
|
2024-05-23 14:19:18 +08:00 |
|
shaohuzhang1
|
1f916a5c3e
|
feat: 【知识库】docx支持图片上传 #69 (#267)
|
2024-04-26 18:03:02 +08:00 |
|
shaohuzhang1
|
8b31fd6b36
|
fix: 分段不支持类型的文件报错
|
2024-04-10 17:05:46 +08:00 |
|
shaohuzhang1
|
bd3f6e4a9b
|
fix: word分段支持表格数据
|
2024-04-10 10:38:17 +08:00 |
|
shaohuzhang1
|
765c79ed9d
|
fix: 修改分段正则,优化分段逻辑
|
2024-04-09 18:05:50 +08:00 |
|