Commit Graph

25 Commits

Author SHA1 Message Date
shaohuzhang1 db772b1d1c
fix: Segmented filtering of paragraphs with empty parent title content (#2693)
Some checks are pending
sync2gitee / repo-sync (push) Waiting to run
Typos Check / Spell Check with Typos (push) Waiting to run
2025-03-26 16:02:26 +08:00
shaohuzhang1 5ec94860b2
perf: Enhance Word parsing (#2612) 2025-03-19 12:04:43 +08:00
shaohuzhang1 df172b530c fix: 修复上传PDF文件智能分段时提示 分段内容不能超过102400个字符 #998 2024-08-26 14:17:25 +08:00
shaohuzhang1 0ad5a76598
fix: 修复分段时,特殊情况会丢失数据 #938 (#946)
Some checks are pending
sync2gitee / repo-sync (push) Waiting to run
Typos Check / Spell Check with Typos (push) Waiting to run
2024-08-07 19:52:05 +08:00
shaohuzhang1 d935e9a836
fix: 修复上传文档,高级分段设置分段长度为10w字符,生成预览还是4096个字符一段 (#884) 2024-07-29 14:08:40 +08:00
gcalgoz 4d8ac28674
fix: 优化文档分割处理 (#814) 2024-07-19 17:33:18 +08:00
shaohuzhang1 00f2a8bbd4
fix: 修复【知识库】高级分段 自动清洗,把所有换行被去除 (#684) 2024-07-02 11:32:20 +08:00
shaohuzhang1 7a5bfa2673
fix: 修复导入带有表格样式的md文件分段后表格格式丢失 #615 (#630)
Some checks failed
sync2gitee / repo-sync (push) Has been cancelled
Typos Check / Spell Check with Typos (push) Has been cancelled
2024-06-13 10:41:52 +08:00
shaohuzhang1 ed7ddfbc59
fix: 修复分段超过分段长度限制 (#577) 2024-05-29 12:00:56 +08:00
shaohuzhang1 5e499e6afa
fix: PDF上传知识库开始导入接口报错 #122 (#125) 2024-04-16 20:59:27 +08:00
shaohuzhang1 fb7abb432f
Pr@main@fix bugs (#41)
* fix: 修复提示问题

* fix: 上传文档限制

* feat: 问题管理

* fix: 修改分段正则,优化分段逻辑

* feat: 问题管理

* fix: word分段支持表格数据

* fix: 问题批量插入去重

* fix: 修复文档问题

* feat: 文档分页优化

* fix: 优化关联问题

* fix: 嵌入样式
2024-04-10 14:16:56 +08:00
shaohuzhang1 16ab1f0eae
Pr@main@fix bugs (#27)
* fix: 优化word分段规则

* fix: 去除标题特殊字符

* fix: 对话重新生成问题

---------

Co-authored-by: wangdan-fit2cloud <dan.wang@fit2cloud.com>
2024-04-01 14:39:56 +08:00
shaohuzhang1 d732a46f89 fix: 【知识库】导入非utf8 编码的txt文件,分段内容是空白 2024-03-25 11:05:15 +08:00
zhangshaohu 53d45e069d fix: 智能分段文本分段丢失数据 2024-03-23 19:46:20 +08:00
shaohuzhang1 da3015fa36 fix: 分段正则修改 2024-03-04 18:34:47 +08:00
shaohuzhang1 cc62c35995 fix: 分段时 title超过256字符将超出部分拼接给content 2024-02-29 18:48:10 +08:00
shaohuzhang1 7eb18fbf30 fix: 上传的文档中 未智能处理空白段落 2024-02-19 11:29:24 +08:00
shaohuzhang1 edbc8561c7 fix: 分段错误,会话模板修改 2024-01-02 15:34:19 +08:00
shaohuzhang1 64c8cc6b39 feat: web数据集 2023-12-29 18:02:23 +08:00
shaohuzhang1 ce1c2271eb fix: 分段换行被替换 2023-12-21 17:46:43 +08:00
shaohuzhang1 b61da14fdd fix: 去除分段日志打印 2023-12-20 14:39:35 +08:00
shaohuzhang1 47cb83ce0c feat: 分段错误 2023-12-15 11:53:43 +08:00
shaohuzhang1 78a9697f50 feat: 批量添加团队成员,文档分段高级分段标识 2023-11-20 18:53:18 +08:00
shaohuzhang1 a2de9691fb feat: 数据集,文档,段落,问题,向量化接口 2023-10-24 20:24:32 +08:00
shaohuzhang1 dbe8e519a9 feat(init): 初始化项目 2023-09-15 17:40:35 +08:00