Commit Graph

16 Commits

Author SHA1 Message Date
shaohuzhang1 621fbd3bd6
fix: The image uploaded from the workflow knowledge base zip file cannot be parsed (#4505) 2025-12-12 14:01:29 +08:00
shaohuzhang1 e529514842
fix: The image uploaded from the workflow knowledge base zip file cannot be parsed (#4503) 2025-12-12 11:57:38 +08:00
CaptainB f6102b1457 chore: enhance zip content extraction with posixpath for path normalization 2025-12-11 15:10:49 +08:00
CaptainB dc0d9e29ce feat: enhance image path handling in markdown processing
--bug=1064136 --user=刘瑞斌 【工作流知识库】上传zip文件,文件中的图片内容没有写入知识库 https://www.tapd.cn/62980211/s/1806941
2025-12-08 11:16:38 +08:00
CaptainB a197cfc69d feat: implement CSV, XLS, and XLSX content extraction with Markdown formatting 2025-12-04 14:33:19 +08:00
shaohuzhang1 a8d0729e65
perf: Memory optimization (#4318) 2025-11-05 19:05:26 +08:00
CaptainB 4c9756839a chore: normalize with_filter parameter to boolean in split handle files
Some checks are pending
sync2gitee / repo-sync (push) Waiting to run
--bug=1057879 --user=刘瑞斌 【知识库】高级分段中自动清洗功能未生效 https://www.tapd.cn/62980211/s/1727744
2025-07-10 15:06:19 +08:00
CaptainB aa901c7fc7 fix: update file URL paths to use relative references 2025-07-02 22:45:11 +08:00
CaptainB 82a2203be6 fix: handle string type for limit and improve error logging in pdf_split_handle
--bug=1057493 --user=刘瑞斌 【知识库】上传文档,使用高级分段报错 https://www.tapd.cn/62980211/s/1720110
2025-06-30 12:47:47 +08:00
CaptainB d49f448a5f fix: correct image path replacement logic in zip_split_handle 2025-06-26 17:02:34 +08:00
CaptainB 37ac79dc5a feat: import File model in zip_split_handle for enhanced functionality
--bug=1057478 --user=刘瑞斌 【知识库】通用知识库上传ZIP文件,分段失败 https://www.tapd.cn/62980211/s/1719181
2025-06-26 16:56:28 +08:00
CaptainB 45908b91ff refactor: update dataset_id to knowledge_id in zip_split_handle.py and tools.py 2025-06-18 21:28:33 +08:00
CaptainB c0b770f41e refactor: update dataset_id to knowledge_id in zip_split_handle.py and tools.py 2025-06-18 21:15:53 +08:00
wxg0103 b8b14884bd refactor: add application settings 2025-06-07 17:57:11 +08:00
CaptainB c3581be9bd fix: rename image_name to file_name in zip_split_handle and remove workspace_id assignment in document 2025-05-13 12:47:59 +08:00
CaptainB 43bef216d5 refactor: reorganize file handling imports into a structured directory 2025-04-30 16:08:17 +08:00