CaptainB
|
9255089d8b
|
feat: enhance PDF content extraction with font size analysis
sync2gitee / repo-sync (push) Waiting to run
Typos Check / Spell Check with Typos (push) Waiting to run
|
2025-12-02 19:22:56 +08:00 |
|
CaptainB
|
d147b794ce
|
chore: replace split_text with smart_split_paragraph in pdf_split_handle.py
|
2025-10-27 14:23:42 +08:00 |
|
CaptainB
|
4c9756839a
|
chore: normalize with_filter parameter to boolean in split handle files
sync2gitee / repo-sync (push) Waiting to run
--bug=1057879 --user=刘瑞斌 【知识库】高级分段中自动清洗功能未生效 https://www.tapd.cn/62980211/s/1727744
|
2025-07-10 15:06:19 +08:00 |
|
CaptainB
|
82a2203be6
|
fix: handle string type for limit and improve error logging in pdf_split_handle
--bug=1057493 --user=刘瑞斌 【知识库】上传文档,使用高级分段报错 https://www.tapd.cn/62980211/s/1720110
|
2025-06-30 12:47:47 +08:00 |
|
CaptainB
|
a73e0b10f9
|
refactor: replace logging with maxkb_logger for consistent logging across modules
|
2025-06-25 17:00:18 +08:00 |
|
CaptainB
|
fe8f87834d
|
refactor: replace logging with maxkb_logger for consistent logging across modules
|
2025-06-25 16:46:50 +08:00 |
|
CaptainB
|
43bef216d5
|
refactor: reorganize file handling imports into a structured directory
|
2025-04-30 16:08:17 +08:00 |
|