ExtractThinker

流程：doclingOCR文件生成Markdown文本——llm进行识别并判断是自定义类型中的哪一种——llm进行提取相应类型所需的数据——存入数据库
参考、
- https://pub.towardsai.net/building-an-on-premise-document-intelligence-stack-with-docling-ollama-phi-4-extractthinker-6ab60b495751
- https://github.com/enoch3712/ExtractThinker
效果如下

alt text

alt text

ExtractThinker

https://tolsz.me/2025/02/06/ExtractThinker/

作者

wbj_Lsz

发布于

2025年2月6日

许可协议

Ollama-minicpm-v 上一篇

极空间探索下一篇