GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture. It introduces Multi-Token Prediction (MTP) loss and stable full-task ...
[INFO] glmocr.pipeline.pipeline: Starting Pipeline... [DEBUG] glmocr.layout.layout_detector: Initializing PP-DocLayoutV3... '[Errno 101] Network is unreachable ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results