STACKQUADRANT

PaddlePaddle/PaddleOCR

RAG Libraries

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

8.6
GitHub Metrics
Stars
79.4k
Forks
10.6k
Open Issues
206
Watchers
541
Contributors
295
Weekly Commits
0
Language
Python
License
Apache-2.0
Last Commit
Jun 3, 2026
Created
May 8, 2020
Latest Release
v3.6.0
Release Date
May 28, 2026
Synced: Jun 3, 2026
Quality Scores
Documentation Qualityw: 20%
8.4

Has docs site (https://www.paddleocr.com). Description: 176 chars. Stars signal: 79,431. Contributors: 295. Score: 8.4/10

Community Healthw: 20%
9.7

Stars: 79,431. Contributors: 295. Watchers: 541. Forks: 10,566. Issue ratio: 0.3%. Score: 9.7/10

Maintenance Velocityw: 15%
7.7

Last commit: 0d ago. Weekly commits: 0. Latest release: v3.6.0. Maturity bonus: 6.1y old. Score: 7.7/10

API Design & DXw: 20%
8.4

Stars/issues ratio: 386. Dynamic language: Python. Has documentation site. Permissive license: Apache-2.0. Popularity signal: 79,431 stars. Score: 8.4/10

Production Readinessw: 15%
8.6

Battle-tested: 79,431 stars. Peer review: 295 contributors. Versioned: v3.6.0. Licensed: Apache-2.0. Age: 6.1 years. Maintenance: last commit 0d ago. Score: 8.6/10

Ecosystem Integrationw: 10%
8.9

Fork interest: 10,566. Major ecosystem: Python. Integration-friendly: Apache-2.0. Adoption: 79,431 stars. Has web presence. Score: 8.9/10

Tags
ai4sciencechineseocrdocument-parsingdocument-translationkieocrpaddleocr-vlpdf-extractor-ragpdf-parserpdf2markdown
Radar
Documentation Quality
Community Health
Maintenance Velocity
API Design & DX
Production Readiness
Ecosystem Integration