QuickTable, an ultra lightweight application to extract tables in PDF

This research proposes a new pipeline to extract tables of interest in PDF files, and develops an ultra lightweight application named QuickTable accordingly. Most of the previous research only focused on one or two tasks of table recognition and there is little research on finding tables of interest. The developed QuickTable uses the proposed pipeline based on PP-Picodet, SLANet, PPOCRv3, Text Segmentation and Cosine Similarity Analysis, which allows users to upload PDF files from mobile devices and enter keywords to get tables of interest. In addition, we have trained models in both Chinese and English so that users can upload files in different languages. Experiments show that the proposed pipeline is lightweight and outperforms previous approaches, demonstrating the effectiveness of our method. The code is open-sourced.

Xiang Fei
Xiang Fei
Master of Science in Robotics (MSR)

My current research interests include SLAM, 3D Reconstruction, and Deep Learning.