dots.ocr is a powerful, multilingual document parser that unifies layout detection and content recognition within a single vision-language model while maintaining good reading order. Despite its ...
Abstract: In this article, a random triangle model (RTM) is proposed for tracking an extended object (EO) (e.g., an aircraft or ship) that can be modeled by one or more triangles. In the RTM, a ...
Abstract: This paper proposes SLOT-MPC, a hierarchical model predictive control framework for a system of multirotor Unmmaned Aerial Vehicle (UAV), which aims to minimize uncertainty in estimating ...
A common misconception in automated software testing is that the document object model (DOM) is still the best way to ...
Enterprise documents such as forms, invoices, receipts, reports, contracts, and other similar records, often carry rich semantics at the intersection of textual and spatial modalities. The visual cues ...
IIIF provides researchers rich metadata and media viewing options for comparison of works across cultural heritage collections. Visit the IIIF page to learn more.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results