[B会]Improving OCR Accuracy on Early Printed Books by Utilizing Cross Fold Training and Voting. DAS 2018

2018-12-26 | 阅读：次

Christian Reul, Uwe Springmann, Christoph Wick, Frank Puppe: Improving OCR Accuracy on Early Printed Books by Utilizing Cross Fold Training and Voting. DAS 2018: 423-428

our method shows considerable differences compared to the work presented above. not only is it applicable to some of the earliest printed books, but it also works with only a single open source ocr engine. furthermore, it can be easily adapted to practically any given book using even a small amount of gt without the need for excessive data to train on (60 to 150 lines of gt corresponding to just a few pages will suffice for most cases)

与上述工作相比，我们的方法显示出相当大的差异。它不仅适用于一些最早的印刷书籍，而且只适用于单一的开源引擎。此外，它可以很容易地适应几乎任何特定的书籍，即使使用少量的GT，而不需要对过多的数据进行培训(相当于几页的60到150行GT就足以满足大多数情况)。

stone