AIMultiple aims to help enterprises identify the right OCR for their business. These enterprises should expect to process high volumes of documents and images (ie at least tens of thousands of pages per month).
What will be the guiding principles?
AIMultiple’s benchmark methodology explains the participation requirements and principles.
Will it be benchmarked?
Extracting text in English from documents and images.
The dataset is expected to contain 500 pages:
- Long format PDF documents (such as technical manuals, whitepapers, contracts) of up to 300 pages that contain text in image form. PDFs of varying legibility will be used. PDF will be collected online.
- 100 pages of transactional documents (such as invoices and receipts). They will be collected online and selected from the documents of AIMultiple and its partners.
- 100 pages of handwritten documents (such as receipts, insurance claim forms). They will be collected online and selected from the documents of AIMultiple and its partners.
In some documents, parts of the document will be digitally altered to protect PII.
How will AIMultiple benchmark perform?
AIMultiple’s OCR benchmarks aim to closely match the preferences of OCR buyers. They want a flexible, cost-effective solution. Therefore, AIMultiple will measure these metrics:
accuracy
This would be measured by cosine similarity. We will not use the Levenshtein distance because different products output text in different orders, especially in the case of multi-column text. While the Levenshtein distance takes these positional differences into account, we are interested in how accurately the text is detected, but not where it is located.
pace
Average response time and distribution of response times will be measured. A maximum of 5 seconds of data processing and transfer time will be allowed per page.
scalability
The same metrics can be tested with a certain number of simultaneous connections. This metric may be the same for all providers (i.e. simultaneous connections may not slow down processing). In such case, AIMultiple may not publish the results of this metric.
Cost
Public cost data published by vendors will be used to calculate the cost of the benchmark. Sellers’ pricing models will also be shared to help buyers compare prices of different loads.
customer service
Reviews on B2B review platforms will be analyzed to assess customer satisfaction.
How will the results be published?
They will be published on AIMultiple.com and will contain graphs that users can use to find the right salespeople for their business. The top three vendors in each of the above categories will be presented.
Each participant will receive
- Detailed results for each document and page with timestamp
- Average result for each document and page
- dataset
Please note that AIMultiple is in the design phase of the benchmark and changes will be made once AIMultiple receives end user feedback and finalizes the benchmark.
If you would like to participate in the AIMultiple OCR benchmarks, reach out to the AIMultiple team at (email protected).











