ABOUT NORCOM
The task
Invoices must be checked for consistency and abnormalities such as double bills must be made visible. As a complex manual process, this should be automated to the greatest possible extent; through the use of advanced analytics, even rare and complex abnormalities should be easy to find.
The challenge
Invoices were available in scanned form with different scan quality, the number of pages and order were variable, the information contained therein was both structured (tables) and unstructured (free text), both with strong structural variations.
our solution
We created a pipeline consisting of OCR, table recognition and information extraction. An integral part of the pipeline was an automatic evaluation of the quality of the extraction results with the possibility of controlled optimization. Invoices were merged through the detection of close duplicates and duplicate entries and other anomalies were made visible using advanced analytics. A scalable architecture makes the functionality of the pipeline visible even on large data and enables the analysis of statistical anomalies.
The customer benefit
Thanks to automation, only a few invoices need to be checked manually, which leads to significant time and cost savings. The detection rate of abnormalities is significantly increased thanks to advanced analytics.
The Extract-App trawls through documents for pre-defined information. If you don't have time to search hundreds of pages for a small but relevant number in the third to last paragraph, leave it to this app.
​
Functions: annotation, named entity recognition, implementation of rules, aggregation of information, for example in tables and dashboards.
​
Currently in use, e.g., in fund due diligence
Someone keeps track! Labeling thoroughly checks documents and provides them with metadata. In this way, no information is lost and those who search will always find the right thing!
Functions: Weak Learning & Machine Learning, Speech Recognition, Author Recognition, Classification, Named Entity Recognition
​
Currently in use,e.g., to determine contractual partners
This app is an esthete. If you don't like bare facts and figures, you've come to the right place. Clear graphics, colorful diagrams, meaningful graphs - the reporting app offers all of this.
​
Functions: Creation of dashboards, graphs, charts, indication of deviations, correlations, time-period comparisons
​
Currently in use, e.g., for monitoring production processes
Your advantages with DaSense
tested
Organization
- any order Dimensions
DaSense offers multidimensional storage structures, so-called facets, which can be combined and filtered as desired. There are also clear annotations for documents and clear versioning.
​
Features:
-
Property facets: Multidimensional filing structure based on document properties such as language, document type, etc.
​
-
Workflow facets: Multidimensional storage structure according to processing status, evaluation, etc.
-
Annotations: Linking properties to individual parts of the document, i.e. sentences, sections or images
​
Advantages
-
Supplementing the existing folder structure with practically relevant categories
-
Illustration of complex relationships
-
Linking multiple facets
Organization
- any order Dimensions
DaSense offers multidimensional storage structures, so-called facets, which can be combined and filtered as desired. There are also clear annotations for documents and clear versioning.
​
Features:
-
Property facets: Multidimensional filing structure based on document properties such as language, document type, etc.
​
-
Workflow facets: Multidimensional storage structure according to processing status, evaluation, etc.
-
Annotations: Linking properties to individual parts of the document, i.e. sentences, sections or images
​
Advantages
-
Supplementing the existing folder structure with practically relevant categories
-
Illustration of complex relationships
-
Linking multiple facets