Teklia has developed for its clients automatic document understanding systems based on Machine Learning and Deep Learning in a wide range of application domains.

an

For the French National Archives, We have developed a transcription platform for handwritten indexes processing, assisted by automatic handwriting recognition and entity extraction. Automatic processing of 800,000 records and 100,000 pages of registers.

Handwriting Entities
nationallibrarynorway

Within the framework of the collaborative research project Hugin-Munin funded by the Research Council of Norway, TEKLIA is developing adaptive techniques for the recognition of handwritten documents in Norwegian.

Handwriting
uqac

For the Balsac project, TEKLIA has performed document structure analysis, handwritten text recognition and personal information extraction in 2.7 million parish record pages from Quebec between 1880 and 1920.

Handwriting Entities Acts
cicr_logo

Automatic structure analysis of tables, printed and handwritten text recognition, validation with crowdsourcing using Callico.

OCR Meta-data Table analysis
bsg

OCR and extraction of meta-data from 500,000 printed index cards.

OCR Meta-data
bis

OCR improvement for the project Parlementary archives of the French revolution.

OCR
lexisnexis

Data extraction, classification and summarization of case law decisions.

nationaalarchief

For the National Archives of the Nederlands, TEKLIA has developed page classification models for processing documents from the archives of the Ministry of Colonies from 1814 to 1849.

Classification Handwriting
irhtcnrs

TEKLIA is collaborating with IRHT since several years to develop solutions for the processing of medieval handwritten documents, within the framework of the HORAE and HOME projects.

Handwriting Entities Acts
ephe

TEKLIA contributes to the development of new features for the eScriptorium project: development of a search engine, setting up user quotas, tracking Machine Learning tasks, etc. TEKLIA also provides system administration for several major eScriptorium instances.

mozilla

Development of the Fuzzing platform (for automated vulnerability detection) and of a tool for automatic classification of patch sets for the Firefox software, to reduce CI costs.

necker

TEKLIA collaborates with Hopital Necker-Enfants Malades in the Macadamia project to develop a plateform for named entity recognition and numerical information extraction from medical records.

navalgroup

Evaluation of document search engines and technologies for automatic summarisation and classification of documents based on contextual embeddings. Performance analysis of search engines (ElasticSearch, OpenSearch).

soge

Normalization and organization of internal control procedures. Text distance, concept extraction, word embeddings.

loreal

Unstructured and textual information extraction from product test reports using machine learning and OCR. Training of named-entity extraction models, incremental and active learning.

CNPassurances

Automatic prediction of priority levels from complaint mail through semantic analysis. Text recognition, topic detection, document classification.

tnp

Clustering of IT tickets by topics, automatic tickets classification and triaging through density-based spatial clustering, keyword extraction and classification.

numen

Automatic extraction of financial information in scanned invoices.

cnp

Automatic redaction of confidential information in fiscal forms. Automatic processing (document clustering, document classification) of a 2.5 million archive documents.

mctrct

Automatic extraction of construction rules from local urban planning documents (Plan locaux d'urbanisme).