Unstructured and textual information extraction from product test reports using machine learning and OCR. Training of named-entity extraction models, incremental and active learning.