Normalization and organization of internal control procedures. Text distance, concept extraction, word embeddings.