Abstract
A method of constructing a dataset for identifying a plurality of latent concepts in a Natural Language Processing model is provided. The method includes executing a clustering process on a first dataset, preparing a second dataset, defining a hierarchical concept tag-set from the second dataset, and annotating the hierarchical concept tag-set.
Original language | English |
---|---|
Patent number | US2023325426 |
IPC | G06F 16/ 38 A I |
Priority date | 7/04/23 |
Publication status | Published - 12 Oct 2023 |