Unstructured data (or unstructured information) refers to information that either does not have a pre-defined data model or is not organized in a pre-defined manner. Unstructured information is typically text-heavy, but may contain data such as dates, numbers, and facts as well. This results in irregularities and ambiguities that make it difficult to understand using traditional computer programs as compared to data stored in fielded form in databases or annotated (semantically tagged) in documents. - wiki
Dealing with unstructured data: Techniques such as data mining, Natural Language Processing(NLP), text analytics, and noisy-text analytics provide different methods to find patterns in, or otherwise interpret, this information.
No comments:
Post a Comment