Sunday, November 16, 2014

Unstuctured Data

Unstructured data (or unstructured information) refers to information that either does not have a pre-defined data model or is not organized in a pre-defined manner. Unstructured information is typically text-heavy, but may contain data such as dates, numbers, and facts as well. This results in irregularities and ambiguities that make it difficult to understand using traditional computer programs as compared to data stored in fielded form in databases or annotated (semantically tagged) in documents. - wiki

Dealing with unstructured data: Techniques such as data miningNatural Language Processing(NLP), text analytics, and noisy-text analytics provide different methods to find patterns in, or otherwise interpret, this information.

No comments:

Post a Comment