International Journal on Science and Technology

E-ISSN: 2229-7677     Impact Factor: 9.88

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Call for Paper Volume 16 Issue 1 January-March 2025 Submit your research before last 3 days of March to publish your research paper in the issue of January-March.

Data Mining from Unstructured Documents

Author(s) Rajalakshmi Thiruthuraipondi Natarajan
Country United States
Abstract Data Mining is the process of identifying and extracting valuable data by scanning through large volumes of structured and unstructured data, which would form the base for further processing using data analytics tools for cleansing, categorization and organization, etc. This source data might not fit to a certain template and can be of any format ranging from plan test to media files and it is the responsibility of the mining process to understand the message, extract relevant information and finally convert to a standard format. Prior to its final stage, these data undergo several rounds to cleansing to eliminate irrelevant information and pick the right set of data intended by the organization with the best turnaround time possible. At each stage of the analysis, the data needs to gets cleaner and distinctive and provide a vision as to the areas it will be used.
This document provides insight on data mining and its potential impact in market. This explores the various sources and the type of data that might be associated with it and how to cleanse and various ways the information can be used for the development of a retail business. This also provides guidance on the patten recognition and the proper compartmentalization of the data so that it is readily available to the target groups for research and marketing
Keywords Data Mining, Data Analytics, Data Cleansing, Data sorting, Data Compartmentalization, Data Organization, Processing Raw and Unstructured Data, Pattern Recognition
Field Engineering
Published In Volume 14, Issue 3, July-September 2023
Published On 2023-07-12
Cite This Data Mining from Unstructured Documents - Rajalakshmi Thiruthuraipondi Natarajan - IJSAT Volume 14, Issue 3, July-September 2023. DOI 10.5281/zenodo.14631493
DOI https://doi.org/10.5281/zenodo.14631493
Short DOI https://doi.org/g8zdq9

Share this