International Journal on Science and Technology
E-ISSN: 2229-7677
•
Impact Factor: 9.88
A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal
Home
Research Paper
Submit Research Paper
Publication Guidelines
Publication Charges
Upload Documents
Track Status / Pay Fees / Download Publication Certi.
Editors & Reviewers
View All
Join as a Reviewer
Reviewer Referral Program
Get Membership Certificate
Current Issue
Publication Archive
Conference
Contact Us
Plagiarism is checked by the leading plagiarism checker
Call for Paper
Volume 16 Issue 1
2025
Indexing Partners
Data Mining from Unstructured Documents
Author(s) | Rajalakshmi Thiruthuraipondi Natarajan |
---|---|
Country | United States |
Abstract | Data Mining is the process of identifying and extracting valuable data by scanning through large volumes of structured and unstructured data, which would form the base for further processing using data analytics tools for cleansing, categorization and organization, etc. This source data might not fit to a certain template and can be of any format ranging from plan test to media files and it is the responsibility of the mining process to understand the message, extract relevant information and finally convert to a standard format. Prior to its final stage, these data undergo several rounds to cleansing to eliminate irrelevant information and pick the right set of data intended by the organization with the best turnaround time possible. At each stage of the analysis, the data needs to gets cleaner and distinctive and provide a vision as to the areas it will be used. This document provides insight on data mining and its potential impact in market. This explores the various sources and the type of data that might be associated with it and how to cleanse and various ways the information can be used for the development of a retail business. This also provides guidance on the patten recognition and the proper compartmentalization of the data so that it is readily available to the target groups for research and marketing |
Keywords | Data Mining, Data Analytics, Data Cleansing, Data sorting, Data Compartmentalization, Data Organization, Processing Raw and Unstructured Data, Pattern Recognition |
Field | Engineering |
Published In | Volume 14, Issue 3, July-September 2023 |
Published On | 2023-07-12 |
Cite This | Data Mining from Unstructured Documents - Rajalakshmi Thiruthuraipondi Natarajan - IJSAT Volume 14, Issue 3, July-September 2023. DOI 10.5281/zenodo.14631493 |
DOI | https://doi.org/10.5281/zenodo.14631493 |
Short DOI | https://doi.org/g8zdq9 |
Share this
doi
CrossRef DOI is assigned to each research paper published in our journal.
IJSAT DOI prefix is
10.71097/IJSAT
Downloads
All research papers published on this website are licensed under Creative Commons Attribution-ShareAlike 4.0 International License, and all rights belong to their respective authors/researchers.