Free Online Articles Directory
18.11.2008 Sign In Register Hello Guest
Email:
Password:
Remember Me 
forgot your password?


Data Mining Phases/process

Author: Mandeep Singh Author Ranking Blue | Posted: 24-08-2008 | Comments: 0 | Views: 109 | Rating:  (124) Article Popularity - Blue (?) Got a Question? Ask.
Sign Up Now!

 

Data Mining is process of finding meaningful information from huge volume of data. Main objective of data mining is to find hidden trends in the data. It may be customers purchasing behaviour, Sale trends, finding new cross selling opportunities and many more. Data mining is step by step process starts with business understanding and ends with possible solution of the problem in hand. CRISP-DM is widely known data mining industry standard and used by most of organization that provides data mining services. There are six stages in CRISP-DM.


  1. Business Understanding

  2. Data Preparation

  3. Data Understanding

  4. Modeling

  5. Model Evaluation

  6. Deployment 

 

These step are interactive (interactive diagram @   http://www.vbams.com/VBAMS/Datamining.html), that mean at any stage you can come back to previous stage. For example at the modeling stage you are still not sure about trends in data, at that point you can go back to data understanding phase to properly understand the trends in the data.

 

Business Understanding: This is the first phase in CRISP-DM Model and includes

-       Understand necessary business process

-       Understand the problem

-       Plan how to solve the problem while considering resources in hand.

-       Define objectives and goals what you are going to achieve.

 

Data Understanding: This is second phase in CRISP-DM model and includes

-          Go through historical data

-          Try to relate data with each other

-          Find hidden trends in data (not too deep)

 

Data Preparation: This is third phase in CRISP-DM model and includes

-          In “Data Understanding” phase we go through historical data and collect that we need (Data Sampling).

-          In this stage, format the data into desired form.

-          Handle missing and noisy data (In next article I will discuss in detail about how to handle noisy and missing data, so don’t miss that one.)

 

Modeling: Fourth phase of CRISP-DM model

-          Develop model for future prediction

-          Try different modeling technique.

-          Try different parameters to improve the results

-          Pick those models that look appropriate at this phase and evaluate them in next phase.

 

Model Evaluation:  Fifth phase of CRISP-DM model

-          Important stage in CRISP-DM model

-          Model need to evaluated in terms of response time, confidence level, cost, error rate and many other

-          Determine how this model is helpful in achieving objectives and goals defined in the first stage.

 

Deployment: Final phase of CRISP-DM model

-          Create reports, so that end user can easily use this model to improve the business performance.

 

Special thanks to the team of CRISP-DM org who are making efforts in building CRISP-DM 2.0 for the industry.

 Don't forgot to check next article

"How to Handle Missing Data"

Mandeep Singh

www.mandeepvba.blogspot.com

 

Rate this Article: Current: 0 / 5 stars - 0 vote(s).

Article Source: http://www.articlesbase.com/project-management-articles/data-mining-phasesprocess-534269.html

Print this Article Print article   Email to a Friend Send to friend   Publish this Article on your Website Publish this Article   Send Author Feedback Author feedback  
About the Author:

Mandeep Singh
www.vbams.com
www.mandeepvba.blogspot.com

Submitting articles has become one of the most popular means to drive traffic to your website and promote yourself and your business. Join us today - It's Free!

Article Comments

Comment on this article Comment on this article
Your Name
Your Email:
Comment Body
Enter Validation Code: Captcha


Related Articles

The New 50/50 Rule
By: Hazen Martin | 30/10/2008 | Entrepreneurship
We need to change the way we look at ourselves as business owners.

VoIP and Your Business
By: Laura Rucker | 14/11/2008 | VoIP
If I were a business owner, I would still be reviewing the capabilities and functions of many VoIP systems, since every business is going to have different needs and preferences. While VoIP is not a one-size-fits-all telecom product, hopefully businesses will not ask themselves "Should I use VoIP?", but instead ask, "Which type of VoIP should I use?"

The Many Forms of Insight Within an Enterprise
By: R. L. Fielding | 13/06/2008 | Management
No matter what role an employee plays within your organization, they are all able to generate insights into the changing nature of your marketplace. No matter what industry you are in, it is important to constantly evaluate your competitive positioning.

10 Things You Should Know About VoIP (voice Over Internet Protocol)
By: Laura Rucker | 14/11/2008 | VoIP
Before rolling out VoIP for your business, read this article about the 10 things you should know about this system. These tips can save you time, money and headaches, not to mention allow for a smoother transition. VoIP technology is a great system, but knows your facts before taking the plunge and deploying it in your office.

Data mining guide
By: Mansi Gupta | 09/05/2006 | Software
Data mining is also known as Knowledge Discovery in Databases (KDD). Data mining is the process of automatically searching large volumes of data for patterns. Data is derived from the word datum, being its plural term.

Increasing your Revenue by 50-85% Will not Cost More Money, Time, or Energy Than you are Already Investing
By: Vivek Kr Bhojnagarwala | 09/08/2007 | Outsourcing
Increasing your revenue by 50-85% will not cost more money, time, or energy than you are already investing. In fact, with the right tools, you can maximize your return with LESS time, LESS effort, and a lot LESS money. Contact one of our campaign managers at The Global Associates now and find out how to start outsourcing!

Multi-core Technology Will Have a Disruptive Impact on Your Business
By: Jose Allan Tan | 02/02/2008 | Hardware
"Typical 4-way servers will handle jobs that previously required midrange or high-end symmetric multi-processing systems. As a result, we will move complex business processes, databases, data mining, and inquiry and data intelligence applications onto industry-standard multi-core servers." -- Vernon Turner, vice president and general manager of Enterprise Computing, IDC

Poker Calculator Report: Data Mining Prevention by Poker Sites or What to Do About Wrecklessjoe55
By: Marty Smith | 12/02/2008 | Online Gambling
The disdain poker sites have for these types of software is that you have never played with WrecklessJoe55 and you shouldn’t know that information until YOU have ascertained it, not someone else. Yes, just like a regular live poker room. The Poker Stars security staff basically once told me that that is the guideline with which they want to emulate and all security policy emanates from that thinking.

Got a Question? Ask.

Ask the community a question about this article:

Frequently Asked Questions

When are they comin home
By: stmlewis | 09-10-2008
when does 3rd battalion 6th marines come home

Essbase 6.5 custom code for external authentication
By: chan | 23-08-2008
I have creeated a DLL from a custom C code using Visual Studio. I want to replace the exisitng DLL in Essbase 6.5 on window NT with this new DLL. How do i do this. The code is for external authentication.

Im going to ntc traing in november. Do this mean I ...
By: Yung Misery | 23-06-2008
Im going to ntc traing in november. Do this mean I'm about to deploy  

What are the requirements to setup a small ...
By: jyotivinay | 20-04-2008
what are the requirements to setup a small intranet?

Intuition vs Instinct
By: enyaple | 15-11-2007
What specific characteristics distinguish intuition from instinct? Are they interdependent, or are they each complete alone?

Data mining in service of the humanities
By: MOSHEL | 25-03-2007
I would like to get some information on applications of data mining in studies of classical texts.

Q&A Powered by:
Powered by Yedda 

Latest Project Management Articles

Project Management – the Career for You?
By: projectmanuk | 17/11/2008
Project Management is an increasingly popular career choice for young graduates. According to statistics produced by the Middlesex University National Centre for Project Management, 1.5 – 2 million people earn their living as Project Managers in the UK alone.

Microsoft Project Training - Where Do I Start?
By: Steve Twine | 10/11/2008
Microsoft Project is the world's most popular project management software - but even regular users would often benefit from a better understanding of its full capabilities. Moreover new reporting and communications features in the 2007 release are well worth understanding.

Automation Infatuation-mobile Work Place Automation Makes Chemical Plant Operations More Reliable and Efficient
By: Brady Moritz | 06/11/2008
Throughout the chemical process industry, it’s difficult to think about plant monitoring without visualizing personnel on daily rounds laboriously filling out paper log sheets. Besides being a manual-intensive routine, this traditional monitoring approach primarily focuses only on documentation.

Energy Independence for 800 Year Old Mill on Remote Italian Island
By: Beth Shady | 30/10/2008
Pacific Solar Radiant, Inc. a Santa Cruz based design/build mechanical engineering, plumbing and heating company traveled to a remote Italian island to make an 800 year old mill completely sustainable and off-grid.

How to Build a Library
By: Samuel Bryant | 29/10/2008
When identifying a need for a library a person must identify the needs to the organization or community.

Dust Collectors
By: Oleg Chetchel | 28/10/2008
The use of centrifugal force to throw a dust particle to the periphery of an air stream has been used in the cyclone collector for many years. Dry centrifugal dust collectors can be divided into two basic groups cataloged by their effectiveness in removal of smaller dust particles.

Pacific Timesheet Announces New Iphone Support
By: Jason Trend | 25/10/2008
Pacific Timesheet announces its leading project timesheet software is the first to support the Apple iPhone.

How You Can Make Huge Amount of Money Over the Internet
By: Eddiee | 21/10/2008
It would also help if you can identify their buying power (to see if they can afford your products), their online behavior (to easily make your marketing strategies more focused), and their preferences. Check on your competitors. You can stay on top of the game if you know the strengths and weaknesses of those people that you are up against. Identify the elements that they are using in creating and selling their products online and top them by exerting more efforts and energy on your product cr

More from Mandeep Singh

Need of Sampling and Sampling Methods
By: Mandeep Singh | 24/09/2008 | Marketing Tips
In the area of Marketing Research Sampling is very important topic. If our initial steps are not correct we can never get those results that we expect from any marketing campaign. So in this article we will discuss about need of sampling and sampling methods.

How to Handle Missing Values
By: Mandeep Singh | 27/08/2008 | Information Technology
To analyse the situation first thing you need is historical data (relevant to the problem). Once you have data, its need to be in proper format so it can be easily analyse. There may be number of problems you have to face while bringing the data into proper format. One problem is how to handle missing or inconsistent data. In this article we go through few basic techniques that used to solve this problem.

Traditional Face to Face Learning and Global/online Learning & Supporting Tools
By: Mandeep Singh | 22/08/2008 | Online Education
Global learning offers many advantage to students but it can not eliminate the traditional way of teaching where face of face workshops are very important in some fields. In this article we will deeply discuss about traditional face to face learning, global/online learning and some tools that support the global learning. Valuable references are given in the end of this article, which can be helpful in your research.

Article Categories





Give Feedback

Sign up for our email newsletter

Receive updates, enter your email below