Free Online Articles Directory
17.11.2008 Sign In Register Hello Guest
Email:
Password:
Remember Me 
forgot your password?


Chi-square Analysis for Attribute Data

Author: Steven Bonacorsi Author Ranking Blue | Posted: 27-09-2007 | Comments: 0 | Views: 2,370 | Rating:  (344) Article Popularity - Bronze (?) Got a Question? Ask.
Sign Up Now!

What Is A Chi-Square Test?

- The probability density curve of a chi-square distribution is asymmetric curve stretching over the positive side of the line and having a long right tail.

- The form of the curve depends on the value of the degrees of freedom.

Types of Chi-Square Analysis:

- Chi-square Test for Association is a (non-parametric, therefore can be used for nominal data) test of statistical significance widely used bivariate tabular association analysis.

- Typically, the hypothesis is whether or not two different populations are different enough in some characteristic or aspect of their behavior based on two random samples.

- This test procedure is also known as the Pearson chi-square test.

- Chi-square Goodness-of-fit Test is used to test if an observed distribution conforms to any particular distribution. Calculation of this goodness of fit test is by comparison of observed data with data expected based on the particular distribution.

When to apply a Chi-Squared Test:

- Chi-Squared test is used to determine if there is a statistically significant difference in the proportions for different groups. To accomplish this, it breaks all outcomes into groups.

What the Chi-Squared Test does:

- It starts by determining how many defects, for example, would be “expected” in each group involved.

- It does this by assuming that all groups have the same defect rate (which Minitab approximates from the data provided).

- Minitab then compares the expected counts with what was actually observed.

- If the numbers are different by a large enough amount, Chi-Square determines that the groups do not have the same proportion.

Chi-Square Requirements:

- Data is typically attribute (discrete). At the very least, all data must be able to be categorized as being in some category or another).

- Expected cell counts should not be low (definitely not less than 1 and preferable not less than 5) as this could lead to a false positive indication that there is a difference when, in fact, none exists.

Chi-Square Hypotheses:

- Ho: The null hypotheses (P-Value > 0.05) means the populations have the same proportions.

- Ha: The alternate hypotheses (P-Value <= 0.05) means the populations do NOT have the same proportions.

Note: if the expected cell counts are below 5, Minitab will print a warning. The warning is generated because of the fact that with the expected count in the denominator, a small value potentially creates an artificially large chi-square statistic. This is particularly troublesome if more than 20% of the cells have expected counts less than 5 and the contribution to the overall chi-square statistic is considerable.

Additionally, if any of the expected cell counts are below 1, Minitab will not even produce a p-value since the chi-square statistic is sure to be artificially inflated. In either of these cases, the binomial distribution (Minitab: Stat/ ANOVA/ Analysis of Means) may be able to be used.

Lastly: Attribute Gage R&R (AR&R) or Kappa Test is needed with an acceptable level of measurement system error prior to running a Chi-Square Analysis

Tips:

- Determine the subgroups and categories to be tested for variation (differences in proportions) as part of your data collection plan.

- Define the operational definitions for success/defect, the stratifications layers (subgroups) and the Cause & Effect diagram (fishbone) to pre-determine where the team believes differences in proportions may exist.

- Continuous (Variable) data can usually be converted into Discrete (Attribute) data by using categories

(Example: cycle time (continuous 1 hr, 1.5 hr, 2 hr) can be categorized into Cycle Time Met = 1 where success is cycle time 8 hrs.)

Tricks

- An (MSA) Attribute R&R (Kappa Analysis) for discrete data or Gage R&R for continuous (variable) data is used prior to calculating the Chi-Square Test to ensure that the measurement variation 10% then the variation you will see in the Chi- Square Test is not valid as too much of the variation seen is coming from your measurement system (10% MSA error) and not your process variation.

Rate this Article: Current: 5 / 5 stars - 3 vote(s).

Article Source: http://www.articlesbase.com/project-management-articles/chisquare-analysis-for-attribute-data-221414.html

Print this Article Print article   Email to a Friend Send to friend   Publish this Article on your Website Publish this Article   Send Author Feedback Author feedback  
Steven BonacorsiAbout the Author:

Steven Bonacorsi is a Senior Master Black Belt instructor and coach. Steven Bonacorsi has trained hundreds of Master Black Belts, Black Belts, Green Belts, and Project Sponsors and Excutive Leaders in Lean Six Sigma DMAIC and Design for Lean Six Sigma process improvement methodologies.

Submitting articles has become one of the most popular means to drive traffic to your website and promote yourself and your business. Join us today - It's Free!

Article Comments

Comment on this article Comment on this article
Your Name
Your Email:
Comment Body
Enter Validation Code: Captcha


Got a Question? Ask.

Ask the community a question about this article:

Frequently Asked Questions

Under d2 how do you get 16 O ...
By: Bonnie | 05-10-2008
Under d2 how do you get 16                   O        E       diff.     d2          Partial chi-square(d2/e) Short legs  44        40       4        16         16/40=0.400 Long legs    116     120      4        16         16/120= 0.133  

How to take the squared (O-E) and divide it by the ...
By: thazel | 12-03-2008
How to take the squared (O-E) and divide it by the expected frequency.

More info
By: sbonacorsi | 03-10-2007
For a full version of this topic, send me an e-mail at sbonacorsi@comcast.net

Q&A Powered by:
Powered by Yedda 

Latest Project Management Articles

Project Management – the Career for You?
By: projectmanuk | 17/11/2008
Project Management is an increasingly popular career choice for young graduates. According to statistics produced by the Middlesex University National Centre for Project Management, 1.5 – 2 million people earn their living as Project Managers in the UK alone.

Microsoft Project Training - Where Do I Start?
By: Steve Twine | 10/11/2008
Microsoft Project is the world's most popular project management software - but even regular users would often benefit from a better understanding of its full capabilities. Moreover new reporting and communications features in the 2007 release are well worth understanding.

Automation Infatuation-mobile Work Place Automation Makes Chemical Plant Operations More Reliable and Efficient
By: Brady Moritz | 06/11/2008
Throughout the chemical process industry, it’s difficult to think about plant monitoring without visualizing personnel on daily rounds laboriously filling out paper log sheets. Besides being a manual-intensive routine, this traditional monitoring approach primarily focuses only on documentation.

Energy Independence for 800 Year Old Mill on Remote Italian Island
By: Beth Shady | 30/10/2008
Pacific Solar Radiant, Inc. a Santa Cruz based design/build mechanical engineering, plumbing and heating company traveled to a remote Italian island to make an 800 year old mill completely sustainable and off-grid.

How to Build a Library
By: Samuel Bryant | 29/10/2008
When identifying a need for a library a person must identify the needs to the organization or community.

Dust Collectors
By: Oleg Chetchel | 28/10/2008
The use of centrifugal force to throw a dust particle to the periphery of an air stream has been used in the cyclone collector for many years. Dry centrifugal dust collectors can be divided into two basic groups cataloged by their effectiveness in removal of smaller dust particles.

Pacific Timesheet Announces New Iphone Support
By: Jason Trend | 25/10/2008
Pacific Timesheet announces its leading project timesheet software is the first to support the Apple iPhone.

How You Can Make Huge Amount of Money Over the Internet
By: Eddiee | 21/10/2008
It would also help if you can identify their buying power (to see if they can afford your products), their online behavior (to easily make your marketing strategies more focused), and their preferences. Check on your competitors. You can stay on top of the game if you know the strengths and weaknesses of those people that you are up against. Identify the elements that they are using in creating and selling their products online and top them by exerting more efforts and energy on your product cr

More from Steven Bonacorsi

Mistake Proofing
By: Steven Bonacorsi | 02/10/2008 | Management
Mistake proofing is a technique for eliminating errors. It is based upon the premise that it is good to do something right the first time; it is even better to make it impossible to do it wrong the first time. The idea is to make it impossible to make a mistake. You may also hear the term, Poka-Yoke or Error Proofing applied to mistake proofing.

Cause and Effect Diagrams (fishbone Diagrams)
By: Steven Bonacorsi | 25/04/2008 | Management
The first such cause-and-effect diagram was used by Kaoru Ishikawa in 1943 to explain to a group of engineers at the Kawasaki Steel Works how various work factors could be sorted and related. In recognition of this, these diagrams sometimes are called Ishikawa diagrams. They are also called fishbone diagrams, because they look something like fish skeletons.

Critical Path Mapping
By: Steven Bonacorsi | 24/04/2008 | Project Management
The activity network diagram has had a relatively long history, dating back to the 1930s. In the 1950s, the technique emerged as the Program Evaluation Research Technique (PERT) and as the Critical Path Method (CPM). There are several ways to represent the output of the PERT/CPM process.

Box Plots
By: Steven Bonacorsi | 21/04/2008 | Management
Box-and-whisker diagrams, or Box Plots, use the concept of breaking a data set into fourths, or quartiles, to create a display. The box part of the diagram is based on the middle (the second and third quartiles) of the data set. The whiskers are lines that extend from either side of the box. The maximum length of the whiskers is calculated based on the length of the box. The actual length of each whisker is determined after considering the data points in the first and the fourth quartiles.

Dot Plots
By: Steven Bonacorsi | 20/03/2008 | Management
A dot plot graphically records variable data in such a way that it forms a picture of the combined effect of the random variation inherent in a process and the influence of any special causes acting on it. To understand the power of dot plots as a basic tool, it first helps to visualize how variation occurs.

Run Charts
By: Steven Bonacorsi | 13/03/2008 | Project Management
Run charts can be very valuable in helping your search for sources of variation. They are easy to plot and easy to interpret. The sampling is uncomplicated, and there are no statistical computations to make. They can also be applied to almost any process or any data.

Scatter Diagrams
By: Steven Bonacorsi | 10/03/2008 | Training
A scatter diagram shows the correlation between two variables in a process. These variables could be a Critical-To-Quality (CTQ) characteristic and a factor affecting it, two factors affecting a CTQ or two related quality characteristics. Dots representing data points are scattered on the diagram. The extent to which the dots cluster together in a line across the diagram shows the strength with which the two factors are related.

Histograms
By: Steven Bonacorsi | 09/03/2008 | Management
A histogram is a tool that allows you to understand at a glance the variation that exists in a process.

Article Categories





Give Feedback

Sign up for our email newsletter

Receive updates, enter your email below