When working with data sets, practitioners sometimes encounter metrics, such as out-of-roundness and loss-of-moisture measurements, with physical limits. In these scenarios, the data distribution is truncated at the value of physical limitation, creating a distribution outside of the criteria of a normally distributed population. With non-normal data, estimates and predictions using the normal distribution are not accurate, creating the need for alternative methods of analysis to assess the data.
Standard Methods
Typically, when data does not fit the normal distribution and prediction or estimation calculations are made using the assumption of normality, data is transformed and assessed for normality. If the transformed data fits the normal distribution, then calculations are performed using the transformed data with transformed specification limits. Alternatively, if other distributions are found that fit the non-normal data, the capability of the process can be calculated using an alternative distribution, which better fits the data. However, if no alternative distribution is found that fits the data and the data cannot be transformed into a normally distributed data set, other methods of analysis are necessary.
Alternative Method
Due to the nature of truncated data sets, which have a point of central tendency at a physical limit, common transformation methods such as Box–Cox and Johnson are often not sufficient. The following method of estimating the population’s standard deviation for the normal distribution is a practical method that gives a realistic estimate of the standard deviation. It also avoids violation of the assumption of normality when using the Cpk calculation based on the normal distribution. This correction provides practitioners with the ability to predict the spread of the data and assess capability in the direction of the upper specification limit. Prior to using this correction method, however, practitioners must verify that the sample data is of adequate size to approximate the normal distribution.
Empirical research and data results, gathered from both theoretical and production data and analysis, support the theory that estimating the standard deviation is possible for physically limited data by proceeding as if the data were not truncated. Theoretically, this means extending the data beyond the physical limitation of the measurement.
The empirical evidence provides a ratio, or correction factor, between the truncated distribution standard deviation and the theoretical normal distribution. The equation is:
is the standard deviation calculated from the physically limited data set truncating one side of the data.
0 comments:
Post a Comment