|
||
|
Viewing By Month : May 2008 / Main
May 25, 2008P values. One-tail or two-tail ? When comparing two groups, you must distinguish between one- and two-tail P values. Some books refer to one-sided and two-sided P values, which mean the same thing. What does one-sided mean? Assuming the null hypothesis is true, what is the chance that randomly selected samples would have means as far apart as (or further than) you observed in this experiment with either group having the larger mean?
Assuming the null hypothesis is true, what is the chance that randomly selected samples would have means as far apart as (or further than) observed in this experiment with the specified group having the larger mean?
When is it appropriate to use a one-sided P value?
Here is an example in which you might appropriately choose a one-tailed P value: You are testing whether a new antibiotic impairs renal function, as measured by serum creatinine. Many antibiotics poison kidney cells, resulting in reduced glomerular filtration and increased serum creatinine. As far as I know, no antibiotic is known to decrease serum creatinine, and it is hard to imagine a mechanism by which an antibiotic would increase the glomerular filtration rate. Before collecting any data, you can state that there are two possibilities: Either the drug will not change the mean serum creatinine of the population, or it will increase the mean serum creatinine in the population. You consider it impossible that the drug will truly decrease mean serum creatinine of the population and plan to attribute any observed decrease to random sampling. Accordingly, it makes sense to calculate a one-tailed P value. In this example, a two-tailed P value tests the null hypothesis that the drug does not alter the creatinine level; a one-tailed P value tests the null hypothesis that the drug does not increase the creatinine level. Recommendation
Common misunderstandings about P values. Kline (see book listing below) lists commonly believed fallacies about P values, which I summarize here: Reference: RB Kline, Beyond Significance Testing: Reforming Data Analysis Methods in Behavioral Research, 2004.
The Mann-Whitney test doesn't really compare medians. You'll sometimes read that the Mann-Whitney test compares the medians of two groups. But this is not precisely correct. Consider this example:
The graph shows each value obtained from control and treated subjects. The two-tail P value from the Mann-Whitney test is 0.0288, so you conclude that there is a statistically significant difference between the groups. But the two medians, shown by the horizontal lines, are identical. The Mann-Whitney test compared the distributions of ranks, which is quite different in the two groups even though the medians are the same. It is not correct, however, to say that the Mann-Whitney test asks whether the two groups come from populations with different distributions. The two groups in the graph below clearly come from different distributions, but the P value from the Mann-Whitney test is high (0.46).
The Mann-Whitney test compares sums of ranks -- it does not compare medians and does not compare distributions. The Mann-Whiteny test is a comparison of medians only when you assume that the distributions of the two populations have the same shape, even if they are shifted (have different medians). If you accept this assumption, then a small P value from a Mann-Whitney test leads you to conclude that the difference between medians is statistically signficant. More generally, the P value answers this question: What is the chance that a randomly selected value from the population with the larger median is greater than than a randomly selected value from the other population?
May 20, 2008Before-after graphs with different colors for different subjects. When you enter data on a column table and choose a before-after graph, Prism plots all the symbols the same way. You can choose different colors or shapes for "before" than for "after" (which is not helpful). And you can right click on each symbol and change its color (and that of the connecting line). But this approach would be very tedious. . Prism 5 can, in fact, create before-after graphs with multiple colors for different subjects. The trick is to enter the data enter the data on a Grouped table. Follow these steps or examine this Prism file. 1. Create a Grouped table.
Choose the appropriate number of "replicates" (subjects) for your data. Be sure to choose to plot each replicate, and to connect each replicate. 2. Enter the data.
Note that the arrangement of data is different than with a column table. The before-after pairs are stacked into subcolumns. This table has two rows, because it plots just before and after. If you had more time points, add more rows. This table has two data sets, male and female, because we want symbols with two different appearances. Use as many data sets as you want. If you want each subject to have its own appearance, create a table with no subcolumns, and enter each subject into its own data set. 3. Polish the graph.
Also see this related example for creating column scatter graphs with multiple symbol colors.
How to turn off automatic snapping. When you move a text object in Prism 5, it will automatically snap into alignment with bars on graphs, groups of bars, the center of the page, and other text objects. This almost always is a great feature, as it lets you quickly move text to an appropriate spot. But sometimes, you may find that the automatic snapping prevents you from fine-tuning a graph.
The Standard Addition Method for determing concentrations. Prism can easily interpolate from a linear or nonlinear standard curve. You perform the assay at a number of known concentrations, fit a line or curve, and interpolate the uknown values. But there is a problem with interpolating from a standard curve. The results can be incorrect when the unknown sample are contaminated with other substances that alter the assay. This is known as the 'matrix effect problem'. The Standard Addition Method is a way to bypass this problem. You don't need to perform the assay with known concentrations of substance. Instead you add various known concentrations (including zero) of known substance to a constant amount of the unknown. This ensures that all the samples have the same amount of unknown, including any substances that interfere with the assay. Fit the data with linear regression. The value you want to know is how much of the known substance has to be added to double the signal. There is an easier, somewhat trickier, way to find out: Extrapolate the line down to Y=0. One of the parameters that Prism reports is the X intercept, which will be negative. Take the absolute value, and that is the concentration of the unknown substance. The confidence inerval for the X intercept gives you the confidence interval for the concentration of the uknown. Simply multiply both confidence limits by -1. To plot the data in Prism, you'll want to extend the linear regression line to start at an X value equal to the X intercept (a choice in the Linear regression parameters dialog). You may also want to move the origin to the lower left, a choice on the first tab of the Format Axis dialog. Here is a graph created with this Prism file.
|
||