Frequently Asked Questions
The FAQs below will be divided into categories including those above. Some of the FAQs have a few mathematical formulae which may be seen using mathplayer which may be downloaded for free from here. Computer code is featured in some of the FAQs and is highlighted in a (grey coloured) text box. This code can be copied directly to your personal PC file (e.g. into a SPSS syntax file) however you also need to copy any small piece of text immediately outside the text box when you copy the code in order to copy over the linebreaks or else you will find the text is copied over as a single line!
If you wish to search for a particular word and are using Internet Explorer just press Control-f and a 'find' slot will appear above the page in your browser. Also please note that macros in the EXCEL spreadsheets will not work (by default) unless they are enabled.
How many observations do I need to make z-scoring (assuming Normality) feasible?
Are there any on-line statistical tables for commonly used distributions?
What is the relationship between the z, t, chi-square and F distributions?
What is the probability of a repetition in a sequence of size k taken from K stimuli?
How do I evaluate Multinomial probabilities of pooled trial frequencies of locations of peaks which can occur in one of K positions in each trial?
How do I fit a Weibull distribution to scores over time using MATLAB and notes on using non-linear least squares?
Why does SPSS give a row of asterisks instead of a mean in the outputted pivot table?
What does a number containing an 'E' signify and how do I remove it?
What is the relationship between the Pearson zero-order correlation and a simple regression estimate?
What is the relationship between predictor variable correlations and the presence of a suppressor variable?
How do I check for outliers in a simple regression with one predictor variable?
How do I explain adding 0.5 to the cells for the odds ratio, d' (dprime) or logit transform?
Can I use an ordinary linear regression instead of logistic regression to test inference about proportions?
What is heterogeneity of variance in SPSS Probit and Logit (and Poisson) Regressions?
What is overdispersion in handling proportions or count data and how do I handle it?
How do I perform tests of Marginal Homogeneity between two raters measuring the same items?
When should I use a logit analysis as opposed to an arcsine transformed ANOVA?
Why don't I see fiducial confidence intervals doing a probit analysis in version 16 of SPSS?
Meta-analysis issues: 1) How do I measure publication bias? 2) How do I obtain a confidence interval for pooled estimates from sets of log odds, Cohen's d and Pearson correlations using a) meta-analysis and b)for Cohen's d as suggested for combining two studies to assess replication of a result?
How do I do a matched pairs comparison on dichotomous data using covariates in SPSS?
How do I handle messy data in SPSS to produce duration times from dates and frequencies from strings?
How do I compare a mean to a constant in EXCEL (one-sample t)?
How do I calculate a 95% confidence interval for the group means from one-way ANOVAs (using either a within or between subjects factor)?
How do I compute a t-test and F ratio using only summary measures in 2x2 between and within subjects ANOVAs?
A two-way analysis of variance to assess treatment and pre-test effects (Solomon's four group design)
In ANOVA and Regression, what do the various different types of Sums of Squares mean, and does the choice matter?
A note about different sums of squares in unbalanced factorial ANOVAs
A note on comparing confidence intervals and statistical significance
How do I obtain the standard deviation from the standard error of the mean (s.e.m.) and how does this and the mean vary with sample size?
What is the variance of the mean of my transformed data and the variance of combinations of means?
What is the value of the error variance for raw data using that of its scalar transformed data?
What is the relationship between the mean and the median from a sample and how do I compute 95% CIs for the median?
What is an optimal cut-off for the discretization/splitting of a continuous variable and how do we use this in a regression?
A quick approximation for Normally distributed expected order statistics
How do I derive an expected total for a subset of items based on the expected overall total?
How do I compute a difference in errors adjusted for overall number of errors when the errors have different signs?
How do I format data for input into a repeated measures analysis in SPSS?
What does the TRANSPOSE ALL DATA option in the RESTRUCTURE menu do in SPSS?
Using the VARSTOCASES command in SPSS to convert repeated measures formatted data to a random effects (including multilevel) model format (and the CASESTOVARS command for the reverse operation).
How do I perform a repeated measures analysis of variance in SPSS?
How do I perform a repeated measures analysis of variance in SPSS involving repeated measures factors with more than 99 levels?
How do I perform a repeated measures analysis of variance in R including correcting for sphericity?
How do I perform a repeated measures analysis of variance in GENSTAT (and MATLAB)?
How do I perform a repeated measures analysis of variance in MINITAB (and use an analogous procedure in SPSS to produce post-hocs on a repeated measures factor)?
How do I perform a non-standard comparison of means in a repeated measures anova in SPSS?
What is the effect of dropping a between by within subjects interaction on other terms in a mixed anova?
How do I test for an interaction involving a continuous variable (moderation analysis)?
What is the difference between a hierarchical and a stepwise regression? Also:a reference for an alternative, decision trees
How do I perform a stepwise regression in SPSS involving interactions?
Why don't I need to use a covariate which differs between randomised groups in an ANCOVA?
Can I do an analysis of covariance using a regression (including computation of covariate adjusted means) and use this to adjust for regression to the mean?
How do I handle errors in variables to estimate slopes and intercepts in a linear regression?
Inappropriate use of a constant covariate in repeated measures ANCOVA
How do I adjust for varying covariates in a repeated measures ANOVA in SPSS?
How do I obtain an interaction in SPSS to describe how a fixed covariate influences a repeated measures interaction?
What summary measures can I use to describe repeated measures?
How do I test for a trend, or contrast, between group means in a one-way ANOVA representing different subjects and also check location of asymptotes on a curve?
Interpreting the intercept term in a regression when covarying out predictors of within-subject difference scores
What is the importance of including an intercept term in the SPSS (between subjects ANOVA) univariate procedure?
How do I obtain covariance terms involving the intercept term in a linear regression using SPSS?
Are there any primer publications explaining rationale and application of single case studies?
How do I compute linear trend coefficients for single cases?
How do I compare a within subjects group difference to that of a single case?
How do I compare all pairwise comparisons in a between subjects anova (and in a repeated measures anova)?
Why won't SPSS let me do post hoc tests involving a within-subjects factor?
How do I test whether one independent variable has an influence on a dependent variable other than via the mediation of a second independent variable? (Sobel Test)
How do I get SPSS to do a Chi-squared analysis of a two-way frequency table?
How do I compare a list of observed frequencies with a list of my own expected frequencies?
Using a chi-square to see if two or more proportions are equal
How do I know which elements contribute to a relationship in a two-way frequency table?
How do I obtain Fisher's exact test and chi-square for a two-way table in EXCEL or a 2x2 table on the web?
When do I use the correction for continuity when performing a chi-square analysis on a 2x2 table?
How do I do a linear trend of proportions in a Chi-squared analysis in SPSS?
How do I test for a strictly increasing or decreasing series on a set of individuals (not necessarily linear)?
How do I test for the presence of an unknown ordering across subjects using 2 Dimensional data?
Repeated measures, Mixed models and Split-plot designs: A Rant
What is the formula for Mauchly's W used for testing sphericity in univariate repeated measures anova?
Why do I get a NAME? appearing in cells when I try to run an EXCEL spreadsheet program?
How do I split comma delimited data occurring in a single cell into separate columns in EXCEL?
How do I make bibliographic citations of a SPSS or R procedure?
Why does SPSS give me a file definition error message when I run syntax?
How do I get my SPSS output file to open in a new version of SPSS?
How do I e-mail a SPSS data file created on a PC so that it is readable on a MAC (and vice-versa)?
How do I read R syntax code into a R session and obtain primers on other issues to get started with R?
How to avoid "$ operator is invalid for atomic vectors" in R
How do I convert a Microsoft file application (e.g. powerpoint) into a pdf file?
How do I scatterplot observations which have the same set of co-ordinates?
How do I do a multiple line plot of cluster profiles for different factors in SPSS?
How do I produce a bar chart including one with empty categories and other features in SPSS?
Improved Confidence Limits for a binomial proportion and differences in binomial proportions
How do I compare observed numbers correct to those expected by chance in a multi-choice task?
How do I produce an interactive bar chart of percentages in SPSS?
How do I put one or more regression lines on a scatterplot in SPSS Version 12.0 and above and in R?
How do I plot user defined regression lines (such as those from ANCOVA) in R?
Why does the R-squared change when I fix the intercept in my regression line from its least squares value?
How do I produce a clustered boxplot and other more advanced graphics in SPSS?
How do I convert a pdf file into a MSWord or MSPowerpoint file?
How do I convert a powerpoint show file so it can print handouts?
How do I add in the analysis toolkit (and other add-ins) into EXCEL?
Rank-based and other correlations (percentage bend) which are robust to non-Normal data
How do I measure agreement to see if two measures are measuring the same thing (Bland and Altman plots)?
Can I do a correlation between two variables where one variable is less than or equal to the other?
How do I produce nonparametric Spearman partial correlations using SPSS?
What is Collinearity in multiple regression, and what do I do about it?
How do I look for the best fitting model with an unknown changepoints on ordered data?
How can I design an experiment so that conditions are counterbalanced for order?
How do I compute the standard error of X1 in a regression also featuring X2?
What is the relationship between regressions involving variables A and B to those involving B-A and A+B in predicting an outcome?
How do I test to see if a mean adjusted for a covariate equals zero in a single group?
How do I compute the standard error of ''beta'' in a linear regression in SPSS?
How do I compute Akaike's and Bayesian information criteria (AIC, BIC) to compare regression models and how do I interpret them?
How do I check Normality assumptions in repeated measures analyses in SPSS?
What is the difference between within subjects effects and within subjects contrasts in SPSS?
When should I use a Multivariate Analysis of Variance (MANOVA)?
Testing normality including rules of thumb for skew, kurtosis in SPSS
Are there any references suggesting advantages to using parametric tests as opposed to nonparametric ones?
How do I compare a specific pair of groups post-hoc in SPSS using the Kruskal-Wallis test?
How do I compare the distributions and magnitudes of a set of positive and negative values (including accessing results from nonparametric tests in SPSS 19 and later)?
Post-hoc nonparametric pairwise comparisons of a one-way within subjects factor
How do I know whether to use an exact or asymptotic p-value with a Mann-Whitney or Kruskal-Wallis test?
A note about using ranked outcomes in t-tests and ANOVAs including nonparametric interactions and Quade's test
What is the expected total discrepancy score in a R choice task?
How do I adjust p-values for number of comparisons using SPSS and R?
Why does the value of one in the F distribution have a p-value which is less than 1?
Is there an optimal ratio of cases to predictor variables I should have before doing a multivariate analysis or any guide as to total sample size?
What is the relationship between significance tests of regression coefficients and of correlation coefficients?
How do I compute the semi-partial correlation coefficient in R?
How do I compare two squared (semi-partial) correlation coefficients (R-squareds) from different samples?
How do I compare two squared (semi-partial) correlation coefficients from the same sample?
How do I adjust R-squareds for the number of predictors in a model?
What are random effect and multilevel models, when do I use them and are there effect sizes?
Where can I find out about using random effects (including multilevel) models in R (including obtaining proportion of variance explained by a variable) & in SPSS?
What does an error message concerning the Hessian matrix suggest when running a mixed (random effects) model?
What does 's' denote in describing a General Linear Model (GLM) and a note on Generalized Linear Models in SPSS?
What is a Generalized Additive (Mixed) Model (GA(M)M) and when do I use it?
How do I construct dummy variables for use in SPSS linear regression?
What is the role of a Part or Semi-Partial Correlation in a regression?
How do I test if a correlation is zero and compute its confidence interval?
How many degrees of freedom are associated with a test of whether a zero-order or multiple correlation equals zero?
How do I estimate a pooled correlation using multiple scores from a set of subjects?
How do I adjust p-values to test if more than one correlation is zero?
How do I obtain a 95% confidence interval for a correlation (or slope in a simple regression) in SPSS?
How do I adjust a correlation for group differences (using partial/semi-partial correlations)?
How do I work out reliability for three or more items? (Cronbach's alpha, composite reliability and Raykov's rho)
When and how do I evaluate a one-sided p-value and quote a one-sided 95% confidence interval?
What sample sizes do I need for doing tests with a given power?
What are polychoric correlations and how do I compute them and use in SPSS?
Which matrix of loadings do I use doing a principal components extraction or non-PC analysis with a direct oblimin rotation when doing a factor analysis?
A guide to the pros and cons of choosing a method for producing factor scores from a factor analysis
How do I assess the importance of variables in a Normal Discriminant analysis?
What is the difference between principal components analysis, principal axis analysis and other factor extraction methods?
How do I interpret variables which load on more than one factor?
How many factors/components should I retain in a factor/principal components analysis?
How do I interpret variables which load on more than one factor?
How do I handle missing data in multivariate analyses in SPSS?
Using SPSS syntax to impute last observation carried forward (LOCF) for missing values in SPSS
How can I detect identical cases (duplicates) in SPSS without having an ID number?
How do I produce truncated exponential random variables using MATLAB?
How do I obtain parameter estimates for finite Normal mixture distributions using SPSS and BMDP?
How do I find out how many people have a score above a certain value?
How do I compute percentile thresholds for exponential data and use these in outlier detection?
Additional kappa statistic evaluation in SPSS, benchmarks on size and a measure of inter-rater agreement based on Euclidean distances
How do I compute consistency across subjects using an intraclass correlation?
A note on correcting for restriction of ranges which underestimate Pearson correlations
How do I obtain sums and means of partially complete cases in SPSS?
How do I obtain the mean of several variables, each minus the same constant?
How do I compute z-scores in SPSS and what is their relationship to comparing group means?
How do I use cumulative distribution functions to compute p-values in SPSS, EXCEL and R?
How do I adjust for age in comparing survival times of two different groups?
What is the relationship between intercept and slope for scores at two time points?
How do I produce random variables which follow a negative skew distribution?
Simulations sampling from data with replacement (bootstrapping)
Generating multivariate data with a required correlation matrix
How do I obtain the formulae behind statistics outputted by SPSS algorithms?
How do I compute a leaving-one-out error rate for a logistic regression in SPSS?
Why are the standard errors so large in logistic regression?
How do I choose between different logistic regression models?
How do I interpret output from a Multinomial logistic regression?
Which discriminant analysis should I use to obtain thresholds to indicate levels of abnormality using a single variable?
How do I plot and interpret a ROC curve in assessing strength in two group prediction?
Which output criteria should I use when using the casewise results option with the Normal discriminant method in SPSS?
What do I do if I have unequal group covariance matrices when doing a MANOVA?
How do I find p-values using critical values as input in SPSS?
How do I do False Discovery Rate (FDR) corrections for multiple tests?
A quick guide to choice of sample sizes for Cohen's effect sizes
How do I convert a t-statistic (and an Odds Ratio) into an effect size?
How do I compute Cohen's d in SPSS and EXCEL and its and eta-squared confidence interval in SPSS, R or EXCEL?
How do I compute effect sizes (including variance adjusted ones)?
How do I do power calculations in SPSS, EXCEL, R and using web freeware?
How do I do power (sample size) calculations on Poisson counts?
How do I do power calculations using formulae for one sample t and sign tests?
How do I work out sample size for apriori specificities and sensitivities?
How do I compare group means in a non-standard post-hoc contrast?
How do I compare a set of group means with a control group mean?
How do I manually compute t-statistics to compare means in a repeated measures ANOVA having 3 or more groups?
Formulae for interaction sums of squares in balanced designs
Return to Statistics main page
These pages are maintained by Ian Nimmo-Smith and Peter Watson (/center)
[Last updated on 1 July 2006]