Statistical functions¶

This page describes the statistical functions that are available in Phonometrica.

Global functions¶

chi2_test(X)¶

Computes Pearson’s chi-squared (\(\chi^2\)) test on X, which must be a two-dimensional array. The m rows in the array represent the m levels of a categorical variable, and the n columns represent the n levels of another categorical variable. Each cell represents the unnormalized frequency count for the combination of the two variables. This test evaluates the null hypothesis that the two variables are independent.

This function returns an object with the following fields:

chi2: the \(\chi^2\) value
df: the number of degrees of freedom
p: the p-value

See also: lm(), poisson()

mean(x[, dim])¶

Returns the mean of the array x. If dim is specified, returns an Array in which each element represents the mean over the given dimension in a two dimension array. If dim is equal to 1, the calculation is performed over rows. If it is equal to 2, it is performed over columns.

poisson(y, X[, robust[, max_iter]])¶

Fits a Poisson regression model. y is a set of N observations which represent count data (i.e. non-negative integers), and X is an N by M matrix for a model with M regression coefficients, including the intercept which must be the first coefficient. (In general, it should be a column of 1’s.) If robust is true (it is false by default), Phonometrica will use the so-called “robust variance sandwich estimator” to adjust the standard errors for mild violations of the assumption that the mean is equal to the variance. If max_iter is provided, it indicates the maximum number of iterations that the solver should perform to estimate the coefficients (200 by default).

This function returns an object with the following fields:

beta: an array of estimates for the regression coefficients. The first entry is the intercept
se: an array representing the standard errors of the regression coefficients
z: an array of z-values for the regression coefficients (z[i] is the z-value for beta[i])
p: an array of p-values for a Wald test which evaluates the null hypothesis that each regression coefficient is equal to 0 (p[i] is the p-value for beta[i])
niter: the number of iterations performed by the numerical solver
converged: a Boolean value indicating whether the solver has converged to a solution. It is true if niter < max_iter

Note: the model is fitted numerically using the Limited-memory Broyden–Fletcher–Goldfarb–Shanno (L-BFGS) approximation method.

Table of Contents

Previous topic

Next topic

Statistical functions¶

Global functions¶