Chengwei LEI, Ph.D.    Associate Professor

Department of Computer and Electrical Engineering and Computer Science
California State University, Bakersfield

 

Statistical Tools



There are so many statistical tools help researchers and analysts understand the datasets.

Here are some basic examples.






Statistical Dependence

 

A variable is considered dependent if it depends on an independent variable. Dependent variables are studied under the supposition or demand that they depend, by some law or rule (e.g., by a mathematical function), on the values of other variables.

 

For this verison of Army dataset, please calculate the distance.






p-value

 

A p-value measures the probability of obtaining the observed results, assuming that the null hypothesis is true. The lower the p-value, the greater the statistical significance of the observed difference. A p-value of 0.05 or lower is generally considered statistically significant.

 


Confidence Interval

 

Informally, in frequentist statistics, a confidence interval (CI) is an interval which is expected to typically contain the parameter being estimated.

 


 

We own an orange farm. Every 10 oranges will be packaged as one box.

Here are the orange sizes from one box:

[85, 92, 78, 89, 90, 82, 75, 86, 88, 91]

 

What is the mean value of oranges' size? 

Are you sure?

 





ANOVA table

 

An ANOVA table is a table that organizes calculations to compare variances between groups. It's a statistical tool used to determine if there are significant differences between means.

  How to calculate