Descriptive Statistics in Python
Descriptive Statistics in Python
How to Calculate Point-Biserial Correlation in Python
Point-biserial correlation is used to measure the relationship between a binary variable, x, and a continuous variable, y.
Similar to the Pearson correlation coefficient, the point-biserial...
Descriptive Statistics in Python
How to Create Frequency Tables in Python
A frequency table is a table that displays the frequencies of different categories. This type of table is particularly useful for understanding the distribution of values...
Descriptive Statistics in Python
How to Calculate Jaccard Similarity in Python
The Jaccard similarity index measures the similarity between two sets of data. It can range from 0 to 1. The higher the number, the...
Descriptive Statistics in Python
Equal Frequency Binning in Python
In statistics, binning is the process of placing numerical values into bins.
The most common form of binning is known as equal-width binning, in which we divide a...
Descriptive Statistics in Python
How to Calculate Mahalanobis Distance in Python
The Mahalanobis distance is the distance between two points in a multivariate space. It’s often used to find outliers in statistical analyses that involve several variables.
This...
Descriptive Statistics in Python
How to Remove Outliers in Python
An outlier is an observation that lies abnormally far away from other values in a dataset. Outliers can be problematic because they can affect the results...
Descriptive Statistics in Python
How to Create a Covariance Matrix in Python
Covariance is a measure of how changes in one variable are associated with changes in a second variable. Specifically, it’s a measure of the degree...
Subscribe
- Never miss a story with notifications
- Gain full access to our premium content
- Browse free from up to 5 devices at once
Must read