If you are a data science student, you must have used this term many times. In Linear Regression, Logistic Regression we use this often. Infact, the most basic statistic like a sample mean is an MLE for the population mean. We use the term MLE solution without knowing what it actually is. Intuitively the name... Continue Reading →
Dumbing down p-value
Prerequisites: To understand this post, you should have exposure to hypothesis testing, and you should have used p-value in your tests. This post is solely about interpreting the perplexing p-value. Let's jump right in! What is p-value? "p-value  is the probability of obtaining test results at least as extreme as the result actually observed, under the... Continue Reading →
Hypothesis Testing in Linear Regression
If you need some introduction to Linear Regression, Go here ! https://vipanchiks.wordpress.com/2022/05/28/linear-regression-a-detailed-introduction/ You must be familiar with the output of a linear regression model. We get a p - value for each variable. What is this p-value? Hypothesis Testing : For a simple linear regression, as you know, the equation will look like this: If... Continue Reading →
Linear Regression – A detailed introduction
What is Regression? We use this term very often in Machine Learning and Statistics. What is the meaning of this term? It is a method used in statistics to determine the relation between on variable (dependent) and the other variable (independent). Literal meaning of regression is stepping back towards the average. So , where from... Continue Reading →
Understanding SST – ANOVA
When population means are same / similar : When the population means are same or nearer, SSW will be high, in other words, most of SST will go into SSW . Checking it in python : See that, SSW is more compared to SSB When population means are different: When population means are different, then... Continue Reading →
Math in ANOVA – Explained
If you don't know what ANOVA is, no worries. Got you covered! https://vipanchiks.wordpress.com/2022/05/17/anova-analysis-of-variance-introduction/ Mathematically understanding ANOVA: The purpose of ANOVA is to compare the population means, remember? How do we do that? Lets dive in! Method: Lets say we have r populations. So, r samples. Calculate the mean of each sample : xi Now, combine... Continue Reading →
ANOVA – Analysis of Variance: Introduction
To carryout comparison of means of several populations we use Analysis of Variance. ANOVA is a statistical method for determining the existence of differences among several population means Aczel - Sounderpandian Why is it called Analysis of Variance? Though we are comparing different population means for difference, the technique requires the analysis of different forms... Continue Reading →
Hypothesis Testing – Introduction
A hypothesis is an statement which can be tested for truth. If someone claims that average stress levels among students of a college are greater than equal to 6, we can't go by that statement can we? We have to check if the statement is true or false. How do we do that? Intuition: A... Continue Reading →
Confidence Interval
If you want to make an inference on the population parameter from a sample statistic., then you use this concept. Let's say we want to find the average cost of PGs all over HSR layout. Instead of going to every PG (population) and finding the mean price, we pick a sample randomly and find the... Continue Reading →
Chi-Squared Distribution
You need to know normal distribution before this. In case you want to read: https://vipanchiks.wordpress.com/2022/03/30/probability-in-normal-distribution/ https://vipanchiks.wordpress.com/2022/03/30/normal-distribution-introduction/ The Chi - squared distribution is a plot of Z - square, where Z are all points on independent standard normal distributions. Remember the Z scores? What is a standard normal distribution? stats. Norm(loc = 0,scale=1) Lets see how... Continue Reading →