Python for Data Analysis: Hypothesis Testing and T-Tests

No video

Python for Data Analysis: Hypothesis Testing and T-Tests

Рет қаралды 63,258

Күн бұрын

This video covers the basics of statistical hypothesis testing and t-tests in Python. This video explains the basics of statistical hypothesis testing and shows how to run one-way, two-way and paired t-tests in Python.
Subscribe:
► www.youtube.co...
This is lesson 24 of a 30-part introduction to the Python programming language for data analysis and predictive modeling. Link to the code notebook below:
Python for Data Analysis: Hypothesis Testing and T-Tests
www.kaggle.com...
This guide does not assume any prior exposure to Python, programming or data science. It is intended for beginners with an interest in data science and those who might know other programming languages and would like to learn Python.
I will create the videos for this guide such that you should be able to learn a lot just watching on KZfaq, but to get the most out of the guide, it is recommended that you create a Kaggle account so that you can copy and edit each lesson so that you can follow along and run code yourself.
Introduction to Python Playlist:
• Python for Data Analysis
Link to the Python for Data Analysis written guide index page:
www.kaggle.com... .
⭐ Kite is a free AI-powered coding assistant that integrates with popular editors and IDEs to give you smart code completions and docs while you’re typing. It is a cool application of machine learning that can also help you code faster! Check it out here: www.kite.com/g...

Пікірлер: 26

@martyzeenyc1210 3 жыл бұрын

I want to thank you for these videos as I'm struggling in my college data science course. This has helped me massively!

@grainofsalt2113 2 жыл бұрын

you explained this concept in the simplest way i have ever seen

@florenciaortega6543 3 жыл бұрын

Thank you very much!!! I couldn't find anywhere else this test as well explained as you did it. So accurated explanation. Thank you! A+ !

@marinastolet7799 2 жыл бұрын

This is exactly what I needed, thank you.

@michaelolubode6168 2 жыл бұрын

Thanks for this video. I knew the theory of hypothesis testing and can do it on paper but it wasn't easy to do the same in python until I saw your video. Thanks for the simplicity.

@jongcheulkim7284 2 жыл бұрын

Thank you so much. This is very helpful.

@neelroy3 2 жыл бұрын

which statistical test can be used to find difference between two groups' percentage values?

@kits1111 Жыл бұрын

you are so awesome !! you explained so well ...

@arashkashefian1735 6 ай бұрын

thank you very useful video. just wondering for two sample or paired tests, is there a way to test if null hypothesis is not just 0 but some none-zero value. For example if S1 is the first sample and S2 is the second sample, then how do we test the hypothesis that S1 - S2 > 1

@atom8926 Жыл бұрын

How you have learn statistics? Please mention some good resource to learn

@valda313 3 жыл бұрын

Shouldn't normality testing be done before performing ttests? (Otherwise, great video, thanks 👍🏻)

@DataDaft 3 жыл бұрын

That is a good point Valda. The distributions should be normal for the t-test, which can be checked through informal means like inspecting a histogram or normal qqplot or more formally with a test like scipy.stats.shapiro(). If the sample data is large enough though, say 50+, that might be adequate for the sample data due to the normality of the sampling distribution via the central limit theorem, but I'm not sure there's a good hard-and-fast rule as to when things are "not normal enough." It is probably a good idea to also run a non-parametric test like the Mann-Whitney test for independent samples and Wilcoxon Signed Rank test for paired samples if normality questionable.

@valda313 3 жыл бұрын

@@DataDaft thanks for response! When I compute these types of statsicial testing, I always firstly do normality test (scipy.stats.shapiro). Based on the result of normality testing I choose either from parametric ttests or nonparametric tests (like Wilcoxon or Mann-Whitney).

@DataDaft 3 жыл бұрын

@@valda313 Thanks for the input! it is helpful to have knowledge viewers fill in gaps (or make me aware of errors). Helps everyone learn.

@ayush9psycho 2 жыл бұрын

quality material!!

@kartiksharma-yw7qf 3 жыл бұрын

You are damnnn good m loving it to study with you.

@durgabhavanikonamarthi6448 3 жыл бұрын

what do we do to our model if we accept an alternative hypothesis?

@iddymanhunter1 3 жыл бұрын

Amazing!!!

@johnnybastos3390 Жыл бұрын

can i say that a p-value = false positive probability?

@atom8926 Жыл бұрын

Awesome

@svitirur1665 3 жыл бұрын

Do I need hypothesis testing in machine learning modeling,? or lets say when should i do hypotheses testing in dataset , as a data scientist

@DataDaft 3 жыл бұрын

Hypothesis testing is a core statistical idea that plays a role in many other concepts in data science and machine learning. Basically any time you have a situation where you want to investigate whether one sample of data differs another (or from a population), hypothesis testing is something to consider. For example, it is at the core of A/B testing which is used to choose between two different options, like which version of an ad or website attracts more clicks.

@janabark6415 3 жыл бұрын

Please, what if I have a different number of records for each of the testing group. For instance 2000 records for control and 2050 for test group. Can I use python function : t_stat, p_val= ss.ttest_ind(df_cnt.exp_rev,df_trt.exp_rev)? I got result: T-score = 0.16434444604672976 # There is 16 % deviation from H0 mean # p-value = 0.8694662602367074 # p-value is > than significance level i.e. 0.05 # Therefore I am rejecting H1 the treatment did not performed better than the control Can I interpret it like this? Thank you very mucho in advance.