No video

How to do the Titanic Kaggle Competition

  Рет қаралды 74,349

Aladdin Persson

Aladdin Persson

Күн бұрын

This video is for those who want to get started doing #kaggle.
❤️ Support the channel ❤️
/ @aladdinpersson
Paid Courses I recommend for learning (affiliate links, no extra cost for you):
⭐ Machine Learning Specialization bit.ly/3hjTBBt
⭐ Deep Learning Specialization bit.ly/3YcUkoI
📘 MLOps Specialization bit.ly/3wibaWy
📘 GAN Specialization bit.ly/3FmnZDl
📘 NLP Specialization bit.ly/3GXoQuP
✨ Free Resources that are great:
NLP: web.stanford.e...
CV: cs231n.stanford...
Deployment: fullstackdeepl...
FastAI: www.fast.ai/
💻 My Deep Learning Setup and Recording Setup:
www.amazon.com...
GitHub Repository:
github.com/ala...
✅ One-Time Donations:
Paypal: bit.ly/3buoRYH
▶️ You Can Connect with me on:
Twitter - / aladdinpersson
LinkedIn - / aladdin-persson-a95384153
Github - github.com/ala...

Пікірлер: 78
@OSShubho
@OSShubho 3 жыл бұрын
Thanks for sharing this simple and elegant beginner friendly code. Your approach are very clear and understandable.
@grantfoster2663
@grantfoster2663 2 жыл бұрын
Really helpful to someone to see someone work through a really simple solution as someone moving from to python from R!
@jayz4581
@jayz4581 2 жыл бұрын
People get 100% because this dataset is so classic and they are always finding the best features or maybe use ensemble methods. But your intro is so straightforward for me to start at kaggle. Thanks!
@ccuuttww
@ccuuttww 3 жыл бұрын
100% its mean over fitting of course u can do more stuff to boost your performance PCA, boost sampling, cross validation, even prior parameter
@AladdinPersson
@AladdinPersson 3 жыл бұрын
I agree, you can try/do a lot to more to make it even better, for this one I tried to keep it minimal and simple
@adilsonmedronha706
@adilsonmedronha706 2 жыл бұрын
Actually it is not overfitting because this accuracy were measure through test set (unseen data), not train set.
@suhass6628
@suhass6628 3 жыл бұрын
Well done mate!. Thanks for this. Hopefully you will do more Kaggle stuff. Will follow everything
@AladdinPersson
@AladdinPersson 3 жыл бұрын
Yeah it will for sure, got another video coming soon on a bit more advanced competition
@fentazimohamedreadh5274
@fentazimohamedreadh5274 Жыл бұрын
Thank you so much!!! It was really helpful to get started in Kaggle competitions^^
@suhass6628
@suhass6628 3 жыл бұрын
And the 100% people, rumour has it that some people have got the info of the people from the actual Titanic records which is publicly available. So it would give 100% obviuosly
@AladdinPersson
@AladdinPersson 3 жыл бұрын
Makes sense!
@viralmedia.007
@viralmedia.007 8 ай бұрын
so are kaggle competitions genuine?? i always wonder how would people get 100% correct predictions or is this specific to this competition only? moreover they come with such huge prize pools
@suhass6628
@suhass6628 8 ай бұрын
@@viralmedia.007 yes the actual competitions which have prize money very genuine. The rigged ones are usually very basic or fir which data is already available publicly
@aexairkeys
@aexairkeys 3 жыл бұрын
awesome job! love the simplicity. keep going!
@shaikhkashif9973
@shaikhkashif9973 Жыл бұрын
Bro for *Embarked*u should go for Nominal encoding not a label because it's names of ports
@hailongnguyen7176
@hailongnguyen7176 10 ай бұрын
Yeah you right
@AbdulRehman-nu2pb
@AbdulRehman-nu2pb Жыл бұрын
Thankyou so much for sharing this elegent and simple , beautifully written code. As a bigineer your code is a holy grail !!!
@maxvettel7337
@maxvettel7337 Жыл бұрын
This is what I really need as a beginner
@deepudeepak1390
@deepudeepak1390 3 жыл бұрын
I did the same approach when I started my kaggle journey 😀 .. .. request from my side please make some viedos on transfer learning in natural language processing thank you
@gauravms6681
@gauravms6681 3 жыл бұрын
remember me when this channel is gonna go hit : )
@jose3538
@jose3538 3 жыл бұрын
Remember me too!
@kyoujinko
@kyoujinko 2 жыл бұрын
This made so much sense, thank you.
@nabshieshty
@nabshieshty 9 ай бұрын
nice vid, did my assignment with this
@jinks6887
@jinks6887 2 жыл бұрын
Thanks I've subscribed. Very simple yet informative content.
@sanskarram992
@sanskarram992 3 жыл бұрын
Very helpful for begineers .................. Thanks for such content.
@gezahagnnegash9740
@gezahagnnegash9740 2 жыл бұрын
Thanks a lot. As a beginners, it's helpful for me!
@denisvoronov6571
@denisvoronov6571 2 жыл бұрын
Perfect for the beginner!
@SussyBaka-ci5xi
@SussyBaka-ci5xi 11 ай бұрын
helped a lot! thank you!
@MDEMANURRAHAMAN-
@MDEMANURRAHAMAN- 2 жыл бұрын
That was really helpful. Thanks
@teamsonnyliston
@teamsonnyliston Жыл бұрын
Thanks a lot man you helped me
@sena1663
@sena1663 2 жыл бұрын
That was easy and helpful :) Thanx!!
@LameGamerYT
@LameGamerYT 2 жыл бұрын
GOD LEVEL VIDEO THANKS SO MUCH!
@danasharon4752
@danasharon4752 2 жыл бұрын
Thank you!
@minhlong1920
@minhlong1920 3 жыл бұрын
Thank you sm dude
@juan.forero_
@juan.forero_ 3 жыл бұрын
Thank you bro!!
@mtk-0_0
@mtk-0_0 Жыл бұрын
appreciate good effort!
@newkamphora
@newkamphora 2 жыл бұрын
Thank you, very helpful ;)
@kefahelhelou9418
@kefahelhelou9418 Жыл бұрын
Thanks a lot
@Leopar525
@Leopar525 2 жыл бұрын
I really like your style of thinking and explaining. Could you please advise on any (free or not) courses/articles or anything you believe is good for beginners?
@classicemmaeasy2292
@classicemmaeasy2292 Жыл бұрын
Very short,simple and explanatory, but you use machine learning techniques all through, you don't really explore and visualize the data. This video is awesome by the way,and beginner's friendly
@adayinthelife5496
@adayinthelife5496 Жыл бұрын
I think your code is excellent, but it freaks me out how many data scientists only see their accuracy as a result. Understanding and presenting the results in meaningful way is key to any science. So... who was likely to survive??
@abdoali-nl2yt
@abdoali-nl2yt Жыл бұрын
thanks for you
@yuliusharjoseputro2069
@yuliusharjoseputro2069 3 жыл бұрын
Hi, thanks for your tutorial. I've implemented your code, but why the accuracy that I got is different with you?
@mukundkrishna2789
@mukundkrishna2789 2 жыл бұрын
For logistic regression, isn't it necessary to do feature scaling before training? When I searched in the net, it was specified that we should do feature scaling for logistic regression
@Borzacchinni
@Borzacchinni 2 жыл бұрын
Thanks for the video! Do you happen to be from Norway perhaps?
@pranjalsingh1389
@pranjalsingh1389 Жыл бұрын
Why did we not used fit.transform on test set
@LeonidasParigoris
@LeonidasParigoris Жыл бұрын
Thanks for this! I have a question, at 3:45 how are you able to avoid writing the whole directory of the file and just say "train.csv", instead of writing the whole snake of the directory e.g. "C:\\Users\\etcetc\\Python\\titanic\\train.csv"?
@danilomontalvo5756
@danilomontalvo5756 3 жыл бұрын
everything else works for me except predictions when getting to 14:43 it just says "AttributeError: 'function' object has no attribute 'predict'"
@arnelecleir4876
@arnelecleir4876 2 жыл бұрын
In this case (using a regression), is it possible to just use stata? I feel like most of the actions performed here would have been easier/quicker in stata… I’m asking this since I now how to work with stata and am currently learning data science via datacamp/kaggle and want to compare some tools :)
@ouhjnadmacabenta3054
@ouhjnadmacabenta3054 2 жыл бұрын
Hi bro how did you set up the CSV file on the jupyter because my CSV file was not defined thanks
@maitrijain7758
@maitrijain7758 Жыл бұрын
Ur code gives error when we predict x test
@udbhavprasad3521
@udbhavprasad3521 3 жыл бұрын
Can you make a video about XGBoost; their is not many resources for that
@RpSKhaira
@RpSKhaira Жыл бұрын
Noob here, question: why did you clean your data through a function? Why not just run those exact commands outside of the function?
@timgen-iu1qo
@timgen-iu1qo Жыл бұрын
i think because he had 2 tables with input data and it's easier to write 1 function and call it 2 times than writing the algorithm 2 times and change something for each table
@AIPlayerrrr
@AIPlayerrrr 3 жыл бұрын
Planning to do more real ones in the future?
@AladdinPersson
@AladdinPersson 3 жыл бұрын
yes
@AladdinPersson
@AladdinPersson 3 жыл бұрын
Got any ideas of some you think would be useful?
@AIPlayerrrr
@AIPlayerrrr 3 жыл бұрын
I watched a lot of video of yours and I think you are very likely to place high as you are really knowledgeable. You explain thing very well. I think you can try the recent human protein competition. It’s a fun weakly supervised classification problem.
@talha_anwar
@talha_anwar 3 жыл бұрын
Upvoted
@Ajay_Pathak_
@Ajay_Pathak_ 3 жыл бұрын
I'm having errors while fitting the model It says Float() must be str or .... Not method
@nevilleodhiambo1370
@nevilleodhiambo1370 2 жыл бұрын
How did you handle this
@maitrijain7758
@maitrijain7758 Жыл бұрын
I also get the same error
@prannayhemachandran966
@prannayhemachandran966 Жыл бұрын
got solution for this????
@Ajay_Pathak_
@Ajay_Pathak_ Жыл бұрын
I have no idea what I did after this error , I might have even left it entirely 😂, sorry guys !
@magikarp1743
@magikarp1743 Жыл бұрын
can someone pls help me out here ? at 14:55 on running it shows "value error: X has 8 features per sample; expecting 7"
@timgen-iu1qo
@timgen-iu1qo Жыл бұрын
I have same error, haven't you solved it yet?
@magikarp1743
@magikarp1743 Жыл бұрын
@@timgen-iu1qo yea i got my mistake... in the 2nd cell i wrote test = pd.read_csv("train.csv") instead of test = pd.read_csv("test.csv")... silly of me
@timgen-iu1qo
@timgen-iu1qo Жыл бұрын
@@magikarp1743 IMAGINE, same mistake... Thanks 😂😂
@vishalgoklani
@vishalgoklani 3 жыл бұрын
LogisticRegression??? where's the neural network? :)
@AladdinPersson
@AladdinPersson 3 жыл бұрын
In the moment it felt like it would be overkill, in retrospect I regret it :3
@karlagonzalez6808
@karlagonzalez6808 3 жыл бұрын
Do u know how to find the most popular name among male Titanic passengers?
@krnl1304
@krnl1304 2 жыл бұрын
One with the maximum frequency should be the most. So use count() and max()
@gurudevdatta3960
@gurudevdatta3960 3 жыл бұрын
im getting an error while spilting the data can you help me? or if you dont mind an you send your number please i will send screenshot to you?
@mehermanoj45
@mehermanoj45 3 жыл бұрын
Plz speed runing datasets like games😂
@AladdinPersson
@AladdinPersson 3 жыл бұрын
How you mean? :P
@mehermanoj45
@mehermanoj45 3 жыл бұрын
@@AladdinPersson pick a random dataset and try how fast can u go from downloading to inference.
@Honest_Reply900
@Honest_Reply900 2 жыл бұрын
Well done. thanks for you efforts! 100% accuracy? I am sure they have cheated :)
If Barbie came to life! 💝
00:37
Meow-some! Reacts
Рет қаралды 77 МЛН
Dad Makes Daughter Clean Up Spilled Chips #shorts
00:16
Fabiosa Stories
Рет қаралды 2,3 МЛН
Please Help Barry Choose His Real Son
00:23
Garri Creative
Рет қаралды 23 МЛН
Magic? 😨
00:14
Andrey Grechka
Рет қаралды 20 МЛН
Beginner Kaggle Data Science Project Walk-Through (Titanic)
38:16
How to do the Titanic Kaggle competition in R - Part 1
35:07
Data Science Dojo
Рет қаралды 100 М.
How to Win Kaggle Competitions - Competition Master Advice
18:18
ML Was Hard Until I Learned These 5 Secrets!
13:11
Boris Meinardus
Рет қаралды 281 М.
Titanic Survival Prediction in Python - Machine Learning Project
53:38
World Number 1 On Kaggle with Christof Henkel #36
1:08:13
AI Stories Podcast - Neil Leiser
Рет қаралды 15 М.
Anthony Goldbloom - How to Win Kaggle Competitions
44:18
Weights & Biases
Рет қаралды 53 М.
[Paper Review]: Deep Neural Networks for YouTube Recommendations
22:52
Aladdin Persson
Рет қаралды 4,1 М.
If Barbie came to life! 💝
00:37
Meow-some! Reacts
Рет қаралды 77 МЛН