8 worst data scientists to work with
5:59
6 Stages of Data Science Project
6:47
Пікірлер
@amitbarak9475
@amitbarak9475 4 күн бұрын
I dont understand why group by is necesery in the last select query
@kadourkadouri3505
@kadourkadouri3505 6 күн бұрын
looks like Hadoop is more or less deprecated
@TheRaju991
@TheRaju991 13 күн бұрын
I am genuinely concerned if anyone attempted any of that.
@sravankumartallozu4805
@sravankumartallozu4805 15 күн бұрын
I might misunderstood the challenge! Doesn't below code solve the problem? select user1, count(distinct user2) / (cast (select count(distinct user1 from facebook_friends) as float) ) * 100 as popularity from facebook_friends group by user1 order by popularity desc
@motakku3423
@motakku3423 17 күн бұрын
such a professional video, I'm really thankful :)
@sophie20324940
@sophie20324940 19 күн бұрын
I believe the choice between using ntile(100) with num <= 2 and ntile(50) with num = 1 depends on the total number of records. In my tests on Redshift, if the dataset exceeds 50 records, both methods yield the same result. However, when the dataset contains fewer than 50 records, ntile(50) with num = 1 returns only one record, while ntile(100) with num <= 2 still returns two records. Therefore, it's better to clarify with the interviewer regarding the expected outcome, especially since it's not feasible to evenly distribute the data into buckets in such scenarios. Considering that the result cannot be fractional, returning one row might be preferable. However, better to communicate and discuss these considerations.
@elliott421
@elliott421 20 күн бұрын
What kind of psychopath would use a logistic regression instead of a linear regression when the situation would call for a linear regression?! I need to go lay down now.
@programmable_life
@programmable_life 20 күн бұрын
Makes me remember one time before presentation i tried changing accuracy by changing random_state 😂
@samuelakwantui3124
@samuelakwantui3124 21 күн бұрын
😂 Hilarious… 1:29 … so funny …
@samuelakwantui3124
@samuelakwantui3124 21 күн бұрын
I’m saving this video as a road map .lol. Thanks, I’m grateful for the info 🙌🏽
@metinunlu_
@metinunlu_ 23 күн бұрын
haha sarcasm is strong with this one, love it
@tylervenner3128
@tylervenner3128 23 күн бұрын
This is so funny 😂😂
@supermarkethobo9567
@supermarkethobo9567 27 күн бұрын
If your boss doesn't like the figures you come up with, just change them! Sometimes you just gotta smooth the numbers
@davidedeh8448
@davidedeh8448 27 күн бұрын
Thanks for always churning out great contents Nathan. I look forward to more of your insights.
@stratascratch
@stratascratch 27 күн бұрын
Thanks! We're trying to experiment with a lot of different types of content (technical, career-focused, comedy, etc). So if there's any topic or video that you're really liking, let us know!
@Delchursing
@Delchursing 28 күн бұрын
Made more api progress today than with days of Internet and ai. Thank you. I love how pragmatic this video is.
@user-uv5lr7bk8n
@user-uv5lr7bk8n 29 күн бұрын
i mean why you had to hide API key?
@andreypopov6166
@andreypopov6166 Ай бұрын
Python data sciences libs. API naming is just horrible (((
@katarzynapuawska8052
@katarzynapuawska8052 Ай бұрын
All ML/DS courses taught me the latter steps - data cleaning, building a model, data visualization. I learned the deployment part from different resources. So it looks like I lack the beggining steps, that is a real-life data that is updated frequently, using APIs to retrieve it and using a DB in cloud to store it. Thank you for this insightful video. I already feel inspired, I think I will try to do something with Spotify data, if it's publicly accessible :) You are a great motivator.
@karunakaranr2473
@karunakaranr2473 Ай бұрын
Thank you for your time and effort. Explained in detail...
@murilozangari
@murilozangari Ай бұрын
thank you!
@elextures
@elextures Ай бұрын
im having a problem with channel id, in CHANNEL_ID, i put my own channel, but each time i press response, i get a different channel
@phillipmountbatton6710
@phillipmountbatton6710 Ай бұрын
am I the only one who doesn't see the link to download the data? Would love to work on this project on my own. Thank you!
@stratascratch
@stratascratch Ай бұрын
Sorry about that! Here it is redivis.com/datasets/1e0a-f4931vvyg/tables
@adamadnan9173
@adamadnan9173 Ай бұрын
very interesting🔥🔥
@adamadnan9173
@adamadnan9173 Ай бұрын
After watching this video, I will return after becoming an expert data scientist.🔥
@metinunlu_
@metinunlu_ Ай бұрын
This is lots of information with short time, love it!
@magicalflyingnemo9586
@magicalflyingnemo9586 2 ай бұрын
Just a heads up, the notebooks you show at the end of each algorithm section is switched up for GB, K-Means and PCA
@bennguyen1313
@bennguyen1313 2 ай бұрын
What if the data file is not in a perfect tabular / csv format? For example, I have a large text file like this: #Time (HHH:MM:SS): 002:34:02 # T(ms) BUS CMD1 CMD2 FROM SA TO SA WC TXST RXST ERROR DT00 DT01 DT02 DT03 DT04 DT05 DT06 DT07 # ===== === ==== ==== ==== == ==== == == ==== ==== ====== ==== ==== ==== ==== ==== ==== ==== ==== 816 B0 D84E BC RT27 2 14 D800 2100 0316 0000 0000 0000 0000 CCCD 0000 817 A0 DC50 RT27 2 BC 16 D800 2120 0000 4080 3000 0000 3000 0000 0000 #Time (HHH:MM:SS): 002:34:03 # T(ms) BUS CMD1 CMD2 FROM SA TO SA WC TXST RXST ERROR DT00 DT01 DT02 DT03 DT04 DT05 DT06 DT07 # ===== === ==== ==== ==== == ==== == == ==== ==== ====== ==== ==== ==== ==== ==== ==== ==== ==== 056 B0 D84E BC RT27 2 14 D800 2100 0316 0000 0000 0000 0000 CCCD 0000 057 A0 DC50 RT27 2 BC 16 D800 2120 0000 4080 3000 0000 3000 0000 0000 How can get just the data from DT00 thru DT07 into an array, without doing lots of preprocessing to scrub out the repeating #Time headers that appear throughout the file?
@davidedeh8448
@davidedeh8448 2 ай бұрын
Thanks for this video Nathan. Your facts are not just true for Data Scientist but also Software engineer and programmers.
@EzinneOgwo-Ude
@EzinneOgwo-Ude 2 ай бұрын
Please I am building a recommendation system for diet and exercise. Where can I get my data set or API. A link will be appreciated.
@gcdrivephotos-fl5cq
@gcdrivephotos-fl5cq 2 ай бұрын
Was gpt 4 used?
@stratascratch
@stratascratch 2 ай бұрын
yes!
@JayMaverick
@JayMaverick 2 ай бұрын
Funny how all the experts say chatgpt is useless, but the tech priests keep hyping it as the savior and doom of humanity.
@stratascratch
@stratascratch 2 ай бұрын
It definitely will eliminate some jobs where tasks are mundane and repetitive. For example, automating exploratory data analysis can be done by AI. But it still has a ways to go in order to make certain decisions.
@huanchenli4137
@huanchenli4137 2 ай бұрын
Wait, we can upload data to chatgpt? How do I do that?
@crtp47
@crtp47 2 ай бұрын
in ChatGPT Plus
@stratascratch
@stratascratch 2 ай бұрын
there are plugins that you can use that allow you to upload files into chatgpt
@huanchenli4137
@huanchenli4137 2 ай бұрын
can you teach me? Has to be gpt4? @@crtp47
@huanchenli4137
@huanchenli4137 2 ай бұрын
can you teach me? Has to be gpt4?@@stratascratch
@piero8284
@piero8284 2 ай бұрын
The only caveat I see in using real data for personal projects is its availability. Most APIs offering relevant functionalities, such as those for cryptocurrencies and market data, require a paid subscription for extensive use. While free APIs are available, it is essential to assess the data based on your project objectives, as well as the quantity of data you require.
@chaitanyas374
@chaitanyas374 3 ай бұрын
🎯 Key Takeaways for quick navigation: 00:00 *📝 Importance of Code Structure* - Understanding the importance of code structure for comprehension and logic, - Exploring a real SQL interview question to demonstrate code structuring, - Highlighting the significance of well-structured code in job interviews and professional work environments. 02:41 *🛠️ Refactoring Nested Queries* - Refactoring nested queries for clarity and improved logic flow, - Demonstrating the use of common table expressions (CTEs) for better code organization, - Emphasizing the importance of understanding and manipulating data before applying additional logic. 07:14 *🔄 Ensuring Consistent Aliases* - Utilizing consistent and descriptive aliases for tables and columns, - Improving code readability and comprehension through clear naming conventions, - Highlighting the significance of explicitness in code for future reference and collaboration. 10:42 *📊 Eliminating Unnecessary ORDER BY Clauses* - Streamlining code by reducing redundant ORDER BY clauses, - Optimizing code execution for efficiency and performance, - Emphasizing the importance of minimizing unnecessary code execution for code optimization. 12:47 *🧹 Simplifying Code with CASE Statements* - Simplifying code logic using CASE statements for concise and efficient code, - Demonstrating optimization by reducing the number of common table expressions (CTEs), - Streamlining code structure for improved readability and maintenance. 13:02 *🔍 Choosing Between HAVING and WHERE Clauses* - Differentiating between HAVING and WHERE clauses based on usage and optimization, - Opting for WHERE clauses over HAVING clauses for simpler and faster query execution, - Highlighting the importance of understanding SQL nuances for code optimization.
@kennethstephani692
@kennethstephani692 3 ай бұрын
Great video!!
@CodyJelinek
@CodyJelinek 3 ай бұрын
This is brilliant. I took way too much time searching and googling and watching hours long videos to find pretty much everything I need in a clear crisp straight forward 30min video. Minus a tiny bit of API beginner information and knowing the basics of Python, this was perfect. Well done. I imagine the next video will be everything I need and nothing that I don’t to take the next step of processing/analyzing the data. I am hitting play on that……. Now!
@berndkaufmann6934
@berndkaufmann6934 3 ай бұрын
Good Explanation
@jorgesanabria6484
@jorgesanabria6484 3 ай бұрын
Feels good to be an ML engineer right now then aha
@SuhasKM-tl1rg
@SuhasKM-tl1rg 3 ай бұрын
Quality content, as usual from the hero. Thank you, wish I'd have a mentor like you who I knew personally.
@stratascratch
@stratascratch 3 ай бұрын
Thanks Suhas. Really appreciate the kind words
@mocabeentrill
@mocabeentrill 3 ай бұрын
@3:03 That's @NicolusRenotte. I love him no homo.
@manuelbonilla1638
@manuelbonilla1638 3 ай бұрын
my database created folder is empty :(
@BillyT83
@BillyT83 3 ай бұрын
Extremely useful video on the "MUST TO KNOW" Python libraries for everyone who wants to dive into Data Science roles! I think that Dash is also a popular framework for building interactive Python dashboards! Thank you for sharing!!!
@emrekayax07
@emrekayax07 3 ай бұрын
Thank you this is helpfull
@rajgopalmanoharan
@rajgopalmanoharan 3 ай бұрын
This is an amazing session, thank you
@deniskolmykov3988
@deniskolmykov3988 3 ай бұрын
StrataScratch Platform. Hmm ... 🤔
@matthewd4972
@matthewd4972 3 ай бұрын
Really appreciate the tutorial, tremendously helpful getting my own api project started.
@Daro2023
@Daro2023 3 ай бұрын
woaw super helpful
@hubbiemid6209
@hubbiemid6209 4 ай бұрын
🔥
@digitalpractice883
@digitalpractice883 4 ай бұрын
I streamlit good enough for a data driven app, I guess you are going to say no, for obvious reasons as cloud experience is crutial, thank again for the video
@digitalpractice883
@digitalpractice883 4 ай бұрын
I would like to express my sincere gratitude to the creator of this KZfaq video for providing excellent content and being truly inspirational. The video not only taught me valuable skills in working with real data, APIs, databases in the cloud, and building models, but it also emphasized the importance of making an impact and getting validation in the field of data science. The way the creator explained the decision-making process when building models and the underlying math behind them was incredibly helpful. Additionally, the interview questions shared in the video provided valuable insights into the best practices for cleaning data, validating models, and optimizing them. Moreover, I greatly appreciate the emphasis on sharing code and insights with the data science community. Learning how to deploy an application and create interactive dashboards or APIs opens up endless possibilities for collaboration and knowledge sharing. Overall, this video has been an exceptional resource for me, and I want to thank the creator for their dedication in providing such valuable content. They have truly inspired me to continue my journey in data science and strive for excellence in every project I undertake.