Netflix Data Cleaning and Analysis Project | End to End Data Engineering Project (SQL + Python)

  Рет қаралды 29,537

Ankit Bansal

Ankit Bansal

2 ай бұрын

In this video we will implement an end to end ELT project. ELT stands for Extract, Load and Transform . We will use Netflix dataset to clean and analyze the data using SQL and Python.
LinkedIn: / ankitbansal6
High quality Data Analytics affordable courses: www.namastesql.com/
End to End ETL project : • End to End Data Analyt...
Netflix dataset: www.kaggle.com/datasets/shiva...
GitHub Project Link: github.com/ankitbansal6/netfl...
Zero to hero(Advance) SQL Aggregation:
• All About SQL Aggregat...
Most Asked Join Based Interview Question:
• Most Asked SQL JOIN ba...
Solving 4 Trick SQL problems:
• Solving 4 Tricky SQL P...
Data Analyst Spotify Case Study:
• Data Analyst Spotify C...
Top 10 SQL interview Questions:
• Top 10 SQL interview Q...
Interview Question based on FULL OUTER JOIN:
• SQL Interview Question...
Playlist to master SQL :
• Complex SQL Questions ...
Rank, Dense_Rank and Row_Number:
• RANK, DENSE_RANK, ROW_...
#sql #dataengineering #projects

Пікірлер: 79
@ankitbansal6
@ankitbansal6 2 ай бұрын
Please like the video as it takes a lot of effort to record a video of more than 1 hour. It will motivate me to create more long form videos. GitHub and all related links in the the description box. Thanks for watching !!!
@simplytech4u898
@simplytech4u898 2 ай бұрын
Thank you Ankit this is really amzing .. once started and finished in one go...
@Hope-xb5jv
@Hope-xb5jv Ай бұрын
10:22 Try many times but not get korean name in sql database i created a table and put insert also but it shows only ???? now i surrender😒
@kumarsumit6117
@kumarsumit6117 Ай бұрын
Use nvarchar
@vijayakanthanannamlai
@vijayakanthanannamlai Ай бұрын
love it Ankit... what an effort
@ishmeenkaur8299
@ishmeenkaur8299 Ай бұрын
really good work, easy to understand.
@ritu-pf1jy
@ritu-pf1jy Ай бұрын
Great efforts sir
@pavitrashailaja850
@pavitrashailaja850 2 ай бұрын
Great effort in putting the whole project together 🤟🏻
@ankitbansal6
@ankitbansal6 2 ай бұрын
Thanks a ton!
@livelovelaugh4050
@livelovelaugh4050 2 ай бұрын
Thank you so much Sir 🙏 . Thank you for giving hope for people like me . Keep inspiring ✨
@ankitbansal6
@ankitbansal6 2 ай бұрын
It's my pleasure
@neeraj_dama
@neeraj_dama 2 ай бұрын
well-done.
@msk-pl3hw
@msk-pl3hw Ай бұрын
It was a really nice project. Had a good hands on in sql.
@ankitbansal6
@ankitbansal6 Ай бұрын
Great 😊
@Random_World_
@Random_World_ Ай бұрын
Thanks for this project
@ankitbansal6
@ankitbansal6 Ай бұрын
My pleasure
@rahulrachhoya2716
@rahulrachhoya2716 2 ай бұрын
Thanks so much @Ankit this valueable video for me. I have an interview with red hat in up coming 3 days as an associate data analysts. I learn lot from your Videos. You are litterly SQL king because you write in very simple manner so that every one can understand . You are my mentor with your videos I am able to solve questions like you . Salute you @Ankit 😎😎😎
@saikanth447
@saikanth447 Ай бұрын
@rahulrachhoya2716 I have seen career portal, no such DA role, can you help me for the same, as we are on the same boat, thanks in advance .
@manishpal2937
@manishpal2937 Ай бұрын
thanks Ankit, the effort you put in your lectures is admirable, learned a lot of new things today from this video 💌
@ankitbansal6
@ankitbansal6 Ай бұрын
My pleasure 😊
@pavanmadamset
@pavanmadamset Ай бұрын
Thank You Very Much Sir
@ankitbansal6
@ankitbansal6 Ай бұрын
Most welcome
@MiteshYadav
@MiteshYadav Ай бұрын
Awesome, can we have series on Python from basics that can be useful for analysis..
@saikatofficial420
@saikatofficial420 Ай бұрын
Thanks a lot sir for this valuable project.Can you please make a video on cross apply . I have watched your SQL course didn't find it .
@simplytech4u898
@simplytech4u898 Ай бұрын
Hi Ankit there is column duration in netflix_raw table having values with min ,season so if need to find avg of duration for season as well how to get the details ,I believe we need to populate the values like other table we did. can you guide how we can do it..
@user-xl4zd8yu1e
@user-xl4zd8yu1e 2 ай бұрын
Thank you very much sirji.... 🙏🙏🙏
@ankitbansal6
@ankitbansal6 2 ай бұрын
Most welcome
@sakshiawadhiya7267
@sakshiawadhiya7267 Ай бұрын
I am facing issues in jupyter notebook like path not exist
@eemayo5889
@eemayo5889 Ай бұрын
Thanks a lot. Could you please show how to download data from API? Great content btw.
@ankitbansal6
@ankitbansal6 Ай бұрын
Check the first part of this video kzfaq.info/get/bejne/q7JgYJmcy8-sY5s.html
@niravshah5038
@niravshah5038 2 ай бұрын
Even after giving data type as nvarchar, I cannot see other characters rather than english in my database
@MayankGadiya-uq1el
@MayankGadiya-uq1el Ай бұрын
please do a detail video on how to do connection from jupyter to sql and explain all engine conn, sqlalchemy etc
@ankitbansal6
@ankitbansal6 Ай бұрын
Watch previous project video
@adityajoshi2797
@adityajoshi2797 26 күн бұрын
Please help me any of video to give me to create directory of kaggle in local machine.m
@mohammadfurquan241
@mohammadfurquan241 2 ай бұрын
Thanks alot sir. I have a suggestion please at the end of the video or in description please put how someone can mention this project in resume with project description in bullet points I am a fresher so it will help me alot. Thank you so much sir ❤
@tanyachugh1640
@tanyachugh1640 2 ай бұрын
Hi @Ankit Bansal, Are there any additional settings needs to be done in SQL server management studio for the special characters to be visible. I have followed the steps twice, but still it is showing question mark for me.
@ankitbansal6
@ankitbansal6 2 ай бұрын
Data type should be nvarchar ?
@tanyachugh1640
@tanyachugh1640 2 ай бұрын
@@ankitbansal6yes I am giving nvarchar only
@TarunDhimanOfficial
@TarunDhimanOfficial Ай бұрын
@@ankitbansal6 even after using nvarchar, special characters are still showing as ? ? ? ?.
@piyushsharma8294
@piyushsharma8294 Ай бұрын
check reply to @VaibhaviSuresh-bw8hq
@roopesh3837
@roopesh3837 2 ай бұрын
In Netflix table why its 8807 it should be 8804 after removing 3 duplicates and where clause is removed by mistake?
@ankitbansal6
@ankitbansal6 2 ай бұрын
You are right where clause I missed to retain unique rows. My bad.
@mansinayak3360
@mansinayak3360 15 күн бұрын
Hi Ankit, I can't see the Japanese characters in title post changing the dtype to nvarchar it's showing question marks. I've been searching what could be the reason. Need you suggestion to resolve this.
@Kelvin2568
@Kelvin2568 8 күн бұрын
Do you solve it? I have the same problem after changing the data type
@austinmkruahsr.615
@austinmkruahsr.615 Ай бұрын
This is wonderful, can I use this same method for postgresql? Please help me...
@ankitbansal6
@ankitbansal6 Ай бұрын
Yes
@shubhamravikar6029
@shubhamravikar6029 Ай бұрын
Hi @Ankit Bansal, I have tried a lot in creating a table using the nvarchar but still it shows the ??? Question mark sign and I have seen all the replies in the comment box but I couldn't find the solution for it. Please help it out so that I can proceed with the project.
@ankitbansal6
@ankitbansal6 Ай бұрын
You can leave it as it is and proceed to the next tasks .
@shubhamravikar6029
@shubhamravikar6029 Ай бұрын
@@ankitbansal6 Okay, Thanks
@manishasaxena9829
@manishasaxena9829 Ай бұрын
at 28:40, you said that we can't see null because of string split.. Just my thought, isn't it because you removed null at 8:44?
@ankitbansal6
@ankitbansal6 Ай бұрын
I didn't remove it. It was just checking the max length and that time removed in analysis only. Not in actual data
@manishasaxena9829
@manishasaxena9829 Ай бұрын
@@ankitbansal6 oh yes, you're right, my bad. Your content is really helpful and very easy to follow. keep uploading such videos. Thank you!
@abhinavumrao8453
@abhinavumrao8453 Ай бұрын
For question number 2 for SQL analysis. Your inner join with netflix table how you are joining on ng.show_id = nc.show_id.....shouldn't be ng.show_id = n.show_id ?? Please clarify my doubt 🙋‍♂️ 🙏.
@abhinavumrao8453
@abhinavumrao8453 Ай бұрын
And if its wrong , how it still gave output for below mapping?? ng.show_id = nc.show_id
@itsyogijangir
@itsyogijangir 2 ай бұрын
How can we removed special sign like ₹ sign symbol in MSSQL server ,i am not able to do it.
@adilmajeed8439
@adilmajeed8439 2 ай бұрын
Use replace function
@itsyogijangir
@itsyogijangir Ай бұрын
@@adilmajeed8439 not working for ₹ sign.
@rachitkeelpur
@rachitkeelpur Ай бұрын
Please help me to by this combo course, i want to learn SQL in Hindi and python in English
@ankitbansal6
@ankitbansal6 Ай бұрын
Send email to sql.namaste@gmail.com
@simplytech4u898
@simplytech4u898 2 ай бұрын
How to use PostgreSQL here if MS SQL is not present any ref video will be helpful..
@ankitbansal6
@ankitbansal6 2 ай бұрын
You can just Google . It's a simple change.
@simplytech4u898
@simplytech4u898 2 ай бұрын
i have figure it out how to import in postgreSQL thakns for amzing project video
@BhakthiYoutube
@BhakthiYoutube 2 ай бұрын
Is it end to end data engineering project ? Looks like etl only rught
@mohammadfurquan241
@mohammadfurquan241 2 ай бұрын
It's a end to end ETL project which comes under Data Engineering. Hope you got it.
@gamingfun5309
@gamingfun5309 2 ай бұрын
Sir how I can connect with mysql
@LearnDataSceince
@LearnDataSceince 2 ай бұрын
import pandas as pd import pymysql from sqlalchemy import create_engine # Database connection details username = 'your username' password = 'your password' host = 'host' port = 'port number' database = 'your database name' # Create pymysql connection connection = pymysql.connect(host=host, port=port, user=username, passwd=password, db=database) df = pd.read_csv('netflix_titles.csv') connection_string = f"mysql+mysqlconnector://{username}:{password}@{host}/{database}" engine = create_engine(connection_string) try: df.to_sql('netflix_raw', con=engine, index=False, if_exists='append') print("DataFrame written to MySQL table 'netflix_raw' successfully.") except Exception as e: print(f"Error: {e}")
@ankitbansal6
@ankitbansal6 2 ай бұрын
Just Google. It's a simple change
@VaibhaviSuresh-bw8hq
@VaibhaviSuresh-bw8hq Ай бұрын
Hi @ankitbansal6, Thanks for making this video its really helpful and informative. I am also trying to implement the same but encountering one small issue, I am not able to convert the special characters into string even after changing the table definition to nvarchar still I ma getting the value as '????'. Can anyone help me with this? I have also tried to load the data using the encoding encoding='utf-8' in my pyspark script.
@piyushsharma8294
@piyushsharma8294 Ай бұрын
There seems to be a problem with collation & along with 'nvarchar', we need to change the collation for database as well. You can fix that by writing this code: ALTER DATABASE [Database_Name] SET SINGLE_USER WITH ROLLBACK IMMEDIATE; GO ALTER DATABASE [Database_Name] Latin1_General_100_CS_AS_KS_WS_SC_UTF8; GO ALTER DATABASE [Database_Name] SET MULTI_USER; GO just adjust your database name in below [Database_Name] & it should work fine! [Edit: these is slight change in the collation name]
@vaibhavisuresh04
@vaibhavisuresh04 Ай бұрын
Okay Thankyou!😊 I will try
@tanyachugh1640
@tanyachugh1640 Ай бұрын
@@vaibhavisuresh04 Hi, Could you please let me know, if the issue got resolved or not?
@sukhwinder101
@sukhwinder101 Ай бұрын
bhai tumhara sql to bot bhadiya hai
@ladiashrith5230
@ladiashrith5230 Ай бұрын
Still I am getting Questions marks for title even it is nvarchar, how can I resolve it?😒 @ankithbansal6
What does a Data Analyst actually do? (in 2024) Q&A
14:27
Tim Joo
Рет қаралды 24 М.
50 YouTubers Fight For $1,000,000
41:27
MrBeast
Рет қаралды 135 МЛН
Nutella bro sis family Challenge 😋
00:31
Mr. Clabik
Рет қаралды 13 МЛН
Smart Sigma Kid #funny #sigma #comedy
00:25
CRAZY GREAPA
Рет қаралды 27 МЛН
Top 5 FREE Resources to 10X Your Data Engineering Skills
11:49
Jash Radia
Рет қаралды 48 М.
End to End Data Analytics Project (Python + SQL)
46:52
Ankit Bansal
Рет қаралды 108 М.
SQL Interview questions | Data Analyst | Part - 1
11:56
The ML Mine
Рет қаралды 3,7 М.
Cracked Myntra as Data Analyst with 1 Year Experience
13:56
Ankit Bansal
Рет қаралды 15 М.
SQLAlchemy: The BEST SQL Database Library in Python
16:39
ArjanCodes
Рет қаралды 56 М.
50 YouTubers Fight For $1,000,000
41:27
MrBeast
Рет қаралды 135 МЛН