AWS Tutorials - AWS Glue Data Quality - Automated Data Quality Monitoring

  Рет қаралды 8,472

AWS Tutorials

AWS Tutorials

Жыл бұрын

AWS Data Quality is an automated serverless services to monitor and evaluate data qualilty both at rest and in move within the ETL job. It can evaludate qualilty for both statistics and values of the data. Learn how to use AWS Data Quality to evaluate data at rest as well as in move.

Пікірлер: 20
@mranaljadhav8259
@mranaljadhav8259 Жыл бұрын
Welcome back sir, waiting for your more videos .. I learned alot from you... Thanks for providing this tutorials for free
@AWSTutorialsOnline
@AWSTutorialsOnline Жыл бұрын
So nice of you
@lucasoliveira7309
@lucasoliveira7309 4 ай бұрын
Great video, i was already going to resolve that with a lambda, so more easy with glue data quality, thank you
@pathakhemant-eb3du
@pathakhemant-eb3du Жыл бұрын
hey I love your tutorials, Thank You for making our life simpler. so I want know that can we do data warehouse testing with this tool when tables is in Redshift
@AWSTutorialsOnline
@AWSTutorialsOnline Жыл бұрын
Currently it support hive metastore with S3 bucket only.
@hsz7338
@hsz7338 Жыл бұрын
Thank you for taking us through the new feature that AWS Glue offers. Do you see Glue Data Quality replacing Glue Data Brew, at least from the Data Quality perspective?
@AWSTutorialsOnline
@AWSTutorialsOnline Жыл бұрын
I don't think data quality in Brew will be replaced. Both will exist. Brew is more for adhoc data preparation and Glue job for automated. Both need data quality feature for their purposes.
@chengchangyu
@chengchangyu Жыл бұрын
thanks for the video. very details.
@AWSTutorialsOnline
@AWSTutorialsOnline Жыл бұрын
Glad it was helpful!
@arunr2265
@arunr2265 Жыл бұрын
Welcome back brother. waiting for your videos. Hope everything is fine
@AWSTutorialsOnline
@AWSTutorialsOnline Жыл бұрын
All good. Sorry for a long pause from my side,
@arun.ayilliath
@arun.ayilliath Жыл бұрын
Great demo! The retry count should have been 0 to prevent re-running.
@AWSTutorialsOnline
@AWSTutorialsOnline Жыл бұрын
agree. I realized it later
@yinggamonkulsarapitak7948
@yinggamonkulsarapitak7948 Жыл бұрын
Great vid! Thanks! Can this Data Quality integrated with CI/CD and Terraform?
@AWSTutorialsOnline
@AWSTutorialsOnline Жыл бұрын
You mean to be able to configure Data Quality using infrastructure as Code. I am not sure - I did not check CloudFormation or Terraform. But it does support APIs for sure.
@scotter
@scotter Жыл бұрын
In your Glue demo, it *seemed* you skipped showing a part. How did you get from a file being dropped into the S3 bucket/sales to it becoming a table? I'm looking for the most code-light way to set this up so my lambda will somehow be triggered once the file is turned into a table, so my lambda can then run the rules defined in console and then write a log file to other S3 bucket/folder of which rows/columns failed. Thank you!
@AWSTutorialsOnline
@AWSTutorialsOnline Жыл бұрын
The table is created using AWS Crawler. I did not mention that because I have covered than my other tutorials.
@user-tm2dw4iv9k
@user-tm2dw4iv9k Жыл бұрын
Is it possible to this entire thing using boto3 in python
@cheluveshab9525
@cheluveshab9525 Жыл бұрын
Hi Brother, I’m a big fan of yours. I have learned many things from your channel and thanks a lot. Please provide your LinkedIn.
@AWSTutorialsOnline
@AWSTutorialsOnline Жыл бұрын
Many Thanks, sorry for a long pause from my side.
AWS Tutorials - AWS Glue Studio integration with Code Repository
20:20
AWS Tutorials - Data Quality Check using AWS Glue DataBrew
42:50
AWS Tutorials
Рет қаралды 9 М.
I Can't Believe We Did This...
00:38
Stokes Twins
Рет қаралды 122 МЛН
What it feels like cleaning up after a toddler.
00:40
Daniel LaBelle
Рет қаралды 56 МЛН
AWS Tutorials - Partition Data in S3 using AWS Glue Job
36:09
AWS Tutorials
Рет қаралды 17 М.
AWS Tutorials - Working with Data Sources in AWS Glue Job
42:06
AWS Tutorials
Рет қаралды 9 М.
AWS Tutorials - Handling PII Data in AWS Glue
35:12
AWS Tutorials
Рет қаралды 4,1 М.
Data Pipelines Explained
8:29
IBM Technology
Рет қаралды 144 М.
Implementing Effective Data Quality
41:46
datasourcetv
Рет қаралды 50 М.
Отдых для геймера? 😮‍💨 Hiper Engine B50
1:00
Вэйми
Рет қаралды 1,2 МЛН
1$ vs 500$ ВИРТУАЛЬНАЯ РЕАЛЬНОСТЬ !
23:20
GoldenBurst
Рет қаралды 1,8 МЛН
Samsung Galaxy 🔥 #shorts  #trending #youtubeshorts  #shortvideo ujjawal4u
0:10
Ujjawal4u. 120k Views . 4 hours ago
Рет қаралды 8 МЛН
АЙФОН 20 С ФУНКЦИЕЙ ВИДЕНИЯ ОГНЯ
0:59
КиноХост
Рет қаралды 1,1 МЛН