No video

Should I shuffle samples with cross-validation?

  Рет қаралды 844

Data School

Data School

Күн бұрын

By default, the cross_val_score function in scikit-learn does not shuffle samples. In this lesson, you’ll learn when you might need to shuffle and how to do it.
P.S. This is a lesson from my NEW course, "Master Machine Learning with scikit-learn." You can enroll here: courses.datasc...
For all paid courses, I offer location-based discounts (up to 75%) to people in 160+ countries. Check your discount here: courses.datasc...
Enroll in a FREE Data Science course here: courses.datasc...

Пікірлер: 5
@dataschool
@dataschool 3 ай бұрын
This is a lesson from my NEW course, "Master Machine Learning with scikit-learn." You can enroll here: courses.dataschool.io/master-machine-learning-with-scikit-learn
@sedighehnadaei1895
@sedighehnadaei1895 2 ай бұрын
As always you did great.thank you so much ❤
@dataschool
@dataschool 2 ай бұрын
You are so welcome!
@aleksandartta
@aleksandartta 3 ай бұрын
Hello Kevin, thank you very much... I have two questions: 1) after hyper parameters tunning and cross validation, the final model should be some that is trained on the whole dataset (meaning train + validation set)? Am I right? 2) do we need cross validation if the dataset is very big (and how to know how big :) ? i.e. when cross validation is not necessary?
@dataschool
@dataschool 3 ай бұрын
Great questions! 1. Yes, re-train the tuned model on the entire dataset (meaning all samples for which you know the target value). 2. Yes, cross-validation is a useful model evaluation procedure with any size dataset, with the possible exception of a very tiny dataset. (Below a certain number of samples, no model evaluation procedure is particularly useful.) Hope that helps!
How to save a scikit-learn Pipeline with custom transformers
2:20
Data School
Рет қаралды 1,1 М.
21 more pandas tricks
24:40
Data School
Рет қаралды 47 М.
CHOCKY MILK.. 🤣 #shorts
00:20
Savage Vlogs
Рет қаралды 30 МЛН
Comfortable 🤣 #comedy #funny
00:34
Micky Makeover
Рет қаралды 17 МЛН
Kind Waiter's Gesture to Homeless Boy #shorts
00:32
I migliori trucchetti di Fabiosa
Рет қаралды 13 МЛН
Cross Validation
3:20
TheDataPost
Рет қаралды 10 М.
Use OrdinalEncoder instead of OneHotEncoder with tree-based models
6:59
Overview of Decision Trees - 24 minutes in R language
24:23
Cost-sensitive learning in scikit-learn
4:16
Data School
Рет қаралды 951
Tune multiple models simultaneously with GridSearchCV
5:07
Data School
Рет қаралды 7 М.
5 Design Patterns That Are ACTUALLY Used By Developers
9:27
Alex Hyett
Рет қаралды 251 М.
How I'd Learn to be a Data Analyst in 2024
13:17
Luke Barousse
Рет қаралды 267 М.
My Initial Impresson Of Go
12:39
TheVimeagen
Рет қаралды 84 М.
How I'd Learn Data Analytics in 2024 (If I Had to Start Over)
14:08
CareerFoundry
Рет қаралды 798 М.