Image Classification with scikit-learn

  Рет қаралды 1,181

:probabl.

:probabl.

Күн бұрын

Scikit-Learn is most known for building machine learning models for tabular data, but that doesn't mean that it cannot do image classification. In this video we'll show you how by using a plugin from the ecosystem called "embetter".
Video Chapters:
00:00 Introduction
01:14 General Pipelines
03:46 Image Pipelines
05:33 Embetter
07:41 Benchmark
To learn more about embetter, you can check out the project here:
github.com/koaning/embetter
The code for all of our videos can be found on this Github repository:
github.com/probabl-ai/youtube...
The code for this specific episode can be found here:
github.com/probabl-ai/youtube...
If you're keen to see more videos like this, you can follow us over at @probabl_ai.

Пікірлер: 7
@MrChristonik
@MrChristonik 2 ай бұрын
Hi, great work. Would it make sense to continue with feature extraction (as per the tabular logic) after the Clip Encoder is applied and the numeric output is provided?
@probabl_ai
@probabl_ai 2 ай бұрын
(vincent here). Nothing is stopping you from doing that. You could have a pipeline with two image feature extractors if you really wanted to. In general though, for rapid prototyping, I've found that a single CLIP encoder already does plenty of heavy lifting. For text use-cases I've sometimes done a bag-of-words featurizer next to a BERT featurizer, which had some merits to it.
@muthukamalan.m6316
@muthukamalan.m6316 3 ай бұрын
why not cv2😢
@serafeiml1041
@serafeiml1041 3 ай бұрын
Why not just flatten the image into a 1D vector and use this as input to the model?
@vincentd.warmerdam1422
@vincentd.warmerdam1422 3 ай бұрын
You might end up with 1D vectors of different size. Different images will have different sizes, and the estimator would expect a fixed size. The embedding trick will always output the same dimensions going out.
@vincentmaladiere1285
@vincentmaladiere1285 3 ай бұрын
and also the embeddings provided by the CLIP encoder are likely to bring much more predictive power to the table than the raw image itself, e.g. the scores at the end of the CV would probably be lower
@Mayur7Garg
@Mayur7Garg 3 ай бұрын
You might lose spatial information that way.
The Quantile Trick
12:53
:probabl.
Рет қаралды 1,4 М.
Image classification with Python and Scikit learn | Computer vision tutorial
32:28
Computer vision engineer
Рет қаралды 45 М.
La final estuvo difícil
00:34
Juan De Dios Pantoja
Рет қаралды 30 МЛН
Be kind🤝
00:22
ISSEI / いっせい
Рет қаралды 22 МЛН
Эффект Карбонаро и бесконечное пиво
01:00
История одного вокалиста
Рет қаралды 6 МЛН
ELE QUEBROU A TAÇA DE FUTEBOL
00:45
Matheus Kriwat
Рет қаралды 29 МЛН
Pipelines for convenience, *and* safety
16:42
:probabl.
Рет қаралды 638
Introducing: Sample Space
1:31
:probabl.
Рет қаралды 368
Histograms for faster boosting
10:24
:probabl.
Рет қаралды 900
Expanding on GMMs
57:06
:probabl.
Рет қаралды 547
Welcome to the Scikit-learn MOOC!
0:57
Inria Learning Lab
Рет қаралды 2,5 М.
Why tree gradients give you a boost
11:16
:probabl.
Рет қаралды 1,9 М.
La final estuvo difícil
00:34
Juan De Dios Pantoja
Рет қаралды 30 МЛН