No video

Training a Code-First Model in Azure ML

  Рет қаралды 2,958

Kevin Feasel

Kevin Feasel

Күн бұрын

In this video, we walk through the code in an Azure Machine Learning project and see how the pieces fit together.
LINKS AND INFO
Catallaxy Services - www.catallaxys...
Demo code and notebook -- github.com/fea...
Chicago parking tickets dataset -- cspolybasepubl...
Chicago parking tickets database -- sqlsunday.com/...
One-Hot encoding in machine learning -- www.geeksforge...

Пікірлер: 20
@wayne7936
@wayne7936 7 ай бұрын
Love how to you go through each logical block of code, block-by-block with highlighting. This series is gold.
@caiyu538
@caiyu538 7 ай бұрын
Great lectures.
@emrynhofmann5757
@emrynhofmann5757 Ай бұрын
Great video! - Just wondering why you chose to use YAML files over the python SDK's CommandComponent()? - Is one better than another, or is it more of a personal preference?
@KevinFeasel
@KevinFeasel Ай бұрын
Entirely personal preference. They'll both work the same way, so the only reason I prefer the YAML version at all is because I can also use similar files with the az cli and run R or .NET code to perform similar deployments without changing quite as much.
@user-ng4pp7fp6t
@user-ng4pp7fp6t 2 ай бұрын
Hi Kevin, can you pls record videos with a higher volume. Not able to hear. Good job though on the content. Thanks
@KevinFeasel
@KevinFeasel 2 ай бұрын
I think I can do that for future videos. I know I typically have the gain turned way down to minimize outside noises but I should be able to do something for the videos.
@BusinessDuck69
@BusinessDuck69 3 ай бұрын
Can you explain your directory stucture a little bit here? In the previous video we made a notebook to perform EDA on the dataset. That was a ipynb file in User/username. Is your 'Pipeline' folder, a folder in this same directory?
@KevinFeasel
@KevinFeasel 3 ай бұрын
The Pipeline folder is separate from the Notebook folder. All of the code is available at github.com/feaselkl/Beyond-the-Basics-with-AzureML. Inside the Pipeline folder, we have three sub-folders: - config/ -- This stores the YAML files that we use for each step in the process. - scripts/ -- This stores the Python scripts that we use for each process. Because I am a simple man who tries not to complicate things too overly much, we have one Python script per config file and the names are very similar. - data/ -- This contains a couple data files we use for testing purposes. We also have two scripts, deploy-train.py and deploy-score.py, which handle training and scoring, respectively. There's no strict requirement to use this folder structure with Azure ML, though it does follow some reasonable practices around separating YAML and code.
@BusinessDuck69
@BusinessDuck69 3 ай бұрын
@@KevinFeasel Hello Kevin, Thanks for the detailed response, I appreciate it. I apologize for not being a bit more clear because you explained most of this directory structure in your video already. What I actually want to know is where your repository, 'beyond-the-basics-with-azureml' lies within the AzureML filesystem? Is it a folder within Authoring > Notebooks -> Users/username? I'm not super familiar with Azure ML yet, so I'm not sure if there is another file system somewhere. In the previous video, I took your advice and used VScode so that I could commit my notebook to a Github repo. What ended up happening is it published the entire folder as a repo, so everything within my compute instance (which is Logs, and Users/username/ChicagoParking.ipynb). Since the video only shows us the directory structure inside the repo, I want to know what is the parent folder of 'beyond-the-basics-with-azureml', so I can organize my repos in a more professional way. Thanks again, I love these videos so far!
@KevinFeasel
@KevinFeasel 3 ай бұрын
@@BusinessDuck69 Ah, I understand the question now. The short answer is, you don't need to upload the files at all. We run the code locally, rather than on our compute instance. We ran the notebook on the compute instance because that infrastructure is already there. Locally, I just have them in a \SourceCode\ folder that holds all of my Git repos. That said, if you want to run the code from your compute instance rather than local machine, you could upload the files into /Users/{username}/beyond-the-basics-with-azureml/, like you have it. Ultimately, think of the compute instance as a data science VM, and the storage associated with it is accessible across the team, but more as a set of scratch files. So admittedly, my AML folder structures sometimes look like a bit of a mess. The "real" outcome should be whatever you put into source control.
@tianhaoluo6782
@tianhaoluo6782 2 ай бұрын
In order to submit the job at 20:14, do we need to set up Azure CLI and/or SDK ahead of time? On my end it looks like it is using my local version of Python. Thank you!
@KevinFeasel
@KevinFeasel 2 ай бұрын
You should have the az cli installed beforehand, yes. I have the pre-requisites in the readme file for the demo code: github.com/feaselkl/Beyond-the-Basics-with-AzureML When it does run, it will use your local Python installation, yes. But that will then make a call to the Azure ML API and request executing code on a compute instance there, so you're not doing actual model training locally, just orchestrating it locally.
@user-jt3bl8jk7k
@user-jt3bl8jk7k 5 ай бұрын
First of all, thank you very much for making these videos, very informative! I have one question though, regarding the output parameters of the pipeline components. It seems you're never specifying the actual path of the output directories for several components, how do they know where to write the files to? Am I missing something?
@KevinFeasel
@KevinFeasel 5 ай бұрын
You are correct that I don't specify the directories for output data for the steps. I do specify the input directory in deploy-train.py's build_pipeline() function on lines 85-100. For example, line 86 has a call to replace_missing_values(input_data=raw_data), where raw_data is the ChicagoParkingTicketsFolder folder input I created in Azure ML. The output location of this first step is something Azure ML injects during my call, not something I explicitly provide. I do, however, have a pointer to wherever that location is, using {step_name}.outputs.output_data. That's how I'm able to chain each of the steps together, and within each Python script, I grab the input and output data as arguments that the central calling code generates after I submit a job.
@user-jt3bl8jk7k
@user-jt3bl8jk7k 5 ай бұрын
@@KevinFeaselWonderful, thank you for the explanation. So, there's no need to explicitly provide values for the component's outputs!
@camilacarrasco7193
@camilacarrasco7193 8 ай бұрын
Great video! Just one question, how can I use an Azure Virtual Machine as a compute cluster? In the docs I read this is possible by attaching the Machine in the workspace, but I haven't found any practical example on how to configure the pipeline to run it in a selected Azure Virtual Machine
@KevinFeasel
@KevinFeasel 7 ай бұрын
Sorry for the late response--I just caught this. It looks like the examples Microsoft has provided are around the v1 SDK. Examples: learn.microsoft.com/en-us/python/api/azureml-core/azureml.core.compute.remote.remotecompute?view=azure-ml-py github.com/Azure/MachineLearningNotebooks/blob/master/how-to-use-azureml/training/train-on-remote-vm/train-on-remote-vm.ipynb I haven't seen any examples using the v2 SDK, though.
@TimoKinnunen
@TimoKinnunen 11 ай бұрын
A method needs arguments, but is parameter used as argument or vice versa? Naming is important. I see arguments are used instead of parameters. To call a method should be done using parameters and inside method parameters become arguments, What do you think about parameter and argument?
@KevinFeasel
@KevinFeasel 11 ай бұрын
I was fairly sloppy with "argument" versus "parameter" in that video, not following the rule that parameters are the labels defining variables and that arguments are the actual values passed in. There's a good chance that, as a result, I've used the wrong term at least once in the video. It's something I'll aim to improve in future videos.
@TimoKinnunen
@TimoKinnunen 11 ай бұрын
@@KevinFeasel Thank you for your answer, the video is great and I understood only parts of it. I am not writing about your specific use in this video and write about parameter and argument more generally. I know parameter and argument is used interchangeable in the whole programming community in our world. And it is annoying me. If I call a method then call what I send into the method parameters. The method needs arguments to do it's job. Yes. They are parameters sent to the method. I am not complaining your use of parameter and argument. The actual values outside the method (or function) are parameters and when they arrive inside method they are arguments, I think. The actual values from variables are now in the method, as you correctly point out. Example: Main() and command-line arguments: static void Main(string[] args){...}. This is what we learn! Why not like this: static void Main(string[] params){...}.
Scoring a Code-First Model in Azure ML
12:41
Kevin Feasel
Рет қаралды 1,5 М.
Code-First Programming with Azure ML
15:26
Kevin Feasel
Рет қаралды 4,2 М.
Blue Food VS Red Food Emoji Mukbang
00:33
MOOMOO STUDIO [무무 스튜디오]
Рет қаралды 20 МЛН
SPILLED CHOCKY MILK PRANK ON BROTHER 😂 #shorts
00:12
Savage Vlogs
Рет қаралды 50 МЛН
王子原来是假正经#艾莎
00:39
在逃的公主
Рет қаралды 16 МЛН
Performing Automated Machine Learning with AzureML
14:03
Kevin Feasel
Рет қаралды 10 М.
Deploying a Model with the Azure ML Designer
14:11
Kevin Feasel
Рет қаралды 7 М.
How to Integrate Azure ML with Power BI
5:07
Addend Analytics - Microsoft Gold Partner
Рет қаралды 402
An Intro to MLflow and Azure ML
11:43
Kevin Feasel
Рет қаралды 4,7 М.
A Primer on Machine Learning Operations (MLOps)
15:20
Kevin Feasel
Рет қаралды 1 М.
Azure Machine Learning Studio
41:33
Dr Tun
Рет қаралды 77 М.
How to run a Jupyter notebook in Azure ML Studio
14:26
CloudSchool TV
Рет қаралды 489
The cloud is over-engineered and overpriced (no music)
14:39
Tom Delalande
Рет қаралды 563 М.
How I would learn Machine Learning (if I could start over)
7:43
AssemblyAI
Рет қаралды 755 М.