I can't believe you only have 30k subs. You're the best R teacher I've encountered so far.
@haraldurkarlsson11472 күн бұрын
Very interesting. I do still think that in this dataset little is gained by imputation. With so few missings it has practically no effect. However, the exercise itself is of great value because it demonstrates how (and perhaps when) to impute different types of missing values. I would like to see more on imputation and perhaps a bit of dive into the different types of missings (MAR, MCAR and MNAR) which boggle the mind. Thanks for another great video.
@munenedenis33773 күн бұрын
Thank you for this Prof.
@user-ve3no9pm9w3 күн бұрын
for working_day I would sort the data (already done) then group_by day and then tidyr::fill and then ungroup
@EquitableEquations3 күн бұрын
Yep! That works great unless midnight is missing, in which case the wrong day gets filled 🤓
@user-ve3no9pm9w2 күн бұрын
@@EquitableEquations true, forgot that. after some considerations and if else should do the trick when the value is either 0 or 1 each day, otherwise i guess a fill in both directions
@tzvetanzlatanov63493 күн бұрын
Excellent explanation and well presented, appreciated! Would be nice to make a video about using formula in smoothing function in ggplot2, e.g. sometimes square term is used as a predictor in poisson regression and then just glm method in smoothing function will not be sufficient.
@EquitableEquations3 күн бұрын
I totally agree! On my list for sure. I have a vid about the ggplot loess smoother coming out next month, so maybe after that would be the right time.
@logmasta3 күн бұрын
Nice!😊
@pipertripp4 күн бұрын
for folks using R studio, if you want info about a function, put the cursor on the function and hit the F1 key. It does the same thing as ?my_func in the console but is faster if you have already included the function in your code.
@EquitableEquations4 күн бұрын
You can find materials supporting this vid (and others) at github.com/equitable-equations/youtube.
@ichigokurosaki-ei9sq4 күн бұрын
❤
@spbynum6 күн бұрын
How do you do code chunks for imported datasets? I keep getting “object not found.”
@EquitableEquations4 күн бұрын
Hi! I recommend working in a project (kzfaq.info/get/bejne/g8qEp7d_nba5oYk.html), which makes file management much simpler. For more sophisticated file structures, you can also use the {here} package (kzfaq.info/get/bejne/pc5jlZacmtuxZ3k.html). Good luck!
@damirb62947 күн бұрын
interpretation of the results missing
@johneagle43848 күн бұрын
Thank you so much. These error messages should be given in English and not only in jargon.
@tmitra0018 күн бұрын
hi it is not saving in smits data after you split! right?
@ilikeanimals73288 күн бұрын
I come from Python (Pandas) and have been wanting to increase my skillset to include R. This was an excellent introduction, thank you!
@thebennator9 күн бұрын
any advices for a 20 yearold freshman in data science and has no skills at all trying to build himeself a good income
@EquitableEquations9 күн бұрын
Hi! In addition to becoming expert in one programming environment to start, I recommend finding a domain of particular interest and building relationships there. Most data science requires a lot of domain knowledge and collaboration.
@mugomuiruri231310 күн бұрын
halo dr
@mugomuiruri231310 күн бұрын
good
@EquitableEquations11 күн бұрын
Cool links: Heather Mattie on Google Scholar: scholar.google.com/citations?user=f_02DZ0AAAAJ On LInkedIn: www.linkedin.com/in/heather-mattie-71b98039/ On X: @HeatherMathy Obermeyer et. al, Dissecting racial bias in an algorithm used to manage the health of populations: www.science.org/doi/10.1126/science.aax2342
@mohammadrezababaei887111 күн бұрын
Great Videos. been watching the whole playlist and really helped me reviewing all the necessary ggplot arguments. keep it up and add more advanced lessons please.
@hassetdereje764012 күн бұрын
this was evry helpful thank you so much
@haraldurkarlsson114712 күн бұрын
Nice presentation. In a quarto book with chapters how do you reference to one chapter from another. It your chapters have: title: Cross Refs with ---- above and below, then how do you add a cross-ref to that chapter. The quarto website suggest using number one heading (#) e.g. # Cross Ref #sec-crossref . but I don't see how that works with the ---- setup. Any ideas (other just changing all the chapter titles from the ---- format???
@SyedMuhammadIqbal-c9d14 күн бұрын
I didn't understand the concept of k what it would be, can you solve by letting k equals to some value
@EquitableEquations13 күн бұрын
Hi! It's helpful to think of k as carrying information about the size (broadly speaking) of the rejection region. It's related to the level of significance and can be computed once that's specified, but this isn't typically helpful to do. The important thing in the NP-lemma is that such a k *exists*.
@rafalsza15 күн бұрын
why not in 5 minutes?
@EquitableEquations14 күн бұрын
Play it at 10x and learn it in 4 why not??
@araeisi15 күн бұрын
I do appreciate it. R to me is a new programming language and I was browsing TY to see what should I and where to start. Yours in my opinion is the best one. Thank you
@josenavarrete615117 күн бұрын
Does the fill command works only on NA cells or also in empty cells?
@geokola18 күн бұрын
Will have to watch this video many more times at my pace. Otherwise, very nice and simplified video. Thanks Richard.
@lowbudgettravelerbd19 күн бұрын
thanks a lot
@johnbainbridge193119 күн бұрын
Hi apologies I'm not quite sure how to search within the channel but does Andrew have an intro into Shiny apps?
@EquitableEquations19 күн бұрын
Not yet, I’m sorry! If you go to the channels homepage it should be searchable. I’m doing machine learning stuff mostly right now
@hafidhahmadrizqipratama760819 күн бұрын
Thank you so much for the tutorial video! Very helpful for me on learning a lot about data!!!
@loveumesi357720 күн бұрын
Great tutorial! Thank you! I've been frustrated with replicating R commands that I don't really understand, but your video have made that clearer
@amanmalik168321 күн бұрын
Thanks for making me dealwith R in Just one Video !!!
@felinamegan635221 күн бұрын
thanks this was a great help!
@haraldurkarlsson114721 күн бұрын
Just a note - the conv_multiunit function from the measurements package does a nice job on unit conversions (miles per gallon to km per liter)
@haraldurkarlsson114722 күн бұрын
Very nice. I am torn whether one should start with base R or jump straight into the tidyverse when teaching R. I get the feeling that one does not appreciate the beauty and simplicity of the tidy (and pipe) approach until one has "suffered" through a little bit of base R s nested approach. After all some things may not run or be needlessly complex in the tidyverse but simple in base R. Any thoughts? - Thanks.
@haraldurkarlsson114722 күн бұрын
Andrew, Your point about the missings is interesting but it only amounts to a percent (113 hrs) and may thus not be worth trying to impute. Besides that there could be simple explaination for the missing hours. Perhaps the bike company was closed for a holiday and/or buisness-related activities. Imputation might therefore not be warranted.
@EquitableEquations22 күн бұрын
Yep, totally! I actually have another vid set for release next week that talks about all of this before jumping in to add zeros. Those missing values are pretty much all at odd hours in the middle of the night. The effect on the model is on the order .7 bike per hour, which is surprisingly high.
@haraldurkarlsson114721 күн бұрын
@@EquitableEquations By the way I see you are in the Chicago region teaching. I attended uni up there and have family in the area.
@haraldurkarlsson114722 күн бұрын
There is only ONE value with heavy rain/ snow so why not just skip it??? (run Bikeshare |> count(weathersit) for a table).
@haraldurkarlsson114722 күн бұрын
Good discussion - especially of why you chose to use Poisson.
@carlosveramendi222623 күн бұрын
Believe me, tons of hours and days are wasted to finally run over with content of value like this. Thank you so much
@nimababariya385224 күн бұрын
hi, do you offer live online lessons to teach R?
@EquitableEquations23 күн бұрын
Hi! I do sometimes, but I'm not able to take on new clients right now, I'm sorry. Busy times!
@CaribouDataScience24 күн бұрын
Good stuff!
@serhatakay835125 күн бұрын
There is nothing better than good old statistics in the era of ml/nn
@EquitableEquations25 күн бұрын
You can find materials supporting this vid (and others) at github.com/equitable-equations/youtube. Here's the link to my vid on the {broom} package: kzfaq.info/get/bejne/hd9hkqSQv86niWQ.html
@alarabalasaygun793629 күн бұрын
awesome video!
@yourivyleaguegirlАй бұрын
for those who couldn't get arrange function done: library(dplyr) df=diamonds_sm df2<-df %>%arrange(color) view(df2)
@yourivyleaguegirlАй бұрын
I'm getting error in library(tidymodels)=not found and also in "initial_split" not found
@yourivyleaguegirlАй бұрын
How do you download air-quality files?
@yourivyleaguegirlАй бұрын
Please provide the csv/xls file or link
@samuelolaoluwa150Ай бұрын
You're awesome. I heard to come back here, having watched one of your videos; learning R in 39 minutes. I have a question please I'm doing the data analysis for my research titled; modeling of selected classification methods and comparison of their performance in the classification of anaemia patients among Nigerian children. Now, I've not been using much of R before this work. So I ran into an issue with my dataset. I have one on of my predictor variables to be continuous. When I declared its class in R as numeric, all values for that variable were overwritten as NAs. I can't start checking each cell because there are almost 11,000 observations. How do I fish out where the problem is?
@samuelolaoluwa150Ай бұрын
You're awesome. I heard to come back here, having watched one of your videos; learning R in 39 minutes. I have a question please I'm doing the data analysis for my research titled; modeling of selected classification methods and comparison of their performance in the classification of anaemia patients among Nigerian children. Now, I've not been using much of R before this work. So I ran into an issue with my dataset. I have one on of my predictor variables to be continuous. When I declared its class in R as numeric, all values for that variable were overwritten as NAs. I can't start checking each cell because there are almost 11,000 observations. How do I fish out where the problem is?