Poisson regression in R
25:20
21 күн бұрын
Poisson regression
9:44
Ай бұрын
Simple linear regression in R
10:07
Bibliographies with R and Quarto
10:25
Cross references in R and Quarto
18:14
Return of the starwars data set
13:38
LaTeX for R users
26:08
5 ай бұрын
The gamma function
3:15
6 ай бұрын
Today I learned in R: Quarto
15:03
Write your own R functions
15:32
7 ай бұрын
Log-normal calculations using R
7:50
The log-normal distribution
9:07
8 ай бұрын
If-else statements in R
12:29
10 ай бұрын
Avoiding data dredging
12:11
11 ай бұрын
Пікірлер
@PatoLorenz
@PatoLorenz 23 сағат бұрын
Thanks for the great explanation!
@geogebraphtraining2501
@geogebraphtraining2501 Күн бұрын
Great job! Thank you very much!
@geogebraphtraining2501
@geogebraphtraining2501 Күн бұрын
I can't believe you only have 30k subs. You're the best R teacher I've encountered so far.
@haraldurkarlsson1147
@haraldurkarlsson1147 2 күн бұрын
Very interesting. I do still think that in this dataset little is gained by imputation. With so few missings it has practically no effect. However, the exercise itself is of great value because it demonstrates how (and perhaps when) to impute different types of missing values. I would like to see more on imputation and perhaps a bit of dive into the different types of missings (MAR, MCAR and MNAR) which boggle the mind. Thanks for another great video.
@munenedenis3377
@munenedenis3377 3 күн бұрын
Thank you for this Prof.
@user-ve3no9pm9w
@user-ve3no9pm9w 3 күн бұрын
for working_day I would sort the data (already done) then group_by day and then tidyr::fill and then ungroup
@EquitableEquations
@EquitableEquations 3 күн бұрын
Yep! That works great unless midnight is missing, in which case the wrong day gets filled 🤓
@user-ve3no9pm9w
@user-ve3no9pm9w 2 күн бұрын
@@EquitableEquations true, forgot that. after some considerations and if else should do the trick when the value is either 0 or 1 each day, otherwise i guess a fill in both directions
@tzvetanzlatanov6349
@tzvetanzlatanov6349 3 күн бұрын
Excellent explanation and well presented, appreciated! Would be nice to make a video about using formula in smoothing function in ggplot2, e.g. sometimes square term is used as a predictor in poisson regression and then just glm method in smoothing function will not be sufficient.
@EquitableEquations
@EquitableEquations 3 күн бұрын
I totally agree! On my list for sure. I have a vid about the ggplot loess smoother coming out next month, so maybe after that would be the right time.
@logmasta
@logmasta 3 күн бұрын
Nice!😊
@pipertripp
@pipertripp 4 күн бұрын
for folks using R studio, if you want info about a function, put the cursor on the function and hit the F1 key. It does the same thing as ?my_func in the console but is faster if you have already included the function in your code.
@EquitableEquations
@EquitableEquations 4 күн бұрын
You can find materials supporting this vid (and others) at github.com/equitable-equations/youtube.
@ichigokurosaki-ei9sq
@ichigokurosaki-ei9sq 4 күн бұрын
@spbynum
@spbynum 6 күн бұрын
How do you do code chunks for imported datasets? I keep getting “object not found.”
@EquitableEquations
@EquitableEquations 4 күн бұрын
Hi! I recommend working in a project (kzfaq.info/get/bejne/g8qEp7d_nba5oYk.html), which makes file management much simpler. For more sophisticated file structures, you can also use the {here} package (kzfaq.info/get/bejne/pc5jlZacmtuxZ3k.html). Good luck!
@damirb6294
@damirb6294 7 күн бұрын
interpretation of the results missing
@johneagle4384
@johneagle4384 8 күн бұрын
Thank you so much. These error messages should be given in English and not only in jargon.
@tmitra001
@tmitra001 8 күн бұрын
hi it is not saving in smits data after you split! right?
@ilikeanimals7328
@ilikeanimals7328 8 күн бұрын
I come from Python (Pandas) and have been wanting to increase my skillset to include R. This was an excellent introduction, thank you!
@thebennator
@thebennator 9 күн бұрын
any advices for a 20 yearold freshman in data science and has no skills at all trying to build himeself a good income
@EquitableEquations
@EquitableEquations 9 күн бұрын
Hi! In addition to becoming expert in one programming environment to start, I recommend finding a domain of particular interest and building relationships there. Most data science requires a lot of domain knowledge and collaboration.
@mugomuiruri2313
@mugomuiruri2313 10 күн бұрын
halo dr
@mugomuiruri2313
@mugomuiruri2313 10 күн бұрын
good
@EquitableEquations
@EquitableEquations 11 күн бұрын
Cool links: Heather Mattie on Google Scholar: scholar.google.com/citations?user=f_02DZ0AAAAJ On LInkedIn: www.linkedin.com/in/heather-mattie-71b98039/ On X: @HeatherMathy Obermeyer et. al, Dissecting racial bias in an algorithm used to manage the health of populations: www.science.org/doi/10.1126/science.aax2342
@mohammadrezababaei8871
@mohammadrezababaei8871 11 күн бұрын
Great Videos. been watching the whole playlist and really helped me reviewing all the necessary ggplot arguments. keep it up and add more advanced lessons please.
@hassetdereje7640
@hassetdereje7640 12 күн бұрын
this was evry helpful thank you so much
@haraldurkarlsson1147
@haraldurkarlsson1147 12 күн бұрын
Nice presentation. In a quarto book with chapters how do you reference to one chapter from another. It your chapters have: title: Cross Refs with ---- above and below, then how do you add a cross-ref to that chapter. The quarto website suggest using number one heading (#) e.g. # Cross Ref #sec-crossref . but I don't see how that works with the ---- setup. Any ideas (other just changing all the chapter titles from the ---- format???
@SyedMuhammadIqbal-c9d
@SyedMuhammadIqbal-c9d 14 күн бұрын
I didn't understand the concept of k what it would be, can you solve by letting k equals to some value
@EquitableEquations
@EquitableEquations 13 күн бұрын
Hi! It's helpful to think of k as carrying information about the size (broadly speaking) of the rejection region. It's related to the level of significance and can be computed once that's specified, but this isn't typically helpful to do. The important thing in the NP-lemma is that such a k *exists*.
@rafalsza
@rafalsza 15 күн бұрын
why not in 5 minutes?
@EquitableEquations
@EquitableEquations 14 күн бұрын
Play it at 10x and learn it in 4 why not??
@araeisi
@araeisi 15 күн бұрын
I do appreciate it. R to me is a new programming language and I was browsing TY to see what should I and where to start. Yours in my opinion is the best one. Thank you
@josenavarrete6151
@josenavarrete6151 17 күн бұрын
Does the fill command works only on NA cells or also in empty cells?
@geokola
@geokola 18 күн бұрын
Will have to watch this video many more times at my pace. Otherwise, very nice and simplified video. Thanks Richard.
@lowbudgettravelerbd
@lowbudgettravelerbd 19 күн бұрын
thanks a lot
@johnbainbridge1931
@johnbainbridge1931 19 күн бұрын
Hi apologies I'm not quite sure how to search within the channel but does Andrew have an intro into Shiny apps?
@EquitableEquations
@EquitableEquations 19 күн бұрын
Not yet, I’m sorry! If you go to the channels homepage it should be searchable. I’m doing machine learning stuff mostly right now
@hafidhahmadrizqipratama7608
@hafidhahmadrizqipratama7608 19 күн бұрын
Thank you so much for the tutorial video! Very helpful for me on learning a lot about data!!!
@loveumesi3577
@loveumesi3577 20 күн бұрын
Great tutorial! Thank you! I've been frustrated with replicating R commands that I don't really understand, but your video have made that clearer
@amanmalik1683
@amanmalik1683 21 күн бұрын
Thanks for making me dealwith R in Just one Video !!!
@felinamegan6352
@felinamegan6352 21 күн бұрын
thanks this was a great help!
@haraldurkarlsson1147
@haraldurkarlsson1147 21 күн бұрын
Just a note - the conv_multiunit function from the measurements package does a nice job on unit conversions (miles per gallon to km per liter)
@haraldurkarlsson1147
@haraldurkarlsson1147 22 күн бұрын
Very nice. I am torn whether one should start with base R or jump straight into the tidyverse when teaching R. I get the feeling that one does not appreciate the beauty and simplicity of the tidy (and pipe) approach until one has "suffered" through a little bit of base R s nested approach. After all some things may not run or be needlessly complex in the tidyverse but simple in base R. Any thoughts? - Thanks.
@haraldurkarlsson1147
@haraldurkarlsson1147 22 күн бұрын
Andrew, Your point about the missings is interesting but it only amounts to a percent (113 hrs) and may thus not be worth trying to impute. Besides that there could be simple explaination for the missing hours. Perhaps the bike company was closed for a holiday and/or buisness-related activities. Imputation might therefore not be warranted.
@EquitableEquations
@EquitableEquations 22 күн бұрын
Yep, totally! I actually have another vid set for release next week that talks about all of this before jumping in to add zeros. Those missing values are pretty much all at odd hours in the middle of the night. The effect on the model is on the order .7 bike per hour, which is surprisingly high.
@haraldurkarlsson1147
@haraldurkarlsson1147 21 күн бұрын
@@EquitableEquations By the way I see you are in the Chicago region teaching. I attended uni up there and have family in the area.
@haraldurkarlsson1147
@haraldurkarlsson1147 22 күн бұрын
There is only ONE value with heavy rain/ snow so why not just skip it??? (run Bikeshare |> count(weathersit) for a table).
@haraldurkarlsson1147
@haraldurkarlsson1147 22 күн бұрын
Good discussion - especially of why you chose to use Poisson.
@carlosveramendi2226
@carlosveramendi2226 23 күн бұрын
Believe me, tons of hours and days are wasted to finally run over with content of value like this. Thank you so much
@nimababariya3852
@nimababariya3852 24 күн бұрын
hi, do you offer live online lessons to teach R?
@EquitableEquations
@EquitableEquations 23 күн бұрын
Hi! I do sometimes, but I'm not able to take on new clients right now, I'm sorry. Busy times!
@CaribouDataScience
@CaribouDataScience 24 күн бұрын
Good stuff!
@serhatakay8351
@serhatakay8351 25 күн бұрын
There is nothing better than good old statistics in the era of ml/nn
@EquitableEquations
@EquitableEquations 25 күн бұрын
You can find materials supporting this vid (and others) at github.com/equitable-equations/youtube. Here's the link to my vid on the {broom} package: kzfaq.info/get/bejne/hd9hkqSQv86niWQ.html
@alarabalasaygun7936
@alarabalasaygun7936 29 күн бұрын
awesome video!
@yourivyleaguegirl
@yourivyleaguegirl Ай бұрын
for those who couldn't get arrange function done: library(dplyr) df=diamonds_sm df2<-df %>%arrange(color) view(df2)
@yourivyleaguegirl
@yourivyleaguegirl Ай бұрын
I'm getting error in library(tidymodels)=not found and also in "initial_split" not found
@yourivyleaguegirl
@yourivyleaguegirl Ай бұрын
How do you download air-quality files?
@yourivyleaguegirl
@yourivyleaguegirl Ай бұрын
Please provide the csv/xls file or link
@samuelolaoluwa150
@samuelolaoluwa150 Ай бұрын
You're awesome. I heard to come back here, having watched one of your videos; learning R in 39 minutes. I have a question please I'm doing the data analysis for my research titled; modeling of selected classification methods and comparison of their performance in the classification of anaemia patients among Nigerian children. Now, I've not been using much of R before this work. So I ran into an issue with my dataset. I have one on of my predictor variables to be continuous. When I declared its class in R as numeric, all values for that variable were overwritten as NAs. I can't start checking each cell because there are almost 11,000 observations. How do I fish out where the problem is?
@samuelolaoluwa150
@samuelolaoluwa150 Ай бұрын
You're awesome. I heard to come back here, having watched one of your videos; learning R in 39 minutes. I have a question please I'm doing the data analysis for my research titled; modeling of selected classification methods and comparison of their performance in the classification of anaemia patients among Nigerian children. Now, I've not been using much of R before this work. So I ran into an issue with my dataset. I have one on of my predictor variables to be continuous. When I declared its class in R as numeric, all values for that variable were overwritten as NAs. I can't start checking each cell because there are almost 11,000 observations. How do I fish out where the problem is?