C++ Super Optimization: 1000X Faster

  Рет қаралды 275,583

Dave's Garage

Dave's Garage

Күн бұрын

Obey this one simple C++ trick for a 1000x performance gain: constexpr! Check out the free sample of my book on the Spectrum at Amazon: amzn.to/42TfSJX
Language Racing Series, Github Code: github.com/PlummersSoftwareLL...

Пікірлер: 777
@DavesGarage
@DavesGarage 3 ай бұрын
Thanks to everyone for the kind words on the intro... I was a little worried about using it, but had fun making it, so went out on a limb!
@c4cypher
@c4cypher 3 ай бұрын
Entertaining and informative, thank you.
@NoLoseJustLearn
@NoLoseJustLearn 3 ай бұрын
Loved it
@systemloc
@systemloc 3 ай бұрын
I laughed SO HARD when constexpr popped up. I haven't laughed that hard in awhile. Thank you for that.
@jstro-hobbytech
@jstro-hobbytech 3 ай бұрын
Dave. Can you do me a favor. I'll even donate 100cad to a charity of your choice. In the fastled library there's a udl in the tetris game written by Aaron liddement. I for the life of me do not understand the bitwise based udl code he has written to define the tertris blocks. I get what he's doing but can't find any other similar cases of the bitwise code using hex to and unsigned long data like the way he's done it. Scott Manley made a wearable matrix using his code years ago which made me see what was going on under the hood. I don't like using code I don't understand and it's driven me nuts ever since. I too am on the spectrum and have a software degree I never used until I had to medically retire a while back. I love your channel and have been around since the beginning and even if you could have a quick look and reccomend a book or something if you don't have the time to make a video. It's one of those things that gets stuck in my head from time to time and it kills me that I do not understand what he's doing fully. I suspect it's intentionally obfuscated and as someone who spends his retirement doing charity work and learning it's been a thorn in my side. I am 45 and plan on using my past credits toward a compsci or ee degree (as a bucket list item and to help me tutor programmers and develop free courses for a local non-profit makerspace). I've googled, bought books, stopped short of chatgpt because I don't want the info handed to me on a silver platter because I prefer to learn hands on. I understand how transistors perform math using different binary rules because I like to do things by coding and then trying the same wirh passive circuit components. He rewrote the sprite class so it no longer uses the udl but it still drives me nuts that I can't figure it out. I'm also a fellow Canadian but I hail from the ns side od the nfld ferry. I'm waiting for approval for funding for a 7 week inpatient program for how to live with ptsd and I'd like to spend my free time coding c++ while I'm away. I was going to bring a guitar but I'd rather work on a skill that I'm not as good at as I'm deciding which degree to pursue. I have a year and a half of credits which transfer to compsci or ee but I don't want to drag around my overpriced ee lab when I could write code instead with a decently powerful laptop. I plan to travel to Utah this summer to camp under the milky way before I hit the books again. I've spent the last 4 years donating half my disposable income from my pension to others so they can learn and put a proper electronics section in the local makerspace. My ptsd keeps me from being able to leave the house and teach there in person or attend the labs I'll have to attend when I go back to school. Sorry. I ramble. I figured you'd get what I mean when something is stuck in your head and can't move past it. I could link the git repository if you like. Id be extremely thankful and would make good on my donation.
@egeres14
@egeres14 3 ай бұрын
Omg, my sides 😂, that intro was so good
@unclerojelio6320
@unclerojelio6320 3 ай бұрын
KZfaq kept recommending this to me and I was starting to get annoyed until I finally noticed it was a Dave’s Garage video.
@EricBRowell
@EricBRowell 3 ай бұрын
😂
@DavesGarage
@DavesGarage 3 ай бұрын
I was wondering about that, but sticking my head in the thumbnail seemed weird... maybe I should at first?
@unclerojelio6320
@unclerojelio6320 3 ай бұрын
@@DavesGarage Yeah, the lack of your mug is why I didn’t immediately click on the video.
@JamesPotts
@JamesPotts 3 ай бұрын
​@@DavesGaragequick work!
@danwilson7581
@danwilson7581 3 ай бұрын
@DavesGarage it's much better than sticking your head in other places.
@ernestg.harveyjr7087
@ernestg.harveyjr7087 3 ай бұрын
That opening was fkn hilarious
@drewwilson8756
@drewwilson8756 3 ай бұрын
Thumbnail was pretty good as well.
@Avighna
@Avighna 3 ай бұрын
Agreed!!!
@DM-qm5sc
@DM-qm5sc 3 ай бұрын
Its from "Better call Saul" if you didnt know. Good show!
@Roboprogs
@Roboprogs 3 ай бұрын
@@DM-qm5scso, Steven Ogg (I think I got “Trevor”’s real name right) had a guest bit? My wife used to watch Sol, but I only sampled a few episodes.
@taltal1122
@taltal1122 3 ай бұрын
Please note: constexpr just declares that it is POSSIBLE to calculate at compile time. It does not actually guarantee it unless the context demands it (as a template argument for example). The compiler will do its best but it might just call the function. c++20 added consteval which forces compile time evaluation.
@skilz8098
@skilz8098 Ай бұрын
Thank you for pointing that out! I'm sure many are aware of this, yet there are just as many who aren't and for those who are new to it, this is really good information to be aware of. As a side note, this could also potentially be compiler dependent (implementation defined). I'm not 100% sure on this based on the literature of the standard so don't quote me on it. However, using MSVC vs Clang, vs GCC, vs Intel, vs MingW, etc... may provide the same results if they support and implement this feature of the language which by now I'm sure all major compilers do. Yet their methods of doing so can and will vary which also depends on the various compiler flags that are being used during compilation. It all depends on how each compiler parses, tokenizes and analyzes or handles the source code it is given to compile as well as how the programmer - user instructs the compiler or gives hints to the compiler to do its job. For simple expressions I would assume that almost all compilers would generate the same assembly, binary results. However for more complex expressions, the results may and can vary. Again, this is a really good and underrated comment. It is truly good advice to give!
@ColinBroderickMaths
@ColinBroderickMaths 28 күн бұрын
It is guaranteed if you just constexpr the result. That is guaranteed to calculate at compile time or be a compile error.
@skilz8098
@skilz8098 28 күн бұрын
@@ColinBroderickMathsSure when the assignment operator is involved for a specific type. Now as for the evaluation of an expression, mileage may vary.
@hopperpl
@hopperpl 3 ай бұрын
We use constexpr a lot for FIR/DCT/DFT (fourier transforms, wavelets) constant generation; this removes literally thousands of magic float numbers in the code, that are so error prone. Miss a number, flip a digit, mixup digit order. No code reviewer would ever notice if the 17th digit of PI is 3 or 2. The code generation only uses sqrt/sin/cos etc for which we have constexpr implementations. constexpr simply guarantees that the value is never ever calculated at runtime, and that's the real beauty of it.
@serhii-ratz
@serhii-ratz 3 ай бұрын
thank's for sharing practical example
@sergeykolesnik1171
@sergeykolesnik1171 3 ай бұрын
this is not an optimization, this is meta programming. Your algorithm doesn't become faster because of the constexpr. It runs just as slow, but during compilation.
@Dom-zy1qy
@Dom-zy1qy 3 ай бұрын
​@@sergeykolesnik1171That was my takeaway
@privateger
@privateger 3 ай бұрын
​@@sergeykolesnik1171Precomputation absolutely is an optimization. You don't tend to compile as often as you run.
@ra2enjoyer708
@ra2enjoyer708 3 ай бұрын
@@sergeykolesnik1171 Algorithms don't have "speed", only complexity. "Fast" algos merely trade time complexity in exchange for space complexity and shit themselves the moment you start to hit RAM limits. I.e. a typical hash table has assumptions about the maximum size of the key storage and upon hitting this limit in best case scenario crashes, in the worst case rears its ugly side effects.
@bobthemagicmoose
@bobthemagicmoose 3 ай бұрын
I had a VERILOG class where a question on the exam was: create a component that calculates the factorial of 32 (or some number, I don’t remember). I hardcoded the result and returned the value :) got an A
@NinjaRunningWild
@NinjaRunningWild 3 ай бұрын
😂 Doesn't speak highly of grading in classes.
@Divine_Evil
@Divine_Evil 3 ай бұрын
Yeah typical unclear requirements... It should have been more generic. Something like: Having an 8bit input N, handshake start/result_rdy interface, clk, reset and an output 31 bit... Calculate the N! when start is asserted and then assert result_rdy when the correct value is done.
@NotMarkKnopfler
@NotMarkKnopfler 3 ай бұрын
🤣🤣@@Divine_Evil
@pilotashish
@pilotashish 3 ай бұрын
don’t ask stupid questions if you don’t want stupid answers
@muggzzzzz
@muggzzzzz 3 ай бұрын
@@pilotashish Actually, there was nothing stupid in this question. The question was to compute a factorial of 32, not to make a function that computes a factorial for any given integer number (32 in particular case), and the program did exactly that.
@Evan490BC
@Evan490BC 3 ай бұрын
I think it would be more accurate if we said that the `constexpr` specifier indicates that the expression *may* be evaluated at compile time, in a constant expression context, but not necessarily. And a `constexpr` function specifier may return such a value. If we want to force the function to produce a compile time constant expression (under some constraints), we must use `consteval`.
@lpprogrammingllc
@lpprogrammingllc 3 ай бұрын
Aside from performance gains, the best use of constexpr is unit testing. First, it's important to note that, unlike inlined internal functions, the constexpr functions themselves _are_ included in the program itself, unless stripped by a later step (such as `--whole-program` in g++), so you can have a program that uses constexpr to automatically return any values below some threshold and fall back to runtime evaluation _of the same code_ if in excess of that value. This means you can write fairly complicated state machines (first case I saw was a complete 6502 emulator) with a complete set of unit tests using `constexpr`. Then, they'll read the "real" data at runtime. By embedding the unit tests in the main compilation unit, you get the neat property that if it compiles, it's correct (so long as you don't encounter a compiler bug). You can even give it simple integration tests (like 10 steps of Conway's game of life on the virtual 6502).
@JohnMullee
@JohnMullee 3 ай бұрын
Nice
@ThePC007
@ThePC007 3 ай бұрын
Doesn't that just mean that compiling now takes just as long as running the actual test would?
@craigslist6988
@craigslist6988 3 ай бұрын
very cool and pretty well explained, thanks!
@AnttiBrax
@AnttiBrax 3 ай бұрын
​@@ThePC007That was implied in the latter part of the comment: "if it compiles, it is correct". So instead of running compile && test you just run compile.
@ThePC007
@ThePC007 3 ай бұрын
@@AnttiBrax But then, what do you gain from doing that? Since unit tests are compiled once, and the run once, there shouldn’t really be much of a performance differnce. I guess you might benefit from your IDE showing the error right inside your code, though, assuming it can actually catch such bugs.
@nunyobiznez875
@nunyobiznez875 3 ай бұрын
It's also worth mentioning that C23, which ironically should have just been submitted for final publication..today as a matter of fact, is adding 'constexpr' to C as well.
@aniketbisht2823
@aniketbisht2823 3 ай бұрын
Nope, just constexpr variables not expression/function. So it's not that useful. Also C doesn't have "if constexpr" like C++ to make compile time code path decisions.
@nunyobiznez875
@nunyobiznez875 3 ай бұрын
@@aniketbisht2823 Nope what? It would have taken you much less time to go to google and type "c23 constexpr".
@zsomborgyenge4359
@zsomborgyenge4359 3 ай бұрын
@@aniketbisht2823 macro magic can do this in c
@nezbrun872
@nezbrun872 3 ай бұрын
"Recursion: a sexy computer science concept that's rarely practical in real life" Ain't that the truth. Furthermore, recursive solutions are also typically write-only. Ex colleague of mine came up with a SQL Server recursive algo to write check amounts in words, very clever I thought, although it was left to me to debug it when it crashed the production server one day... it'd eaten up a couple of 100GB of RAM. Turns out he hadn't written it at all, it'd been copy and pasted from Stack Overflow.
@TheNewFaceOfHSP
@TheNewFaceOfHSP 3 ай бұрын
Aren't they always? I've been doing this for 10+ years and every single time I think of something clever, somebody has usually already made it (to one degree or another).
@NinjaRunningWild
@NinjaRunningWild 3 ай бұрын
There's a reason NASA doesn't allow it in their coding though. Error-prone & hard to debug.
@JohnDoe-sq5nv
@JohnDoe-sq5nv 3 ай бұрын
You're missing the most important part of recursion: They are FUN! Nah, but seriously I often solve problems by recursion because it is easier for me to think that way. The benefit when you write something recursively is that you through smart use of parameters can easily control the data in every context, that is in every call in the recursion. I understand, however, that they are more complicated to debug for others who aren't used to thinking recursively. Therefore it is, as always, extremely important to properly test your recursive functions for all possible inputs that can come through the public interface, and never, ever, have them read or write data not in the input parameters.
@isodoublet
@isodoublet 3 ай бұрын
@@NinjaRunningWild No, that's not why. Recursive solutions in fact tend to be much easier to understand and debug than iterative solutions. The problem is that they don't fail gracefully: try to solve too big a problem and you'll get a literal stack overflow, whereas an iterative solution would typically run just fine.
@fantasypvp
@fantasypvp 3 ай бұрын
Recursive solutions generally use up a lot more memory than iterative ones due to each function call adding memory to the stack
@TomMinnick
@TomMinnick 3 ай бұрын
Probably one of my favorite scenes in Better call Saul. Steven Ogg is awesome. I also live your mashup here with it!
@whannabi
@whannabi 3 ай бұрын
Big shots in that show
@sietzethebest
@sietzethebest 3 ай бұрын
Mostly used constexpr to generate lookup tables at compile time. I remember writing a cloth physics simulation that could use a lookup table of size 256 to avoid multiple multiplications and divisions. And as the table is small it will most likely be stored in a cache somewhere so it was really fast. Got an A+ on that assignment. :)
@revenevan11
@revenevan11 2 ай бұрын
That's a great example of a use for that, feel like I just understood the practicality a lot better now. Thanks!
@isodoublet
@isodoublet 3 ай бұрын
To me, the biggest benefit of constexpr is that it significantly simplifies some stuff that previously would have to be written with template metaprogramming. Thanks to constexpr, just about anything that could be a regular operation on values can now be done in mostly ordinary-looking C++ instead of template stuff, which makes it much more likely to be done at all -- nobody wants to maintain template metaprograms.
@johnshaw6702
@johnshaw6702 3 ай бұрын
Well I wouldn't necessarily say that. I created my own meta templates before they became part of the standard. It was quite fascinating at the time and a very productive learning experience. The introduction of 'constexpr' was an extension of what some people were doing with templates already. It just allows you to write them in a more easily understood manor.
@isodoublet
@isodoublet 3 ай бұрын
@@johnshaw6702 It's true, templates were already Turing-complete so there was nothing you couldn't do before that you can now. But template metaprogramming is very hard to teach, and the syntax makes the code rather difficult to understand and debug. It's also purely functional (excepting the stateful template instantations thing which IMO should not be used in serious code), which is a completely different paradigm to normal C++. So constexpr stuff, while not giving any strictly "new" capabilities, makes the existing capabilities easy enough to use that I can convince my team to use them, for example, and be confident that people will "get it" and continue to get it over time. I don't shy away from template metaprogramming myself and there's some great tricks you can still only do using it, but the more that can be done using familiar syntax and patterns, the better.
@johnmckown1267
@johnmckown1267 3 ай бұрын
So the compile time becomes O(2^n)?
@michaelfuller34
@michaelfuller34 3 ай бұрын
Only if your compile environment has the requisite resources 😂
@Obscurai
@Obscurai 3 ай бұрын
Yep, it's a cheat. Trading compile time for run time.
@Bolidoo
@Bolidoo 3 ай бұрын
Well that finonacci implementation is horrible. A simple iterative one can get you O(n), a smarter one O(log n) and an aproximate one (using the formula for the nth finonacci number) runs in O(1). Also constexpr is not really needed, if you run the compiler with optimizations activated (O3) it will probably do this by itself.
@dismuter_yt
@dismuter_yt 3 ай бұрын
Yes, but you can get O(1) both for runtime and compile time by calculating the value once on the side and putting it in manually in the code. So if it actually never changes, you'll have paid for the computation only once, and not at every compilation.
@AnttiBrax
@AnttiBrax 3 ай бұрын
​@@ObscuraiIt sounds like a cheat if you only concentrate on the drag race example. The benefits to constexpr were really badly sold in the vid I think. There are a lot of better examples of great real world applications in the comments.
@gabydewilde
@gabydewilde 3 ай бұрын
You can hard code the results in any language of course but if you want to enjoy some time off while compiling it's perfect.
@attilatorok5767
@attilatorok5767 3 ай бұрын
constexpr can never be as powerful as a python script that generates some code called in your makefile. It's dirty but it works.
@MorebitsUK
@MorebitsUK 3 ай бұрын
Hi Dave, I've been trying constexpr and static constexpr lately, in a calculator for room heating ratings. Instead of having an array load in at runtime, I made the array static constexpr; low and behold the lookups were instantaneous and the it was 10x faster than the one that didn't do that. On a side note, a C++ Guru showed how it works in that case and all the array elements are coded as a lookup table in the code, making it 100x quicker. Great video as usual, keep up the content. 😃
@mbnqpl
@mbnqpl 3 ай бұрын
Wait, so I can make my hello world 1000x faster?
@feel65
@feel65 Ай бұрын
print(“hi”)
@germanassasin1046
@germanassasin1046 3 ай бұрын
My favorite application of constexpr is compile time regular expression library, it evaluates your regex at compile time so you don’t have to worry about runtime cost of creating a regex. Also pretty cool one is in the fmt library where it checks whether the format string you provided is valid, btw this library partially made it into c++20 and was mostly completed in c++23 and I love it. Man, people are so creative.
@BMW_M-Performance
@BMW_M-Performance 3 ай бұрын
You are one of the reasons I started learning C/C++. I am extremely grateful for this channel, I love your content! Programming is just great.
@clemensschotte
@clemensschotte 3 ай бұрын
During the era of the Commodore 64, we relied on precalculated paths utilizing sine and cosine functions, which were too slow for real-time usage in demos and games. Consequently, when you observe sprites gracefully traversing the screen, it's all precalculated. Today, you can adopt a similar approach, such as with a website, where you generate as much content as possible beforehand and serve the already existing pages.
@snemarch
@snemarch 3 ай бұрын
Even on a 486 (with hardware FPU support for SIN/COS), which is some order of magnitude faster than the C=64, there was a lot of speed to gain by using lookup tables. But at least we could do LUT + interpolation instead of fully pre-calculated paths :P
@revenevan11
@revenevan11 2 ай бұрын
This makes me chuckle a bit, thinking about how people used to look up the trig tables in the back of their math book to evaluate something, and I grew up thankful I never really had to beyond an exercise, because we have computers to evaluate those for any value for us... only to know realize that in some cases the fastest way for a computer to do so is sometimes to make a table ahead of time it can look up to get the answer! (or part of the answer at least) 😅
@yarost12
@yarost12 3 ай бұрын
We had a C++ logging library that ran on embedded hardware, it was used to convert numerical debug level values into strings. There was a heap of functions that converted different log message types into readable strings. A few developers from the customer side complained that this lib alone could use the CPU to 100% when debugging was enabled slightly above default. We made those conversions constexpr and switched them to literal types and dropped the load quite significantly. What has to be mentioned is that constexpr functions will consume more stack space and programms will be larger, so you have to be careful with that. Sometimes it can be worth it, when you're fighting for kilobytes.
@0ADVISOR0
@0ADVISOR0 3 ай бұрын
Man, let me tell you, this exact screen 1:49 is one of the reasons I love this Channel. The code neatly on the left the face not too big not too small and rounded soft edges on the right, THIS is quality!
@Pedantic2025
@Pedantic2025 3 ай бұрын
two things on this: 1. the [[assume(expr)]] attribute in C++ does something similar (it indicates to the compiler that it can assume expr holds. 2. using C++20's requires for templates, you can achieve a similar effect, but have the code detect how deep it has recursed and switch to a runtime implementation
@Evan490BC
@Evan490BC 3 ай бұрын
Nice! I regard C++20's Concepts more as *declarative* type constraints (so I prefer `constexpr` for this case). In essence, you are suggesting something similar to Rust's procedural macros, if I'm not mistaken.
@ljw4103
@ljw4103 3 ай бұрын
If you made a c++ tutorial I would buy it.
@theograice8080
@theograice8080 3 ай бұрын
I concur! I've wanted to learn C++ for ages. I started to try to grok the low-level details of systems programming years ago as a teen, but modern OS UIs abstract away so much of what is really happening that any program seems like a miracle to me.
@NinjaRunningWild
@NinjaRunningWild 3 ай бұрын
@@theograice8080Try game programming & write your own engine in DirectX, OpenGL, or SDL. You'll learn all you need to know along the way.
@theintjengineer
@theintjengineer 3 ай бұрын
@@theograice8080, get the book Beginning C++23 by Ivor Norton and start there.
@lennymclennington
@lennymclennington 3 ай бұрын
@@theograice8080 if you actually want to learn C++, use learncpp. It is a good tutorial that is suitable to teach you about software development in C++ from zero prior knowledge about programming. If you already know some programming languages you can still use it and just skip a few of the first chapters.
@Carstuff111
@Carstuff111 3 ай бұрын
That opening.....man this is just one of the great bonuses to this channel! Here to learn and got a great laugh as a bonus right off the bat!
@mgancarzjr
@mgancarzjr 3 ай бұрын
"I bet it's constexpr." If I'm remembering correctly, constexpr had some other very interesting effects. It may be in the book C++ High Performance by Andrist and Sehr.
@blackcitadel37
@blackcitadel37 3 ай бұрын
there's constexpr young brother too, consteval.
@zamf
@zamf 12 күн бұрын
constexpr is so powerful that if you go far enough in theory you can have your binary contain every possible state that your program can be in (calculated at compile-time) and then just "jump" to that program state based on your inputs that are only known at runtime. Of course, this would mean the binary could grow to an unimaginable size but in theory it would run in a split second. For example, Jason Turner (a big proponent of constexpr in the C++ community) suggested a game engine where all the states of the game are calculated at compile time using constexpr functions and then the whole game would be a sequence of pre-drawn frames that appear on the screen based on user input. And in this theoretical engine the FPS could reach the CPU frequency.
@JamieBainbridge
@JamieBainbridge Ай бұрын
Awesome video. I'm a C99 grognard but had always wondered what this keyword meant. Your presentation is information dense but easy to understand with lots of laughs too. Thanks!
@BartoszZasieczny
@BartoszZasieczny 3 ай бұрын
Once I got a task to implement loading some lookup table that was supposed to be loaded form a csv file. There was an equation and argument values were a discrete finite set with even intervals between values. I threw out the CSV file and jsut implemented a constexpr function what would calculate the lookup table at compile-time, put all the values inside a vector and then just read them. Awesome stufff.
@Hadzz95
@Hadzz95 3 ай бұрын
Great video Dave! Would love to see more videos about algorithms
@johnshaw6702
@johnshaw6702 3 ай бұрын
I loved your coverage of this. Many years ago I fell in love with templates, especially meta expressions. The introduction of 'constexpr' just simplified what I already was doing. If you understand templates, then you see the limitations of the C++ language up close and personal, as well as the power. I believe your examples could be done using templates alone, it just wont be as clean. I created template code that doesn't directly know the integral type being used. When an 'int' was 16 bits and a 'long' was 32 bits, and you wanted the minimum of a 32 bit integer, it would result in a 'long' integer. But when a compiler came with a standard library that said an 'int' was 32 bits, that was chosen instead. I realy didn't like to depend on the standard library to determine the choice, but it's the only way I knew how to do it. Letting the compiler dertermine something at complie time is a great way to reduce runtime overhead.
@samanthajarosek9756
@samanthajarosek9756 3 ай бұрын
I love this channel, I am always learning amazing things
@damindra
@damindra 3 ай бұрын
Great video, just wrote a fibonacci program in c++ as I am learning , and this will be great to test against your constexpr code! Love your videos keep them coming
@ACCPhil
@ACCPhil 3 ай бұрын
Using tail recursion massively speeds things up. I wonder if I can paste code __int64 Fibtail(__int64 n, __int64 p = 0, __int64 c =1) { if (n == 0) return p; else if (n == 1) return c; else return Fibtail(n - 1, c, p+c); }
@emjizone
@emjizone 3 ай бұрын
Good to understand and remember. I'm much more used to writing static code than programming the final compiler to build static code, but this promises a level of machine-level optimization that I've too often neglected. *Thank you* Dave from @DavesGarage for this valuable lesson. In any case, I would never calculate functions such as fibonacci or factorial by top-down recursion in production, of course, not even with memorization. This silly approach only serves to demonstrate concepts.
@jbuchan12
@jbuchan12 3 ай бұрын
This is awesome Dave thanks, I'm a c# software engineer, and I am learning c++ atm. This is very good timing.
@pete540Z
@pete540Z 3 ай бұрын
Thanks, Dave. I look forward to a video on the ct_sqrt function mentioned.
@firiasu
@firiasu 3 ай бұрын
You're awesome man! Like C++ itself!
@joshuawillis7874
@joshuawillis7874 3 ай бұрын
Last thing I expected: Dave using a Breaking Bad meme to introduce his video... Being able to talk to our compilers directly is one of the reasons I love systems languages! Have you explored Rust very much, Dave? It has some of the most impressive metaprogramming features I've seen - including full on injection into its AST to create custom syntax.
@coldlyanalytical1351
@coldlyanalytical1351 3 ай бұрын
Forth could do that in the 1980s .. a brilliant langauge .. but that sort of 'refine the language' trick creates write-once-read-never code.
@jonathanlapierre8620
@jonathanlapierre8620 3 ай бұрын
Unless I do not understand here what you are basically saying is that Program A aka the compiler, does the heavy lifting of compute and pre-resolve the sieve which is pushed in the binaries of Program B aka the actual competition submission. Although this is a "clever" niched solution, it's no more useful than having the sieve solution injected from a JSON that was pre-calculated by another program before. I am no C++ guru and I love that I have learned something but it seems this is no different than compilers environment variables logics. As for the sieve, I totally get why it would not be included in the competition. Big fan. Awesome stuff. I should have started with that. Ciao!
@bretthunter2828
@bretthunter2828 3 ай бұрын
I thought the same - if you want fib(35) hard coded why bother even using a constexp "function" but just put the value in directly - you are just as likely mistype the functional code as you would type the exact number in from google or calculator. The only upside would be if you are unsure at the time of writing the main program what fib number you want to use - and hence easy to change to say fib(22)
@MatthewMakesAU
@MatthewMakesAU 3 ай бұрын
​@@bretthunter2828it's a lot easier to review code for errors than a magic number
@germanassasin1046
@germanassasin1046 3 ай бұрын
⁠@@bretthunter2828what you experience is what is called a failure of imagination. I also thought like you, but when I saw what kind of sorcery people came up with I was shocked. For example I saw a compile time regex parser library, if you know regex it can construct state machine at compile time which is an incredible speed up. And it was simple to use!
@-_James_-
@-_James_- 3 ай бұрын
To give a concrete example to what M-FeD was saying, if you have a string and you need to know which of one possible values it contains, you can switch on the runtime-calculated hash of the string and have case statements of compile-time calculated hashes of possible strings to compare against. The alternative would be a ton of if...else if... statements and a bazillion string compares. E.g.: switch( hash( str )) { case hash( "colour" ): break; case hash( "size" ): break; //etc... } All you need is a couple of hash functions that can hash strings or string literals, and you're good to go.
@l.piekha100
@l.piekha100 10 күн бұрын
Your videos are simply awesome and perfect, thank you so much.
@xnadave
@xnadave 3 ай бұрын
So, I can stop writing code generators for LUTs? Very cool. I was already vaguely aware of how this worked, but seeing a concrete analysis is very helpful. Thanks for sharing.
@mikeh_nz
@mikeh_nz 3 ай бұрын
Neat! I loved the breaking bad intro and hadn’t heard of constexpr, so googled when it was the throat shot I hadn’t heard of this - I must be somewhat ancient back at the original c++ versions Yeah I guess the downside is the that timebuild is broken due to some intern doing a fib(2^100) in their code. Love the video
@bowiemtl
@bowiemtl 3 ай бұрын
I went from learning about tidbits of early windows programs to learning about relevant code features. Thank you sir!
@drop0ne_f20
@drop0ne_f20 Күн бұрын
I freaking love the intro Dave!!!
@palaashatri
@palaashatri 3 ай бұрын
Hahaha love the intro, and the video. Will definitely share it with my colleagues!
@AdvancedSoul
@AdvancedSoul 3 ай бұрын
The coolest compile time practical code I've seen in use in the wild is Zig's regex compiler
@flooooooooooooooooo
@flooooooooooooooooo 3 ай бұрын
Hey dave what theme do you use in VS code? It looks like the one I use called gerry themes
@KillerMZE
@KillerMZE 3 ай бұрын
"if constexpr" is super useful for making templated functions that act mostly the same except for a small snippet. For example I've successfully used it for a container that needed both a Peek and a Read function, that did pretty much the same thing with the difference of either advancing an internal variable or not
@corvinyt
@corvinyt 3 ай бұрын
This is just shifting the algorithm runtime from program run time to compile time.
@diogeneslaertius3365
@diogeneslaertius3365 3 ай бұрын
Exactly my thought. Why doesn't he just hard-code the result of the function, this way we don't even need to bother the compiler. "Hey folks, I optimized the program so it runs 1000000 times faster when you call factorial(1e7). How? I hard-coded the constant with the result."
@eDoc2020
@eDoc2020 3 ай бұрын
@@diogeneslaertius3365 Because then you have hardcoded magic values in your code. If that magic value is incorrect how will you know? If it's done with code you can fix errors more easily.
@AelfricLake
@AelfricLake 3 ай бұрын
Yeah in a complex program you'll just hold every possible result in memory? What...
@eDoc2020
@eDoc2020 3 ай бұрын
@@AelfricLake Not every possible result, only hardcoded results for specific constant inputs given at compile time. It's of limited usefulness because most inputs won't be known at compile time.
@futuza
@futuza 3 ай бұрын
Sure, but it's still a useful tool no?
@TIME1minute
@TIME1minute 3 ай бұрын
Hi Dave! I work with Qt C++ right now and your videos are helping me make the quality of my work better! Thanks again! Do you think you'll do any more videos on compile-time execution, i.e. template arguments and properly using constexpr in class definitions?
@monawoka97
@monawoka97 3 ай бұрын
Constexpr is really amazing from a compiler tech perspective, but it's still hard to think of circumstances where it enables something genuinely new. These examples are effectively using constexpr to bake a lookup table into the exe. It's really cool that this is possible, but you could achieve this in C. It's obviously more tedious. Either an executable as part of your build script that code gens the lookup table as a pre-process if you want it bundled into the executable - or just loading in an external file at runtime with the lookup table. It's still pretty neat. Definitely ergonomic.
@Ch40zz
@Ch40zz 3 ай бұрын
constexpr is very useful for code obfuscation, string encryption etc. enjoy doing that in C
@mgancarzjr
@mgancarzjr 3 ай бұрын
If you can throw constexpr in front of a function, you turn it, potentially, into a lookup table without having to find and replace all the code where the function is called.
@strehlow
@strehlow 3 ай бұрын
@@mgancarzjr or just re-code the function to do the lookup rather than compute the result. The calling code wouldn't need any changes.
@mgancarzjr
@mgancarzjr 3 ай бұрын
@@strehlow how do you propose letting each instance of the function know where to look for the lookup table? Genuinely curious. I can only think of static class members being able to automatically access the same memory.
@strehlow
@strehlow 3 ай бұрын
@@mgancarzjr The function definition has the lookup table. The calling code doesn't know or care how the function produces the result. It doesn't need direct access to the lookup table. This is no different than refactoring a function with a more efficient algorithm, or any other optimizations. As long as it still produces the same mapping of input to output, why would the calling code need any changes?
@erroneum
@erroneum 3 ай бұрын
Another great thing about constexpr is that you don't need to pass only constant arguments in (unlike non-type template parameters or consteval functions), but if you have even basic optimizations turned on, when you do pass in constant arguments, the compiler should evaluate the result and substitute it in. As an example, I'm writing a calculator, but I want to be able to use machine integers for speed, but switch to GMP when they get too large. To do so, I needed to implement the logic to check if there would be an overflow/underflow, so I made 3 functions, willOverflowAddition, willOverflowSubtraction, and willOverflowMultiplication, which are template constexpr functions for two signed integers. In practice they won't be being used in constant expressions, but for the optimizer it's helpful to afford it any opportunities to simplify the code you can.
@langtonmwanza6689
@langtonmwanza6689 29 күн бұрын
Someone explain this to me, because it seams a bit pointles, if the function input is known before compile time then whats thr point of the function just replace it with the result no? Once the code has been compiled I can't put a different value for that function input so whats actually the point?
@marksmit6086
@marksmit6086 3 ай бұрын
I use constexpr with a hash method for strings. That way in a switch/case statement strings can be used as a selector. This makes code more readable, I love that.
@rohann8073
@rohann8073 26 күн бұрын
this is awesome thanks Mr. Plummer
@ycombinator765
@ycombinator765 3 ай бұрын
So constexpr just compiles my code before I hit the compile button? noice! Thanks for this banger
@mario50000
@mario50000 3 ай бұрын
no constant expressions are evaluated at compile time the point is that the expression is constant and thus won't need to be calculated during runtime as it was already evaluated and baked in during compilation
@BradHouser
@BradHouser 3 ай бұрын
Maybe I am missing something. I see limited value if you want to find the Fibonacci Series of N values, when N has to be known at compile time. It appears to be trading compile time to runtime. How many times do I expect to run it?
@philmarsh7723
@philmarsh7723 3 ай бұрын
Thanks! I was wondering if compilers e.g. gcc run these multithreaded?
@VoidloniXaarii
@VoidloniXaarii 3 ай бұрын
Thank you! Might be that your explanation finally made it stick 4 me
@oblivion_2852
@oblivion_2852 3 ай бұрын
How big was the primes table that was submitted? Also I assume there was a function that could use the table when a number was queried not in the table?
@DavidLindes
@DavidLindes 3 ай бұрын
This is cool. And wasn't there when I learned C++ (which I know a bit of, but don't use super often for the kinds of work I do), so... I'd missed the memo! Thanks for sharing!
@MoSiddeek
@MoSiddeek 3 ай бұрын
Great work! I still think these are contrived examples however. How does the consexpr implementation compare to hardcoding pre-populated isPrime[] or fib[]? I can't see the advantage really.
@pleasurewasmine3173
@pleasurewasmine3173 Ай бұрын
i envy your memory and general cognitive ability Dave. You are a fortunate person to have such a beautiful mind. in a world where everything seems to be a race against time, you have seen this over the years and better yet you were able to keep up with it.
@TeuFortMan
@TeuFortMan 3 ай бұрын
Is a compiler can't collapse function by compile time without explicit constexpr?
@HritikV
@HritikV 3 ай бұрын
the starting scene was magnetic ! taught so much by baiting us all
@jeffscheeler2532
@jeffscheeler2532 3 ай бұрын
Most of this went way over my head!
@idaknow716
@idaknow716 3 ай бұрын
Thank you for another interesting programming video.. I confess that much of it is jibberish, but with the meager programming i have learned thus far, I am left with this question.. Do you recommend any curriculum, online or institutional to learn programming? Reading the code from previous videos and from your github, I can understand the flow as if in a fog, but lack the base knowledge that might make it more clear. Just curious to ask. Loved the intro too.. I have seen so many internet meme's using that scene... :)
@hhlavacs
@hhlavacs 3 ай бұрын
Nice video about an important topic. Thanks Dave! constexpr is really a super power! People have implemented constexpr ray tracers! Just one side note: There is a closed form formula for Fibonacci numbers (no recursion required), which can be evaluated in microseconds as well, without any computational limitations on the number. It is based on linear algebra, eigenvectors and the eigenvalues (1+sqrt(5))/2 and (1-sqrt(5))/2.
@milanstevic8424
@milanstevic8424 7 күн бұрын
in other words, phi and 1-phi where phi is defined as phi = 1+1/phi (or phi^2 = phi+1) and is known as golden ratio and 1-phi is known as a golden ratio conjugate (1-phi = -1/phi) phi can be declared as a constant phi = 1.6180339887498948482045868343656 whereas 1-phi = -0.6180339887498948482045868343656 so due to how floating point numbers work, it's best to instead define apc = 0.6180339887498948482045868343656 (absolute phi conjugate) and then apply that by doing 1.0+apc (instead of phi) and -apc (instead of conjugate)
@johannesschneider1784
@johannesschneider1784 3 ай бұрын
This is such an excellent explanation of constexpr omg
@martuscellifaria
@martuscellifaria 3 ай бұрын
Hey Dave, I am working on a company, and much of our old software is written with MFC. Does it still makes sense using Windows CString in 2024, or slowly migrating it to std::string should be considered?
@ThePC007
@ThePC007 3 ай бұрын
Can constexpr be used to hash strings during compile time? That could be incredibly useful for a lot of things, including networking, EEPROM writing/loading, or even efficient event systems.
@BMW_M-Performance
@BMW_M-Performance 3 ай бұрын
That intro was awesome!
@michaelbauers8800
@michaelbauers8800 3 ай бұрын
I pre upvoted due to this (a) being Dave's Garage which I respect, (b) C++, and (c) constant expressions, which I love. const expressions are easier to understand than some crazy template metaprogramming to generate primes :)
@commanderlake7997
@commanderlake7997 Ай бұрын
Why not have the pre-computed values be integrated with the code by the IDE so you don't have to wait for it to compile every time? Or just use the windows calculator?
@iantr001
@iantr001 3 ай бұрын
I've used constexpr functions combined with static_assert to do compile-time verification of program internals. For example, a console app's command line switch definitions were double-checked for correctness. I.e., Did I define the same switch twice or were they all defined using the assumed casing. If the verification constexpr function called by static_assert returned false, the compile would fail.
@yephick
@yephick 3 ай бұрын
Writing embedded code (a USB camera) this `constexpr` allows me to pre-determine many things, including building the complete USB descriptors' set, and just "bake it in" into firmware image
@charlesgi5058
@charlesgi5058 3 ай бұрын
@Dave's Garage Can you please make a video explaining what a NUMA core is in task manager? When you right click the processor graphics it says show as and NUMA threads is greyed out. Thanks
@nezbrun872
@nezbrun872 3 ай бұрын
You need a machine with multiple CPU sockets to enable it. Each CPU has it own directly connected memory, although both CPUs can access the entire shared address space. For CPU1 to access CPU2's memory, it goes through a high speed processor to processor interconnect (called QPI or UPI on Intel), but this adds overhead, so typically the OS favours affinity between CPU and its directly connected memory when allocating memory to threads. Logically access is transparent to the application. There's also the issue of cache coherency for the OS to consider, as each CPU has its own cache.
@cj09beira
@cj09beira 3 ай бұрын
NUMA stands for Non Uniform Memory Access, it exists to deal with computers that either have multiple sockets or that for other reasons have one portion of memory have a different access time than another, (for example it takes longer to move data across sockets). some single socket cpus also perform better with NUMA enabled as they have some memory channels more closely integrated to some core than others (like 1st threadripper generation)
@fiffy6572
@fiffy6572 3 ай бұрын
i didn't know baout constexpr, thank you!
@danwilson7581
@danwilson7581 3 ай бұрын
I wonder if using constexpr would change the way 'msft minutes' are shown during a file transfer or copy..
@XzatonJW
@XzatonJW 3 ай бұрын
Constexpr in C++20 is amazing, youre also allowed to do dynamic allocations provided you dont leak the memory out of the constexpr context. For example, you can use std vector, and so long as you copy the data into an array, or similar, its totally allowed
@AJMansfield1
@AJMansfield1 3 ай бұрын
If you do some tricky stuff with arena allocators, even that's not necessarily impossible either. You can absolutely have a constexpr function returning pointers into an rodata segment of the compiled binary.
@thomaswilliams690
@thomaswilliams690 3 ай бұрын
Wow, amazing! Now to try some things.
@CODEDSTUDIO
@CODEDSTUDIO 3 ай бұрын
Please what is your take on Microsoft use/ adapting of Rust Lang
@hugoandres5480
@hugoandres5480 2 ай бұрын
What do you think of the future of c++ against go and rust?, thanks.
@user-yr1uq1qe6y
@user-yr1uq1qe6y 3 ай бұрын
Is this anything like Dart const? I still haven't wrapped my head around best practices with that one.
@oalfodr
@oalfodr 3 ай бұрын
If all the numbers are crunched at the compile time, what are use cases for this?
@arnold5328
@arnold5328 Ай бұрын
I‘m so happy to find this randomly
@Ikxi
@Ikxi 3 ай бұрын
Ok, question If the compiler does it all, won't file size be beeg? Just a noob programmer, so I don't really know these things.
@DavesGarage
@DavesGarage 3 ай бұрын
No.. it DOES take longer to compile, but then the answer is put in where the code would otherwise go, so it's tiny. But of course, it only works for things that CAN be figured out at compile time... but it's surprising what can!
@tanzeelahmed9871
@tanzeelahmed9871 3 ай бұрын
What's the practical usage of this? Any industrial application involves input from some party to some extent, which won't allow us to hardcode the starting expression? Also, aren't we trading off time between compilation and execution?
@TheVoiceofTheProphetElizer
@TheVoiceofTheProphetElizer 3 ай бұрын
As goofy as it may sound, I dream of a day where the LLMs get so proficient at ASM for the X86 architecture that we can go back and optimize everything and show ourselves we don't need to continually increase chip horsepower.
@danwilson7581
@danwilson7581 3 ай бұрын
I like this vid Dave. Its back in the 'slightly overhead' range, as im a net dev type, with lots of net. Much better than boring network snoozefests. By the way, ever develop anything that plays with tcp sliding windows?
@aniketbisht2823
@aniketbisht2823 3 ай бұрын
You can also use the entirety of (C++23) and allocate at compile time (C++20) to perform arbitrary computations at compile time. Constexpr should be the reason alone for many C bois to consider using C++ (another compelling features being RAII, templates and type-safety).
@shm236
@shm236 Ай бұрын
Finally, just what my todo app needed.
@pcdoodle1
@pcdoodle1 3 ай бұрын
that intro is gold
@JxH
@JxH 3 ай бұрын
In a sense, it's a small step away from just working out the answer in advance, and then just outputting the answer at "Run" (sic) time.
@Kadotus
@Kadotus 3 ай бұрын
Thanks Dave!
@khaledf3977
@khaledf3977 7 күн бұрын
What greet way of explaining thanks
@hongkongcantonese501
@hongkongcantonese501 3 ай бұрын
I never thought a C++ joke(!) would be so gd funny! Gonna re-read Petzhold for the gags. :)
@CremeDeLaCoulee
@CremeDeLaCoulee 3 ай бұрын
For the sieve prime problem, it shouldn't be necessary to separately calculate the square root of the upper bound before starting the algorithm. It seems like a more efficient algorithm might be ... As we're marking off the multiples of primes, keep track of the multiplier in a local variable incremented at each pass, and if the multiplier is less than or equal to prime itself after we've passed the upper bound, then we know we are at/past the square root of the upper bound and can call it quits on marking off more multiples of primes. The cost of this approach is two new operations -- incrementing the multiplier in the inner loop (while crossing off each multiple of the prime) and a comparison at the end of each prime loop to check whether this multiplier is less than or equal to the prime itself (and if so, then break from the outer loop). The second operation is a substitute for comparison against the pre-calculated square root, so isn't really an additional operation, it is just placed after rather than before. So, unless the upper bound is the square of a prime, we will have to run one additional prime to know we have reached this point, as compared to the pre-calculated square root which will tell us before we run that prime. The time-saving on this approach would depend partially on the efficiency of the dedicated square root algorithm (I'm skeptical on the efficiency of the one shown...). And the relative cost of the increment operation that's tracking the multiplier. My instinct is that since the multiplier is a local variable on the stack whereas crossing out the prime multiples is an array/memory operation presumably on the heap, its likely the increment will be done concurrently by the CPU while waiting for the memory-write operation to complete, and thus the cost will be nonexistent in practice on a modern CPU. I could be wrong... in which case the additional cost of this incrementing would grow O(n^2) with n = the upper bound... but then what is the complexity of the recursive square root??? Plus there's no chance of optimizing that with concurrent memory operations that need to be done anyways... hmmmmmm?????
@kensmith5694
@kensmith5694 3 ай бұрын
Thoughts on using this: 1) It could be used to precompute the twiddle factors for an FFT if you are working to make the FFT really fast. It is an alternative to writing a program to compute them and then including the text into your C++ source. 2) In a hardware thing I needed to know the best combination of what was loaded into two registers of a device. One register could selected between 7 clock sources and the other was the (integer) factor to divide by. The goal was to hit an exact frequency as best I could. I solved this again by writing code that figured it out and spit it out as text to include.
@hebozhe
@hebozhe 3 ай бұрын
I'm subbing for the intro, alone.
Bill Gates' Easter Egg!
23:39
Dave's Garage
Рет қаралды 202 М.
Modern C++: Unique Ptrs and Vectors
16:25
Dave's Garage
Рет қаралды 131 М.
狼来了的故事你们听过吗?#天使 #小丑 #超人不会飞
00:42
超人不会飞
Рет қаралды 56 МЛН
100❤️
00:19
Nonomen ノノメン
Рет қаралды 38 МЛН
Normal vs Smokers !! 😱😱😱
00:12
Tibo InShape
Рет қаралды 118 МЛН
顔面水槽をカラフルにしたらキモ過ぎたwwwww
00:59
はじめしゃちょー(hajime)
Рет қаралды 36 МЛН
Interview With A Sr JavaScript Dev | Prime Reacts
24:43
ThePrimeTime
Рет қаралды 156 М.
When Optimisations Work, But for the Wrong Reasons
22:19
SimonDev
Рет қаралды 784 М.
Fast Inverse Square Root - A Quake III Algorithm
20:08
Nemean
Рет қаралды 4,9 МЛН
The Downsides Of C++ | Prime Reacts
21:23
ThePrimeTime
Рет қаралды 124 М.
Programmer VS The Human Benchmark Test | Number Memory
17:47
Code Bullets Day Off
Рет қаралды 398 М.
The Bubble Sort Curve
19:18
Lines That Connect
Рет қаралды 391 М.
E01: Stupid C++ Tricks with Dave
15:43
Dave's Garage
Рет қаралды 317 М.
Faster than Rust and C++: the PERFECT hash table
33:52
strager
Рет қаралды 509 М.
i cant stop thinking about this exploit
8:40
Low Level Learning
Рет қаралды 328 М.
Теперь это его телефон
0:21
Хорошие Новости
Рет қаралды 1,7 МЛН
Трагичная История Девушки 😱🔥
0:58
Смотри Под Чаёк
Рет қаралды 376 М.
Обзор игрового компьютера Макса 2в1
23:34
3.5.A Solar Mobile 📱 Charger
0:39
Gaming zone
Рет қаралды 319 М.
What percentage of charge is on your phone now? #entertainment
0:14