Lex Fridman Podcast - #215 – Wojciech Zaremba: OpenAI Codex, GPT-3, Robotics, and the Future of AI

Starting point is 00:00:00 The following is a conversation with Wochek Zaremba, co-founder of OpenAI, which is one of the top organizations in the world doing artificial intelligence research and development. Wochek is the head of language and co--3, and who knows, 4, 5, 6, and, and, and plus one. And he also previously led OpenAI's robotic efforts. These are incredibly exciting projects to me that deeply challenge and expand our understanding of the structure and nature of intelligence. The 21st century, I think, may very well be remembered for a handful of revolutionary AI systems and their implementations.

Starting point is 00:00:53 GPT, Codex, and applications of language models and transformers in general to the language and visual domains may very well be at the core of these AI systems. To support this podcast, please check out our sponsors. They're listed in the description. As usual, I'll do a few minutes of ads now, no ads in the middle. I try to make these interesting, so hopefully you don't skip, but if you do, please still check out the sponsor links in the description.

Starting point is 00:01:23 It's the best way to support this podcast. I use their stuff, I enjoy it, maybe you will too. This show is brought to you by PaperSpace Gradient, which is a platform that lets you build, train, deploy machine learning models of any size and complexity. I love how powerful and intuitive gradient is. If you're already in the machine learning world or are interested in joining it, I highly recommend FastAI, which is a set of courses by Jeremy Howard and others that introduced you to machine learning.

Starting point is 00:01:58 The reason I bring that up is they use and recommend paper space gradient. One of the nice things you can host notebooks on there, you can swap out the computer instance at any time, so you can start out with a small scale CPU, GPU instance, and then swap out something super powerful once you're ready, once you need it. I'm also really excited about what they're calling workflows, which provides a way to automate machine learning pipelines on top of gradient

Starting point is 00:02:26 compute infrastructure. It makes it really easy to build a production app because all the orchestration is reduced to a simple Yamal file. To give gradient a try, visit gradient.run slashlex and use the sign up link there. You'll get 15 bucks in free credits, which you can use to power your next machine learning application. That's gradient.run slashlex. If you're interested in future machine learning projects, I'm doing terms of tutorials and educational stuff. Maybe gradients is worth to try because I'm likely to use gradient as part of that.

Starting point is 00:03:00 Anyway, like I said, gradient.runslashlex. This episode is also brought to you by Indeed, a hiring website. I've used them as part of many hiring efforts I've done for the teams I've led in the past. They have tools like Indeed Instant Match giving you quality candidates who resummes on Indeed, fit your job description immediately. I think hiring is the most important component of a successful company, no matter what stage your company is in. I've been lucky to get a chance to interact with some very successful people in the entrepreneurial

Starting point is 00:03:37 and the business world, and I think that universally the thing that is common between them is the ability to build a great team. So you should definitely use the best tools for the job and indeed should be in your toolbox. Right now, I'll get a free $75 sponsored job credit to upgrade your job post at indeed.com slash Lex. Get it at indeed.com slash Lex. Offer valid through September 30, terms of conditions apply. to thousands of nonfiction books and condenses them down into just 15 minutes that you can read or listen to. I can recommend a bunch of books including Sapiens by Yvaharari. I recently announced that I'm

Starting point is 00:04:33 doing a book a day for a few days and actually had some chaos come up in my life which through a wrench into that. But I will return to this project and I'm excited by the idea of sort of spending an entirety of a day reading and really sort of living through the story but I use Blinkist to select possible not fiction books and I also use Blinkist to reflect on the books I've read in the past whether I want to revisit them. Go to Blinkist.com slash Lex to start your free seven-day trial and get 25% off a Blinkist Premium membership as Blinkist.com slash Lex to start your free seven day trial and get 25% off a Blinkist Premium membership as Blinkist.com slash Lex spelled B-L-I-N-K-I-S-T, Blinkist.com slash Lex.

Starting point is 00:05:16 This show is also brought to you by Grammarly, a writing assistant tool that checks spelling, grammar, sentence structure, and readability. Grammarly Premium, the version you pay for, offers a bunch of extra features. My favorite is the clarity check, which helps detect rambling, overcomplicated chaos that many of us can descend into. I'm a big fan of quality of writing, emerging through the editing process. So the stream of consciousness, we just throw a bunch of stuff on the page and figure out which of its sticks. For me, the end result of a great piece of writing is as simple as possible.

Starting point is 00:05:59 And like Einstein said, not too simple. But I find it's almost difficult to arrive at something that's too simple. Grammarly is available on basically any platform and major sites and apps like Gmail and Twitter and so on. Do more than just spell check. Get your point across more effectively with Grammarly Premium. Get 20% off by signing up at Grammarly.com slash Lex.

Starting point is 00:06:23 That's Grammarly.com slash Lex. That's Grammarly.com slash Lex. This episode is also brought to you by Aithleep and it's Pod Pro mattress. It controls temperature with an app, it's like magic. It's back with sensors. It can cool down to as low as 55 degrees and he's side of the bed separately. I've been hanging out with Andrew Heuberman this week and he's actually just a mastermind of temperature control and all kinds of devices. He had this nice explanation of what parts of the body to apply cooling to for it to be actually effective.

Starting point is 00:06:57 It's fascinating to see what kind of effect temperature control has on you. I think during sleep, for me, it does magic. Even with air conditioning, just having a warm blanket and a cold bed, it makes me really look forward to the naps and the sleep. They have a pot pro cover,

Starting point is 00:07:14 so you can just add that to your mattress without having to buy theirs. But their mattress is nice, just so you know. Again, it can track a bunch of metrics, like heart rate variability, but cooling alone is honestly worth the money. Go to 8sleep.com slash Lex to get special savings. That's 8sleep.com slash Lex.

Starting point is 00:07:32 This is the Lex Friedman Podcast, and here's my conversation with Wojczyk, Siremba. You mentioned that Sam Altman asked about the Fermi Paradox and other people at OpenAI had really sophisticated interesting answers. So that's when you knew this is the right team to be working with. So let me ask you about the Fermi paradox about aliens. Why have we not found overwhelming evidence for aliens visiting Earth? I don't have a conviction in the answer, but rather probabilistic perspective on what might be a possible answer.

Starting point is 00:08:27 It's also interesting that the question itself even can't touch on your typical question of what's the meaning of life. Because if you assume that we don't see aliens because they destroy themselves, that kind of upweights the focus on making sure that we want destroy ourselves. At the moment, the place where I am actually with my belief, and these things also change over the time, is I think that we might be alone in the universe, which actually makes life more,

Starting point is 00:08:59 or let's say consciousness life more kind of valuable. And that means that we should more appreciate it. Have you always been alone? See what's your intuition about our galaxy, our universe? Is it just sprinkled with graveyards of intelligence civilizations? Or are we truly is life intelligent life truly unique? At the moment, I believe that it is unique.

Starting point is 00:09:21 But I would say I could also, there was like a some footage released with UFO objects, which makes me actually doubt my own belief. Yes. Yeah, I can tell you one crazy answer that I have heard. Yes. So, apparently when you look actually at the limits of computation, you can compute more if the temperature of the universe would drop down.

Starting point is 00:09:48 So one of the things that Aliens might want to do if they are truly optimizing to maximize amount of compute, which you know, maybe can lead to, or let's say simulations or so, it's instead of wasting current entropy of the universe, because you know, we by leaving we are actually somewhat wasting entropy, then you can wait for the universe to cool down, such that you have more computation. That's kind of a funny answer, I'm not sure if I believe in it,

Starting point is 00:10:15 but that would be one of the reasons why you don't see aliens. It's also possible to some people say that, maybe there is not that much point in actually going to other galaxies if you can go inwards. So there is no limits of what could be an experience if we could connect machines to other brains while there are still some limits if we want to explore the universe. Yeah, there could be a lot of ways to go inwards too. Once you figure out some aspect of physics we haven't

Starting point is 00:10:47 figured out yet, maybe you can travel to different dimensions. I mean, travel in three dimensional space may not be the most fun kind of travel. There may be like just a huge amount of different ways to travel and it doesn't require a spaceship going slowly in 3D space to space time. It also feels in one of the problems is that speed of light is low and the universe is vast and it seems that actually most likely if we want to travel very far then we would instead of actually sending spaceships with humans that wait a lot, we would send something similar to what Yuri Miller is working on. These are like a huge sale which is at first powered, there is a shot of laser from an Earth and it can't propel it to quarter of speed of light. and sale itself contains the kilograms of equipment and that might be the way to actually transport matter through universe.

Starting point is 00:11:51 But then when you think what would it mean for humans, it means that we would need to actually put their 3D printer and you know 3D print a human on other planets, I don't know play them YouTube or let's say or like a 3D print like huge human right away or maybe a womb or so With our current techniques of archaeology If if a civilization was born and died Long long enough ago on earth we wouldn't be able to tell and so that makes me really sad

Starting point is 00:12:22 And so I think about earth in that same way how can we leave some remnants if we do destroy ourselves? How can we leave remnants for aliens in the future to discover? Like here's some nice stuff we've done. Like Wikipedia and YouTube, do we have it like in a satellite orbiting Earth with a hard drive? Like how do we say, how do we back up human civilization for the good parts, or all of it is good parts, so that it can be preserved

Starting point is 00:12:53 longer than our bodies can. It's a difficult question. It also requires the difficult acceptance of the fact that we may die. And if we die, we may die suddenly as a civilization. So let's see, I think it kind of depends on the cataclysm. We have observed in other parts of the universe that birds of gamma rays. These are high energy rays of light that actually can

Starting point is 00:13:22 apprehend to kill and tiger galaxy. So there might be actually nothing event to nothing to protect us from it. I'm also when I'm looking actually at the pass civilization. So it's like Aztecs or so they disappeared from the surface of the Earth. And one can ask, why is it the case? And the way I'm thinking about

Starting point is 00:13:44 this, you know, that definitely they had some problem that they couldn't Why is it the case? And the way I'm thinking about it is, you know, that definitely they had some problem that they couldn't solve. And maybe there was a flat and all of a sudden they couldn't drink, there was no potable water and they all died. And I think that so far, the best solution to such a problem is, I guess, technology. So I mean, if they would know that you can just boil water and then drink it after, then that would save their civilization.

Starting point is 00:14:13 And even now when we look at the current pandemic, it seems that once again, actually science comes to rescue. And somehow science increases size of the action space. And I think that's a good thing. Yeah, but nature has a vastly larger action space. Yeah, but still it might be a good thing for us to keep on increasing action space. Okay, looking at past civilizations, yes, but looking at the destruction of human civilization, perhaps expanding the action space will add actions that are easily acted upon, easily executed, and as a result destroy us.

Starting point is 00:14:58 So let's see, I was pondering why actually even we have a negative impact on the globe. Because, you know, if you ask every single individual, they would like to have clean air. They would like healthy planet, but somehow it actually is not the case that as a collective, we are not going in this direction. I think that there exists very powerful system to describe what we value. That's capitalism. It assigns actually monetary values to various activities

Starting point is 00:15:27 At the moment the problem in the current system is that there are some things which we value There is no cost assigned to it. So even though we value clean air or maybe we also Value lack of distraction on that's a internet or so at the moment These quantities, you know companies corporations can follow them for free So in some sense I Wish or like that I guess purpose of politics to To align the incentive systems and we are kind of maybe even moving in this direction that first issue is even to be able to measure the incentive systems. And we are maybe even moving in this direction.

Starting point is 00:16:05 That first issue is even to be able to measure the things that we value, then we can actually assign them on a tariff value to them. Yeah, and that's so it's getting the data and also probably through technology enabling people to vote and to move money around in a way that is aligned with their values. And that's very much a technology question.

Starting point is 00:16:29 So like having one president and Congress and voting that happens every four years is something like that. That's a very outdated idea. There could be some technological improvements to that kind of idea. So I'm thinking from time to time about these topics, but it also feels to me that it's a little bit like a, it's hard for me to actually make correct predictions what is the appropriate thing to do.

Starting point is 00:16:53 I extremely trust Sam Altman, our CEO on these topics. Here, I'm more on the side of, I guess, naive hippie that's your life philosophy. Well, I think self-doubt and I think hippie implies optimism. Those two things are pretty good with operate. I mean, still it is the hard for me to actually understand how that politics works or like how this, like, exactly how the things would play out. And Sam is really ex-self-nated. What do you think is rarest in the universe?

Starting point is 00:17:39 You said we might be alone. What's hardest to build is another engineering way to ask that. Life, intelligence or consciousness. So like you said that we might be alone, which is the thing that's hardest to get to. Is it just the origin of life? Is it the origin of intelligence? Is it the origin of consciousness? So let me at first explain you my kind of mental model, what I think is needed for life to appear. So I imagine that at some point there was this primordial soup of amino acids and maybe in some proteins were turning into some other proteins through reaction. And you can almost think about this cycle of what turns into what?

Starting point is 00:18:31 There is a graph, essentially describing which substance turns into some other substance. And essentially, life means that all the sudden in the graph has been created a cycle such that the same thing keeps on happening over and over again. That's what this need needed for life to happen. And in some sense, you can think almost that you have this gigantic graph and it needs like a sufficient number of edges for the cycle to appear. Then from perspective of intelligence and consciousness, my current intuition is that they might be quite intertwined.

Starting point is 00:19:05 First of all, it might not be that it's like a binary thing that you have intelligence or consciousness. It seems to be more a continuous component. Let's see, if we look for instance on the even networks recognizing images, people are able to show that the activations of these networks correlate very strongly with activations in visual cortex of some monkeys.

Starting point is 00:19:32 The same seems to be true about language models. Also, if you, for instance, look if you train agent in 3D world, at first, it barely recognizes what is going on. Over the time, it kind of recognizes foreground from background. Over the time, it kind of knows where there is a foot, and it just follows it. Over the time, it actually starts having a 3D perception. So it is possible, for instance, to look inside of the head of an agent and ask what would it see if it looks to the right. And the crazy thing is, you know, initially when

Starting point is 00:20:11 the agents are where they trained, these predictions are pretty bad. Over the time, they become better and better. You can still see that if you ask what happens when the head is turned by 360 degrees, for some time, they think that the different thing appears. And at some stage, they understand, actually, that the same thing supposed to appear. So they get an understanding of 3D structure. It's also very likely that they have inside some level of symbolic reasoning, like they're

Starting point is 00:20:43 particularly symbols for other agents. So when you look at data agents, they collaborate together. And they have some anticipation of if they would win battle, they have some expectations with respect to other agents. I might be too much anthropomorphizing and how the things look look look for me. But then the fact that they have a symbol for other agents

Starting point is 00:21:13 makes me believe that at some stage as the, you know, as they are optimizing for skills, they would have also symbol to describe themselves. This is like a very useful symbol to have. And this particularity, I would call it like a self consciousness or self awareness. And still it might be different from the consciousness. So I guess the way how I'm understanding the word consciousness, I'd say the experience of drinking a coffee or let's say experience of being a bat. That's the meaning of the word consciousness. It doesn't mean to be awake. Yeah, it feels, it might be also somewhat related to memory and recurrent connections.

Starting point is 00:21:54 So it's kind of like if you look at anesthetic drugs, they might be like a, they essentially they they disturb brainwaves such that they maybe memory is not not formed. And so there's a lessening of consciousness when you do that. Correct. And so that's the one way to intuit what is consciousness. There's also kind of another element here. It could be that it's this self-awareness module that you described. Plus, the actual subjective experience is a storytelling module that tells us a story about what we're experiencing. The crazy thing, so let's say, I mean, in meditation, they teach people not to speak story inside of the head. And there is also some fraction of population who doesn't have actually narrator. I know people who don't have a narrator, and you know, they have to use external people in order to kind of solve tasks that require internal narrator. So it seems that it's possible to have the experience without the talk.

Starting point is 00:23:14 What are we talking about when we talk about the internal narrator? Is that the voice when you're like, we be people? That's what you are referring to. Well, I was referring more on it, like, not an actual voice. I meant, like, there's some kind of, like subjective experience feels like it's, it's fundamentally about storytelling to ourselves. It feels like, like, the feeling is a story that is much much simpler abstraction than the raw sensory information. Most useful aspect of it is that

Starting point is 00:24:11 because I'm conscious, I think there's an intricate connection to me not wanting to die. So like, it's a useful hack to really prioritize not dying. Like those seem to be somehow connected. So I'm telling a story of like it's richly feels like something to be me.

Starting point is 00:24:33 And the fact that me exists in this world, I wanna preserve me. And so that makes it a useful agent hack. So I will just refer maybe to that first part, as you said, about the kind of story of describing who you are. I was thinking about it even, so obviously I like thinking about consciousness. I like thinking about AI as well, and I'm trying to see analogies of these things in AI

Starting point is 00:25:01 what would it correspond to? So, you know, open AI, train a model called GPT, which can generate pretty amusing text on arbitrary topic. And one way to control GPT is by putting into prefix at the beginning of the text some information what would be the story about. You can have even chat with GPD by saying that the chat is with Lex or Elon Musk or so. And GPD would just pretend to be you or Elon Musk or so and it almost feels that this story that we give ourselves to describe our life it's almost like a things that you put into context of GPD. Yeah, the primary is the and but the context we provide the GPT is multimodal. So GPT itself is multimodal.

Starting point is 00:26:08 GPT itself hasn't learned actually from experience of single human, but from the experience of humanity. It's a chameleon. You can turn it into anything. And in some sense, by providing context, it, you know, behaves as the thing that you wanted it to be. And it's interesting that the, you know, people have a stories of who they are. And as is the, this story is they help them to operate in the world. But it's also, you know, interesting, guess various

Starting point is 00:26:39 people find it out through meditation or so, that there might be some patterns that you have learned when you were a kid that actually are not serving you anymore. And you also might be thinking that that's who you are and that's actually just a story. Yeah, so it's a useful hack, but sometimes it gets us into trouble, it's a local aftermath. It's a local aftermath. You wrote that Stephen Hawking tweeted, Stephen Hawking asked what breathes fire into equations, which meant what makes given mathematical equations realize the physics of a universe. Similarly, I wonder what breathes fire into computation, what makes given computation conscious.

Starting point is 00:27:22 Okay. makes given computation conscious. Okay, so how do we engineer consciousness? How do you breathe fire and magic into the machine? So it seems clear to me that not every computation is conscious. I mean, you can, let's say just keep on multiplying one matrix over and over again, and maybe gigantic matrix, you can put a lot of computation. I don't think it would be conscious.

Starting point is 00:27:46 So in some sense, the question is, what are the computations which could be conscious? I mean, so one assumption is that it has to do purely with computation that you can abstract away matter. Other possibilities, it's very important was the realization of computation that it has to do with some four skills or so and they bring consciousness. At the moment my intuition is that it can be fully abstracted away. So in case of computation, you can ask yourself, what are the mathematical objects or so that could bring such a properties. So for instance, if we think about the models, AI models, what they truly try to do

Starting point is 00:28:30 are like models like GPD, as they try to predict the next world or so, and this turns out to be equivalent to compressing text. And because in some sense, compression means that you learn the model of reality and you have just to remember where are your mistakes. The battery are in predicting the end. And in some sense, when we look at our experience, also when you look for instance at the card driving, you know in which direction it will go, you are good like in prediction.

Starting point is 00:29:07 And you know, it might be the case that the consciousness is intertwined with compression. It might be also the case that self-consciousness has to do with compressors trying to compress itself. So, okay, I was just wondering what are the objects in mathematics or computer science which are mysterious that could have to do with consciousness. And then I thought, you see in mathematics there is something called Gideot-Yoram which means if you have sufficiently complicated mathematical system, it is possible to point the mathematical system back on itself.

Starting point is 00:29:50 In computer science, there is something called halting problem. It's somewhat similar construction. So I thought that if we believe that other assumption that consciousness has to do with compression, then you could imagine that the ACR keep on compressing things, then at some point it actually makes sense for the compressor to compress itself. Meta compression. Consciousness is meta compression. That's an idea. And in some sense, some sense, the crazy...

Starting point is 00:30:26 I love it. Thank you. But do you think if we think of a touring machine, a universal touring machine, can that achieve consciousness? So is there something beyond our traditional definition of computation that's required? So, it's a specific computation and I said this computation has to do with compression. And the compression itself, maybe other way of putting it is like you are internally creating the model of reality.

Starting point is 00:30:55 In order like a you try insight to simplify reality in order to predict what's going to happen. And that also feels somewhat similar to how I think actually about my own conscious experience. So clearly I don't have access to reality. The only access to reality is through you know, cable going to my brain. And my brain is creating a simulation of reality. And I have access to the simulation of reality. Are you bringing a chance aware of the hotter prize, Marcus Hutter, he made this prize for compression of Wikipedia pages, and there's a few qualities to it. One, I think has to be perfect compression, which makes, I think that little quirk makes it

Starting point is 00:31:41 much less applicable to the general task of intelligence because it feels that intelligence is always going to be messy. Like perfect compression feels like it's not the right goal, but nevertheless a very interesting goal. So for him, intelligence equals compression. And so the smaller you make the file, given a large Wikipedia page, the more intelligent the system has to be. And that makes sense. So you can make perfect compression if you store errors. And I think that actually what he meant is you have algorithm plus errors. And by the

Starting point is 00:32:17 way, who there is a he was a he was a he is the advisor of chanck, who is a deep-mind co-founder. Yeah, so there's an interesting, and now he's a deep-mind, there's an interesting network of people. He's one of the people that I think seriously took on the task of what would an AGI system look like? I think for a long as time, took on the task of what would an AGI system look like? I think for a long as time, the question of AGI

Starting point is 00:32:49 was not taken seriously or rather rigorously and he did just that. Like mathematically speaking, what would the model look like? If you remove the constraints of it having to be, if you remove the constraints of it having to be having to have a reasonable amount of memory, reasonable amount of running time complexity, computation time, what would it look like? And essentially, it's a half math, half philosophical discussion of how would it, like, a reinforcement learning type of framework look like for an AGI.

Starting point is 00:33:26 Yeah, so he developed a framework even to describe what's optimal with respect to reinforcement learning. Like there is a theoretical framework, which is, as you said, under assumption there is infinite amount of memory and compute. There was actually one person before his name is Solomonov, who they're extended Solomonov work to reinforcement learning,

Starting point is 00:33:47 but there exists a theoretical algorithm which is optimal algorithm to build intelligence and I can actually explain to the algorithm. Yes. Let's go, let's go. So the task itself, you can- Can I just pause how absurd it is for brain in a skull trying to explain the algorithm for intelligence.

Starting point is 00:34:09 Just go ahead. It is pretty crazy. It is pretty crazy that the brain itself is actually so small and it can ponder. How does that algorithm that optimally solves the problem of intelligence? Okay. So what's the other? So let's see. So first of all, the task itself is describe as, you have infinite sequence of zeros and ones.

Starting point is 00:34:33 Okay, you read N bits and you are about to predict N plus one bit. So that's the task and you could imagine that every task could be casted as such a task. So if, for instance, you have images and labels. You can just turn every image into sequence of zeros and ones. Then label, you concatenate labels. And that's actually the,

Starting point is 00:34:54 and you could, you could start by having training data first. And then afterwards, you have test data. So theoretically, any problem could be casted as the problem of predicting zeros and ones on this infinite type. So, let's say you read already n bits and you want to predict n plus 1 bit. And I will ask you to write every possible program that generates these N bits. So, and you can have, you choose programming language. It can be in Python or C++. And the difference between programming languages

Starting point is 00:35:32 might be, there is a difference by constant. As symptomically, your predictions will be equivalent. So, you read N bits, you enumerate all the programs that produce these N bits in their output. And then in order to predict n plus 1 bit, you actually await the programs according to their length. And there is like a some specific formula how you await them. And then the n plus 1 bit prediction is the prediction from each of this program according to that weight. Like statistically, statistically, yeah.

Starting point is 00:36:09 So the smaller the program, the more likely you are to pick the its output. So that's that algorithm is grounded in the hope or the intuition that the simple answer is the right one. It's a formalization of it. Yeah. And it also means like if you would ask the question after how many years would you know, Sun explode. You can say, it's more likely that there is two to some power because they're short on their program. Yeah. And then other. Why I don't have a good intuition about how different the space of short programs are from the space of large programs.

Starting point is 00:36:54 Like, what is the universe for short programs? Like run things. So, as I said, the things have to agree with Enbits. So even if you need to start, if you have a very short program and they're like, I feel some, if it's not a perfect with prediction of Enbits, you have to start errors. What are the errors?

Starting point is 00:37:16 And that gives you the full program that agrees on Enbits. Oh, so you don't agree perfectly with the Enbits and you store a longer program, slightly longer program because it contains these extra bits of errors. That's fascinating. What's your intuition about the programs that are able to do cool stuff like intelligence and consciousness? Are they perfectly, is there if-then statements in them?

Starting point is 00:37:48 So, is there a lot of exceptions that they're storing? So, you could imagine if there would be tremendous amount of each statements, then they wouldn't be that short. In case of neural networks, you could imagine that what happens is when you start with an initialized neural network, it stores internally, many possibilities how the problem can be solved, and HDD is kind of magnifying some paths which are slightly similar to the correct answer, so it's kind of magnifying correct programs. And in some sense, HDD is a search algorithm

Starting point is 00:38:29 in the program space and the program space is represented by kind of the wiring inside of the neural network. And there's like an insane number of ways how the features can be computed. Let me ask you the high level basic question that's not so basic. What is deep learning? Is there a way you'd like to think of it that is different than like a generic textbook definition? The thing that I hinted at just a second ago is maybe the closest to how I'm thinking

Starting point is 00:39:00 these days about deep learning. So now the statement is neural networks can represent some programs, seems that various modules that we are actually adding up to, like we want networks to be deep because we want multiple steps of the computation. And deep learning provides the way to represent space of programs which is searchable and it's searchable with stochastic gradient descent. So we have an algorithm to search over human goes number of programs and gradient descent kind of bubbles up the things that are tend to give correct answers. So a neural network with fixed weights,

Starting point is 00:39:48 that's optimized. Do you think that is a single program? So there is a work by Christopher Olaj, where he works on inter-pratability of neural networks, and he was able to identify inside of the neural network, for instance, a detector of a wheel for a car, or the detector of a mask for a car.

Starting point is 00:40:13 And then he was able to separate them out and assemble them together using a simple program for a car detector. That's like, if you think of traditionally defined programs, that's like a function within a program that this particular neural network was able to find. And you can tear that out, just like you can copy and paste from Stack Overflow.

Starting point is 00:40:35 That, so any program is a composition of smaller programs. Yeah, I mean, the nice thing about the neural networks is that it allows the things to be more fuzzy than in case of programs, in case of programs, you have this like a branching this way or that way. And the neural networks, they they have an easier way to to be somewhere in between or to share things. What to use the most beautiful, surprising idea and deep learning and the utilization of

Starting point is 00:41:07 these neural networks, which by the way for people who are not familiar, neural networks is a bunch of, what would you say? It's inspired by the human brain. There's neurons, there's connection between those neurons, there's inputs and there's outputs and there's millions or billions of those neurons and the learning happens by adjusting the weights on the edges that connect these neurons. Thank you for giving definition I supposed to do it but I guess you have enough empathy to listen there's to actually know that that might be useful. No that's like so I'm asking Plato of like what is the meaning of

Starting point is 00:41:46 like he's not going to answer you're being philosophical and deep and quite profound talking about the space of programs which is just very interesting but you also people who are just not familiar with the hell we're talking about when we talk about deep learning. Anyway, sorry, what is the most beautiful or surprising idea to you in in all the time you've worked at Deplorany? You worked on a lot of fascinating projects, applications of neural networks. It doesn't have to be big and profound. It can be a cool trick.

Starting point is 00:42:17 Yeah, I mean, I'm thinking about the trick, but like it's still amusing to me that it works at all. That let's say that the extremely simple algorithms to create in descent, which is something that I would be able to derive on the piece of paper to high school student, when put at the scale of thousands of machines actually can create the behaviors, which we call kind of human-like behaviors.

Starting point is 00:42:46 So in general, any application is to cast a gradient descent and neural networks is amazing to you. So, or is there a particular application in natural language, reinforcement learning, and also, would you attribute that success to, is it just scale? What profound insight can we take from the fact that the thing works for gigantic sets of variables? I mean, the interesting thing is these algorithms, they were invented decades ago. And people actually gave up on the idea. And you know, back then they thought that we need profoundly different algorithms

Starting point is 00:43:37 and they spent a lot of cycles on very different algorithms. And I believe that, you know, we have seen that various And I believe that we have seen that various innovations that say like transformer or drop out or so they can pass the help. But it's also remarkable to me that this algorithm from 60s or so. I mean, you can even say that the gradient descent was invented by light needs, I guess 18th century or so, that actually is the core of learning. In the past people are, it's almost like out of the maybe an ego, people are saying that it cannot be the case that such a simple algorithm is the, you know, could solve complicated problems. So they were in search for the other algorithms.

Starting point is 00:44:28 And as I'm saying, like I believe that actually, we are in the game where there is, there are actually frankly three levers. There is compute, there are algorithms, and there is data. And if we want to build intelligence systems, we have to pull all three levers, and they are actually multiplicative. It's also interesting, so you ask, is it only compute?

Starting point is 00:44:49 People internally they did the studies to determine how much gains they were coming from different levers. And so far we have seen that more gains came from compute than algorithms. But also we are in the world that in case of compute, is a kind of you know exponential increase in funding and at some point It's impossible to invest more. It's impossible to you know invest $10 trillion We are speaking about that. Let's say all taxes in US But you're talking about money. There could be innovation in the compute. That's that's true as well So I'm in there like a few pieces. So one piece is human brain is an incredible supercomputer.

Starting point is 00:45:30 And they're like a, it has 103 nanoparameters. Or like if you try to count the body's quantities in the brain, they're like a neuron synapses. They're small number of neurons. There is a lot of synapses. It's unclear even how to map synapses to parameters of neural networks, but it's clear that there are many more. So it might be the case that our networks are still somewhat small.

Starting point is 00:46:04 It also might be the case that they are more efficient in brain or less efficient by some huge factor. I also believe that there will be, at the moment we are at the stage that these new round networks, they require a thousand X or like a huge factor of more data than humans do. And it will be a matter of of more data than humans do. And it will be a matter of, there will be algorithms that vastly decrease sample complexity, I believe so. But that place where we are heading today

Starting point is 00:46:33 is Dark Domains, which contains million X more data. And even though computers might be 1000 times slower than humans in learning, that's not a problem. Like, for instance, I believe that it should be possible to create super human therapist by, and they're like even simple steps of doing it. And the core reason is there is just machine will be able to read way more transcripts

Starting point is 00:47:06 of therapies and then it should be able to speak simultaneously with many more people and it should be possible to optimize it all in parallel. And well, there's now you're touching on something I deeply care about and think is way harder than we imagine. What's the goal of the therapist? What's the goal of a therapist? What's the goal of therapy? So, okay, so one goal, now this is terrifying to me, but there's a lot of people that contemplate suicide, self-affirmative depression, and they could significantly be helped with therapy,

Starting point is 00:47:41 and the idea that an AI algorithm might be in charge of that. It's like a life and death task. It's the stakes are high. So one goal for a therapist, whether a human or AI is to prevent suicide ideation, to prevent suicide. How do you achieve that? suicide ideation to prevent suicide. How do you achieve that? So, let's see.

Starting point is 00:48:07 So, to be clear, I don't think that the current models are good enough for such a task, because it requires insane amount of understanding and patty, and the models are far from this place. But it's... But do you think that understanding empathy that signals isn't the data? I think there is something now in the data.

Starting point is 00:48:26 Yes, I mean, there are plenty of transcripts of conversations. And it is possible to understand personalities. It is possible from it to understand if conversation is friendly, amicable, antagonistic. It is, I believe that, you know, given the fact that the models that we train now, they can have, they are chameleons that they can have any personality.

Starting point is 00:48:57 They might turn out to be better in understanding personality of other people than anyone else. And they're too pathetic. To be empathetic. Yeah, interesting, but I wonder if there's some level of multiple modalities required to be able to be empathetic of the human experience, whether language is not enough to understand death, to understand fear, to understand childhood trauma, to understand wit and humor required when you're dancing with a person

Starting point is 00:49:35 who might be depressed or suffering, both humor and hope and love and all those kinds of things. So, there's another underlying question which is self-supervised versus supervised. So can you get that from the data by just reading a huge number of transcripts? I actually, so I think that reading a huge number of transcripts is a step one. It's like at the same way as you cannot learn

Starting point is 00:50:02 to dance just from YouTube by watching it, you have to actually try it out yourself. Yeah. So I think that here, that's a similar situation. I also wouldn't deploy the system in the high-stakes situations right away, but kind of see gradually where it goes. And obviously initially, it would have to go hand in hand with humans. But at the moment, we are in the situation that actually, there is many more people who actually would like to have a therapy or speak with someone,

Starting point is 00:50:35 then there are therapies out there. I was so fondamentally, I was thinking, what are the things that Fundamental, I was thinking what are the things that can vastly increase people well-being? Therapy is one of them. I think meditation is other one. I guess maybe human connection is a third one, and I guess pharmacologically it's also possible. Maybe direct brain stimulation or something like that, but these are pretty much options out there. Then let's say the way I'm thinking about the H.I. and Devor is by default that's an endeavor to increase amount of wealth. And I believe that we can vast the increase amount of wealth for everyone. And simultaneously, so I mean, there are like two endeavors that make sense to me. One is like essentially increase amount of wealth. And second one is increase overall human well-being. And those are coupled together.

Starting point is 00:51:27 And they can, I would say, these are different topics. One can help another. And therapists is a funny word, because I see friendship and love as therapy. I mean, so therapists broadly define as just friendship as a friend. So therapists has a very kind of clinical sense to it, but what is human connection? You're like not to get all Kamu and Dusty Eski on you, but life is suffering and we draw,

Starting point is 00:52:01 we seek connection with other humans as we desperately try to make sense of this world in the deep, overwhelming loneliness that we feel inside. So I think connection has to do with understanding. And I think that almost like a lack of understanding causes suffering. If you speak with someone until you you, do you feel ignored that actually causes pain? If you are feeling deeply understood that actually, they might not even tell you what to do in life, but like pure understanding or just being heard. Understanding is a kind of, it's a lot, you know, just being heard. Feel like you're being heard. Like somehow that's alleviation temporarily

Starting point is 00:52:47 of the loneliness. That if somebody knows you're here with their body language, with the way they are, with the way they look at you, with the way they talk, you feel less alone for a brief moment. Yeah, very much agree. So I thought in the past about somewhat similar question to yours, which is what is love, rather what is connection.

Starting point is 00:53:12 And, and obviously I think about these things from AI perspective, what would it mean? So I said that the, you know, intelligence has to do with some compression, which is more or less like you can say, almost understanding of what is going around. It seems to me that other aspect is there seem to be reward functions and you can have a reward for food, for maybe human connection, for let's say, warmth, sex, and so on. And it turns out that the body's people might be optimizing slightly different reward functions. They essentially might care about different things.

Starting point is 00:53:55 And in case of love, at least the love between two people, you can say that the, you know, boundary between people dissolves to such extent that they end up optimizing each other reward functions. Mm-hmm. Yeah. Oh, that's interesting. Celebrate the success of each other.

Starting point is 00:54:18 Yeah, or in some sense, I would say love means helping others to optimize their reward functions. Not your reward functions, not the things that you think are important, but the things that the person cares about, you'll try to help them to optimize it. So love is, you think of two reward functions, it just is a condition. Yeah.

Starting point is 00:54:40 You combine them together. Yeah, pretty much. Maybe like with a weight, it depends like the dynamic of the relationship. Yeah, I mean, you could imagine that if you are fully optimizing someone's reward function without the earth, then then maybe are creating code dependency or something like that. Yeah. I'm not sure what's the appropriate weight, but the interesting thing is I even think that the individual person, we ourselves, we are actually less of a unified insight. So for instance,

Starting point is 00:55:09 if you look at the donut on the one level, you might think, oh, this is like a lookstaste, I would like to eat it on other level, you might tell yourself, I shouldn't be doing it because I want to gain muscles. So, and you know, you might do it regardless, kind of against yourself. So it seems that even within ourselves, they're almost like a kind of intertwined person as. And I believe that the self-love means that the love between all these personas, which also means being able to love, love yourself when we are angry or stressed or so. Combining all those reward functions of the different selves you have. Yeah, and accepting that they are there.

Starting point is 00:55:50 Like, you know, often people, they have a negative self-talk or they say, I don't like when I'm angry. And like a triton imagine, triton imagine, if there would be, like a small baby, like a five- old angry and then you're like, you shouldn't be angry like a stop being angry. But like, instead, actually, you want the legs to come over, give him a hug and it's fine. You can't be angry as long as you want. And then he would stop. Or maybe not. Or maybe not, but you cannot expect it even.

Starting point is 00:56:25 Yeah, but still that doesn't explain the why of love. Like why is love part of the human condition? Why is it useful to combine the reward functions? It seems like that doesn't I mean, I don't think reinforcement learning frameworks can give us answers to why even even the hotter framework has an objective function that's static. So we came to existence as a consequence of evolutionary process and in some sense the purpose of evolution is survival and then this complicated optimization objective baked into us let's say compression which might help us us operate in the reward and it baked into us various reward functions. Yeah. And then to be clear, at the moment, we are operating in the regime, which is somewhat out of distribution, where the event evolution optimized us.

Starting point is 00:57:17 It's almost like love is a consequence of cooperation that we've discovered is useful. Correct. In some way, it's even the case if you... I just love the idea that love is like the out of distribution. Or it's not out of distribution. It's like as you said, it evolved for cooperation. Yes. And I believe that the co... like an in some sense, cooperation ends up helping each of us individually, so it makes sense evolutionary. And there is a in some sense, and you know, love means there is this solution of boundaries

Starting point is 00:57:47 that you have a shared reward function. And we evolve to actually identify ourselves with larger groups. So we can identify ourselves with a family. We can identify ourselves with a country. To such extent that people are willing to give away their life for country. So there is, we are wired actually even for love.

Starting point is 00:58:09 And at the moment, I guess, maybe it would be somewhat more beneficial if we would identify ourselves with all the humanity as a whole. So you can clearly see when people travel around the world, when they run into person from the same country, they say, which CPR and all this, like all the same, they find all these similarities, they find some, they befriend those folks earlier than others. And so there is like a sense, some sense of the belonging. And I would say, I think it would be overall good thing to the world for people to move towards, I think it's even called open individualism, move toward the mindset of a larger and larger groups. So the challenge there, that's a beautiful vision and I share it to expand that circle

Starting point is 00:58:59 of empathy, that circle of love towards the entirety of humanity, but then you start to ask, well, where do you draw the line? Because why not expand it to other conscious beings? And then finally, for our discussion, something I think about is why not expanded to AI systems. Like we start respecting each other when the person, the entity on the other side has the capacity to suffer because then we develop a capacity to sort of empathize.

Starting point is 00:59:31 And so I could see AI systems that are interacting with humans more and more having conscious like displays. So like they display consciousness through language and through other means. And so then the question is like, well, is that consciousness? Because they're acting conscious. And so, you know, the reason we don't like torturing animals is because they look like they're suffering when they're tortured. And if AI looks like it's suffering when it's tortured, how is that not requiring

Starting point is 01:00:12 of the same kind of empathy from us and respect and rights that animals do it? I think it requires empathy as well. I mean, I would like, guess us or humanity, or so make a progressing understanding what consciousness is because I don't want just to be speaking about the philosophy, but this force they are alive. And I think that there is actually a path to understand exactly what consciousness is. And in some sense, it might require essentially putting probes inside of a human brain, what Neuralink does. So to go there, I mean, there's several things

Starting point is 01:01:04 with consciousness that make it a real discipline, which is one is rigorous measurement of consciousness. And then the other is the engineering of consciousness, which may or may not be related. I mean, you could also run into trouble, like for example, in the United States, for the Department of DOT Department of Transportation, and a lot of different places put a value in human life. I think DOT's values $9 million per person. So in that same way, you can get into trouble if you put a number on how conscious it being is. Because then you can start making policy. If a cow is 0.1 or like 10% as conscious as a human, then you can start making calculations and might get you into trouble.

Starting point is 01:01:54 But then again, that might be a very good way to do it. I would like to move to that place that actually we have scientific understanding what consciousness is. And then we'll be able to actually assign value. And I believe that there is even the path for the experimentation in it. So, you know, we said that, you know, you could put the probes inside of the brain. There is actually few other things that you could do with devices like neural link. So you could imagine that the way even to measure

Starting point is 01:02:26 if AI system is conscious is by literally just plugging into the brain. I mean that assumes that it's kind of easy, but the plugging into the brain and asking person, if they feel that their consciousness expanded. This direction of course has some issues. You can say, you know, if someone takes a psychedelic drug, they might feel that their consciousness expanded, even though that drug itself is not conscious. Right. So, like, you can't fully trust the self-report of a person saying their consciousness is expanded or not. Let me ask you a little bit about psychedelics.

Starting point is 01:03:01 There have been a lot of excellent research on different psychedelics. Scybin MDMA, even DMT, drugs in general, marijuana too. What do you think psychedelics do to the human mind? It seems they take the human mind to some interesting places. Is that just a little hack, a visual hack, or is there some profound expansion of the mind? So, let's see. I don't believe in magic, I believe in science, in causality. Still, let's say, and then, as I said, I think that the brain, that our subjective experience of reality is,

Starting point is 01:03:49 we live in the simulation run by our brain, and the simulation that our brain runs, they can be very pleasant or very halish. Drugs, they are changing some hyperparameters of the simulation. It is possible thanks to change of these hyperparameters to actually look back on your experience and even see that the given things that we took for granted, they are changeable. So they allow to have an amazing perspective. There is also, for instance, the fact that after the empty people can see the full movie inside of their head gives me further belief that brain can generate the full movie that the brain is actually learning

Starting point is 01:04:35 the model of reality to such extent that it tries to predict what going to happen next. Yeah, very high resolution. So it can replay realities. That's true. Extremely high resolution. Yeah, it's also kind of interesting to me that somehow there seems to be some similarity between these drugs and meditation itself. And I actually started even these days to think about meditation as a psychedelic. Do you practice meditation? I practice meditation. I once, a few times on the retreats,

Starting point is 01:05:08 and it feels after a second or third day of meditation. There is almost a sense of tripping. What is the meditation retreat in tail? You wake up early in the morning and you meditate for the extended period of time. And that's just so you can... Yeah, so it's optimized even though there are other people, it's optimized for isolation. So you don't speak with anyone, you don't actually look into other people's eyes. And you know, you don't actually look into other people's eyes. And you know, you see down that chair, say, if it's a meditation, it tells you to focus on the breath. So you try to put all the all attention into breathing and breathing in and breathing out. And the

Starting point is 01:06:01 crazy thing is that as you focus attention like that, after some time, there's times starts coming back like some memories that you'll completely forgotten. It almost feels like you have a mailbox, and then you are just like an archiving email one by one. And at some point, there is this like an amazing feeling of getting to mailbox zero, zero emails. And it's very pleasant. It's kind of, it's crazy to me that

Starting point is 01:06:38 once you resolve these inner stories, or like an inner traumas, then once there is nothing left, the default state of human mind is extremely peaceful and happy. Extreme like a, some sense, it feels that the, it feels at least to me the way

Starting point is 01:07:04 how when I was a child that I can look at any object and it's very beautiful. I have a lot of curiosity about the simple things and that's where the usual meditation takes me. Are you, what are you experiencing? Are you just taking in simple sensory information and they're just enjoying the rawness of that sensory information. So there's no memories or all that kind of stuff you're just enjoying being. Yeah pretty much. I mean still there is a

Starting point is 01:07:37 that it's it's thoughts are slowing down sometimes they pop up but it's also somehow the extended meditation takes you to the space that they are way more friendly, way more positive. There is also this thing that we've extracted. It almost feels that we are constantly getting little bit of a reward function and we are just spreading this reward function on a body's activities. But if you stay still for the extent period of time, it kind of accumulates, accumulates, accumulates. And there is a sense, there is a sense that at some point it passes on threshold and it feels as

Starting point is 01:08:33 Drop is falling into kind of ocean of love and bliss and that's like a this is like a very pleasant and that's I'm saying like a That corresponds to the subjective experience some people I guess in spiritual community they describe it that that's the reality and I I would say I believe that there are like all sorts of subjective experience that one can have. And I believe that for instance meditation might take you to the subjective experiences which are very pleasant, collaborative. And I would like a word to move toward a more collaborative place. Yeah, I would say that's very pleasant. And I enjoyed doing stuff like that. I, I, I wonder how that maps to your mathematical model of love

Starting point is 01:09:13 with the reward function combining a bunch of things. It seems like our life, then is we're just, we have this reward function and we're accumulating a bunch of stuff in it with weights. It's like multi-objective. And what meditation is is you just remove them, remove them until the weight on one or just a few is very high. And that's where the pleasure comes from. Yeah, so something similar, how I'm thinking about it. So I told you that there is a story of who you are. And I think almost about it as a text prependant to GPP.

Starting point is 01:09:59 Yeah. And some people refer to it as ego. Okay. It's like a story who you are, okay? So ego. Okay, this I can start it. Who, who, who you are? Okay. So ego is the prompt for GPT three. Yeah, GPT. Yes. And that's the description of you.

Starting point is 01:10:12 And then with meditation, you can get to the point that actually you experience things we found the prompt. And the experience things like as they are, you are not biased over the description how they supposed to be. That's very pleasant. And then we've respected the reward function. It's possible to get to the point that there is this solution of self. And therefore you can say that the you're having a, you're like a your brain attempts to simulate the reward function of everyone else or like everything. That's that there is this like a love which feels like at oneness with everything and that's also you know very beautiful very pleasant at some point you might have a lot of altruistic thoughts during the at that moment and then that self always comes back. How would you recommend if somebody is interested in meditation like a big thing to take on as a project. Would you recommend

Starting point is 01:11:05 a meditation retreat? How many days, what kind of thing would you recommend? I think that actually retreat is the way to go. It almost feels that meditation is a psychedelic, but when you take it in the small dose, you might better defy it. Once you get the high dose, actually, you're going to feel it. So even cold turkey, if you haven't really seriously meditated for prolonged period of time, just go to retreat. Yeah, I like it. I mean, days start weekend one weekend. So like two, three days. And it's like a, it's interesting that first or second day, it's hard. And and at some point it becomes easy. There's a lot of seconds in a day. How hard is the meditation retreat just sitting there in a chair? So the thing is actually it literally just depends on your

Starting point is 01:11:59 on death, your own framing. Like if you are in the mindset that you are waiting for it to be offered or you are waiting for Nirvana to happen, it will be very unpleasant. And in some sense, even the difficulty, it's not even in the lack of being able to speak with others. Like you are sitting there, your legs will hurt from sitting. In terms of like the practical things, do you experience kind of discomfort, like physical discomfort of just sitting like your butt being numb, your legs being sore, all that kind of stuff? Yes, you experience it. And then the they teach you to observe it, rather than it's like a, the crazy thing is, you at first might have a feeling toward trying to escape it. Yeah. And that becomes very

Starting point is 01:12:45 apparent that that's extremely unpleasant. And then you just just observe it. And at some point, it just becomes, it just is. It's like, I remember that we've, Iljat told me some time ago that, you know, he takes the cold shower. And he's mindset of taking a cold shower. And his mindset of taking a cold shower was to embrace suffering. Yeah, excellent. I do the same. Does he or style? Yeah, it's my style.

Starting point is 01:13:12 I like this. So my style is actually, I also sometimes take cold showers. It is purely observing how the water goes through my body, like a purely being present, not trying to escape from there. Yeah. And I would say then it actually becomes pleasant. It's not like, ah! Well, that's interesting. I'm also, that's the way to deal with anything really difficult, especially in the physical space, is to observe it, to say it's pleasant. Hmm, I would use a different word. You're, you're accepting of the full beauty of reality, I would say, because

Starting point is 01:13:57 say pleasant. But yeah, in some sense, it is pleasant. That's the only way to deal with a shower is to become an observer and to find joy in it Same with like really difficult physical Exercise or like running for a really long time endurance events Anytime you're exalt any kind of pain. I think the only way to survive it is not to resist it. It's to observe it You mentioned Ilya Ilya Satskever. It's very he's I like I think the only way to survive it is not to resist it. You should observe it. You mentioned Ilya. Ilya is a skewer. It's very... I wear a chief sandist, but I'll say he's very close friend of mine.

Starting point is 01:14:32 You co-founded OpenAid with you? I've spoken with him a few times. He's brilliant. I really enjoy talking to him. His mind, just like yours, works in fascinating ways. Both of you are not able to define deep learning simply. What's it like having him as somebody you have technical discussions with on in space machine learning, deep learning AI, but also life. What's it like when these two agents get into a self-place situation in a room? What's it like collaborating with him?

Starting point is 01:15:12 So I believe that we have extreme respect to each other. So I love Elias insight, both like I guess about consciousness, life, AI. But in terms of the, it's interesting to me because you're a brilliant, thinker in this space of machine learning, like intuition, like digging deep in what works, what doesn't, why it works, why it doesn't, and so is Ilya. I'm wondering if there's interesting deep discussions you've had with him in the past or disagreements that were very productive. So I can say, I also understood over the time where are my strengths. So obviously we have plenty of AI discussions.

Starting point is 01:16:03 And you know, I myself have plenty of ideas, but I consider Ilya one of the most prolific AI scientists in the entire world. And I think that I realized that maybe my super skill is being able to bring people to collaborate together, that I have some level of empathy, that is unique in AI world. And that might come from either meditation, psychedelics,

Starting point is 01:16:32 or let's say I read just hundreds of books on this topic. And I also went through a journey of, you know, I develop all sorts of algorithms. So I think that maybe I can, that's my super human skill. Ilya is one of the best AI scientists, but then I'm pretty good in assembling teams. And I'm also not holding to people. I'm growing people and then people become managers at OpenAI.

Starting point is 01:16:59 That's a room many of them, like a research man. So you find places where you're excellent and he finds like his deep scientific insights as where he is and you find ways you can deposit pieces fit together. Correct. You know, ultimately, for instance, let's say, Ilya, he doesn't manage people. That's not what he likes or so. I like hangout with people. I by default, I'm an extrovert, then I cared about people. Oh, it's interesting. Okay. Okay. Cool. So that that has perfectly together. But I mean, I also just like your intuition about various problems

Starting point is 01:17:39 and machine learning, he's definitely one I really enjoy. I remember talking to him about something I was struggling with, which is coming up with a good model for pedestrians, for human beings that cross the street in the context of autonomous vehicles. And he immediately started to like formulate a framework within which you can evolve a model for pedestrians like through self-play all that kind of mechanisms. The depth of thought on a particular problem, especially problems he doesn't know anything about is fascinating to watch. It makes you realize like yeah the the

Starting point is 01:18:23 limits that the human intellect might be a limitless. Or just impressive to see a descend of a comaple clever ideas. Yeah, I mean, so even in the space of deep learning when you look at various people, there are people now who invented some breakthroughs once, but there are very few people who did it multiple times. And you can think if someone invented it once, that might be just a sure luck. And if someone invented it multiple times, you know, if a probability of inventing it once is one over a million, then probability of inventing it twice

Starting point is 01:18:58 or three times would be one over a million squared or to the power of three, which would be just impossible. So it literally means that it's given that it's not the lack. And Eliy is one of these few people who have a lot of these inventions in his arsenal. It also feels that, for instance, if you think about folks like Gauze or Euler, at first they read a lot of books, and then they did thinking, and then they figure out math. And that's how it feels with Ilya. At first, he read stuff, and then he spent his thinking cycles.

Starting point is 01:19:43 And it's a really good way to put it. When I talk to him, I see thinking. He's actually thinking. Like, he makes me realize that there's like deep thinking that the human mind can do. Like, most of us are not thinking deeply. Like, you really have to put a lot of effort to think deeply. Like I have to really put myself in a place where I think deeply about a problem. It takes a lot of effort. It's like airplane taking off for something. You have to achieve deep focus. He's just, what is it? His brain is like a vertical takeoff in terms of airplane analogy. So it's interesting. But I mean,

Starting point is 01:20:27 Cal Newport talks about this as an idea of deep work. It's, you know, most of us don't work much at all in terms of like, like deeply think about particular problems, whether it's a math, engineering, all that kind of stuff. You want to go to that place often, and that's real hard work. And some of us are better than others at that. So I think that the big piece has to do with actually even engineering or environment that sets that it's conducive to that. So see, both Ilya and I on the frequent basis, we kind of disconnect ourselves from the world in order to be able to do extensive amount of thinking. Yes.

Starting point is 01:21:07 So, Ilya, usually, he just leaves iPad at hand. He loves his iPad. And for me, I'm even sometimes, you know, just going for a few days to different location to Airbnb. I'm turning off my phone and there is no access to me. And that's extremely important for me to be able to actually just formulate new thoughts to do deep work rather than to be reactive. And the older I am, the more of this

Starting point is 01:21:39 that I can run on tasks at hand. Before I go on to that thread, let me return to our friend GPT. Let me ask you another ridiculous big question. Can you give an overview of what GPT3 is or like you say in your Twitter bio GPTN plus one? How it works and why it works. So, GPT-3 is a human-ghost neural network. Let's assume that we know what is neural network, okay, by the definition. And it is trained on the entire internet

Starting point is 01:22:15 just to predict next word. So let's say it sees part of the article and it, the only task that it has at hand, it is to say what would be the next word, what would be the next word. And it becomes really exceptional at the task of figuring out what's the next word. So you might ask, why would this be an important task?

Starting point is 01:22:40 Why would it be important to predict what's the next word? And it turns out that a lot of problems can be formulated as a text completion problem. So, GPD is purely learning to complete the text. And you could imagine, for instance, if you are asking a question, who is president of the United States, then GPD can give you an answer to it. It turns out that many more things can be formulated this way. You can format text the way that you have sent as an English.

Starting point is 01:23:13 You make it even look like some content of a website, elsewhere which would be teaching people how to translate things between languages. So it would be EN, colon, text in English, FR, colon, and then you ask people, and then you ask model to continue. And it turns out that the such a model is predicting translation from English to French. The crazy thing is that this model can be used for way more sophisticated tasks. So you can format text such that it looks like a conversation between two people, and that might be a conversation between you

Starting point is 01:23:50 and Elon Musk. And because the model read all the texts about Elon Musk, it will be able to predict Elon Musk words as it would be Elon Musk. It will speak about colonization of Mars about sustainable future and so on. And it's also possible to even give arbitrary personality to the model.

Starting point is 01:24:12 You can say, here is a conversation that we've a friendly AI bot. And the model will complete the text as a friendly AI bot. So I mean, how do I express how amazing this is? So just to clarify a conversation, generating a conversation between me and Elon Musk, it wouldn't just generate good examples of what Elon would say. It would get the syntax all correct. So like interview style, it would say like Elon

Starting point is 01:24:46 Cohen and Lex Cohen, like it's not just like, uh, inklings of, uh, semantic correctness. It's like the whole thing, grammatical, syntactic, semantic, it's just really, really impressive generalization. Yeah, I mean, I also want to, you know, provide some caveat. So it can generate few paragraphs of coherent text, but as you go to longer pieces, it actually goes off the rails. If you try to write a book, it won't work out this way. What way does it go off the rails, by the way? Is there interesting ways in which it goes off the rails? What falls apart first? So the model is trained on the all-deexisting data that is out there, which means that it is not trained on its own mistakes. So for instance, if it would make a mistake, then I kept, so to give you

Starting point is 01:25:46 an example. So let's say I have a conversation with a model pretending that is Elon Musk, and then I start putting some, I'm start actually making up things which are not factual. I would say, I like Twitter. But I got you, sorry. I don't know, I would say that Elon is my wife. And the model will just keep on carrying it on. As if it's true. Yes. And in some sense, if you would have a normal conversation with Elon, he would be what the fuck?

Starting point is 01:26:22 Yeah, there would be some feedback between. So the model is trained on things that humans have written. But through the generation process, there's no human in the loop feedback. Correct. That's fascinating. It makes us. So it's magnified. It's like the errors get magnified and magnified. And it's also interesting.

Starting point is 01:26:44 I mean, first of all, humans have the same problem. It's just that we will make fewer errors, and magnify the errors slower. I think that actually what happens with humans is if you have a wrong belief about the word as a kid, then very quickly, we'll learn that it's not correct because they are grounded in reality and they are learning from your new experience. Yes. But do you think the model can correct itself too?

Starting point is 01:27:10 Wanted through the power, the representation, and so the absence of Elon Musk being your wife, information on the internet, want to correct itself. There won't be examples like that. So the errors will be subtle at first. Satellite first. And in some sense, you can also say that the data that is net out there is the data, which would represent how the human learns.

Starting point is 01:27:37 That's an, and, and maybe model would be learned through in such a data than it would be better off. How intelligent is GPT 33, do you think? When you think about the nature of intelligence, it seems exceptionally impressive. But then if you think about the big AGI problem, is this footsteps along the way to AGI? So, let's see.

Starting point is 01:28:01 Seems that intelligence itself is a multiple axis of it. And I would expect that the the systems that we are building they made end up being superhuman on some axis and subhuman on some other axis. It would be surprising to me on all axis simultaneously they would become superhuman. Of course people ask this question is GPT a spaceship that that would take us to to Moon or are we putting a building a

Starting point is 01:28:31 ladder to heaven that we are just building bigger and bigger ladder and we don't know in some sense which one of these two is better. I'm trying it. I like stairway to heaven as a good song. I'm not exactly sure which one is better. We are saying like the spaceship to the moon is actually effective. Correct, so people who criticize GPD, they say the archive is just building a taller ladder and it will never reach the moon. And at the moment, I would say the way I'm thinking is this is like a scientific question. And I'm also in heart, I'm a builder creator.

Starting point is 01:29:12 And like I'm thinking, let's try out, let's see how far it goes. And so far we see constantly that there is a progress. Yeah. So do you think GPT-4, GPT, GPT N plus 1 will, there'll be a phase shift, like a transition to a place where it would be truly surprised. Then again, like GPT 3 is already very, like, truly surprising. The people that criticize GPD 3 as is there, as what is it ladder to heaven, I think to quickly get accustomed to how impressive it is, that the prediction of the next word

Starting point is 01:29:52 can achieve such depth of semantics, accuracy of syntax, grammar, and semantics. Do you think GPD 4 and 5 and 6 will continue to surprise us? I mean, definitely, there will be more impressive models. There is a question, of course, if there will be a phase shift. And also, even the way I'm thinking about these models is that when we build these models, you know, we see some level of the capabilities, but we don't even fully understand everything that the model can do.

Starting point is 01:30:29 And actually, one of the best things to do is to allow other people to probe the model, to even see what is possible. Hence the using GPD as an API and opening it up to the world. Yeah, I mean, so when I'm thinking from perspective of, I get there like a, obviously, various people that have concerns about AI, including myself. And then when I'm thinking from perspective,

Starting point is 01:30:56 what's the strategy even to deploy these things to the world? Then the one strategy that I have seen many times working is the iterative deployment that you deploy slightly better versions and you allow other people to criticize you. So you actually, or try it out, you see where are their fundamental issues. And it's almost you don't want to be in that situation

Starting point is 01:31:19 that you are holding into powerful system and there's like a huge overhang, then you'll deploy it and it might have a random chaotic impact on the world. So you actually want to be in this situation that the are the gradually deploying systems. I asked this question, Emilio, let me ask you this question.

Starting point is 01:31:40 I've been reading a lot about Stalin and power. If you're in possession of a system that's like AGI that's exceptionally powerful, do you think your character integrity might become corrupted, like famously power corrupts and absolute power corrupts and absolutely power corrupts absolutely. So I believe that you want at some point to work toward distributing the power. I think that you want to be in this situation that actually HGI is not controlled by a small number of people, but essentially by a larger collective. So the thing is, that requires a George Washington style move. In the ascent to power, there's always a moment

Starting point is 01:32:33 when somebody gets a lot of power and they have to have the integrity and the moral compass to give away that power. That humans have been good and bad throughout history at this particular step. And I wonder, I wonder, we like blind ourselves in, for example, between nations, a race towards, yeah, AI race between nations. We might blind ourselves and justify to ourselves the development of AI without distributing the power, because we want to defend ourselves against China, against Russia, that kind of logic.

Starting point is 01:33:14 I wonder how we design governance mechanisms that prevent us from becoming power hungry and in the process destroying ourselves. So, let's see. I have been thinking about this topic quite a bit, but I also want to admit that once again, I actually want to rely way more on some Altman, on it, he wrote and he wrote an excellent block on how even to distribute wealth. And he's propos, he proposes in his block to tax equity of the companies rather than profit and to distribute it. And this is, this is an example of Washington move.

Starting point is 01:34:00 I guess I personally have insane trust in some. He already spent plenty of money running a universal basic income project that gives me, I guess, maybe some level of trust to him, but I also, I guess, love him as a friend. Yeah, I wonder because we're sort of summoning a new set of technologies. I

Starting point is 01:34:26 wonder if we'll be cognizant like you're describing the process of open AI, but it could also be at other places like in the US government, right? both China and the US are now Full-stream ahead on autonomous weapons systems development. And that's really worrying to me because in the framework of something being a national security danger or a military danger, you can do a lot of pretty dark things that blind our moral compass. And I think AI will be one of those things. In some sense, the mission and the work you're doing in OpenAI is like the counterbalance to that. So you want to have more OpenAI

Starting point is 01:35:14 unless autonomous weapons systems. I like this statement. It's like it will be clear, like, this is interesting and I'm thinking about it myself but this applies that I put my trust actually in Sam's hands because it's extremely hard for me to reason about it. Yeah, I mean one important statement to make is it's good to think about this. Yeah, no question about it. No question. Even low level, quote unquote, engineer. Even like low level quote unquote engineer. Like there's such a, I remember I programmed a car, our C car. They went really fast like 30, 40 miles an hour.

Starting point is 01:35:56 And I remember I was like sleep deprived. So I programmed it pretty crampily. And it like the code froze. So it's doing some basic computer vision and it's going around on track, but it's going full speed. And there's a bug in the code that the car just went. It didn't turn. It went straight full speed and smashed into the wall.

Starting point is 01:36:21 And I remember thinking the seriousness with which you need to approach the design of artificial intelligence systems and the programming of artificial intelligence systems is high because the consequences are high. That little car smashing into the wall, for some reason I immediately thought of an algorithm that controls nuclear weapons, having the same kind of bug. And so the lowest level engineer and the CEO of a company all need to have this seriousness in approaching this problem and thinking about the worst case consequences.

Starting point is 01:36:56 So I think that is true. I mean, what I also recognize in myself and others even asking this question is that it evokes a lot of fear. And the fear itself ends up being actually quite debilitating. The place where I arrived at the moment might sound cheesy or so, but it's almost to build things out of love rather than fear. build things out of love rather than fear. Yeah. I can focus on how I can maximize the value, how the systems that I'm building might be useful.

Starting point is 01:37:35 I'm not saying that the fear doesn't exist out there, and it totally makes sense to minimize it, but I don't want to be working because I'm scared. I want to be working out of passion, out of curiosity, out of the, you know, looking forward for the positive future. With the definition of love arising from a rigorous practice of empathy. So not just like your own conception of what is good for the world, but always listening to others. Correct. Like at the love where I'm constantly doing reward functions of others. Others to in fit limit to infinity is like a sum of like one to N where N is seven billion or whatever it is.

Starting point is 01:38:17 Not not projecting my reward functions on others. Yeah, exactly. Okay. Can we just take a step back to something else super cool, which is OpenAI Codex? Can you give an overview of what OpenAI Codex and GitHub co-pilot is, how it works, and why the hell it works so well? So we've GPT3, we noticed that the system, you know, the system training on all the language out there started having some rudimentary coding capabilities. So we're able to ask it, you know, to implement addition

Starting point is 01:38:52 function between two numbers. And indeed, it can write Python or JavaScript code for that. And then we thought, we might as well just go full steam ahead and try to create a system that is actually good at what we are doing every day ourselves, which is programming. We optimize models for proficiency in coding. We actually even created models that both have a comprehension of language and codex is API for these models. So it's first pre-trained on language and then I don't know if you can say fine tuned because there's a lot of code but it's language in code. It's language in code. It's also optimized for various things like let's say low latency and so on.

Starting point is 01:39:41 Codex is the API, the similar to GPT-3. We expect that there will be proliferation of the potential products that can use coding capabilities and I can speak about it in a second. Copilot is a first product and developed by GitHub. So as we're building models, we wanted to make sure that these models are useful. And we work together with GitHub on building the first product. Copilot is actually, as you code, it suggests you code completions. And we have seen in the past, there are like a various tools that can suggest how to, like a few characters of the code

Starting point is 01:40:18 or the line of code. The thing about Copilot is it can generate 10 lines of code. You, it's often the way how it works is you often write in the comment what you want to happen because people in comments they describe what happens next. So, this day is when I code instead of going to Google to search for the appropriate code to solve my problem, I say, oh, for this array, could you smooth it? And then, you know, it imports some appropriate libraries and they use this numpy convolution or so, I that I was not even aware that exists. Many does that appropriate thing. So you, you write a comment, maybe the header of a function and it completes the function. Because you don't know what is the space of all the

Starting point is 01:41:05 possible small programs that can generate. What are the failure cases, how many edge cases, how many subtle errors there are, how many big errors there are. It's hard to know, but the fact that it works at all and a large number of cases is incredible. It's like a it's a kind of search engine into code that's been written on the internet correct, so for instance When you search things online then usually you get to the some particular case like if you go to stack overflow and people describe the one particular situation And then they seek for a solution, but in case of

Starting point is 01:41:43 Copilot it's aware of your entire context and in context is, oh, these are the libraries that they are using. That's the set of the variables that is initialized. And on the spot, it can actually tell you what to do. So the interesting thing is, and we think that the Copilot is one possible product using Copilot, but there is a place for many more. So internally, we tried out to create other fun products.

Starting point is 01:42:09 So it turns out that a lot of tools out there, let's say Google Calendar or Microsoft Word or so, they all have an internal API to build plugins around them. So there is a way, in the sophisticated way, to control calendar, or Microsoft Word. Today, if you want more complicated behaviors from these programs, you have to add a new button for every behavior. But it is possible to use Codex and tell, for instance, to calendar, could you schedule an appointment with Lex next week after 2pm and eat the right corresponding piece of code?

Starting point is 01:42:48 And that's the thing that actually you want so interesting So the you figure out is there's a lot of programs with which you can interact through code So there you can generate that code from natural language That's fascinating and that's somewhat like also closest to what was the promise of Siri or Alexa. So previously all these behaviors, they were had hard coded. And it seems that codex on the fly

Starting point is 01:43:15 can pick up the API of, let's say, given software. And then it can turn language into use of this API. So without hard coding, you can find, it can translate to machine language, corrected to a, so for example, this would be really exciting for me, like for Adobe products like Photoshop, which I think action script,

Starting point is 01:43:37 that I think there's a scripting language that communicates with them same with Premiere. And do you could imagine that that allows event to do coding by voice on your phone? So for instance, in the past, as of today, I'm not editing word documents on my phone because it's just the keyboard is too small. But if I would be able to tell to my phone, you know, make the header large, then move the paragraphs around, and it does actually what I want. So I can tell you one more cool thing,

Starting point is 01:44:07 or even how I'm thinking about codecs. So if you look actually at the evolution of computers, we started with very primitive interfaces, which is a punch card, and punch card, so shall you make a holes in the plastic card to indicate zeros and ones. And during that time, there was a small number of specialists who were able to use computers. And by the way, people even suspected that there is no need for many more people to use computers. But then we moved from punch cards to at first assembly and see.

Starting point is 01:44:44 And these programming languages, they were slightly higher level. They allowed many more people to code. And they also led to more of a proliferation of technology. And further on, there was a jump to say from C++ to Java and Python. And every time it has happened, more people are able to code. And we build more technology. And it's even, you know, hard to imagine now is someone will tell you that you should write code in assembly instead of, let's say, Python or Java or Java script. And code exists yet another step toward kind of bringing computers closer to humans, such that you communicate with a computer with your own language

Starting point is 01:45:28 rather than with a specialized language. And I think that it will lead to increase a number of people who can code. Yeah, and the kind of technologies that those people will create as it's a new mobile, it could be a huge number of technologies we're not predicting at all, because that's less and less requirement of having a technical mind, a programming mind. You're not opening it to the world of other kinds of minds, creative minds, artistic minds,

Starting point is 01:46:00 all that kind of stuff. I would like, for instance, biologists who work on DNA to be able to program and not to need to spend a lot of time learning it. And I believe that's a good thing to the world. And I would actually add, I would add, so at the moment, I'm a managing codex team and also language team. And I believe that there is like a plenty of brilliant people out there. And they should apply.

Starting point is 01:46:22 Oh, okay, yeah, awesome. So what's the language in the codex? really on people out there. And they should apply. Oh, okay, yeah, awesome. So what's the language in the codex? So those are kind of their overlapping teams. It's like GBT, the raw language, and then the codex is like applied to programming. Correct, and they are quite intertwined. There are many more teams involved

Starting point is 01:46:40 making these models extremely efficient and deployable. For instance, there are people who are working to make our data centers amazing, or there are people who work on putting these models into production, or even pushing it at the very limit of the scale. So all aspects from the infrastructure to the actual machine. So I'm just saying there are multiple things while the team working on Codex and language, I guess I'm directly managing them.

Starting point is 01:47:12 I would love to hire more. Yeah, if you're interested in machine learning, this is probably one of the most exciting problems and like systems to be working on. It's actually, it's pretty cool. Like what the program synthesis, the generating of programs is very interesting, very interesting problem that has echoes of reasoning and intelligence in it.

Starting point is 01:47:37 And I think there's a lot of fundamental questions that you might be able to sneak up to by generating programs. Yeah, one more exciting thing about the programs is that so I said that the, you know, the in case of language that one of the travels is even evaluating language. So when the things are made up, you'll, you need somehow either a human to say

Starting point is 01:48:01 that this doesn't make sense. Or so in case of program, there is one extra lever that we can actually execute programs and see what they evaluate to. So the process might be somewhat more automated in order to improve the qualities of generations. That's fascinating. So like the, wow, that's really interesting.

Starting point is 01:48:22 So for the language, the simulation to actually execute it, that's a human mind. Yeah. For programs, there is a computer on which you can evaluate it. Wow, that's a brilliant little insight that the thing compiles and runs. That's first and second you can evaluate on a do automated unit testing. And in some sense, it seems to mean that we'll be able to make a tremendous progress.

Starting point is 01:48:54 We are in the paradigm that there is way more data and there is like a transcription of millions of software engineers. Yeah. Yeah. Yeah. So, I mean, you just mean because I was going to ask you about reliability. The thing about programs is you don't know if they're going to, like, a program that's controlling

Starting point is 01:49:17 a nuclear power plant has to be very reliable. So I wouldn't start with controlling nuclear power plant. I'd be one day, but that's not actually, that's not on the current roadmap. That's not the step one. You know, it's the Russian thing. You just want to go to the most powerful destructive thing right away, run by JavaScript, but I got you.

Starting point is 01:49:37 So it's a lower impact, but nevertheless, when you make me realize it is possible to achieve some lows of reliability by doing testing. And you could imagine that maybe there are ways for a model to write even code for testing itself and so on. And there are ways to create the feedback loops that the model could keep on improving. By writing programs, they generate tests. For instance. For instance. For instance.

Starting point is 01:50:05 And that's how we get consciousness because it's met a compression. That's what you're going to write. That's the prompt that generates consciousness. Compressor of compressors. You just write that. Do you think the code that generates consciousness will be simple? So let's see. I mean, ultimately the core idea behind will be simple. So let's see, I mean, ultimately the core idea behind will be simple, but there

Starting point is 01:50:27 will be also decent amount of engineering involved. Like in some sense, it seems that, you know, spreading these models on many machines, it's not that trivial. And we find all sorts of innovations that make our models more efficient. I believe that first models that I guess are conscious of like a truly intelligent, they will have all sorts of tricks. But then again, there's a which is certain argument that maybe the tricks are temporary thing. Yeah, they might be temporary things. And in some sense, it's also even important to know that even the cost of a trick. So sometimes people are eager to put the trick while forgetting that there is a cost of maintenance or like long term cost.

Starting point is 01:51:21 Long term cost or maintenance or maybe even flexibility of code to actually implement new idea. So even if you have something that gives you 2x but it requires you know 1000 lines of code, I'm not sure if it's actually worth it. So in some sense, you know, if it's 5 lines of code and 2x, I would take it. And we see many of this, but also that requires some level of, I guess, lack of attachment to code that we are willing to remove it. Yeah. So you led the OpenAI robotics team.

Starting point is 01:51:57 Can you give an overview of the cool things you're able to accomplish, what are you most proud of? So when we started robotics, we knew that actually reinforcement learning works and it is possible to solve very complicated problems. Like for instance, AlphaGo is an evidence that it is possible to build superhuman and gall players. Delta too is an evidence that is possible to build superhuman agents playing data. So I asked myself a question, you know, what about robots out there? Could we train machines to solve arbitrary task in the physical world?

Starting point is 01:52:34 Our approach was, I guess, let's pick a complicated problem that, if we would solve it, that means that we made some significant progress in the domain, and then we went after the problem. So, we noticed that actually the robots out there, they are kind of at the moment optimized per task. So you can have a robot that it's like, if you have a robot opening a battle, it's very likely that the end factor is a battle opener. And in some sense, that's a hack to be able to solve a task, which makes any task easier. And ask myself, so what would be a robot

Starting point is 01:53:12 that can actually solve many tasks? And we conclude that a human hands have such a quality that indeed, they are, you know, you have five kind of tiny arms attached individually, they can manipulate pretty broad spectrum of objects. So we went after a single hand, like I tried to solve Ruby, skipping single-handed. We picked this task because we thought that there is no way to hard-code it. And as also we picked a robot on which it would be hard to hard-coded and we went after the solution such that it could generalize to other problems and just to clarify it's one

Starting point is 01:53:53 robotic hand solving the Rubik's cube. The hard part is in the solution to the Rubik's cube. It's the manipulation of like having it not fall out of the hand, having it use the five baby arms to what is it, like rotate different parts that Ruby scooped to achieve the solution. Correct. Yeah. So what was the hardest part about that? What was the approach taken there?

Starting point is 01:54:19 What are you most proud of? Obviously, we have like a strong belief in reinforcement learning. And one path it strong belief in reinforcement learning. And you know, one path it is to do reinforcement learning in the real world. Other path is to the simulation. In some sense, the tricky part about the real world is at the moment, how models they require a lot of data. There is essentially no data. And I think we decided to go through the path of the simulation and

Starting point is 01:54:46 in simulation, you can have infinite amount of data. The tricky part is the fidelity of the simulation. And also, can you in simulation represent everything that you represent otherwise in the real world? And you know, it turned out that, you know, because there is lack of fidelity, it is possible to what we arrived at is training a model that doesn't solve one simulation, but it actually solves the entire range of simulations, which are very, in terms of like, what's exactly the friction of the Q-Port-A-Wade or so, and the single AI that can solve all of them

Starting point is 01:55:26 ends up working well with the reality. How do you generate the different simulations? So, you know, there's plenty of parameters out there. We just pick them randomly, and an in simulation model just goes for thousands of years, and keeps on solving group X-Cube in each of them. And the thing is the neural network that we used, it has a memory, and as it presses, for instance, the side of the cube, it can sense,

Starting point is 01:55:54 oh, that's actually this site was difficult to press. I should press it stronger, and throughout this process kind of learns even how to solve this particular instance of the Ruby Fubic Event Mass. It's kind of like a, you know, sometimes when you go to a gym and after bench press, you try to lift the glass and you kind of forgot and your hand goes like a half upright away because kind of you got used to maybe different weight and it takes a second to adjust. Yeah.

Starting point is 01:56:32 And this kind of of memory, the model gains through the process of interacting with the QVN simulation. I appreciate you speaking to the audience with the bench press, all the bros in the audience, probably working out right now. There's probably somebody listening to this actually doing bench press. So maybe put the bar down and pick up the water bottle and you'll know exactly what check is talking about. Okay, so what was the hardest part of getting a whole thing to work? So the hardest part is at the moment when it comes to physical work, when it comes to robots, they require maintenance. It's hard to replicate them a million times.

Starting point is 01:57:18 It's also hard to replay things exactly. I remember this situation that one guy at our company, he had a model that performs way better than other models in solving Rubik's field. And you know, he kind of didn't know what's going on, why is that? And it turned out that, you know, he was running it from his laptop that had better CPU, or maybe local GPU as well. And because of that, there was less of a latency, and the model was the same.

Starting point is 01:57:54 And that actually made solving Ruby XCube more reliable. Saying some sense, there might be some saddlebacks like that when it comes to running things in the real world. Even hinting on that, you could imagine that the initial models, There might be some saddlebacks like that when it comes to running things in the real world. Even hinting on that, you could imagine that the initial models you would like to have models which are insanely huge neural networks and you would like to give them even more time for thinking. When you have these real time systems, then you might be constrained actually by the amount

Starting point is 01:58:23 of latency. Ultimately, I would like to build a system that it is worth for you to wait five minutes, because it gives you the answer that you are willing to wait for five minutes. So latency is a very unpleasant constraint under which to operate? Correct. Also, there is actually one more thing which is tricky about robots. There is actually no not much data. So the data that I'm speaking about would be a data of first person experience from the robot and like a gigabytes of data like that. Involved half gigabytes of data like that of robots solving parties problems. It would be very easy to make a progress on robotics.

Starting point is 01:59:05 And you can see that in case of text or code, there is a lot of data like a first person perspective data on writing code. Yeah, so you had this, you mentioned this really interesting idea that if you were to build like a successful robotics company, so OpenAid's mission is much bigger than robotics. This is one of the things you've worked on. But if it was a robotics company, they you wouldn't so quickly dismiss supervised learning. Correct. You would build a robot that was perhaps

Starting point is 01:59:40 what, like an empty shell like dumb and they would operate under totally operation. So you would invest that's just one way to do it. Invest in human super like direct human control of the robots as it's learning and over time add more and more automation. That's correct. So let's say that's how I would build robotics company today. If I would be building robotics company, which is spent 10 million dollars or so recording human trajectories controlling a robot. After you find a thing that the robot should be doing, there's a market fit for it.

Starting point is 02:00:17 You can make a lot of money with that product. Correct. Yeah. So I would record data and then I would essentially train supervised learning model on it. That might be the path today. Long term, I think that actually what is needed is to train powerful models over video. So you have seen maybe a models that can generate images like Dali. And people are looking into models generating videos.

Starting point is 02:00:46 They're like a part of this algorithmic question is even how to do it. And it's unclear if there is enough compute for this purpose. But I suspect that the models that which would have a level of understanding of video, same as GPT has a level of understanding of text, could be used to train robots to solve tasks. They would have a lot of common sense. If one day, I'm pretty sure one day, there will be a robotics company, by robotics company, I mean the primary source of income is from robots that is worth over $1 trillion.

Starting point is 02:01:27 What do you think that company will do? I think sell driving cars now. It's interesting because my mind went to personal robotics, robots in the home. It seems like there's much more market opportunity there. I think it's very difficult to achieve. I mean, this might speak to something important, which is I understand self-driving much better than understand robotics in the home.

Starting point is 02:01:52 So I understand how difficult it is to actually solve self-driving to a level, not just the actual computer vision and the control problem and just the basic problem of self-driving, but creating a product that would undeniably be that would cost less money, like it would save you a lot of money, like, or it is the magnitude less money that could replace Uber drivers, for example. So car sharing is autonomous. They create a similar or better experience in terms of

Starting point is 02:02:23 how quickly you get from A to B or just whatever, the pleasantness of the experience, the efficiency of the experience, the value of the experience, and at the same time, the car itself costs cheaper. I think that's very difficult to achieve. I think there's a lot more low-hanging fruit in the home. That could be. I also want to give you a perspective on home. That could be. I also want to give you perspective on how challenging it would be at home or maybe kind of depends on the exact problem

Starting point is 02:02:53 that you'd be solving. Like if we're speaking about these robotic arms and hence these things, they cost tens of thousands of dollars or maybe a hundred k and you know maybe obviously maybe if there would be economy of scale these things would be cheaper but actually for any household to buy the price would have to go down to maybe a thousand bucks. Yeah, I personally think that so self-driving car provides a clear service. I don't think robots in the home, they'll be a trillion dollar company will just be all about service, meaning it will

Starting point is 02:03:32 not necessarily be about like a robotic arm that helps you, I don't know, open a bottle or watch the dishes or any of that kind of stuff. it has to be able to take care of the whole, the therapist thing you mentioned. I think that's, of course, there's a line between what is a robot, what is not, like, does really need a body, but, you know, some AI system was some embodiment, I think. So the tricky part, when you think actually what's the difficult part is, think. So the tricky part when you think actually what the difficult part is when the robot has like when there is a diversity of the environment with which the robot has to interact that becomes hard. So you know, on one spectrum, you

Starting point is 02:04:14 have industrial robots as they are doing over and over the same thing, it is possible to some extent to prescribe the movements. And with very small amount of intelligence, the movement can be repeated millions of times. Also, you know, various pieces of industrial robots where it becomes harder and harder. I kept, for instance, in case of Tesla, maybe a matter of putting a rack inside of a car.

Starting point is 02:04:42 And, you know, because the rack kind of moves around, it's not that easy, it's not exactly the same every time. Then it's having the case that you need actually humans to do it. Well, welding cars together, it's a very repetitive process. Then in case of self-driving itself, the difficulty has to do with the diversity of the environment, but still the card itself.

Starting point is 02:05:08 The problem that you're solving is you try to avoid even interacting with things. You're not touching anything around, because touching itself is hard. And then if you would have in the home robot that has to touch things, and like if these things they change the shape, if there is a huge variety of things to be touched, then that's difficult. If you are speaking about the robot, which there is head that is smiling in some way with cameras, that if it doesn't touch things, that's relatively simple. So to both agree and to push back. So you're referring to touch, like, agree and to push back. So you're referring to touch like soft robotics, like the actual touch.

Starting point is 02:05:54 But I would argue that you could formulate just basic interaction between like non-contact interaction is also kind of touch. And that might be very difficult to solve. That's the basic, not the agreement, but that's the basic open question to me, with self-driving cars and the agreement with Elon, which is how much interaction is required to solve self-driving cars, how much touch is required. You said that in your intuition touch is not required. In my intuition to create a product that's compelling to use, you're going to have to interact with pedestrians, not just avoid pedestrians, but interact with them.

Starting point is 02:06:36 When we drive around in major cities, we're constantly threatening everybody's life with our movements. And that's how they respect us. There's a game they're ready going out with pedestrians. And I'm afraid you can't just formulate autonomous driving as a collision avoidance problem. So I think it goes beyond, like a collision avoidance is the first order approximation. But then, at least in case of testa, they are gathering data from people driving their cars. And I believe that's an example of supervised learning data that they can train their models on. And they are doing it, which can give a model

Starting point is 02:07:12 dislike another level of behavior that this needed to actually interact with their real world. Yeah, it's interesting how much data is required to achieve that Well, what do you think of the whole Tesla autopilot approach the computer vision based approach with multiple cameras And as a data engine it's a multi-task multi-headed neural network and is this fascinating process of Similar to what you're talking about with the robotics approach what you're talking about with the robotics approach, which is you deploy your own network and then there's humans that use it.

Starting point is 02:07:49 And then it runs into trouble in a bunch of places and that stuff is sent back. So like the deployment discovers a bunch of edge cases and those edge cases is sent back for supervised annotation thereby improving your own network. And that's deployed again. It goes over and over until the network becomes really good at the task of driving, becomes safer and safer. We need to get that kind of

Starting point is 02:08:12 approach to robotics. I believe that's the way to go. So in some sense, even when I was speaking about collecting trajectories from humans, that's like a first step, and then you deploy the system, and then you have humans revising all the issues. And in some sense, like at this approach, converges to a system that doesn't make mistakes, because for the cases where there are mistakes, you got their data, how to fix them, and the system will keep on improving. So there's a very, to me, difficult question of how long that converging takes, how hard it is. The other aspect of autonomous vehicles probably applies to certain robotics applications

Starting point is 02:08:52 is society. They put, as the quality of the system converges, so one there's a human factors perspective of psychology, humans being able to supervise those, even with tele-operation, those robots. And the other is society willing to accept robots. Currently society is much harsher on self-driving cars than it is on human-driven cars, in terms of the expectation of safety. So the bar is set much higher than for humans.

Starting point is 02:09:20 And so if there's a death in a autonomous vehicle that's seen as much more Much more dramatic than a death in a human driven vehicle Part of the success of deployment of robots is freaking out how to make robots part of society Both on the just a human side on the media journalist side and also on the policy government side. And that seems to be, maybe you can put that into the objective function to optimize. But that is definitely a tricky one. And I wonder if that is actually the trickiest part for self-driving cars or any system that's safety critical. It's not the algorithm, it's the society accepting it.

Starting point is 02:10:13 Yeah, I would say I believe that the part of the process of deployment is actually showing people that they give and things can be trusted. And trust is also like a glass that is actually really easy to crack it and damage it. And I think that's actually very common with innovation that there is some resistance toward it. And it's just a natural progression. So in some sense people will have to keep on proving that indeed these systems are for being used. And I would say I also found out that often the best way to convince people is by letting them experience it. Yeah, absolutely. That's the case with Tesla Autopilot, for example. That's the case with, yeah, with basically robots in general.

Starting point is 02:11:01 It's kind of funny to hear people talk about robots. There's a lot of fear, even with like, legged robots. But when they actually interact with them, there's joy. I love interacting with them. And the same with the car. With the robot, if it starts being useful, I think people immediately understand. And if the product is designed well, they fall in love. You're right. It's actually even similar when I'm thinking about Copilot, the Geek Hub Copilot.

Starting point is 02:11:30 There was a spectrum of responses that people had. And ultimately, the important piece was to let people try it out. And then many people just loved it. Especially like programmers. Yeah, programmers. But like some of them, they came Especially like programmers. Yeah programmers but like some of them you know they came with a fear. Yeah. But then you tried out and you think actually that's cool. And you know you can try to resist the same way as you know you could resist moving from punch cards to let's say

Starting point is 02:11:58 C++ or so. And it's a little bit futon. So we talked about generation of program, generation of language, even self-supervised learning in the visual space for robotics and then reinforcement learning. What's you in like this whole beautiful spectrum of AI? Do you think is a good benchmark, a good test to strive for, to achieve intelligence. That's a strong test of intelligence. It started with Alan Turing and the Turing test. Maybe you think natural language conversation is a good test. So, you know, it would be nice if, for instance, machine would be able to sell three month hypothesis in math. That would be, I think that would be very impressive. So theorem proving, is that to you,

Starting point is 02:12:49 proving theorems is a good, oh, like one thing that the machine did, you would say damn, exactly. Okay, that would be quite impressive. I mean, the tricky part about the benchmarks is, you know, as we are getting closer, we have to invent new benchmarks. There is actually no ultimate benchmark out there. Yeah, see, my thought with the RIMON hypothesis would be the moment the machine proves it, would say, okay, well, then the problem was easy. That's what happens. And I mean, in some sense,

Starting point is 02:13:23 that's actually what happens over the years in AI that like we get used to things very quickly. You know something, I talk to Rodney Brooks. I don't know if you know that is. He called Alpha Zero homework problem. Because he was saying like there's nothing special about it. It's not a big leap. And I didn't, well, he's coming from one of the aspects that we referred to is he was part of the founding of I robot

Starting point is 02:13:47 Which deployed now tens of millions of robot in the home. So if you see robots There are actually in the homes of people as the legitimate Instantiation of artificial intelligence and yes Maybe an AI that plays a silly game like Go and Chess is not a real accomplishment, but to me it's a fundamental leap, but I think we assume it's then say, okay, well then that game of Chess or Go wasn't that difficult compared to the thing that's currently unsolved. So my intuition is that from perspective of the evolution of these AI systems, we'll at first see the tremendous progress in digital space. The main thing about digital space is also that

Starting point is 02:14:32 everything is, there is a lot of recorded data. Plus, you can very rapidly deploy things to billions of people. While in case of physical space, the deployment takes multiple years. You have to manufacture things and you know, delivering it to actual people, it's very hard. So, I'm expecting that first and the prices in digital space of goods they would go down to, let's say, March, in Alcalc are 2-0. And also the question is how much of our life will be in digital, because it seems like

Starting point is 02:15:08 we're heading towards more and more of our lives being in the digital space. So innovation in the physical space might become less and less significant. Like why do you need to drive anywhere if most of your life is spend in virtual reality? I still would like, you know, to at least at the moment, my impression is that I would like to have a physical contact with other people and that's very important to me. We don't have a way to replicate it in the computer. It might be the case that over the time it will change. Like, 10 years from now. Why not have like an arbitrary infinite number of people you can interact with, some of them are real, some are not, with arbitrary characteristics

Starting point is 02:15:48 that you can define based on your preferences. I think that's maybe where we are heading and maybe I'm resisting the future. Yeah. I'm telling you, I, if I got to choose, if I could live in Elder Scrolls Skyrim versus the real world, I'm not so sure I would stay with the real world.

Starting point is 02:16:10 Yeah, I mean, the question is, will VR be sufficient to get us there, or do you need to, you know, black electrodes in the brain? Yeah. And it would be nice if these electrodes wouldn't be invasive. Yeah. Or at least, like, probably non-destructive. But in a digital space, do you think we'll be able to solve the touring test,

Starting point is 02:16:33 the spirit of the touring test, which is, do you think we'll be able to achieve compelling natural language conversation between people, like, have friends that are AI systems on the internet. I thought all I think is doable. Do you think the current approach to GBT will take us there? There is the part of at first learning all the content out there, and I think that still system should keep on learning as it speaks with you. And I think that should work. That question is how exactly to do it and you know obviously We have people at the open air asking these questions and

Starting point is 02:17:10 The kind of at first pre-training on all existing content is like a backbone. And is a decent backbone Do you think AI needs a body Connecting to our robotics question to truly connect with humans or can most of the connection be in the digital space? So let's see we know that there are people who met each other online and they felt in love Yeah, so it seems that it's conceivable to establish connection which is purely true internet to establish connection which is purely through internet.

Starting point is 02:17:46 And of course, it might be more compelling the more modalities you add. So it would be like your proposal like a tender, but for AI, you like swipe right left and have the systems or AI and the other is humans and you don't know which is which. That would be our formulation of Turing test. The moment AI is able to achieve more swipe right or left,

Starting point is 02:18:11 whatever, the moment it's able to be more attractive than other humans, it passes the Turing test. Then you would pass the Turing test in attractiveness. It's right. Well, no, like attractiveness just to clarify. That would be conversation. Not just visual, right? It's also attractiveness with wit and humor and whatever, whatever makes conversation pleasant for humans.

Starting point is 02:18:35 Okay. All right. So, so you're saying it's possible to achieve in a visual space. In some sense, I would almost ask that question, why wouldn't that be possible? Right. Well, I have this argument with my dad all the time, he thinks that touch and smell are really important. So they can be very important. And I'm saying the initial systems they won't have it,

Starting point is 02:18:59 still I wouldn't, I can't, there are people being born without these senses. And, you know, I believe that they can still fall in love and have meaningful life. Yeah, I wonder if it's possible to go close to all the way by just training on transcripts of conversations. Like, I wonder how far that takes us. So I think that actually still you want images. I would like, so I don't have kids, but like I could imagine that having AI tutor,

Starting point is 02:19:30 it has to see kids drawing some pictures on the paper and then facial expressions, all that kind of stuff. We use dogs and humans, use their eyes and to communicate with each other. I think that's a really powerful mechanism of communication. Body language too, that words are much lower bandwidth. And for body language, we still, you know, we kind of have a system that displays an image of its artificial expression on the computer. It doesn't have to move you know, mechanical pieces or so. So I think that,

Starting point is 02:20:04 now that there is like kind of a progression, you can imagine that text might be the simplest to tackle, but this is not a complete human experience at all. You expand it to let's say images both for input and output. And what you describe is actually the final, uh, I guess frontier, what makes us human the fact that we can touch each other or smell or so. And it's the hardest from perspective of data and deployment. And I believe that these things might happen gradually. Are you excited by that possibly? This particular application of human to AI friendship and interaction. So let's see. Like would you do look forward to a world you said you're living with a few

Starting point is 02:20:52 folks and you're very close friends with them. Do you look forward to a day where one or two of those friends are AI systems? So if the system would be truly wishing me well, rather than being in the situation that it optimizes for my time to interact with the system. The line between those is a grey area. I think that's the distinction between love and possession. And these things they might be often correlated for humans, but it might find that there are like some friends with whom you haven't spoke for four months.

Starting point is 02:21:30 And then you pick up the phone, it's as the time hasn't passed. They are not holding to you. And I wouldn't like to have AI system that it's trying to convince me to spend time with it. I would like the system to optimize for what I care about and help me in achieving my own goals. But there's some, I mean, I don't know, there's some manipulation, there's some possessiveness, there's some insecurities, there's fragility, all those things are necessary to form a close friendship over time, to go to some dark shit together, some bliss and happiness together.

Starting point is 02:22:10 I feel like there's a lot of greedy self-centered behavior within that process. My intuition, but I might be wrong, instead, human computer interaction doesn't have to go through a computer being a greedy, possessive, and so on. It is possible to train systems maybe that they actually, you know, they are, I guess, prompt that or fine tune or so to truly optimize for what you cared about. And you could imagine that, you know, the way how the process would look like is at some point, we as a human, we look at the transcript of the conversation or like an entire interaction and we say, actually here there was more loving way to go about it and we supervised system toward being more

Starting point is 02:22:59 loving or maybe we trained the systems such that it has a reward function toward being more loving. Yeah. Or maybe the possibility of the system being an asshole and manipulative and possessive every once in a while is a feature, not a bug. Because some of the happiness that we experience when two souls meet each other, when two humans meet each other is a kind of break from the assholes in the world. And so you need assholes in AI as well,

Starting point is 02:23:31 because it'll be like a breath of fresh air to discover an AI that the three previous AIs you had, are it too friendly? Are, no, or cruel or whatever. It's like some kind of mix, And then this one is just right. But you need to experience the full spectrum. And I think you need to be able to engineer assholes. So, I see. Because there's some level to us being appreciate, to appreciate the human experience.

Starting point is 02:24:02 We need the dark and light. So that kind of reminds me, I met a while ago at the Meditation Retreat, one woman, and a beautiful woman, and she had the crush. She had the trouble walking on one deck. I asked her what has happened. And she said that five years ago she was in Maui, Hawaii, and she was eating a salad and some snail fell into the salad and apparently there are neurotoxic snails over there and she got into coma for a year and Apparente there is you know high chance of event just dying, but she was in the coma at some point She regained partially consciousness. She was able to hear people in the room People behave as she wouldn't be there

Starting point is 02:25:07 No, at some point she started being able to speak, but she was mumbling. I could barely able to express herself. At some point, she got into a wheelchair. Then at some point, she actually noticed that she can move her toe. Then she knew that she would be able to walk. Then, you know, that's where she was five years after. She said that since then, she appreciates the fact that she can move her toe. And I was thinking, hmm, do I need to go through such experience to appreciate that I have, I can move my toe. Well, that's really good story, really deep example, yeah. And in some sense, it might be the case that we don't see light if we haven't went through the darkness, but I wouldn't say that we shouldn't assume that that's a case.

Starting point is 02:25:51 It may be able to engineer shortcuts. Yeah, Ilya had this belief that maybe one has to go for a week or six months to some challenging camp. Yeah. To just experience a lot of difficulties. And then comes back and actually everything is bright, everything is beautiful. I'm with Ilya. And it must be a Russian thing. Where are you from originally? I'm Polish. Polish. Okay. I'm tempted to say that explains a lot, but yeah, there's something about the Russian, the

Starting point is 02:26:25 nest necessity of suffering. I believe suffering or rather struggle is necessary. I believe that struggle is necessary. I mean, in some sense, you even look at the story of any superhero in a movie. It's not that it was like, every guy could go easy, easy, easy, easy. I'm like, oh, that's your ground truth. It's the story of superheroes. Okay. You mentioned that you used to do research at night and go to bed at like 6 a.m. or

Starting point is 02:26:53 7 a.m. I still do that often. What sleep schedules have you tried to make for a productive and happy life? Is there, is there some interesting wild sleeping patterns that you engage that you found that works really well for you? I tried at some point decreasing number of hours of sleep like a gradually, half an hour every few days to this. You know, I was hoping to just save time. That clearly didn't work for me.

Starting point is 02:27:22 I got some point, there's like a face shift and I felt tired all the time. That clearly didn't work for me. I got some point. There's like a face shift and I felt tired all the time. You know, there was a time that I used to work during the nights. The nice thing about the nights is that no one disturbs you. And even I remember when I was meeting for the first time with Greg Brockman, his CTO and Chairman of OpenAI. Our meeting was scheduled to 5pm and I overstepped for the meeting. Overslap for the meeting at 5pm. Now you sound like me, that's hilarious, okay? At the moment, in some sense, my sleeping schedule also has to do with the fact that I'm interacting with people. I sleep without an alarm.

Starting point is 02:28:08 So, yeah, the team thing you mentioned, an extrovert thing, because most humans operate during a certain set of hours, you're forced to then operate at the same set of hours. But I'm not quite there yet. I found a lot of joy, just like you said, working through the night because it's quiet because the world doesn't disturb you. And there's some aspect counter to everything you're saying. There's some joyful aspect to sleeping through the mess of the day because people are having meetings and sending emails and there's drama meetings. I can sleep through all the meetings. You know, I have meetings every day and they prevent me from having sufficient amount of time for a focus work.

Starting point is 02:28:56 And then I modified my calendar and I said that I'm out of office Wednesday, Thursday and Friday every day and I'm having meetings only Monday and Tuesday and that buss the positively influenced my mood that I have literally like a three days for fully focused work. Yeah. So there's better solutions to this problem than staying awake all night. Okay. You've been part of development of some of the greatest ideas and artificial intelligence. What would you say is your process

Starting point is 02:29:26 for developing good novel ideas? You have to be aware that clearly there are many other brilliant people around. So you have to ask yourself a question, why they give an idea? Let's say it wasn't tried by someone else. In some sense, it has to do with, you know, it might sound simple,

Starting point is 02:29:51 but like I'm thinking outside of the box and what I mean here. So for instance, for a while, people in academia, they assumed that you have a fixed dataset, and then you optimize the algorithms in order to get the best performance and that was so in great assumption that no one thought about training models on anti-internet or like that maybe some people thought it, but it felt too many as unfair. And in some sense, that's almost like a,

Starting point is 02:30:31 it's not my idea or so, but that's an example of breaking a typical assumption. So you want to be in the paradigm that you're breaking a typical assumption. In the context of the AI community, getting to pick your data set as cheating. Correct. And in some sense, so that was assumption that many people had out there. And then if you free yourself from assumptions, then the other likely to achieve something that others cannot do.

Starting point is 02:31:01 And in some sense, if you are trying to do exactly the same things as others, it's very likely that you're trying to do exactly the same thing as others, it's very likely that you're going to have the same results. Yeah. But there's also that kind of tension, which is asking yourself the question, why haven't others done this? Because I mean, I get a lot of good ideas, but I think probably most of them suck when they meet reality. So actually, I think the other big piece is getting into habit of generating ideas,

Starting point is 02:31:35 training your brain to are generating ideas, and not even suspending judgment of the ideas. So in some sense, I noticed myself that even if I'm in the process of generating ideas, if I tell myself, oh, that was a bad idea, then that actually interrupts the process and I cannot generate more ideas because I'm actually focused on the negative part of why it won't work. But I created also an environment in the way that it's very easy for me to store new ideas. So for instance, next to my bed, I have a voice recorder and it happens to me often, like I wake up in the during the night and I have some idea. In the past, I was

Starting point is 02:32:19 writing them down on my phone, but that means turning off the screen and that wakes me up or like pulling it at paper which requires you know turning on the light. These days I just start recording it. What do you think I don't know if you know who Jim Keller is? I know Jim Keller. He's a big proponent of thinking harder on a problem right before sleep so that he can sleep through it and solve it in a sleep, or like come up with some radical stuff in a sleep, you're trying to get me to do this. So, it happened from my experience perspective. It happened to me many times during the high school days when I was doing mathematics,

Starting point is 02:33:01 that I had the solution to math problem as I woke up. At the moment, regarding thinking hard about the equipment problem is I'm trying to actually devote substantial amount of time to think about important problems, not just before the sleep. I'm organizing amount of the huge chunks of time, such that I'm not constantly working on the urgent problems, but I actually have time to think about the important one. So you do it naturally. But his idea is that you prime your brain to make sure that that's the focus. Often times people have other worries in their life that's not fundamentally deep problems.

Starting point is 02:33:39 I don't know. Just stupid drama in your life and even at work, all that kind of stuff. He wants to kind of pick the most important problem that you're thinking about and go to bed on that. I think that's wise. I mean, the other thing that comes to my mind is also, I feel the most fresh in the morning. So during the morning, I tried to work on the most important things rather than just being pulled by urgent things or checking email or so.

Starting point is 02:34:09 What do you do with the, because I've been doing the voice recorder thing too, but I end up recording so many messages. It's hard to organize. I have the same problem. Now I have heard that Google Pixel is really good in transcribing text, and I might get a Google pixel just for the sake of transcribing text. People who listen to this, if you have a good voice recorder suggestion that transcribes, please let me know. Some of it has to do with open AI codex too.

Starting point is 02:34:38 Some of it is simply the friction. I need apps that remove that friction between voice and the organization of the resulting transcripts and all that kind of stuff. But yes, you're right. Absolutely. Like during, for me, it's walking, sleep, too, but walking and running, especially running, get a lot of thoughts during running. And there's no good mechanism for recording thoughts. So one more thing that I do, I have a separate phone which I which has no apps, maybe it says like audible or let's say no one has this phone number, this kind of my meditation phone. Yeah. And I try to expand the amount of time that that's the phone that I'm having.

Starting point is 02:35:26 It has also Google Maps if I need to go summer. And I also use this phone to write down ideas. Ah, that's really good idea. That's a really good idea. Often actually what I end up doing is even sending a message from that phone to the other phone. So that's actually my way of recording messages. All right, just put them into notes.

Starting point is 02:35:46 I love it. What advice would you give to a young person? High school, college, about how to be successful. You've done a lot of incredible things in the past decade. So maybe maybe you have some... There might be something. There might be something. maybe maybe some something. There might be something. There might be something. I mean, my son, I can simply stick herself. But I would say literally just follow your passion double down on it.

Starting point is 02:36:14 And if you don't know what's your passion, just figure out what could be a passion. So the step might be an exploration. When I was in elementary school, was math and chemistry. And I remember for some time, I gave up on math because my school teacher, she told me that I'm dumb. And I guess maybe an advice would be just ignore people if they don't do that.

Starting point is 02:36:41 You're dumb. You mentioned something offline about chemistry and explosives. What was that about? So let's see. So a story goes like that. I can't. I got into chemistry, maybe I was like a second grade of my elementary school third grade.

Starting point is 02:37:02 I started going to chemistry classes. I really love building stuff. And I did all the experiments that they describe in the book, how to create oxygen with vinygaren and baking soda or so. So I did all the experiments. And at some point, I was, so what's next, what can I do? And the next plus if they also is like, you have a clear reward signal, you know, if the thing worked on it.

Starting point is 02:37:35 So I remember at first I got interested in producing hydrogen, that was kind of funny experiment from school, you can just burn it. And then I moved to nitroclistering, so that's also relatively easy to synthesize. I started producing as a shadow dynamite and that one I think it would be a friend. You remember there was a, no, there was at first like maybe two attempts that I went with a friend to that one a what we built and it didn't work out

Starting point is 02:38:06 and I get third time he was like it won't work like a let's don't waste time and we were I was carrying this this you know that tube with dynamite I don't know pound or so dynamite in my backpack. We're like riding on the bike to the edges of the city. Yeah. And attempt number three. This was be attempt number three. Attempt number three. And now we dig a hole to put it inside. It actually had the, you know, an electric alternator.

Starting point is 02:38:46 We draw a cable behind the tree. I even, I never, I haven't ever seen like an explosion before, so I thought that there will be a lot of sound. But, you know, we're like laying down and I'm holding the cable and the battery. At some point, you know, we kind of like a three to one. And I just connected it. And it felt like at the ground, it was like a more like a sound. And then the soil started kind

Starting point is 02:39:14 of lifting up and started falling on us. Wow. And then the difference is, let's make sure the next time we have helmets. But it's also, you know, I'm happy that nothing happened to me. It could have been the case that I lost the lean bar so. Yeah, but that's childhood of an engineering mind with a strong reward signal of an explosion. I love it. I, my, there's some aspect of chemists, the chemist I know, like my dad, with plasma chemistry, plasma physics, he was very much explosive too. It's a worrying

Starting point is 02:39:54 quality of people that are working chemistry, they love. I think it is that exactly is the strong signal that the thing worked. There is no doubt. There's no doubt. There's some magic. It's almost like a reminder that physics works, that chemistry works. It's cool. It's almost like a little glimpse at nature

Starting point is 02:40:15 that you, yourself, engineer. That's why I really like artificial intelligence, especially robotics, is you create a little piece of nature. And in some sense, even for new explosives, the motivation was creation rather than destruction. Yes, exactly. In terms of advice, I forgot to ask about just machine learning and deep learning for people who are specifically interested in machine learning. How would you recommend again to the field? So I would say, say, rainplament everything, and also there is plenty of courses.

Starting point is 02:40:48 So, like from scratch? So, on different levels of abstraction in some sense, but I would say, rainplament something from scratch, rainplament something from a paper, rainplament something, you know, from podcasts that you have heard about, I would say that's a powerful way to understand things. So, it's often the case that you read the description and you think heard about, I'll say that's a powerful way to understand things. So it's often the case that you read the description and you think you understand, but you truly understand once you build it, then you actually know what

Starting point is 02:41:14 really met it in the description. Is there particular topics that you find people just found love with? I've seen I tend to really enjoy reinforcement learning because it's much more, it's much easier to get to a point where you feel like you created something special, like fun games, kind of things that are rewarding. It's rewarding, yeah. It's supposed to like, re-implementing from scratch, more like supervised learning kind of things.

Starting point is 02:41:47 It's, yeah. So, you know, if someone would optimize for things to be rewarding, then it feels that the things that are somewhat generative, they have such a property. So you have, for instance, adversarial networks, or you have just even generative language models. And you can even see internally we have seen this thing with our releases. So we have

Starting point is 02:42:12 we released recently two models. There is one model called Dali that generates images. And there is other model called Clip that actually you provide various possibilities, what could be the answer to what is on the picture, and it can tell you which one is the most likely. Okay. And in some sense, in case of the first one, Dali, it is very easy for you to understand that actually there is magic going on. And in the case of the second one, even though it is insanely powerful, and you know, people from Vision Community, they, as they started probing it inside, they actually understood

Starting point is 02:42:50 how far it goes. It's difficult for a person at first to see how well it works. And that's the same, as you said, that in case case of supervised learning models, you might not kind of see or it's not that easy for you to understand the strength. Even though you don't believe in magic to see the magic. To see the magic. It's a generative. That's really brilliant. So anything that's generative, because then you are at the core of the creation.

Starting point is 02:43:22 You get to experience creation without much effort, unless you have to do it from scratch. And it feels that, you know, humans are wired. There is some level of reward for creating stuff. Yeah. Like, of course, different people have a different weight on this reward. Yeah.

Starting point is 02:43:39 In the big objective function. In the big objective function of a person. Of a person. You wrote that beautiful is what you intensely pay attention to. Even a cockroach is beautiful if you look very closely. Can you expand on this? What is beauty? So what I wrote here actually corresponds to my subjective experience that I had through extended periods of meditation. It's pretty crazy that at some point the meditation gets you to the place that you have really increased focus, increased attention.

Starting point is 02:44:20 And then you look at the very simple objects that were all the time around you. You can look at the table or on the pen or at the nature. And you notice more and more details, and it becomes very pleasant to look at it. And once again, it kind of reminds me my childhood, like a just pure joy of being. It's also, I have seen even the reverse effect that by default, regardless of what we possess, we very quickly get used to it.

Starting point is 02:44:54 And you know, you can have a very beautiful house. And if you don't put sufficient effort, you're just gonna get used to it. And it doesn't bring any more joy regardless of what you have. Yeah. Well, I actually, I find them material possessions, getting the way of that experience of pure joy. So I've always been very fortunate to just find joy in simple things, just like you're saying. Just like, I don't know, objects in my life, just stupid objects like this cup thing, you

Starting point is 02:45:33 know, just objects. Sounds okay. I'm not being eloquent, but literally objects in the world. They're just full of joy because it's like, I can't believe, one, I can't believe that I'm fortunate enough to be alive to experience these objects. And then two, I can't believe humans are clever enough to build these objects. The hierarchy of pleasure that that provides is infinite. I mean, even if you look at the cap of water, so you know, you see, first, I can level up like a reflection of light. But then you think, you know, man, there's like a trillions upon of trillions

Starting point is 02:46:09 of particles bouncing against each other. There is also the tension on the surface that, you know, if the back could like stand on it and move around. And you think it also has this like a magical property that as you decrease temperature, it actually expands in volume which allows for the legs to freeze on the surface and at the bottom to have actually not freeze which allows for life like a crazy. You look in detail at some object and you think actually in this table, that was just the figment of someone's imagination at some point. And then there was like a thousand of people involved to actually manufacture it and put it here. And by default, no one cares.

Starting point is 02:46:53 And then you can start thinking about evolution, how raw starts from single cell organisms that led to this table. And, and, and these thoughts, they give me life appreciation. Yeah, exactly. And even lack of thoughts, just the pure rosignal also gives their life appreciation. See, the thing is, and then that's coupled, for me, with the sadness that the whole ride ends, and perhaps is deeply coupled,

Starting point is 02:47:21 in that the fact that this experience, this moment, ends, gives it, gives it an intensity that I'm not sure I would otherwise have. So in that same way, I try to meditate on my own death often. Do you think about your mortality? Are you afraid of death? So fear of death is like one of the most fundamental fears that each of us has. We might be not even aware of it, it requires to look inside to even recognize that it's out there. And there is still, let's say, this property of nature that if things would last forever,

Starting point is 02:48:02 then they would be also boring to us. The fact that the things change in some way gives an meaning to them. I also found out that it seems to be very healing to people, to have these short experiences, I guess, psychodolic experiences in which they experience death of self, in which they let go of this fear and then maybe can even increase the appreciation of the moment. And it seems that many people, they can easily comprehend the fact that the money is finite while they don't see that time is finite. I have this discussion with Ilya from Time to Time, he's thinking, man, the life will pass very fast at some point, I will be 40, 50, 60, 70 and then it's over.

Starting point is 02:49:06 This is true, which also makes me believe that every single moment it is so unique that should be appreciated and it also makes me think that I should be acting on my life because otherwise it will pass. I also like this framework of thinking from Jeff Bezos on regret minimization that I would like if I will be at that deathbed to look back on my life and not regret that I haven't done something.

Starting point is 02:49:42 It's usually you might regret that you haven't tried. I'm finally failing. I haven't tried. I was in each year, I turned over currents, tried to live a life that if you had to live it infinitely many times, that would be the, you'd be okay with You'd be okay with that kind of life. So try to live it optimally. I can say that it's almost like I'm undeveloped to me where I am in my life. I'm extremely grateful for actually people whom I met. I would say I think that I'm decently smart and so on. But I think that actually to great extent

Starting point is 02:50:28 where I am has to do with the people who I met. Would you be okay if after this conversation you died? So if I'm dead, then it kind of, I don't have a choice anymore. So in some sense, there's like a plenty of things that I would like to try out in my life. Mm-hmm. I feel that, you know, I'm gradually going one by one and I'm just doing them.

Starting point is 02:50:51 I think that the list will be always infinite. Yeah. So might as well go today. Yeah, I mean to be clear, I'm not looking forward to dying. I would say if there is no choice, I would accept it. But like in some sense, if there would be a choice, if there would be possibility to leave, I would fight for living. I find it's more honest and real to think about, you know, dying today at the end of the day. That seems to me,

Starting point is 02:51:26 at least to my brain, more honest slap in the face. As opposed to, I still have 10 years, like today, then I'm much more bought appreciating the cup and the table, and so on, and less about like silly worldly accomplishments and all those kinds of things. We, we, we have in the company a person who say at some point found out that they have cancer. And that also gives you know, huge perspective. We've respect what matters now. Yeah. And you know, often people in situations like that, they conclude that actually what matters is human connection. And love and that's people conclude also as you have kids, kids is family. You, I think, tweeted, we don't assign the minus infinity reward to our death.

Starting point is 02:52:15 Such your reward would prevent us from taking any risk. We wouldn't be able to cross a road in fear of being hit by a car. So in the objective function, you mentioned fear death might be fundamental to the human condition. So as I said, let's assume that they're like a reward function in our brain. And the interesting thing is even realization how different reward functions can play with your behavior. As a matter of fact, I wouldn't say that you should assign infinite negative reward to anything because that messes up the math. The math doesn't work out. It doesn't work out.

Starting point is 02:52:55 And it's as you said, even, you know, government or some interest companies, you said they assign nine, nine million dollars to human life. Yeah. And I'm just saying it, we respect to, that might be a hard statement to ourselves, but in some sense that there is a finite value of our own life. I'm trying to put it from perspective of being less,

Starting point is 02:53:19 or of being more egoless, and realizing fraudulity of my own life. And in some sense, the fear of death might prevent you from acting, because anything can cause death. Yeah, and I'm sure actually if you were to put death in the objective function, there's probably so many aspects to death and fear of death and the objective function, there's probably so many aspects to death and fear of death and realization of death and mortality. There's just whole components of finiteness of not just your life, but every experience and so on, that you're going to have to

Starting point is 02:53:58 formalize mathematically. And also, you know, that might lead to dramatically. And also, you know, that might lead to you spending a lot of compute cycles on this like a deliberating this terrible future instead of experiencing now. And in some sense, it's also kind of unpleasant simulation to run in your head. Yeah. Do you think there's an objective function that describes the entirety of human life? So usually the way you ask that is, what is the meaning of life? Is there a universal objective function

Starting point is 02:54:38 that captures the why of life? So, I suspect that they will ask this question, but it's also a question that I ask myself many, many times. See, I can tell you a framework that I have these days to think about this question. So I think that fundamental meaning of life has to do with some of our reward actions that we have in brain, and they might have to do with, let's say, for instance, curiosity they might have to do with, let's say, for instance, curiosity or human connection, which might mean understanding others. It's also possible for a person to slightly modify their reward function, usually they

Starting point is 02:55:16 mostly stay fixed, but it's possible to modify reward function. And you can pretty much choose. So in some sense, the reward functions, optimizing reward functions, they will give you a life satisfaction. Is there some sense, the reward functions optimizing reward functions, they will give you a life satisfaction. Is there some randomness in the function? I think when you are born, there is some randomness. Like you can see that some people, for instance, they, they care more about building

Starting point is 02:55:36 stuff. Some people care more about caring for others. Some people, they're all sorts of default reward functions. And then in some sense, you can ask yourself, what is the satisfying way for you to go after this reward function? And you just go after this reward function. And some people ask, are these reward functions real? I almost think about it as, let's say, if you would have to discover mathematics, in mathematics, you

Starting point is 02:56:08 are likely to run into various objects like a complex numbers or differentiation, some other objects and these are very natural objects that arise. And similarly, the reward functions that we are having in our brain, they are somewhat very natural that, you know, there is a reward function for understanding, like a comprehension, a curiosity and so on. So in some sense, they are in the same way natural as their natural objects in mathematics. Interesting. So you know, there's the, the old sort of debate is mathematics invented or discovered. You're saying reward functions are discovered. So nature, nature that provided some, you can still, let's say, expand it throughout the life. Some of the reward functions, they might be

Starting point is 02:56:54 futile. Like, for instance, there might be a reward function, maximize amount of wealth. Yeah. And this is more like a learning reward function. But we know also that some reward functions, if you optimize them, you won't be quite satisfied. Well, I don't know which part of your reward function resulted in you coming today, but I deeply appreciate it if you did spend your valuable time with me. What check is really fun talking to you,

Starting point is 02:57:23 your brilliant, your good human being. And it's an honor to meet you and an honor to talk to you. Thanks for talking to me, Wojtek is really fun talking to you, your brilliant, your good human being, and it's an honor to meet you and an honor to talk to you. Thanks for talking to me, brother. Thank you, Lekstalod. I appreciate that you're questions. You're a good state to me. I had a lot of fun being here. Thanks for listening to this conversation with Wojtek, Sarenba.

Starting point is 02:57:39 To support this podcast, please check out our sponsors in the description. And now, let me leave you with some words from Arthur C. Clarke, who is the author of 2001 A Space Odyssey. It may be that our role on this planet is not to worship God, but to create him. Thank you for listening, and I hope to see you next time. you

Lex Fridman Podcast - #215 – Wojciech Zaremba: OpenAI Codex, GPT-3, Robotics, and the Future of AI

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.