번역: Can MechE’s Survive the AI Revolution? I Asked a Boston Dynamics ML Engineer (ex-MechE)

Transcript Translation

Can MechE’s Survive the AI Revolution? I Asked a Boston Dynamics ML Engineer (ex-MechE) - https://www.youtube.com/watch?v=w2W29zdUGfk

A lot of the top AI engineers that I work with on a day to day basis have a mechanical engineering background. So you can actually be the one to go into AI and take other people's jobs. There's actually not that much math behind it. That's very encouraging. It is not too late for me. No, for you it's too late. Hopefully the problem is that it's interesting, but I'm so bad at presenting it as being interesting. Damn, I wish I was wearing shoes. Can you see my feet? No. Okay. Okay, great. All right. 12:30 at night. Jesse, why don't you introduce yourself? I'm Jesse. I, like Leon was Cornell mechanical engineering. And I kind of quickly diverged from Leon's scope. So Leon was more like pure mechanical. And then I went more robotics programming side. I worked for seven years promotional on self driving cars. And then recently I joined Boston Dynamics doing humanoid robotics. You were a control specialist in college, is that right? Undergrad. It was just kind of vanilla mechanical engineering. I did Cornell autonomous sailboat team, which kind of got me a flavor in robotics where I started getting a taste of what I wanted to do was ironically matlab class. So matlab is this like very mechanical engineering programming language. I was really enjoying it and I didn't realize that there's a whole world of programming. I think as a mechanical engineer, it's easy to think we are mechanical engineering, we are doing machine shop and these things, you know, computer science and programming is a totally different. But actually robotics is this nice major that lets you combine mechanical and programming. It's very lucky that I'm probably the most comfortable in the world with you, Jesse. I'm the least comfortable with you. You recently left Motional as like a super principal, hyper principal controls engineer. Just principal. Just principal. And became a reinforcement learning engineer. You saw a few years back that machine learning was exciting and I think you've been low key working on a pivot. And so robotics is this interesting thing that combines many different majors. So you have computer science, electrical engineers, mechanical engineers that are all working together. I think a lot of industries it's easy to silo yourself in like your thing. But robotics, you kind of have to have a good understanding of each component. You can't just be a mechanical engineer. You have to be a mechanical engineer who knows how to program, who knows a little bit of electrical engineering. So I started in kind of this controls route where I'm like, I don't want to be the one that manufactures or like makes this part. I want to be the one that controls this thing to do something useful in the world. So the classical thing that you do as a controlist engineer is you like start with the pendulum. So like you take this pendulum that's just swinging on the bottom. The problem you're trying to solve is how do you swing the pendulum so that it's balancing itself. And so as a controls engineer, you start with these very simple canonical problems and you build up from that, like how can you sense the world, how can you use the sensors in the system that you have and the actuators and the motors to actually do something useful? In robotics you have these very different industries. So you have humanoid robots or you have self driving cars. And at face value they seem very different, but actually a lot of the underlying algorithms are similar. So like controls is this field where you take sensor input from the outside world and you determine how to actuate the motors that allow you to get the system like the pendulum to do what you want. So like swing up and balance itself. But kind of as you go along this path, you realize that these more complicated systems, like a self driving car or a humanoid robot, are more complicated, that are more nuanced. Not just this input and output system. You have like other people around you, other pedestrians, other agents that are interacting with you. As you build up these systems, you realize some of these classical algorithms that you learned in school are too simple, naive, and you actually have to like add more and more complexity. And at some point you start going down the path of machine learning and you realize that machine learning and AI is the way to control these more complicated systems, to look at the world around you with cameras and perception as the sensors. And now you have the actuator for self driving cars that could be like the steering wheel and the gas and brake. And how do you look at the world around you and then actuate the system to safely interact with the other cars and the other pedestrian. You might start with these simple control algorithms like pid, go to more complicated things like model predictive control or trajectory optimization. And then as you encounter more and more complicated situations, you realize that kind of the final solution is like machine learning or AI. The hot take or the sound bite is like mechanical engineers. You might be thinking like, oh, this AI is going to take our jobs. As a mechanical engineer, you might be like, okay, that's like computer science, that's AI. That's a different field. As a mechanical engineer, actually, I'm finding that there's a lot of AI engineers who have the background of a mechanical engineer. And so you can actually be the one to go into AI and take other people's jobs. Mmm, Saucy. Not all the mechis are controls or math savvy as you, for the meatheads like myself who are more spatial or keen to design blocks with holes. How would someone like that kind of get into the controls and robotic stuff if they don't have a knack for dynamics and hard maths? Yeah, I think it's a misconception that, like, you need math to go into these fields. Like, a lot of programming and computer science, they don't even know, like, sine and cosine. It's kind of a wide range of things it can do in robotics and programming. A lot of it's not theoretical, and actually a lot of modern robotics is not based on a theoretical understanding of what works. Like, AI is kind of a classical example now of no one knows why it works. There's not that much theory behind it. It just kind of does. There's actually not that much math behind it. You can kind of download a library like Pytorch or Scikit, learn, and you just throw a bunch of training data at a problem and get the AI model to learn something. And you don't really need to know the theory and you can kind of just watch it work and have it solve problem that you care about. Say someone is a mechanical engineer in another industry interested in getting into robotics and wants to do a side project. What is the scope of a meaningful side project to make their way into the robotics space? Yeah, I mean, one thing that I like about robotics is that it's kind of easy to iterate and to work on these side projects, because a lot of it's programming. A good place to start could be to write your own simulation where you're not needing to actually build anything physical. You can kind of write a simulation. For instance, like, I started with, like, an autonomous sailboat. So, like, my project project wasn't building the sailboat, it was creating a simulation of a sailboat. As far as projects that you can kind of start, like a mechatronics or robotics project, there's a lot of existing kits that you can buy. Robotics kits are nice because it kind of walks you through the basic steps. But I think once you learn the basics, like once you kind of take a couple introduction classes to robotics, I think it's more meaningful to jump immediately to, like, your own thing. And so, like, an example of something that I did was these balancing bots that have just two wheels, like A segue. And this is going back to the pendulum idea, where like this pendulum is this very simple system in mechanical engineering. You can have a pendulum, a pendulum naturally wants to fall over. And a segue, or like a robot on two wheels, is a system that naturally wants to fall over. You can create your own motor controller, have your own imu, your own power system, and develop a robot pretty simply with maybe only 100 lines of code that reads the IMU to detect the angle, and then controls the motors to keep the robot upright. So like, as the robot starts tipping over this way, the motors will kind of carry the base under it and it's a self stabilizing system, as one example. But I think you can come up with a lot of interesting, relatively simple projects, whether it's like purely simulation or involves some simple hardware that you can kind of work on your own idea and see it to fruition. How did you transition from controls engineering into reinforcement learning? Yeah, that's a good question. So there's this famous, this paradox called Moravec's paradox, that's making the observation that everyone thought that the first job that would be automated was like the manufacturing job, that a robot would replace a person in a factory doing these simple pick an object up here and place it here. And people assumed that the thing that would be hard to replace would be like an artist or like a poet or something that requires creativity. Moravec's paradox is this observation that actually the types of things that are being automated are these creative jobs. And actually the hardest thing that we're trying to automate is like this simple thing of like, can I pick up this object here and move it here? Kind of operating in the physical world. Controls gives you these, this mathematical framework that is kind of theoretically back. For instance, one of the options here is like trajectory optimization, where you find like a closed form thing that says like, this is the optimal solution to move this here and, and put it here. But it turns out that in any meaningfully useful situation, it becomes too complicated to like formulate this thing in this classical way of controls. And that reinforcement learning is simply controls that's backed by machine learning. So reinforcement learning is this process of trying to have this agent or this robot that's operating this world that takes in some observation of what is around it, you know, the state of this table and some objects on this table, and is now taking an action, taking its hand and moving this object here. So that is a controls problem. But for more complicated systems, which are the systems that we see in like day to day industry with types of systems that you're trying to automate. It turns out that classical controls often struggle in these more complicated settings. And reinforcement learning using AI as the brains of this controller is gaining popularity as a way to solve these problems. Pick and place and end effectors is this long standing problem that's been plaguing Amazon and humanoid robotics. What is the state of the art or the bleeding edge of this problem and the technologies? Amazon would profit billions and billions of dollars a year if they could simply figure out how to take an object or a package and move it from this box to this box. And it seems like such a trivial problem to solve. We have AI showing up in all these places like ChatGPT, and yet we can't figure out how to move an object from here to here reliably enough to make this system profitable. Which I think is an interesting observation. I think that's bound to change. And that's why I'm in this industry, because I feel like this is like the next ChatGPT is going to be a system that can operate in the physical real world and do these simple tasks that seem to require common sense. So like ChatGPT was learning common sense on like the digital scale and robotics is, how do we learn common sense on the physical scale? We need a chatgpt moment. For robotics, the state of the art is very simplistic. The current state of the art are these simple robots that have like suction cups that kind of suck up an object to move it from one place to another. And the reason that they're using suction cups is simply because you don't have to be very precise. You just go over to an object, suck on it like an octopus tentacle to move it, and it doesn't really matter to like detect the exact shape or texture of that object. So that's more of a crutch than the optimal solution. The optimal solution is like a human hand, like human hands are incredibly dexterous. We evolved to be able to manipulate fine grained objects. We underestimate how difficult of a problem this is to solve. We've evolved for billions of years to be able to do these very subtle tasks. And our skin is filled with millions of these very fine grained sensors that are very hard to get into like a skin of a robot. And so I think this is an unsolved problem. The state of the art are these like suction cup like robots. Like even at Boston Dynamics there's this stretch robot which is like kind of the most industrial friendly robot that we have that is this big crane filled with suction cups that pick up a box and put it here. Boston Dynamics is trying to get atlas, which is a humanoid robot, to be able to use a dexterous hand to do these tasks instead. And I think it's a matter of time. I think it will be an AI solution and I think that mechanical engineers will benefit from learning that. It's not like mechanical engineer versus AI or mechanical engineer versus software engineer. You can, as a mechanical engineer go down the path. You can kind of meet in the middle. You have mechanical here, you have software here. Robotics is in the middle. AI can be in the middle. A lot of the top AI engineers that I work with on a day to day basis have a mechanical engineering background. You might think of this as like the thing that will take your job. You can actually instead be the one that joins this AI and ML field and be the one to kind of solve this next frontier, which is like, how do you get robots to. To have common sense and interact with the world in a useful way? Nice takeaway. When we were in school, we took that graduate level robotics course and we tried to learn ros, which had like the steepest, most absurdly painful learning curve ever. Like, I think we dropped out of it after two weeks. And then you eventually took it again and passed it, right? Yeah, it was this classical robotics class where it's like a mix of programmers, computer science majors and electrical engineers. And then like us poor mechanical engineers. And the first homework is like, on an Ubuntu machine. I didn't know what Ubuntu was, I didn't know what Linux was. And it was like, oh, run this program on this operating system that I've never heard of. And I think we were both stunned into a sad silence of like, we have no idea how to even run this program. It's an entirely new language. Right. When a mechanical engineer tries to learn robotics at first, how would you guide them past that? Yeah, well, I think now in the age of large language models, use that to your advantage. I think people now are using ChatGPT to like write code for them, which I think is great. But I think you want to make sure that if you're using ChatGPT to write code because you're not familiar with code, you don't want to replace the learning opportunity to actually learn how to code. There's still an important learning process. The end goal should be, I will learn how to code, and I will learn how to code as A mechanical engineer who wants to get into robotics. And it's not like I will get ChatGPT to code for me having no idea how to actually do that myself. So I think using large language models, using ChatGPT as a learning coach to guide you through these processes. I think if we had that when we were taking the class, it would have been way easier. We could be like, where do I even start? And explain like the basic concepts to me. You had a fairly dedicated path, but you didn't have the most direct path. Right. If you were to do it all over again, or if you were a student now starting in 2025, how would you structure your career to pursue the things that you are most optimistic about? Yeah, I think I took a very meandering path. I knew like nothing going into Cornell about like engineering, so I didn't know until like end of senior year that I was interested in robotics. And I'm kind of jealous when I look at people around me now who like kind of had like a dead set, like focus on robotics from like freshman year and like did everything right and like worked in the top labs and they have a skill set that I can't really compete with. Meanwhile in sophomore junior year I was taking dynamics, fluid mechanics, things that don't direct apply to like what I'm doing now. But I think where I ended up is where I would want to have ended up anyway. So I got lucky that way. And I feel like there is a little bit to be gained by kind of taking this circuitous path because you kind of learn a little bit about these other things that you might not apply on a day to day basis, but could be useful especially in a field like robotics where like so many things apply. So like if I have a little bit more knowledge about fluid mechanics as a roboticist, well, suddenly if I'm now making a robot that's operating in the ocean, I can now bring that expertise that I learned from fluid mechanics to this one application. And so I think that you can always use it to your advantage. And I think taking this circuitous path is a good way to find out what you're interested in. Because not everyone's going to be interested in the same thing and you kind of don't know what sparks your interest until you try it. So I think actually taking the meandering path can be a good way of dabbling in many things until something clicks. That's very encouraged. But it's not too late for me. No, for you it's too late. What's your take on the robotics industry. I feel like robotics, it has proven to be harder than anyone thought it would be. I mean, you go back and you see people like Elon Musk, starting from like 2014, saying that full self driving is going to come the next year. Elon was like the most vocal optimist with robotics, but I think that that mindset and mentality was shared by many in the field. I think part of it was kind of marketing. Like everyone wants to be like, yes, my robot will be ready next year, but I think a lot of it is genuine because everyone was surprised by ChatGPT that it could write poetry all of a sudden. And no one thought it would be ready at that point. Many people thought useful robots that can do things around the house, like laundry or fold your bed, would be an easier problem to solve than generating a painting or things midjourney can do. And so I think it's surprised people how hard robotics is. But I think in this, you know, era now of machine learning, AI and data, we are positioned to potentially see a reawakening of robotics. I think robotics industries have proved super hard to be profitable. You have very few companies that have robots that people want to buy and are useful enough. So like Roomba, a vacuum cleaner is like one of the few examples of like a robot that people will actually buy at this point, which is a very telling case of how difficult robotics is. Everyone now is talking about this flywheel, where in this era of AI and ML, it's kind of just a matter of getting the data. ChatGPT works because it was trained on an Internet scale of data. And so for robotics, a lot of people think now that it's just a matter of finding that Internet scale or that Internet size of data for robots. So what might that look like? It might look like simple robots operating in the real world, like a Roomba or some of these other robots that are now not just only operating in the world, but collecting data that can be used to train future iterations of the robot. So you can imagine this flywheel where you have these relatively simple robots that work well enough that people will buy them, do relatively simple tasks in the house, but collect tons of data, and you can use that data to train the next iteration of robots that are a little bit more useful, a little bit more general, and more people will buy those. And you now generate this self fulfilling flywheel where robots are now self improving in this way because the first generation of robots are making the second generation smarter. And Tesla's kind of doing this with full self driving now, where they're collecting data of humans driving their cars kind of for free. Data is inherently hard to generate and it's hard to get free data unless you're just downloading the Internet, especially for robotics. And so Tesla's found this really useful niche where they can get people to collect data for them and use that to improve the next generation of robotics. So I think robotics is going to become all about data in this age of AI and I think people who can find ways of getting there in with maybe an initial simple robot that's just useful enough for people to buy it and then use that to generate tons and tons of data that can then be used to make the next generation more useful. I think is going to be the future. Is this data or are these models, robot architecture like degree of freedom specific, or will they be generalizable across different robots, say like Roomba to humanoid? Yeah, I think that's a big question. And I think if you look at papers coming out now, there is talk of these foundation models analogous to LLM, which are like kind of these foundational models. That just means that these things generalize across many different systems or environments. And so in robotics they're talking about these foundational robotics models that generalize or were trained cross embodiment. Cross embodiment just means you have many different robotic platforms that you collect data with. And rather than a couple years ago it was all about training specialized robots, collecting data just for one type of robot. But now it's becoming more popular to collect data across many different robotic platforms, many embodiments of robots and use that to train these generalist models. And the idea is some concept that might generalize across many different robots could be learned by training with large amounts of data that span these many different robots. For instance, object permanence, things like this idea that like objects, though they might go out of your view, still exist and come back into your view are things that robots have to learn and are not robot platforms that specific? And you can kind of collect data across many different robots to learn these kind of common sense principles. Thanks so much for bearing with me through doing this at 1am for bailing me out. It's three hours past my bedtime. Yes, but anything for you. Thank you, Jesse. Anything for robots. I'm just trying to replace your job. Specifically yours will be the first to go.

Can MechE’s Survive the AI Revolution? I Asked a Boston Dynamics ML Engineer (ex-MechE) - https://www.youtube.com/watch?v=w2W29zdUGfk

제가 매일 함께 일하는 최고의 AI 엔지니어들 중 많은 분들이 기계공학 배경을 가지고 있습니다. 그래서 실제로 여러분이 AI 분야에 진출해서 다른 사람들의 일자리를 가져올 수 있습니다. 실제로 그 뒤에 있는 수학이 그리 많지 않습니다. 매우 격려적이네요. 저에게는 너무 늦지 않았습니다. 아니요, 당신에게는 너무 늦었습니다. 문제는 흥미로운데, 제가 흥미롭게 전달하는 것을 너무 못한다는 것입니다. 젠장, 신발을 신었으면 좋았을 텐데. 제 발이 보이나요? 아니요. 알겠습니다. 좋습니다. 알겠습니다. 밤 12시 30분입니다. 제시, 자기소개를 해주시겠어요? 저는 제시입니다. 저는 리온처럼 코넬 기계공학과였습니다. 그리고 리온의 범위에서 빠르게 벗어났습니다. 리온은 순수 기계공학 쪽이었고, 저는 로봇공학 프로그래밍 쪽으로 갔습니다. 7년간 프로모션에서 자율주행차 분야에서 일했습니다. 그리고 최근에는 보스턴 다이내믹스에서 휴머노이드 로봇공학을 하고 있습니다. 대학에서 제어 전문가였나요? 학부 때요. 그냥 평범한 기계공학이었습니다. 저는 코넬 자율 범선 팀을 했는데, 그것이 로봇공학에 대한 맛을 주었고, 제가 하고 싶었던 것을 맛보기 시작한 곳이 아이러니하게도 matlab 수업이었습니다. matlab은 매우 기계공학적인 프로그래밍 언어입니다. 저는 그것을 정말 즐겼고, 프로그래밍의 전체 세계가 있다는 것을 깨닫지 못했습니다. 기계공학도로서는 우리가 기계공학이vtt 완전히 다른 것입니다. 하지만 실제로 로봇공학은 이런 멋진 전공으로 기계공학과 프로그래밍을 결합할 수 있게 해줍니다. 다행히 세상에서 아마 제가 당신과 가장 편안할 것입니다, 제시. 저는 가장 편안하지 않습니다. 당신은 최근에 Motional에서 슈퍼 프린시펄, 하이퍼 프린시펄 제어 엔지니어로 일하다가 그만두었죠. 그냥 프린시펄이에요. 그냥 프린시펄이요. 그리고 강화학습 엔지니어가 되었죠. 몇 년 전에 머신러닝이 흥미롭다는 걸 보았고 제 생각에는 당신이 은밀하게 전환 작업을 해왔던 것 같습니다. 그래서 로봇공학은 이런 흥미로운 것으로 여러 다른 전공들을 결합합니다. 컴퓨터 과학, 전기 엔지니어, 기계 엔지니어들이 모두 함께 일하고 있습니다. 많은 산업에서는 자신만의 영역에 고립되기 쉽습니다. 하지만 로봇공학에서는 각 구성요소에 대한 좋은 이해가 필요합니다. 단순히 기계 엔지니어가 될 수는 없어요. 프로그래밍하는 방법을 알고, 전기공학을 조금 아는 기계 엔지니어가 되어야 합니다. 저는 이런 제어 경로에서 시작했는데 제조하거나 부품을 만드는 사람이 되고 싶지 않았습니다. 저는 이 물건을 제어해서 세상에서 유용한 일을 하게 만드는 사람이 되고 싶었습니다. 제어 엔지니어로서 하는 고전적인 일은 진자로 시작하는 것입니다. 아래쪽에서 그냥 흔들리는 진자를 가져와서. 해결하려는 문제는 어떻게 진자를 흔들어서 스스로 균형을 잡게 하느냐입니다. 제어 엔시스템과 액추에이터, 모터를 실제로 유용하게 사용하는 방법은 무엇인가요? 로봇공학에는 매우 다른 산업들이 있습니다. 휴머노이드 로봇이나 자율주행차가 있죠. 표면적으로는 매우 다르게 보이지만, 실제로는 많은 기본 알고리즘들이 유사합니다. 제어는 외부 세계의 센서 입력을 받아서 시스템이 원하는 대로 동작하도록 하는 모터를 어떻게 작동시킬지 결정하는 분야입니다. 예를 들어 진자가 스윙업하고 균형을 잡도록 하는 것처럼요. 하지만 이 경로를 따라가다 보면, 더 복잡한 시스템들, 자율주행차나 휴머노이드 로봇 같은 것들은 더 복잡하고 미묘하다는 것을 깨닫게 됩니다. 단순한 입력과 출력 시스템이 아니라, 주변에 다른 사람들, 다른 보행자들, 상호작용하는 다른 에이전트들이 있습니다. 이러한 시스템들을 구축하면서, 학교에서 배운 일부 고전적인 알고리즘들이 너무 단순하고 순진하다는 것을 깨닫게 되고, 실제로 점점 더 복잡성을 추가해야 합니다. 어느 시점에서 머신러닝의 길로 가게 되고, 머신러닝과 AI가 이러한 더 복잡한 시스템들을 제어하는 방법이며, 카메라와 인식을 센서로 사용하여 주변 세계를 바라보는 것임을 깨닫게 됩니다. 그리고 이제 자율주행차의 액추에이터는 핸들과 가속기, 브레이크가 될 수 있습니다. 어떻게 주변 세계를 바라보고 다른 차량들과 다른 보행자들과 안전하게 상호작용하도록 시스템을 작동시킬 수 있을까요? PID와 같은 간단한 제어 알고리vtt 그리고 점점 더 복잡한 상황들을 마주하게 되면, 결국 최종 해결책은 머신러닝이나 AI라는 것을 깨닫게 됩니다. 뜨거운 이슈나 헤드라인은 기계공학자들에게 이런 생각을 하게 만듭니다. 아, 이 AI가 우리 일자리를 빼앗을 거야. 기계공학자로서, 아, 저건 컴퓨터공학이고, AI고, 다른 분야야 라고 생각할 수 있습니다. 기계공학자로서, 실제로 저는 기계공학 배경을 가진 AI 엔지니어들이 많다는 것을 알게 되었습니다. 그래서 실제로 여러분이 AI로 진출해서 다른 사람들의 일자리를 빼앗을 수 있습니다. 음, 도발적이네요. 모든 기계공학자가 당신처럼 제어나 수학에 능숙한 것은 아닙니다. 저처럼 더 공간적이거나 구멍 뚫린 블록 설계에 능숙한 골치 아픈 사람들은 어떻게 역학과 어려운 수학에 재능이 없다면 제어와 로봇 분야에 진입할 수 있을까요? 네, 이런 분야에 진입하려면 수학이 필요하다는 것은 잘못된 생각입니다. 많은 프로그래밍과 컴퓨터공학자들은 사인과 코사인조차 모릅니다. 로봇공학과 프로그래밍에서 할 수 있는 것들은 매우 다양합니다. 대부분이 이론적이지 않고, 실제로 현대 로봇공학의 많은 부분은 무엇이 효과적인지에 대한 이론적 이해에 기반하지 않습니다. AI는 이제 아무도 왜 작동하는지 모르는 고전적인 예시입니다. 그 뒤에 그다지 많은 이론이 없습니다. 그냥 작동할 뿐입니다. 실제로 그 뒤에 그다지 많은 수학이 없습니다. Pytorch나05:28.620vtt 이론을 정말로 알 필요는 없고 그냥 보면서 작동하는 것을 보고 당신이 관심 있는 문제를 해결하도록 할 수 있습니다. 누군가가 다른 산업의 기계공학자인데 로봇공학에 관심이 있어서 사이드 프로젝트를 하고 싶다고 해봅시다. 로봇공학 분야로 진입하기 위한 의미 있는 사이드 프로젝트의 범위는 어떻게 될까요? 글쎄요, 제가 로봇공학에 대해 좋아하는 한 가지는 반복하고 이런 사이드 프로젝트를 진행하기가 쉽다는 것입니다. 많은 부분이 프로그래밍이거든요. 좋은 시작점은 실제로 물리적인 것을 만들 필요 없이 자신만의 시뮬레이션을 작성하는 것일 수 있습니다. 시뮬레이션을 작성할 수 있거든요. 예를 들어, 저는 자율 범선으로 시작했습니다. 그래서 제 프로젝트는 범선을 만드는 것이 아니라, 범선의 시뮬레이션을 만드는 것이었습니다. 시작할 수 있는 프로젝트로는 메카트로닉스나 로봇공학 프로젝트처럼, 구매할 수 있는 기존 키트들이 많이 있습니다. 로봇공학 키트는 기본 단계를 안내해주기 때문에 좋습니다. 하지만 기본을 배우고 나면, 로봇공학 입문 수업을 몇 개 듣고 나면, 바로 자신만의 것으로 뛰어드는 것이 더 의미 있다고 생각합니다. 그래서 제가 한 일의 예로는 세그웨이처럼 바퀴가 두 개만 있는 균형 로봇들이 있습니다. 이것은 진자 아이디어로 돌아가는 것인데, 진자는 기계공학에서 매우 단순한 시스템입니다. 진자가 있으면, 진자는 자연스럽게 넘어지자체 전력 시스템을 구축하고, 간단히 로봇을 개발할 수 있습니다. 아마도 100줄의 코드만으로 IMU를 읽어서 각도를 감지하고, 모터를 제어해서 로봇을 똑바로 세워둘 수 있죠. 로봇이 이쪽으로 기울기 시작하면, 모터들이 그 아래로 베이스를 운반하면서 자가 안정화 시스템이 되는 것처럼 말이에요. 하지만 저는 여러분이 많은 흥미로운, 상대적으로 간단한 프로젝트들을 생각해낼 수 있다고 봅니다. 순수한 시뮬레이션이든 간단한 하드웨어를 포함하든 여러분 자신의 아이디어를 작업하고 결실을 맺을 수 있습니다. 제어 공학에서 강화학습으로 어떻게 전환하셨나요? 좋은 질문이네요. 모라벡의 역설이라는 유명한 역설이 있는데, 모든 사람들이 처음 자동화될 일자리는 제조업 일자리라고 생각했다는 관찰입니다. 로봇이 공장에서 사람을 대체해서 이런 간단한 작업을 하는 것 말이죠. 여기서 물체를 집어서 저기에 놓는 것처럼요. 사람들은 대체하기 어려운 것은 예술가나 시인처럼 창의성을 요구하는 것이라고 가정했습니다. 모라벡의 역설은 실제로는 이런 창의적인 일들이 자동화되고 있다는 관찰입니다. 그리고 실제로는 우리가 자동화하려고 하는 가장 어려운 것은 이런 간단한 것입니다. 여기 있는 이 물체를 집어서 저기로 옮길 수 있는가? 물리적 세계에서 작동하는 것 말이죠. 제어는 이런 수학적 프레임워크를 제공해주는데 이론적으로 뒷받침됩니다. 예를 들어, 여기서 선택지 중 하이것을 여기로 옮기고 여기에 놓는 최적의 솔루션입니다. 하지만 의미 있게 유용한 상황에서는 이것이 고전적인 제어 방식으로 문제를 공식화하기에는 너무 복잡해집니다. 그리고 강화 학습은 단순히 머신러닝으로 뒷받침되는 제어입니다. 강화 학습은 이 에이전트나 이 로봇이 주변 환경을 관찰하며 작동하는 과정입니다. 이 테이블의 상태와 테이블 위의 일부 객체들을 관찰하고, 이제 행동을 취하여 손을 움직여 이 객체를 여기로 옮기는 것입니다. 이것이 제어 문제입니다. 하지만 더 복잡한 시스템들, 즉 일상 산업에서 볼 수 있는 시스템들과 자동화하려는 시스템 유형들에서는 고전적 제어가 이러한 복잡한 환경에서 종종 어려움을 겪습니다. 그리고 AI를 이 컨트롤러의 두뇌로 사용하는 강화 학습이 이러한 문제들을 해결하는 방법으로 인기를 얻고 있습니다. 픽앤플레이스와 엔드 이펙터는 아마존과 휴머노이드 로봇공학을 괴롭혀온 오랜 문제입니다. 이 문제와 기술들의 최신 기술이나 최첨단 상태는 무엇입니까? 아마존은 단순히 객체나 패키지를 이 상자에서 저 상자로 옮기는 방법을 알아낼 수 있다면 매년 수십억 달러의 수익을 올릴 것입니다. 그리고 이는 해결하기 매우 사소한 문제처럼 보입니다. ChatGPT와 같은 곳곳에서 AI가 나타나고 있지만, 우리는 여전히 객체를 여기서 저기로 옮기는 방법을 알아내지 못하고 있습니다.여기서 충분히 안정적으로 작동하여 이 시스템을 수익성 있게 만들 수 있습니다. 이는 흥미로운 관찰이라고 생각합니다. 이는 바뀔 수밖에 없다고 생각합니다. 그래서 제가 이 업계에 있는 이유입니다. 다음 ChatGPT는 물리적 현실 세계에서 작동하고 상식이 필요해 보이는 간단한 작업들을 수행할 수 있는 시스템이 될 것이라고 생각합니다. ChatGPT가 디지털 규모에서 상식을 학습했듯이, 로봇공학은 물리적 규모에서 상식을 어떻게 학습하느냐의 문제입니다. 로봇공학에 ChatGPT 순간이 필요합니다. 현재 최첨단 기술은 매우 단순합니다. 현재 최첨단 기술은 흡입컵을 가진 단순한 로봇들로, 물체를 빨아들여 한 곳에서 다른 곳으로 이동시킵니다. 그들이 흡입컵을 사용하는 이유는 단순히 매우 정밀할 필요가 없기 때문입니다. 물체 위로 가서 문어 촉수처럼 빨아들여 이동시키면 되고, 물체의 정확한 모양이나 질감을 감지하는 것은 중요하지 않습니다. 이는 최적의 솔루션이라기보다는 임시방편입니다. 최적의 솔루션은 인간의 손과 같은 것입니다. 인간의 손은 믿을 수 없을 정도로 정교합니다. 우리는 세밀한 물체를 조작할 수 있도록 진화했습니다. 우리는 이 문제를 해결하는 것이 얼마나 어려운지 과소평가합니다. 우리는 수십억 년 동안 이런 매우 미묘한 작업을 할 수 있도록 진화했습니다. 우리의 피부는 수백만 개의 매우 세밀한 센서들로 가득 차 있는데, 이는 로봇의 피부에 넣기 매우 어렵습니다. 그래서 저는11:06.910 --> 우리가 가진 가장 산업 친화적인 로봇은 이 큰 크레인에 흡입 컵이 달려서 상자를 집어 올려 여기에 놓는 것입니다. Boston Dynamics는 인간형 로봇인 Atlas가 손재주 있는 손을 사용해서 이런 작업을 할 수 있도록 하려고 합니다. 그리고 저는 이것이 시간 문제라고 생각합니다. AI 솔루션이 될 것이고 기계 엔지니어들이 이를 배우는 것이 도움이 될 것입니다. 이것은 기계 엔지니어 대 AI나 기계 엔지니어 대 소프트웨어 엔지니어가 아닙니다. 기계 엔지니어로서 그 길을 갈 수 있습니다. 중간에서 만날 수 있습니다. 여기에 기계가 있고, 여기에 소프트웨어가 있습니다. 로보틱스는 중간에 있습니다. AI도 중간에 있을 수 있습니다. 제가 일상적으로 함께 일하는 최고의 AI 엔지니어들 중 많은 분들이 기계 공학 배경을 가지고 있습니다. 이것을 당신의 일자리를 빼앗을 것이라고 생각할 수도 있습니다. 하지만 실제로는 이 AI와 ML 분야에 참여해서 이 다음 프론티어를 해결하는 사람이 될 수 있습니다. 즉, 로봇이 상식을 가지고 세상과 유용하게 상호작용하도록 하는 방법 말입니다. 좋은 결론이네요. 우리가 학교에 있을 때, 대학원 수준의 로보틱스 과정을 들었고 ROS를 배우려고 했는데, 가장 가파르고 터무니없이 고통스러운 학습 곡선을 가지고 있었습니다. 2주 후에 포기했던 것 같습니다. 그리고 당신은 결국 다시 들어서 통과했죠, 맞나요? 네, 그것은 고전적인 로보틱스 수업이었는데 프첫 번째 과제가 우분투 머신에서 하는 것이었어요. 우분투가 뭔지도 몰랐고, 리눅스가 뭔지도 몰랐거든요. 그런데 들어본 적도 없는 운영체제에서 이 프로그램을 실행하라는 거예요. 그래서 우리 둘 다 슬픈 침묵에 빠져서 충격을 받았어요. 이 프로그램을 어떻게 실행하는지조차 전혀 모르겠더라고요. 완전히 새로운 언어 같았어요. 맞아요. 기계공학자가 처음 로봇공학을 배우려 할 때, 어떻게 그 단계를 넘어서도록 도와주시겠어요? 네, 지금은 대규모 언어 모델의 시대니까 그걸 활용하면 좋을 것 같아요. 요즘 사람들이 ChatGPT를 사용해서 코드를 작성하게 하는데, 저는 그게 좋다고 생각해요. 하지만 코드에 익숙하지 않아서 ChatGPT를 사용해서 코드를 작성한다면, 실제로 코딩을 배울 수 있는 학습 기회를 대체하지 않도록 해야 해요. 여전히 중요한 학습 과정이 있어요. 최종 목표는 코딩을 배우는 것이어야 하고, 로봇공학에 입문하고 싶은 기계공학자로서 코딩을 배우는 것이어야 해요. 그리고 실제로 어떻게 하는지 전혀 모르면서 ChatGPT에게 코딩을 대신 시키는 것이 아니라요. 그래서 대규모 언어 모델을 사용하고, ChatGPT를 학습 코치로 사용해서 이런 과정들을 안내받는 것이 좋다고 생각해요. 우리가 수업을 들을 때 그런 게 있었다면 훨씬 쉬웠을 거예요. 어디서부터 시작해야 하는지 물어보고 기본 개념들을 설명해달라고 할 수 있었겠죠. 당신은 꽤 헌신적인 길을 자신이 가장 낙관적으로 생각하는 것들을 추구하기 위해 커리어를 바꾸었나요? 네, 저는 매우 구불구불한 길을 걸었다고 생각합니다. 코넬에 들어갈 때 엔지니어링에 대해 아무것도 몰랐어서, 4학년 말이 되어서야 로봇공학에 관심이 있다는 걸 알았습니다. 그리고 지금 주변 사람들을 보면 좀 질투가 납니다. 1학년 때부터 로봇공학에 대해 확고한 목표를 가지고 모든 것을 올바르게 해서 최고의 연구실에서 일하고 제가 경쟁할 수 없는 기술을 가진 사람들 말이에요. 한편 2학년, 3학년 때 저는 역학, 유체역학 같은 것들을 배웠는데, 이것들은 지금 제가 하고 있는 일에 직접적으로 적용되지 않습니다. 하지만 제가 결국 도착한 곳이 어쨌든 제가 도착하고 싶었던 곳이라고 생각합니다. 그래서 그런 면에서 운이 좋았습니다. 그리고 이런 우회적인 길을 택하는 것에도 얻을 수 있는 게 조금 있다고 생각합니다. 왜냐하면 일상적으로는 적용하지 않을 수도 있지만, 특히 로봇공학처럼 많은 것들이 적용되는 분야에서는 유용할 수 있는 다른 것들에 대해 조금씩 배우게 되기 때문입니다. 예를 들어 로봇공학자로서 유체역학에 대해 조금 더 많은 지식을 가지고 있다면, 갑자기 바다에서 작동하는 로봇을 만들게 될 때, 유체역학에서 배운 전문지식을 이 특정 응용에 가져올 수 있습니다. 그래서 항상 자신에게 유리하게 활용할 수 있다고 생각합니다. 그리고 이런 우회적인 길을 택하는 것이 자신이 무엇에 관심이 있는지 알아내는 좋은 방법이라고 생각합니다. 왜그것을. 그래서 실제로 구불구불한 길을 택하는 것이 좋은 방법이 될 수 있다고 생각해요 뭔가 맞아떨어질 때까지 많은 것들을 조금씩 해보는 것 말이에요. 그건 매우 권장됩니다. 하지만 너무 늦지 않았어요 저에게는요. 아니요, 당신에게는 너무 늦었어요. 로봇 산업에 대한 당신의 견해는 어떤가요. 로봇 분야가, 누구나 생각했던 것보다 더 어려운 것으로 증명되었다고 느껴요. 과거를 돌아보면 일론 머스크 같은 사람들이 대략 2014년부터 완전 자율주행이 내년에 나올 것이라고 말했던 것을 볼 수 있어요. 일론은 로봇 분야에서 가장 목소리 큰 낙관론자였지만, 그런 사고방식과 정신력은 이 분야의 많은 사람들이 공유했다고 생각해요. 일부는 마케팅 때문이었다고 생각해요. 모든 사람이 네, 제 로봇이 내년에 준비될 것입니다라고 말하고 싶어하지만, 많은 부분이 진심이었다고 생각해요. 모든 사람이 ChatGPT가 갑자기 시를 쓸 수 있다는 것에 놀랐거든요. 그리고 그 시점에 준비될 것이라고 생각한 사람은 없었어요. 많은 사람들이 집안일을 할 수 있는 유용한 로봇들, 세탁이나 침대 정리 같은 것들이 그림을 그리거나 미드저니가 할 수 있는 것들보다 더 쉬운 문제일 것이라고 생각했어요. 그래서 사람들은 로봇 분야가 얼마나 어려운지에 대해 놀랐다고 생각해요. 하지만 지금 이 머신러닝과 AI, 데이터 시대에 우리는 잠재적으로 로봇 분야의 재각성을 볼 수 있는 위치에 있다고 생각해요. 로봇 산업이16:30.진공청소기는 현재 사람들이 실제로 구매할 몇 안 되는 로봇의 예 중 하나입니다. 이는 로봇공학이 얼마나 어려운지를 보여주는 매우 시사적인 사례입니다. 지금 모든 사람들이 이 플라이휠에 대해 이야기하고 있습니다. AI와 ML의 시대에서 데이터를 얻는 것이 핵심입니다. ChatGPT가 작동하는 이유는 인터넷 규모의 데이터로 훈련되었기 때문입니다. 로봇공학에서도 많은 사람들이 이제 로봇을 위한 인터넷 규모나 인터넷 크기의 데이터를 찾는 것이 문제라고 생각합니다. 그것은 어떤 모습일까요? 룸바나 다른 로봇들처럼 실제 세상에서 작동하는 간단한 로봇들의 모습일 수 있습니다. 이제 이 로봇들은 단순히 세상에서 작동하는 것뿐만 아니라 미래 로봇 반복 훈련에 사용할 수 있는 데이터를 수집하고 있습니다. 사람들이 구매할 정도로 충분히 잘 작동하는 비교적 간단한 로봇들이 있는 플라이휠을 상상할 수 있습니다. 집에서 비교적 간단한 작업을 수행하지만 대량의 데이터를 수집하고, 그 데이터를 사용해서 조금 더 유용하고 조금 더 일반적인 다음 세대 로봇을 훈련시킬 수 있습니다. 그러면 더 많은 사람들이 그것들을 구매하게 됩니다. 이제 1세대 로봇이 2세대를 더 똑똑하게 만들기 때문에 로봇이 이런 방식으로 스스로 개선되는 자기 실현적 플라이휠이 생성됩니다. 테슬라는 지금 완전 자율주행으로 이런 일을 하고 있습니다. 사람들이 자신의 차를 운전하는 데이터를 거의 무료로 수집하고 있습니다그들을 위해 데이터를 수집하고 이를 차세대 로봇 개발에 활용하는 것입니다. 따라서 로봇공학은 이 AI 시대에 모든 것이 데이터에 관한 것이 될 것이라고 생각하며 사람들이 구매할 만큼 유용한 초기 단순 로봇으로 시작해서 그런 방식으로 접근할 수 있는 방법을 찾는 사람들이 엄청난 양의 데이터를 생성하고 그 데이터를 다음 세대를 더 유용하게 만드는 데 사용할 수 있을 것입니다. 이것이 미래라고 생각합니다. 이 데이터나 이러한 모델들이 자유도와 같은 로봇 구조에 특화된 것인지, 아니면 다른 로봇들에 걸쳐 일반화될 수 있는지, 예를 들어 룸바에서 휴머노이드까지? 네, 그것은 큰 질문이라고 생각합니다. 그리고 지금 나오는 논문들을 보면, LLM과 유사한 이러한 기반 모델들에 대한 이야기가 있습니다. 이것들은 일종의 기초적인 모델들입니다. 이는 이러한 것들이 많은 다른 시스템이나 환경에 걸쳐 일반화된다는 것을 의미합니다. 그래서 로봇공학에서는 교차 구현으로 훈련된 이러한 기반 로봇 모델들에 대해 이야기하고 있습니다. 교차 구현은 데이터를 수집하는 다양한 로봇 플랫폼들이 있다는 의미입니다. 그리고 몇 년 전까지는 모든 것이 특화된 로봇 훈련에 관한 것이었고, 한 종류의 로봇만을 위한 데이터 수집이었습니다. 하지만 이제는 다양한 로봇 플랫폼들에 걸쳐 데이터를 수집하고, 다양한 로봇 구현체들과 이를 사용하여 훈련하는 것이 더 인기가 되고 있습니다.19:29.100 --> 19:32이 많은 다양한 로봇들에 걸쳐 있는. 예를 들어, 객체 영속성, 이런 것들 객체들이 시야에서 벗어날 수 있지만 여전히 존재한다는 아이디어 그리고 다시 시야로 돌아온다는 것은 로봇이 학습해야 하는 것이고 특정 로봇 플랫폼에만 해당되지 않는다. 그리고 여러 다양한 로봇들로부터 데이터를 수집해서 이런 상식적 원리들을 학습할 수 있다. 새벽 1시에 이걸 하느라 참아주셔서 정말 감사하고, 구해주셔서 감사합니다. 지금 제 취침시간보다 3시간이나 늦었어요. 네, 하지만 당신을 위해서라면 뭐든지요. 감사합니다, 제시. 로봇을 위해서라면 뭐든지. 저는 그냥 당신의 직업을 대체하려고 하는 거예요. 특히 당신 것이 첫 번째로 사라질 거예요. 구체적으로 당신 것이 첫 번째로 사라질 거예요.