3. Reading list for 2019

January 16, 2019

One of my new year resolution is to read more. I have a habit of not finishing the book so I hope this becomes a fun game that motivates me to finish. I plan to stamp finished date for each book here and maybe write some thoughts. :D

The Book of Why: The New Science of Cause and Effect
by Judea Pearl, Dana Mackenzie
The Myth of the Nice Girl: Achieving a Career You Love Without Becoming a Person You Hate
by Fran Hauser
Code: The Hidden Language of Computer Hardware and Software (Developer Best Practices)
by Charles Petzold
Homo Deus: A Brief History of Tomorrow
by Yuval Noah Harari
Scarcity: Why Having Too Little Means So Much
by Sendhil Mullainathan, Eldar Shafir
The Three-Body Problem (Remembrance of Earth's Past Book 1)
by Cixin Liu
The Model Thinker: What You Need to Know to Make Data Work for You
by Scott E. Page
Educated: A Memoir
by Tara Westover
Factfulness: Ten Reasons We're Wrong About the World--and Why Things Are Better Than You Think
by Hans Rosling, Anna Rosling Rönnlund
Prediction Machines: The Simple Economics of Artificial Intelligence
by Ajay Agrawal, Joshua Gans
Pachinko (National Book Award Finalist)
by Min Jin Lee
The Black Swan: Second Edition: The Impact of the Highly Improbable: With a new section: "On Robustness and Fragility" (Incerto)
by Nassim Nicholas Taleb
Bad Blood: Secrets and Lies in a Silicon Valley Startup
by John Carreyrou
Thinking, Fast and Slow
by Daniel Kahneman
Becoming
by Michelle Obama
Measure What Matters: How Google, Bono, and the Gates Foundation Rock the World with OKRs
by John Doerr, Larry Page
When: The Scientific Secrets of Perfect Timing (Random House Large Print)
by Daniel H. Pink
Stuff Matters: Exploring the Marvelous Materials That Shape Our Man-Made World
by Mark Miodownik, Sarah Scarlett
Shoe Dog: A Memoir by the Creator of Nike
by Phil Knight
Reality Is Not What It Seems: The Journey to Quantum Gravity
by Carlo Rovelli, Simon Carnell (Kindle Edition)
"Surely You're Joking, Mr. Feynman!": Adventures of a Curious Character
by Richard P. Feynman, Ralph Leighton (Kindle Edition)
The Gene: An Intimate History
by Siddhartha Mukherjee
The Future of the Mind: The Scientific Quest to Understand, Enhance, and Empower the Mind
by Michio Kaku
The Soul of A New Machine
by Tracy Kidder (Paperback)
On Writing: 10th Anniversary Edition: A Memoir of the Craft
by Stephen King (Paperback)
The Sixth Extinction: An Unnatural History
by Elizabeth Kolbert (Paperback)

2. Reflecting on My Time at a Korean Startup, and What's Next

January 16, 2019

Last June, I was fortunate to meet with three co-founders of CMES, a computer-vision company specialized in inspection and scanning. When I met them, they had already done some work related to industrial automation with very well-known global companies and were expanding to other industrial robotics applications. In August 2018, I happily joined as a robot vision engineer.

1. What I did

In the last few months, I designed and developed a software that, given a structured point cloud from an RGBD image, returns a list of detected objects’ positional information in camera’s coordinates. The software can also return the list of objects in an order to be picked up according to the pre-defined order. It would just need a JSON format file that contains a 2-dimensional top-view of objects in different specific arrangements with each object or group of objects numbered to be picked. The program will compare it with the positional pattern of detected objects and chose an appropriate model to use to define picking sequences for each identified object or grouped nearby objects. My program is used by the main server that captures an image and send positional and grasping commands to the robot through PLC.

Before building this software, I had built few prototypes. One was an object detector that returns a location of an object from an image using deep learning techniques. I’ve tried with a few different methods available open-source like Fast-RCNN and RetinaNet. The other was an object detector built using traditional computer vision methods. I first detected edges through Hough’s transform and computed their intersections. Using those intersection points, I found all possible object hypothesis and filtered improbable ones to localize correct objects. Defining objects using edges were possible because an object had a specific geometric shape and the view from the camera was normal to its geometric surface. Building these two prototypes helped me gain some intuition and identify challenging areas for creating an object detector.

Figure1. The work cell at the customer’s logistic center

2. Challenges I faced

I had two main objectives during development. First, my software should function as intended or requested by the client. Second, my code should be readable, maintainable and flexible.

Some challenges I faced were due to lack of experience using specific libraries. It's difficult to search online for things you don't know that you don't know. I hadn't grasped all available resources such as classes and functions I could use to write more efficient code. I also wasn’t aware of many useful algorithms that were useful for solving some of my problems until I discussed my solutions with seniors. They include algorithms like iterative closest point (ICP) or principal component analysis (PCA). In those cases, I had come up with solutions, but they were most often less efficient, more complicated to execute, and less reliable to variables in data. Finally, I was applying new technology, deep learning, that was relatively new to me. Over the past year, I had diligently spent many hours studying the foundations and prototyping using tutorials to be more familiar with the concepts. It was much more challenging to apply those concepts to a real-world scenario and develop software that also meets industrial standards.

Furthermore, since I was writing my software from scratch, I had great flexibility to architect my code. That also meant I felt more responsibility to start right. I wanted to write clean code that was maintainable and flexible. It was important to me because I believed a good software engineer should care about others who would be reading and modifying the codebase. My company also had a plan to expand my application to provide for other potential customers. Because I had recently changed the field to software engineering and admittedly lacked in experience, there were times I was unsure which best practices to follow or didn’t even realize I was writing in bad practice until code review.

3. How I overcame my challenges

I spent many hours researching other people’s approach. I wanted to make sure I wasn’t rebuilding the wheel, and even if I were, I wanted to learn from others. I'm very fortunate to live in a time where there are plenty of resources and supportive communities online. Some resources that were helpful as reference were open source library documentation pages and their community site, GitHub repositories and their issues pages, StackOverflow, and “Clean Code” book. I also reached out for discussion when I felt stuck, or when I felt that my approach wasn’t the best solution but couldn’t yet devise on my own. Those were when I learned about algorithms like ICP or PCA that were very useful. Other things I did was engaging with deep learning communities. Not only did I follow well-known scientists online, but I also attended of some events like recent NVIDIA’s AI Conference or their Jetson’s Meetup group in Seoul to gain some insights and network with machine learning and deep learning practitioners. I also took advantages of some job interviews where I got to meet machine learning scientists and engineers and asked them questions to gain their insights. Interestingly, some issues were also left unanswered by many of the practitioners too. Ironically, it helped me to work with some uncertainty knowing that it wasn’t something I faced due to lack of expertise or experience but due to the nature of the field at this time.

I also learned a lot through code review with my seniors. We didn’t have scheduled code review, but I asked for one occasionally. It helped me to identify some bugs. It also helped me to architect and refactor the code for more efficiency and scalability. Additionally, confirmations on good practices or styles I followed were very encouraging. Overall, code review was helpful to correct and update my code while exposing myself to fruitful knowledge gathered from more experienced engineers.

Figure2. A mock work-cell at the customer’s site for an intermediate testing before installing at a real site.

4. What I learned

Overall, I think I maximized my experience in the last five months at the startup.

Two initial goals I achieved are:

Integrating deep learning techniques into a commercial product for a real-world application (i.e., industrial automation)
Designed and developed a fully functioning software in C++

Additional highlights are:

Experience working with client and external collaborator
Experience working on-site; installation, testing and debugging
Experience working at a startup; This means working with a lack of structure compared to bigger companies/teams, quick pace and late-night work cultures, and a range of responsibilities related to coding and non-coding.

5. What I want to improve

First, I'll like to understand better how a computer works, and how the operating system works. I also want to spend more time better grasping C and C++ language, how to use them more efficiently, and read more style guides.

Second, I’ll like to focus more on better understanding the algorithmic side of machine learning and conduct foundational research. Therefore, I plan to review my foundation in mathematics and ultimately in machine learning. Specifically, I’m interested in deep reinforcement learning (DRL) and other topics that are complementary or supplementary to DRL like neuroevolution for robotics application. In the long term, I’ll like to participate in building human-level intelligence or artificial general intelligence that will expand the capabilities of robots.

6. Why?

My initial point of interest to robotics was an introduction to a surgical robot, the da Vinci surgical system, during high school. Now, I see value in its everyday applications as well as in industrial and medical. The potential benefits that robotics can have to our society inspire me. But those benefits usually come with more mobile and smarter robots. It should be able to communicate with others, human or machines, sense the environment, reason for itself, and make a decision. Additionally, it should be able to manipulate itself and objects around it safely and swiftly.

I am especially mesmerized by the mobility of robots. It’s thrilling to watch it move and do something useful. Robots doing something dexterous has always captivated me. It might be because the movement itself symbolizes all possibilities intelligent robots can have to our society. Therefore, topics like deep reinforcement learning to devise autonomous manipulator are interesting to me.

To summarize, my goal is to make an impact in robotics research and push the capabilities of robots for the benefits of our society - towards a healthier, more productive and equal society. I believe machine learning holds one of the keys to expanding robot applications and therefore would like to invest my career in this area.

7. What's next?

I feel fortunate to have had this opportunity to contribute and grow as a software engineer. I feel bitter-sweet to close this chapter in this journey as I've got to know the team better, but I am also excited to start a new chapter as a bit more mature engineer armed with newly gained experience. I am joining Advanced Robotics Lab at LG Electronics as a research engineer. January 21st, 2019 is my first day. :)

1. [Project RoboPainter] Initial Project Idea

January 12, 2019

[ This post has been moved from the Work page. Last edited: Jan 2018]

What is RoboPainter? RoboPainter is a robot arm capable of drawing and painting shapes.

Why build RoboPainter? Building RoboPainter combines my passion for robotics, artificial intelligence, and painting. I am also inspired by the annual RoboArt Competition. The goal of the competition is to produce soemthing visually beautiful using robotics. The deadline for an art submission is in April every year since 2016. I am preparing for the competition next year in 2018.

How are you building RoboPainter? To build RoboPainter, I will use the uSwift Pro robot arm with gripper end-effector to hold a pen or brush and combine it with Intel's RealSesne camera for my Linux Box computer for collecting visual images. I will then implement an off-the-shelf algorithm for vision-based, multi-task manipulation using end-to-end learning from demonstration[1]. Following this, I will spend hours training the model with tasks relevant to drawing and coloring shapes. Finally, I will test the algorithm and evaluate the success of the tasks. This may require multiple cycles to implement any necessary modifications and re-evaluate the performance. When successful, I plan to build a modular program that applies transfer learning from one robot to another. This way, a learned model from one robot can be used to derive a policy for another robot.

Why this specific method? Groups in CMU, UCF and UCB have shown that it is possible to execute end-to-end training using only raw images as input to autonomously accomplish a variety of tasks. There are several challenges to applying end-to-end learning to robotics. First, it is data hungry. It is expensive to collect data for robotics tasks. Since this is a one-person project, it was important for me to find a data-efficient method to train painting tasks. Second, it requires hand-crafting a robust and resilient control strategy that is extremely difficult to capture in multiple manipulation tasks. However, through the technique of learning from demonstration (LfD), demonstrated that it is possible to train robots through a manageable number of training sets. Finally, the code for [1] was available open-source to use as a baseline for this project.

Screen Shot 2018-08-23 at 10.22.36 PM.png

Screen Shot 2018-08-23 at 10.35.19 PM.png

Here, is the list of relevant concepts. I will briefly describe each concepts over time.

Convolutional Neural Net (CNN)

Recurrent Neural Net (RNN)

Long Short-term Memory (LSTM)

Variational Autoencoder (VAE)

Generative Adversarial Network (GAN)

Autoencoding with Learned Similarity (VAE/GAN)

Neural Autoregressive Distribution Estimator (NADE)

Imitation Learning; Learning from Demonstration (LfD)

Observations from an expert produce a set of demonstration trajectories, or sequences of states and actions, that are used to shape reward or desired policy directly to generate demonstrated behavior.

Behavioral Cloning

Supervised learning policy using demonstration trajectories as ground truth, mapping states directly to actions.

Inverse Reinforcement Learning

Unsupervised learning policy using demonstrations to learn latent rewards or goals, and training the controller under those rewards to get the policy.

Visuomotor Learning

References

[1] Rahmatizadeh et al. Vision-based multi-task manipulation for inexpensive robots using end-to-end learning from demonstration. CoRL 2017. [https://goo.gl/XT6jAU]

[2] Larsen et al. Autoencoding beyond pixels using a learned similarity metric. ICML 2016.

[3] Pinto and Gupta. Learning to push by grasping: Using multiple tasks for effective learning. 2016.

[4] H. Larochelle and I. Murray. The neural autoregressive distribution estimator. AISTATS. 2011

[5] P.Pastor et al. Learning and generalization of motor skills by learning from demonstration. IEEE ICRA. 2009

[6] Kingma et al. Auto-encoding variational Bayes. ICLR. 2014

[7] Goodfellow et al. Generative adversarial nets. 2014

3. Reading list for 2019

2. Reflecting on My Time at a Korean Startup, and What's Next

1. [Project RoboPainter] Initial Project Idea

Yejin’s Blog

Featured Posts