SHSbot

ME555 Experiment design & Research method
Smart home service robot
(shsbot)

Overview

Project Motivation

Nowadays, when we talk about Robot arm, majority of its applications came to our mind is Industrial robot in manufacturing. Industrial robots can move precisely and carrying out their tasks efficiently. They can only do the fixed job by programming and can’t automatically work in a new environment. They have to be monitored at all times to ensure that they do not get any mechanical faults which would cause them to stall. This could lead to losses to the company. However, with the advancements in technology, monitoring robots has become very easy. It can even be done remotely. The robots are able to send feedback of their processes via text to the operator. In case of a technical hitch, it can also be corrected remotely without having to stop the robot. However, although the applied technologies and research of industrial robot arms are already mature and advanced, they only can do some cyclic and tedious jobs in noisy factories when they receive comments from specific programming.

With 6-degree of freedom, the capacity of the robot arm shouldn’t be limited in factories doing simple and repetitive works. The robot arms with the functions of accomplishing multiple tasks should be applied in people’s daily lives. They can assist humans in doing several simple households works such as collecting waste papers and passing coffee. Therefore, to achieve that goal, studies on robotic cognitive and human-robot interaction are very much needed. With these motivations, we plan to design and build a smart home service robot (SHSbot).

Application

More precise and intelligent robotic arms have been or will be put into use in many fields, such as assisting patients or the disabled in daily life in hospitals, and highly sophisticated medical robotic arms can also help doctors perform operations; The smart robotic arm can also sort garbage more efficiently in the garbage dump. Even at the space station, there will be robotic arms to help clean up the astronauts’ living garbage. Moreover, there are already intelligent service robots in many restaurants, and it is believed that there will be more intelligent household robots to assist people in their daily lives soon.

Design Iteration

In order to provide people direct helps in daily life, the primary ideal of us is to create a kind of small size and affordable robot arm. However, the current civilian robot arm market is dominated by various small, cheap and single-function robotic arms. It is not so much that the robotic arm is actually one kind of toy with an advanced mechanical structure. Although the costs of them are relatively low, these robot arms lacking artificial intelligence are limited help to people’s daily lives. In this case, we decide to design this arm by ourselves to make its control system and mechanical structure satisfies our requirement of serving people and also, keep it in low cost.

1.Initial Composition

Based on reference here, our initial idea to this robot construction and function can be considered as three parts.

The first part is a high-precision six-degree-of-freedom robotic arm and a depth camera for visual functions. High-precision manipulator can reduce the error caused by mechanical movement and control, so that the service of robot is more accurate. Similarly, the camera should also have accurate coordinate recognition ability, so that the robot can accurately locate the object.The robot arm will use Computer Vision and Machine Learning (CNN) to complete the recognition of objects on the table.
The next part is a base rover which can make the robot arm move in the room. To realize the parallel displacement of the SHSbot so that the robot can walk freely at home, a base rover is one of the most crucial parts of the SHSbot. The base rover realizes obstacle avoidance and objects recognition through robot vision and depth camera.
Furthermore, in line with the purpose of serving people, we will also apply the voice recognition function to this robot, which could receive speech commands. Meanwhile, other sensors which can help do room serves will be added to the robot and finally, make the robot can complete or help others complete most of things in daily life.

The idea of our initial robot is as follows.

2.Review and Improvement

Because of the limited time and energy, we decided to just study the first part in this semester. Then after several times review, we conceived the blueprint of a Smart Home Service Robot, SHSbot. It is one kind of intelligent and practical Robot arm, which was designed to help people collect garbage on table top such as wasted papers or sola cans and so on, also organize the table top orderly.

The SHSbot is designed to sit on a corner of a desk and help people perform trivial tasks within the range of the robot arm. It obtains environmental information through the camera, and then controls the movement of the robot arm to complete the corresponding task according to the command of the human. These tasks can be cleaning desktops, cleaning up trash and passing things around. For some more professional applications, by changing the mechanical arm business manipulator can be completed, such as the welding work in the mechanical laboratory can be changed to its manipulator welding equipment to complete. Like Tony’s robotic arm on the right.

Design Criteria

1. Structure and Hardware

Robot Arm First of all, in terms of the use and design of robot arm, we chose armbot which has a relatively complete development foundation and has been used for many times in this course. Because this robot arm has a large movement radius and relatively high movement accuracy, its code is open source and some model files are available in ROS, so we only need to sort it out a little before we can use it in our project.

Depth Camera For visual sensors, we chose the depth camera Realsense D435i. Depth camera can directly output the depth information of an object without the need for conversion through a specific algorithm, so it greatly reduces the error generated when recognizing the position of an object. It is the preferred solution for robot navigation and object recognition applications

Platform For the platform and controller, we choose Jetson Nano, Arduino and TIC500. Jetson Nano is a small mobile processor suitable for embedded and robot development. Arduino and TIC500 are common controller combinations that are relatively simple and suitable for beginners.

2. Development Environment

Compare both advantages and disadvantages of different development environments, we choose to set up and control SHSbot using ROS, Arduino, and Moveit, do the pathing planning of SHSbot in Rvis using Moveit, and achieve the simulation via Gazebo cooperates Moveit&Rvis.

ROS, Robot Operating System which is a collection of software frameworks for robot software development. We completed Jetson Installation and ROS installation, setup a ROS catkin workspace, created the environment, built SSN, and downloaded necessary packages

MoveIt which provides a platform for developing advanced robotics applications is a software for mobile manipulation, incorporating the latest developments in motion planning, manipulation, kinematics & dynamics, control and navigation. We supposed to combine Gazebo, ROS Control for a feasible development platform of SHSbot at first, then we finished the path planning via MoveIt.

Gazebo, offers the ability to accurately and efficiently simulate populations of robots in complex various environments. After finished the setup of MoveIt, we were able to finish the Gazebo simulation integration, the MoveIt Setup Assistant helps setup the SHSbot to work with Gazebo, but there are still additional steps required to successfully run MoveIt in Gazebo.

Object recognition (Machine Learning), SHSbot applies CV and ML (CNN) to complete the recognition of specific objects on the table top, the sensing algorithm will send the goal position information of objects to the reasoning algorithm. The algorithm will complete the path planning of the robot arm, complete the object grabbing and movement by controlling motors

Learning Objectives

Robot Structure

Robot arm mechanical structure (3D printing, planetary gear…)
Robot arm kinematics (DH parameter, forward and inverse robot arm kinematics)

Robot System

Robot operation system(Moveit!, Rviz,Gazebo)
Arduino (motor control, serial communication)

Robot Learning and Perception

Depth camera
Deeplearning
Object detection

Problem Statement

To further simplify the task, make our goals clear and available, we build a specific situation and task here.

In this project, we want to make a robot arm able to clean up our desks automatically. We use a robot arm to do pick and place motion and incorporate computer vision and deep learning to recognize items and detect their position.

In the experiment, to demonstrate the service we wanted to achieve, we separate one desk into two areas. As shown in the picture below, some items we daily use are please in order in area B. When we use them, we usually just put them back on the desk randomly which is the situation in area A. So, in the experiment, we will record the initial position of items in area B and then put them on area A randomly. Finally, we want our robot arm to recognize the items and locate their position, and then pick them up and place them back in order on area B.

System Decomposition

The next step for us is to organize and divide our project into small and reasonable tasks. Due to the special feature of ROS (Topic Publication and Subscription), we can development different part respectively and then construct communication programs to put them together. Therefore, following some reference we found, we divide our project into three parts, and each of our group member focuses on just one of them, which is represented by using different color in the figure. We mainly have four subjects: vision, motion planning, simulation and control. Vision part can be divided into depth camera using and calibration, object detection. Moveit will be used in motion planning. To complete the robot simulation in ROS, we also need to build robot configuration file and set some property parameters. Finally, to control a really arm, we need to finish serial communication and controller setup. In addition, we also use 3D printing, laser cutting and other methods to complete the hardware construction.

Shucheng’s part is using depth camera to do object detection and publish position of target item to node in Moveit.
Yu’s part is to simulate armbot in Gazebo.
Jiaxun’s part is using ROS to control the motion of real armbot. (Detailed tutorial is in his personal page)

More detailed implement and develop will be showed in the Skill level and Implement detail part.

Conclusion

In conclusion, this is a very complex and ambitious project for us to do in one semester. So far, we finished most of the planned work, but we haven’t actually made the hole system work successfully. Some problem is really hard to solve not only because of theory, but also due to multiple bugs in hardware. As shown in the picture below, we haven’t finished the joint calibration of camera and robot arm, the communication between object detection and Moveit and the communication between Moveit and Arduino. These parts will be the future work of this project.

Skill Level and Implement Detail

Beginner

Intermediate

Advanced

Expert

Beginner

Intermediate

Advanced

1. Gazebo Simulation

Gazebo provides the ability to accurately and efficiently simulate populations of

robots in complex various environments. Regularly, we Install Gazebo using Ubuntu packages. At first, we need to grasp the basic concepts of ROS and already learnt through the ROS Tutorials. Then follow the steps of Gazebo installation tutorial to download the package and begin the installation. Then install the gazebo_ros_pkgs from Source (on Ubuntu). We needed to setup a Catkin Workspace which is a folder where allow users to modify, build, and install catkin packages to begin the installation, this step is relatively complex, and we follow this particular tutorial to complete the package installation. To test whether we can run Gazebo through a simple rosrun command, after launching roscore if needed. These two commands will be the most frequent used commands:

source ~/catkin_ws/devel/setup.bash

roscore &
rosrun gazebo_ros gazebo

After the Gazebo installation and tests, we could do the path planning simulation via Gazebo in ROS environment. Besides, due to the limited running capability of jetson, and various joints, links, inertia of SHSbot needed to be defined and controlled, the path planning simulation of SHSbot is not very smooth. We applied the simplified version of the SHSbot modeling and here is the result of the simulation for path planning in Gazebo and Rivs&Moveit.

2. Rvzi path planning

3. Communication between ROS and Arduino

4. Communication between Gazebo and Moveit&Rvis

After finished the setup of MoveIt, we were able to finish the Gazebo simulation integration, the MoveIt Setup Assistant helps setup the SHSbot to work with Gazebo, but there are still additional steps required to successfully run MoveIt in Gazebo, here is a reference which include specific steps and explanation about the whole process. These steps are relatively general steps for various kinds of robot arm to achieve the communication between Gazebo and Moveit. However, because the 3D modeling of SHSbot is based on armBOT which have been introduced in our project website. Since there are already some incomplete but usable “launch” and “controller” files in the armbot project file, but such incompleteness directly leads to the communication between various platforms cannot be successfully implemented.

By comparing with the reference file, the missing controllers were generated and named, some yaml files were also modified. Created the files such as “moveit_controller_manager. launch file”, “controllers.gazebo.yaml”, and so on, also do some modifications. Here is the screenshot of the result for the path planning simulation of SHSbot.

5. ROS package writing and compiling (C++, Cmake)

A simple package can be built like this tutorial.

6. Camera calibration

The basic method can be found here.

7. Objection Detection with ML

(1) Basic concepts of deep learning and artificial neural networks

With the continuous progress of deep learning theory and artificial intelligence technology, its development prospect and application has attracted more and more attention in academia and industry. As one of its important branches, artificial neural network theory has been widely used in object recognition, speech recognition, target detection and automatic driving.

A single-layer neural network, also known as the perceptron, was invented by scientist Frank Rosenblatt. A perceptron can accept several binary inputs, and produces a binary output. These are modeled on the human brain, whereby neurons are stimulated by connected nodes and are only activated when a certain threshold value is reached.

A standard multilayer perceptron (traditional neural network).

(2) Deep learning training process

The core idea of deep learning is to take advantage of the characteristics of its multiple layers and take the output of the previous layer as the input of the next layer, so that each layer can learn a specific feature and iterate through multiple layers until the actual expression required is obtained.

The training process is as follows:

Initialization parameter

The parameters weights and bias are initialized at the beginning of training. When weight is initialized, if the weight of each hidden neuron is set to the same size, then the influence of each hidden neuron on the output will be the same, and the same gradient size will be obtained when the backpropagation gradient descent method is used for calculation. In this case, no matter how many hidden units are set, the final effect is the same. Currently, the multi-layer neural network loses its significance. Therefore, when initializing parameters, weights should be randomly initialized. But bias, because there is no symmetry problem, can be set to 0.

2. Propagation forward

Propagation forward is to take unlabeled data (or ontology parts of labeled data) and learn from the lowest level of input data and work up to the highest level of output, while obtaining the corresponding parameters for each level. Since there is no need for the participation of labels, this part can be regarded as an unsupervised training process, and it is also a process in which the neural network learns the data set.

The output layer is usually activated by Sigmoid function for binary classification calculation, and Softmax function for multi-classification calculation; For other hidden layers commonly used to activate the ReLU function, because when, the function gradient is always 1, which greatly improves the operation speed of neural network based on gradient algorithm.

3.Cost calculation

In deep learning, a value is needed to evaluate the quality of training results, which is obtained through the cost function:

The purpose of training logistic regression model is to have accurate predictions, so the total cost of the cost function should be reduced to the lowest level and be as close to 0 as possible.

4. Propagation backward

Back propagation is to use the label of data for training, so that the error is transmitted from top to bottom, so as to realize the adjustment of the network. This step is an effective way to ensure that the overall network can output the optimal solution and is a process of supervision and training. Specifically, the parameters and for each layer are updated using gradient descent method.

5. Repeat steps (1) to (4)

(3) Introduction to YoloV3+DarkNet Method

YOLO: As an improved CNN, Yolo algorithm no longer uses the traditional window sliding to identify features, but directly divides the original image into non-overlapping small squares, and finally produces feature maps of the same size through convolution. Each element of the feature map is also a small square corresponding to the original image, and each element is then used to predict the target of those centers within that small square. From the picture, the FPS of Yolo network is very high, which means the response time of this method is much lower compared with other tradition convolutional neural network, so it is suitable for real-time detection.

Structure and Performance of Yolov3

More detail shows on this paper.

https://arxiv.org/abs/1506.02640

Darknet: This package is to use Yolo in ROS for real-time object detection system. More detail shows on its Github.

(4) Implement

1. Code download

Github: https://github.com/leggedrobotics/darknet_ros

Command:

mkdir -p catkin_workspace/src
cd catkin_workspace/src
git clone –recursive git@github.com:leggedrobotics/darknet_ros.git
cd ../

2. Compile

In the ROS workspace directory, execute the following command:

catkin_make -DCMAKE_BUILD_TYPE=Release

The entire project is compiled at this point, the program will check if yolov2-tiny.weightsand yolov3.weights are available in {catkin_ws}/darknet_ros/darknet_ros/yolo_network_config/weights. The default downloaded code does not include these two model files to save volume, so the model files will automatically start downloading after compilation.

3. Running

(1) Image topic release
Because Darknet_ROS will directly subscribe to the specified image topic name, and then inspect the image, draw the detection box, and publish the corresponding detection topic, we first need to find a ROS package that can publish the image topic. Here we recommend using the ROS official USB_CAM driver package. You can directly post the images captured by the computer’s own camera or USB camera connected to the computer as the ROS image topic.
Download the camera driver:

sudo apt-get install ros-kinetic-usb-cam

Then publish the camera image topic:

roslaunch usb_cam usb_cam-test.launch

You should be able to see the actual image display if there are no errors.

Note: most camera’s output is mjpeg，so we should change the launch document.

(2) Run darknet_ros
Darknet_ros is then executed for detection, and the configuration file needs to be changed before running the detection so that the topics subscribed to by Darknet_ROScorrespond to the image topics published by USB_CAM.
Open the darknet_ros/config/ros.yaml file and modify:

subscribers:

camera_reading:

          topic: /camera/rgb/image_raw                   to                    topic: /usb_cam/image_raw

Go back to the darknet workspace root and execute:

source devel/setup.bash
roslaunch darknet_ros darknet_ros.launch

The result shows as follows. We can see our program detect the object successfully, and the the green square which means the position of the coke can is very accurate. The position information with high accuracy will promise the success of the robot arm’s moving and grasping. However, we can also see that the FPS is very low actually, which means the program need a very long time to process the image and cannot output its result in time. In the conclusion before, we can believe this is not the problem of the algorithm itself, so the reason is that the hardware is not powerful enough. Because this method will retrain the network every time we run this program, so it need many hardware resources. Because we just use Jetson Nano here, this method is not very practical. To use this, people can try Jetson TX instead.

Object Detection Result

Topic Graph of this Method

Expert

Team Member

Jiaxun Liu

jiaxun.liu@duke.edu

Yu Zhou

yu.zhou@duke.edu

Shucheng Zhang

shucheng.zhang@duke.edu

Reference

[1] Lecun Y , Bengio Y , Hinton G . Deep learning[J]. Nature, 2015, 521(7553):436.

[2] Canny J . A Computational Approach to Edge Detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1986, PAMI-8(6):679-698.

[3] Zhang, Zhengyou. (2000). A Flexible New Technique for Camera Calibration. Pattern Analysis and Machine Intelligence, IEEE Transactions on. 22. 1330 – 1334. 10.1109/34.888718.

[4] https://towardsdatascience.com/simple-introduction-to-convolutional-neural-networks-cdf8d3077bac

[5] https://wiki.pathmind.com/convolutional-network

ME555 Experiment design & Research method
Smart home service robot
(shsbot)

Overview

Project Motivation

Application

Design Iteration

1.Initial Composition

2.Review and Improvement

Design Criteria

1. Structure and Hardware

2. Development Environment

Learning Objectives

Problem Statement

System Decomposition

Conclusion

Skill Level and Implement Detail

1. Ubuntu, ROS and Jetson setup

2. Basic understanding of ROS and ROS programming

3. 3D modeling (SolidWorks/Onshape)

4. 3D printing

5. Linear algebra and probability theory for Machine Learning

1. Understand robot arm kinematics (DH parameter)

2. Use limit switch to set initial position of step motor (Arduino)

3. Create robot URDF file

4. Robot configuration file generation (Moveit setup assistant)

5. Image processing and OpenCV

6. Depth camera construction and using in ROS

7. Traditional Object Detection Method

1. Gazebo Simulation

2. Rvzi path planning

3. Communication between ROS and Arduino

4. Communication between Gazebo and Moveit&Rvis

5. ROS package writing and compiling (C++, Cmake)

6. Camera calibration

7. Objection Detection with ML

1. Execute planned path in real robot arm(armbot)

2. Use Arduino to subscribe Joint state topic in ROS

3. Joint calibration of depth camera and robot arm

4. Training own CNN model and implement in ROS

5. Communication between camera and robot arm

Team Member

ME555 Experiment design & Research method Smart home service robot (shsbot)

Overview

Project Motivation

Application

Design Iteration

1.Initial Composition

2.Review and Improvement

Design Criteria

1. Structure and Hardware

2. Development Environment

Learning Objectives

Problem Statement

System Decomposition

Conclusion

Skill Level and Implement Detail

1. Ubuntu, ROS and Jetson setup

2. Basic understanding of ROS and ROS programming

3. 3D modeling (SolidWorks/Onshape)

4. 3D printing

5. Linear algebra and probability theory for Machine Learning

1. Understand robot arm kinematics (DH parameter)

2. Use limit switch to set initial position of step motor (Arduino)

3. Create robot URDF file

4. Robot configuration file generation (Moveit setup assistant)

5. Image processing and OpenCV

6. Depth camera construction and using in ROS

7. Traditional Object Detection Method

1. Gazebo Simulation

2. Rvzi path planning

3. Communication between ROS and Arduino

4. Communication between Gazebo and Moveit&Rvis

5. ROS package writing and compiling (C++, Cmake)

6. Camera calibration

7. Objection Detection with ML

1. Execute planned path in real robot arm(armbot)

2. Use Arduino to subscribe Joint state topic in ROS

3. Joint calibration of depth camera and robot arm

4. Training own CNN model and implement in ROS

5. Communication between camera and robot arm

Team Member

ME555 Experiment design & Research method
Smart home service robot
(shsbot)