Improvements on Object Detection (Faster R-CNN)

Photo by rawpixel on Unsplash

Abstract

In this project, the main goal is to investigate the object detection model architecture to improve the mean Average Precision (mAP). In the past, researchers are dedicated to proposing new ideas in each part of the network architecture to enhance model performance. However, each method is a stand-alone little piece. It remains unclear if we can take advantage of these little pieces to push the model performance. Therefore, we aim to combine these refinements made in each architecture to investigate whether the model performance can be even better. Our modifications are based on Faster R-CNN with Feature Pyramid Network (FPN) to improve the model performance, including backbone, Region Proposal Network, feature maps refinement and RoI head.

Our code repository is located here.

Performance

Evaluate the performance on the PASCAL 2007 dataset


JING-AN TZENG
JING-AN TZENG
Video Processing Algorithm Engineer

My interests include Robotics, Computer Vision and Control System.

Related