Improvements on Object Detection (Faster R-CNN)

Abstract
In this project, the main goal is to investigate the object detection model architecture to improve the mean Average Precision (mAP). In the past, researchers are dedicated to proposing new ideas in each part of the network architecture to enhance model performance. However, each method is a stand-alone little piece. It remains unclear if we can take advantage of these little pieces to push the model performance. Therefore, we aim to combine these refinements made in each architecture to investigate whether the model performance can be even better. Our modifications are based on Faster R-CNN with Feature Pyramid Network (FPN) to improve the model performance, including backbone, Region Proposal Network, feature maps refinement and RoI head.
Our code repository is located here.