paper-reading-group

Progressive Domain Adaptation for Object Detection

Author: Chun-Han Yao, Han-Kai Hsu, Hung-Yu Tseng, Maneesh Singh, Ming-Hsuan Yang, Wei-Chih Hung, Yi-Hsuan Tsai Link: https://arxiv.org/pdf/1910.11319v1.pdf Publishing/Release Date: Oct 24, 2019 Status: Finished

Motivation and basic approach

Progressive%20Domain%20Adaptation%20for%20Object%20Detection%20478798eacb8b477e92629c95ae306a49/Untitled.png

Note: Conventional domain adaptation problem : From source to target.

New approach : intermediate synthetic domain that allows us to gradually solve separate sub-tasks with smaller gaps (shown as L: S→F and L : F→T). In addition, we treat each image in the synthetic domain unequally based on its quality with respect to the target domain, where the size of the yellow triangles stand for their weights (i.e., the closer to target, the higher of the weight).

1) we introduce an intermediate domain in the proposed adaptation framework to achieve progressive feature alignment for object detection

2) we develop a weighted task loss during domain alignment based on the importance of the samples in the intermediate domain.

3) we conduct extensive adaptation experiments under various object detection scenarios and achieve state-of-the-art performance.

Previous Work on object detection

https://arxiv.org/pdf/1409.7495.pdf

DANN

GRL ( Gradient reversal layer )

For training the feature extractor in order to maximize the classification loss of domain predictor, Gradient Reversal layer was place between Feature extractor and domain classifier.

The Gradient Reversal Layer basically acts as an identity function (outputs is same as input) during forward propagation but during back propagation it multiplies its input by -1. — Leads to Gradient ascent.

CycleGAN - https://arxiv.org/abs/1703.10593

Progressive%20Domain%20Adaptation%20for%20Object%20Detection%20478798eacb8b477e92629c95ae306a49/Untitled%201.png

Some results by using CycleGAN:

Progressive%20Domain%20Adaptation%20for%20Object%20Detection%20478798eacb8b477e92629c95ae306a49/Untitled%202.png

Progressive Domain Adaptation

Adaptation in the Feature Space

To align distributions in the feature space, we propose a deep model which consists of two components; a detection network and a discriminator network for feature alignment via adversarial learning.

Progressive%20Domain%20Adaptation%20for%20Object%20Detection%20478798eacb8b477e92629c95ae306a49/Screenshot_(41).png

Intermediate Domain

Adaptation Process

Weighted Supervision

Progressive%20Domain%20Adaptation%20for%20Object%20Detection%20478798eacb8b477e92629c95ae306a49/Screenshot_(47).png

I - synthetic target image, p(T) - probability of being in target

Progressive%20Domain%20Adaptation%20for%20Object%20Detection%20478798eacb8b477e92629c95ae306a49/Screenshot_(48).png

The final weighted loss function -

Progressive%20Domain%20Adaptation%20for%20Object%20Detection%20478798eacb8b477e92629c95ae306a49/Screenshot_(51).png

Progressive%20Domain%20Adaptation%20for%20Object%20Detection%20478798eacb8b477e92629c95ae306a49/Screenshot_(52).png

Implementation details

Datasets

Results

We show a baseline Faster R-CNN result trained on the source data without applying domain adaptation, and a supervised model trained fully on the target domain data (oracle) to illustrate the existing gap between domains.

Cross camera adaptation

Progressive%20Domain%20Adaptation%20for%20Object%20Detection%20478798eacb8b477e92629c95ae306a49/Untitled%204.png

Weather Adaptation

Progressive%20Domain%20Adaptation%20for%20Object%20Detection%20478798eacb8b477e92629c95ae306a49/Untitled%205.png

Adaptation from small to larger dataset

Progressive%20Domain%20Adaptation%20for%20Object%20Detection%20478798eacb8b477e92629c95ae306a49/Untitled%206.png

The result -

Progressive%20Domain%20Adaptation%20for%20Object%20Detection%20478798eacb8b477e92629c95ae306a49/Untitled%207.png

Discussion

To improve precision, could we use a image translation network to generate multiple domains and use a multi-domain discriminator to adapt all domains simultaneously [ Kim et al. ] , and add to this a weighted loss function that considers the distance between the synthetic and target domain?