KpSFR

Abstract

We propose a novel deep learning framework for sports field registration. The typical algorithmic flow for sports field registration involves extracting field-specific features (e.g., corners, lines, etc.) from field image and estimating the homography matrix between a 2D field template and the field image using the extracted features. Unlike previous methods that strive to extract sparse field features from field images with uniform appearance, we tackle the problem differently. First, we use a grid of uniformly distributed keypoints as our field-specific features to increase the likelihood of having sufficient field features under various camera poses. Then we formulate the keypoints detection problem as an instance segmentation with dynamic filter learning. In our model, the convolution filters are generated dynamically, conditioned on the field image and associated keypoint identity, thus improving the robustness of prediction results. To extensively evaluate our method, we introduce a new soccer dataset, called TS-WorldCup, with detailed field markings on 3812 time-sequence images from 43 videos of Soccer World Cup 2014 and 2018. The experimental results demonstrate that our method outperforms state-of-the-arts on the TS-WorldCup dataset in both quantitative and qualitative evaluation.

Overview Video

Methodology

Architecture overview. Our proposed model consists of two parts, namely the standard encoder-decoder architecture and the keypoints-aware label condition module. Given the field image Iⁱⁿ as input, we perform symmetric encoder-decoder to extract the feature maps from encoder E and decoder D. We then generate the parameters of the dynamic head S using the keypoints-specific controller G fed by the extracted output feature of the encoder E (green vector) and the keypoints encoding vector K_i (orange vector). Then, the dynamic head S outputs the i-th heatmap H^pred_i. Finally, we employ soft aggregation to merge all the predicted heatmaps {H^pred_i}^N_i=1 into the final output {M^pred_i}^N_i=1, and estimate the predicted homography R^pred using DLT and RANSAC.

Results

Chen et al.

Nie et al.

Ours

Sports Field Registration via Keypoints-aware Label Condition

Abstract

Overview Video

Methodology

Results

Sports Field Registration via
Keypoints-aware Label Condition