Kervolutional Neural Networks - Robotics Institute Carnegie Mellon University

Kervolutional Neural Networks

Chen Wang, Jianfei Yang, Lihua Xie, and Junsong Yuan
Conference Paper, Proceedings of (CVPR) Computer Vision and Pattern Recognition, pp. 31 - 40, June, 2019

Abstract

Convolutional neural networks (CNNs) have enabled the state-of-the-art performance in many computer vision tasks. However, little effort has been devoted to establishing convolution in non-linear space. Existing works mainly leverage on the activation layers, which can only provide point-wise non-linearity. To solve this problem, a new operation, kervolution (kernel convolution), is introduced to approximate complex behaviors of human perception systems leveraging on the kernel trick. It generalizes convolution, enhances the model capacity, and captures higher order interactions of features, via patch-wise kernel functions, but without introducing additional parameters. Extensive experiments show that kervolutional neural networks (KNN) achieve higher accuracy and faster convergence than baseline CNN.

BibTeX

@conference{Wang-2019-126298,
author = {Chen Wang and Jianfei Yang and Lihua Xie and Junsong Yuan},
title = {Kervolutional Neural Networks},
booktitle = {Proceedings of (CVPR) Computer Vision and Pattern Recognition},
year = {2019},
month = {June},
pages = {31 - 40},
}