A Comparative Analysis of Gradient Descent-Based Optimization Algorithms on Convolutional Neural Networks

Dogo, E. M.; Afolabi, O. J.; Nwulu, N. I.; Twala, B.; Aigbavboa, C. O.

A Comparative Analysis of Gradient Descent-Based Optimization Algorithms on Convolutional Neural Networks

Files

A Comparative Analysis of Gradient Descent-Based Optimization Algorithms_CNN.pdf (58.48 KB)

Date

2018

Authors

Publisher

IEEE

Abstract

In this paper, we perform a comparative evaluation of seven most commonly used first-order stochastic gradient-based optimization techniques in a simple Convolutional Neural Network (ConvNet) architectural setup. The investigated techniques are the Stochastic Gradient Descent (SGD), with vanilla (vSGD), with momentum (SGDm), with momentum and nesterov (SGDm+n)), Root Mean Square Propagation (RMSProp), Adaptive Moment Estimation (Adam), Adaptive Gradient (AdaGrad), Adaptive Delta (AdaDelta), Adaptive moment estimation Extension based on infinity norm (Adamax) and Nesterov-accelerated Adaptive Moment Estimation (Nadam). We trained the model and evaluated the optimization techniques in terms of convergence speed, accuracy and loss function using three randomly selected publicly available image classification datasets. The overall experimental results obtained show Nadam achieved better performance across the three datasets in comparison to the other optimization techniques, while AdaDelta performed the worst.

Description

2018 International Conference on Computational Techniques, Electronics and Mechanical Systems (CTEMS), Belgaum, India, 2018, pp. 92-99

Keywords

Optimization, Training, Deep learning, Neural networks, Stochastic processes, Convergence, Classification algorithms, Artificial Intelligence, optimizers, performance measures, deep learning, stochastic gradient descent

URI

http://repository.futminna.edu.ng:4000/handle/123456789/1007

Collections

Computer Engineering

Full item page

A Comparative Analysis of Gradient Descent-Based Optimization Algorithms on Convolutional Neural Networks

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By