Neural Networks: Tricks of the Trade

Overview of attention for book

Springer-Verlag Berlin Heidelberg

Altmetric Badge

Book Overview
Altmetric Badge

Chapter 1 Introduction
Altmetric Badge

Chapter 2 Speeding Learning
Altmetric Badge

Chapter 3 Efficient BackProp
Altmetric Badge

Chapter 4 Regularization Techniques to Improve Generalization
Altmetric Badge

Chapter 5 Early Stopping — But When?
Altmetric Badge

Chapter 6 A Simple Trick for Estimating the Weight Decay Parameter
Altmetric Badge

Chapter 7 Controlling the Hyperparameter Search in MacKay’s Bayesian Neural Network Framework
Altmetric Badge

Chapter 8 Adaptive Regularization in Neural Network Modeling
Altmetric Badge

Chapter 9 Large Ensemble Averaging
Altmetric Badge

Chapter 10 Improving Network Models and Algorithmic Tricks
Altmetric Badge

Chapter 11 Square Unit Augmented, Radially Extended, Multilayer Perceptrons
Altmetric Badge

Chapter 12 A Dozen Tricks with Multitask Learning
Altmetric Badge

Chapter 13 Solving the Ill-Conditioning in Neural Network Learning
Altmetric Badge

Chapter 14 Centering Neural Network Gradient Factors
Altmetric Badge

Chapter 15 Avoiding Roundoff Error in Backpropagating Derivatives
Altmetric Badge

Chapter 16 Representing and Incorporating Prior Knowledge in Neural Network Training
Altmetric Badge

Chapter 17 Transformation Invariance in Pattern Recognition – Tangent Distance and Tangent Propagation
Altmetric Badge

Chapter 18 Combining Neural Networks and Context-Driven Search for On-line, Printed Handwriting Recognition in the Newton
Altmetric Badge

Chapter 19 Neural Network Classification and Prior Class Probabilities
Altmetric Badge

Chapter 20 Applying Divide and Conquer to Large Scale Pattern Recognition Tasks
Altmetric Badge

Chapter 21 Tricks for Time Series
Altmetric Badge

Chapter 22 Forecasting the Economy with Neural Nets: A Survey of Challenges and Solutions
Altmetric Badge

Chapter 23 How to Train Neural Networks
Altmetric Badge

Chapter 24 Big Learning and Deep Neural Networks
Altmetric Badge

Chapter 25 Stochastic Gradient Descent Tricks
Altmetric Badge

Chapter 26 Practical Recommendations for Gradient-Based Training of Deep Architectures
Altmetric Badge

Chapter 27 Training Deep and Recurrent Networks with Hessian-Free Optimization
Altmetric Badge

Chapter 28 Implementing Neural Networks Efficiently
Altmetric Badge

Chapter 29 Better Representations: Invariant, Disentangled and Reusable
Altmetric Badge

Chapter 30 Learning Feature Representations with K-Means
Altmetric Badge

Chapter 31 Deep Big Multilayer Perceptrons for Digit Recognition
Altmetric Badge

Chapter 32 A Practical Guide to Training Restricted Boltzmann Machines
Altmetric Badge

Chapter 33 Learning Feature Hierarchies with Centered Deep Boltzmann Machines
Altmetric Badge

Chapter 34 Deep Learning via Semi-supervised Embedding
Altmetric Badge

Chapter 35 Identifying Dynamical Systems for Forecasting and Control
Altmetric Badge

Chapter 36 A Practical Guide to Applying Echo State Networks
Altmetric Badge

Chapter 37 Forecasting with Recurrent Neural Networks: 12 Tricks
Altmetric Badge

Chapter 38 Solving Partially Observable Reinforcement Learning Problems with Recurrent Neural Networks
Altmetric Badge

Chapter 39 10 Steps and Some Tricks to Set up Neural Reinforcement Controllers

Overall attention for this book and its chapters

Altmetric Badge

About this Attention Score

In the top 5% of all research outputs scored by Altmetric
High Attention Score compared to outputs of the same age (97th percentile)
High Attention Score compared to outputs of the same age and source (99th percentile)

Mentioned by

news: 1 news outlet
blogs: 1 blog
policy: 1 policy source
twitter: 15 X users

patent: 22 patents
weibo: 1 weibo user
wikipedia: 22 Wikipedia pages
googleplus: 1 Google+ user

Citations

dimensions_citation: 421 Dimensions

Readers on

mendeley: 388 Mendeley
citeulike: 1 CiteULike

Summary News Blogs Policy documents X Patents Weibo Wikipedia Google+ Dimensions citations

So far, Altmetric has seen 17 X posts from 15 X users, with an upper bound of 839,242 followers.

@roydanroy In here, bottom of p. 422: "stochastic gradient descent directly optimizes the expected risk [..]" https://t.co/XwYwzTp8VB

20 Jun 2023

Reply Repost Favourite

@TaliaRinger I really wish some folks would come together to write a newer edition of this book: https://t.co/523NUSvARo

24 Dec 2021

Reply Repost Favourite

@_joaogui1 @_clashluke @_arohan_ f(x) = 1.7159*tanh (2/3*x) It satisfies the properties: f(1)=1 f''(x) has a maximum at 1. https://t.co/EuPKszk7Bw

05 Jun 2021

Reply Repost Favourite

RT @MLDawn2018: @compthink @emna_amor3 See how years ago, @ylecun came up with a very specific type of Sigmoid to avoid vanishing gradient.…

11 Apr 2021

Reply Repost Favourite

This page shows the most recent X posts that mention this research output.

Click here to find out how to access more activity, including 13 additional X posts.

Neural Networks: Tricks of the Trade

Table of Contents

About this Attention Score

Mentioned by

Citations

Readers on