Keras callbacks guide and code

[Update: The post was written for Keras 1.x. Things have been changed little, but the the repo is up-to-date for Keras 2.0]

I decided to look into Keras callbacks.

When you want to do some tasks every time a training/epoch/batch, that’s when you need to define your own callback. It’s simple, it’s just I needed to look into the code to know what I could do with it. In my case, I wanted to compute an auc_roc score after training every epoch. It was being computed after out of fit function as I am using multiple hdf files to do it. Let’s fix it.

Your own callback function can be defined as below:

import keras

class My_Callback(keras.callbacks.Callback):
    def on_train_begin(self, logs={}):
        return

    def on_train_end(self, logs={}):
        return

    def on_epoch_begin(self, logs={}):
        return

    def on_epoch_end(self, epoch, logs={}):
        return

    def on_batch_begin(self, batch, logs={}):
        return

    def on_batch_end(self, batch, logs={}):
        self.losses.append(logs.get('loss'))
        return

The code is quite straightforward. This class is inherited from keras.callbacks.Callback, which already has those on_{train, epoch, batch}_{begin, end} functions. What we need to do is to redefine them. Or overload them. And then put an instance of your callback as an input argument of keras’s model.fit function.

What we can do in each function? First, there are input arguments – epoch/batch, and logs{}. What are they? And what property do we need?

on_train_begin(self, logs={})

See an example on Keras’ documentation.

class LossHistory(keras.callbacks.Callback):
    def on_train_begin(self, logs={}):
        self.losses = []

Yes, this is where we need to define some useful properties. In this example, self.losses is added and initiated with a blank list.

logs

There are not much with the input argument here. I think logs would be just a blank dictionary in most of the cases.


(pdb) print logs
{}

This is because nothing happened yet. Not sure what happens if we save a model and load it again though.

DETOUR! Let’s look into self.params and self.model. This is not only about on_train_begin() but applies for every callback function.

self.params


(Pdb) self.params
{'verbose': 1, 'nb_epoch': 12, 'batch_size': 128, 'metrics': ['loss', 'acc', 'val_loss', 'val_acc'], 'nb_sample': 60000, 'do_validation': True}

self.params have some useful information of the training configuration.

self.model


(Pdb) self.model
<keras.models.Sequential object at 0x1063b8e90>

Oops, seems like we can do something interesting with this in every callback. This is the model instance – an instance of Sequential() or Functional we are training now. We can use lots of information from it.

Check out model page – there are too many properties and functions! A tentative list would be…

self.model.validation_data

(At the moment this property exists not after on_train_begin, but after on_batch_begin)

len(self.model.validation_data) == 3, because validation_data[0] == train_x (that you input in model.fit()), validation_data[1] == train_y, validation_data[2]=sample_weight, as below.


(Pdb) self.model.validation_data[0].shape
(10000, 1, 28, 28)

(Pdb) self.model.validation_data[1].shape
(10000, 10)

(Pdb) self.model.validation_data[2].shape
(10000,)

You can use this data to compute your own metric e.g. computing auc_roc_score with scikit-learn package.

self.model.training_data

It shows up on dir(self.model), but at any point of training I couldn’t get it.

self.model.save_weights

You may want to save weights.

on_epoch_begin(self, epoch, logs={})

When the very first epoch begins,


(Pdb) epoch
0
(Pdb) logs
{}

When the second epoch begins,


(Pdb) epoch
1
(Pdb) logs
{}

I.e. logs are cleared every time, epoch stands for the number of epoch in zero-based indexing.

on_epoch_end(self, epoch, logs={})

When the very first epoch ends,


(Pdb) epoch
0
(Pdb) logs
{'acc': 0.13145000000000001, 'loss': 2.3134536211649577, 'val_acc': 0.16389999999999999, 'val_loss': 2.28033113861084}

When the second epoch ends,


(Pdb) epoch
1
(Pdb) logs
{'acc': 0.15653333332538605, 'loss': 2.255207451756795, 'val_acc': 0.185, 'val_loss': 2.2099738941192628}

So, the logs only contains the result of the current epoch. That’s why you need to append the result by yourself, as below:


def on_epoch_end(self, epoch, logs={}):
    self.losses.append(logs.get('loss'))
    return

on_batch_begin(self, batch, logs={})


(Pdb) batch
0
(Pdb) logs
{'batch': 0, 'size': 128}

batch is, again, batch index, and logs has some information of batch.

on_batch_end(self, batch, logs={})


(Pdb) batch
0
(Pdb) logs
{'acc': array(0.1015625, dtype=float32), 'loss': array(2.366058349609375, dtype=float32), 'batch': 0, 'size': 128}

After training a batch, logs has bit more information.

Example

I’d like to compute auc_roc_score as mentioned for every epoch and somehow store them. As below, where I overload all the functions to make it clear (redundant though).


import keras
from sklearn.metrics import roc_auc_score

class Histories(keras.callbacks.Callback):
    def on_train_begin(self, logs={}):
        self.aucs = []
        self.losses = []

    def on_train_end(self, logs={}):
        return

    def on_epoch_begin(self, epoch, logs={}):
        return

    def on_epoch_end(self, epoch, logs={}):
        self.losses.append(logs.get('loss'))
        y_pred = self.model.predict(self.model.validation_data[0])
        self.aucs.append(roc_auc_score(self.model.validation_data[1], y_pred))
        return

    def on_batch_begin(self, batch, logs={}):
        return

    def on_batch_end(self, batch, logs={}):
        return

The whole files are in this repo. Clone and run the mnist_cnn.py.

16 Comments

Anonymous says:

October 22, 2016 at 11:33 pm

Thanks a lot !! This page was very helpful for understanding the Callback class

LikeLike

Omar says:

April 20, 2017 at 1:25 pm

Thanks for the post, it was very useful.
There is an issue with the attribute validation_data, it seems that it is not part of ‘model’ but the callback object itself, i.e. self.validation_data

I added the ‘auc’ calculation to the metrics dictionary so it is printed every time an epoch ends. Moreover, you can now add a tensorboard callback (in model.fit or model.fit_generator parameters) to visualize this new scalar as a plot.

class CustomCallbacks(keras.callbacks.Callback): #create a custom History callback
def on_train_begin(self, logs={}):
self.aucs = []
self.losses = []
self.params[‘metrics’].append(‘val_auc’)

def on_train_end(self, logs={}):
return

def on_epoch_begin(self, epoch, logs={}):
return

def on_epoch_end(self, epoch, logs={}):
self.losses.append(logs.get(‘loss’))
y_pred = self.model.predict(self.validation_data[0])
auc_epoch = sklearn.metrics.roc_auc_score(self.validation_data[1][0], y_pred)
logs[‘val_auc’] = auc_epoch
self.aucs.append(auc_epoch)
return
def on_batch_begin(self, batch, logs={}):
return

def on_batch_end(self, batch, logs={}):
return

LikeLike

Yoel Shoshan says:

April 21, 2017 at 8:27 pm

First of all, thanks for your post.
I’m wondering about something, when you do
y_pred = self.model.predict(self.model.validation_data[0])

isn’t this a double computation? if I remember correctly, keras automatically calculates a val_BLAH version of the metric that you requested when compiling the model.
Maybe there is a way to access the “y_pred” that keras already internal calculated?
(this becomes relevant with big datasets…)

LikeLike

Rishi says:

September 18, 2017 at 5:49 am

Thanks so much for this article! Made it much easier to understand. Would you have any examples of subclassing the KerasClassifier and KerasRegressor classes as well? It wasn’t very clear to me from the keras documentation what parts of the interface need to be redefined and what ones need to be left alone. Most examples I’ve seen only use the separate “build_fn” – couldn’t find any example of inheriting the interfaces.

LikeLike

Scott Hawley says:

October 7, 2017 at 8:39 pm

Also, for defining your own keyword parameters, you can add an __init__ block, as in…

class MyCallback(Callback):
def __init__(self, my_param=’Hello’): # define your keyword/params here
self.my_param = my_param

def on_train_begin(self, logs={}):
print(“on_train_begin: my_param = “,self.my_param)

def on_epoch_end(self, epoch, logs={}):
print(“on_train_end: my_param = “,self.my_param)

mycallback = MyCallback(my_param=’Howdy’)

model.fit(X_train, Y_train, batch_size=batch_size, epochs=epochs, callbacks=[mycallback])

LikeLike

EnricoRotundo says:

December 6, 2017 at 1:52 pm

When fitting a generator like as follows:

model.fit_generator(generator=train_generator,
steps_per_epoch=steps_per_epoch,
epochs=epochs,
validation_data=validation_generator,
validation_steps=validation_steps,
shuffle=True,
callbacks=[metrics],
verbose=1)

How can I access the `validation_steps` parameter within a custom callback? It’s not in `self.params`, can’t find it in `self.model` either.

LikeLike

Dushyant Singh Chauhan says:

March 12, 2018 at 7:08 pm

Thank you so much for this article. now it will be easy to understand but i have some other issue. I have large dataset and i can not load whole data with different dimension as well so split whole dataset into batches and made every batch of same dimension. So my question is that how can i load batches one by one. Can i solve using this callback?

LikeLike

keunwoochoi says:

March 12, 2018 at 11:30 pm

Hi, that’s a much more common situation where you’d need to implement your own Generator for feeding the data and use it in the `fit()` method. You can check out the details in Keras doc.

LikeLike

Reply

Anonymous says:

July 9, 2018 at 10:57 am

If I want after each batch ends to update the output activation of a specific layer, How can I do that in Keras?

For example, the output shape of a specific layer= [9,1500], where 9 is the batch size and 1500 is the output features. How can I update this output after each batch ends ?

LikeLike

Doãn Trung Nghĩa says:

September 7, 2018 at 3:06 pm

Thank you very much for your time preparing this article. It helps me a lot.

LikeLike

Anonymous says:

July 29, 2019 at 10:33 pm

Thank you, it helps me a lot to understand the keras’ callback function!

LikeLike

Pingback: Python: Video Processing | Zheng's Blog

Chris Birch says:

April 4, 2020 at 10:54 pm

This was such a well written article! Thank you for all the time you spent adding all of the images of code in color for easier immediate understanding, and your explanation was very understandable. Thank you for this gift. 🙂

LikeLike

Pingback: Text Classification: All Tips and Tricks from 5 Kaggle Competitions – Site Title

Pingback: Text Classification: All Tips and Tricks from 5 Kaggle Competitions | Neptune Blog

FlynnAllport says:

September 15, 2021 at 5:33 am

Thank you so much for your time in preparing this article. It helps me a lot.

LikeLike

Keras callbacks guide and code

on_train_begin(self, logs={})

logs

self.params

self.model

self.model.validation_data

self.model.training_data

self.model.save_weights

on_epoch_begin(self, epoch, logs={})

on_epoch_end(self, epoch, logs={})

on_batch_begin(self, batch, logs={})

on_batch_end(self, batch, logs={})

Example

Published by keunwoochoi

16 Comments

Leave a Comment Cancel reply

on_train_begin(self, logs={})

logs

self.params

self.model

self.model.validation_data

self.model.training_data

self.model.save_weights

on_epoch_begin(self, epoch, logs={})

on_epoch_end(self, epoch, logs={})

on_batch_begin(self, batch, logs={})

on_batch_end(self, batch, logs={})

Example

Share this:

Related

Published by keunwoochoi

16 Comments

Leave a Comment Cancel reply