keras STFT layers

I started implementing new keras layers at keras_STFT_layer repo.

What are these?

With these layers, you wouldn’t need to pre-compute and store STFT/Melgram/CQT in your hard drive. A new pipeline would be…

  • Store audio files as it is,
    • or perhaps decode them into raw wave (PCM) and store them in npy or hdf.
  • Start training!

 

The code would be

model = keras.Sequential()
specgram = Spectrogram(n_dft=512, n_hop=128, input_shape=(len_src, 1))
model.add(specgram)
model.add(BatchNormalization(axis=time_axis)) # recommended

Would it be faster?

I will find out 🙂

How’s the quality of the conversion?

screen-shot-2016-09-13-at-14-58-43

screen-shot-2016-09-13-at-14-58-36

More info

Stay tuned to the keras_STFT_layer repo, there are code, ipython files, etc.

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s