12 Comments

  1. Dinesh Vadhia says:

    Hi! I don’t see how the pre-trained One Million Song model was trained? Thanks

    Like

    1. keunwoochoi says:

      Hi, what do you mean by ‘how’?

      Like

      1. Dinesh Vadhia says:

        In a previous post it said that the model was trained on ~29s audio from the OMS dataset. Is that correct and if so, I couldn’t find the code? Thanks!

        Like

      2. keunwoochoi says:

        Training is not the part of code. I used MSD dataset.

        Like

      3. Dinesh Vadhia says:

        Ah, ok. That is what I was wondering ie. how did you train the model using the MSD dataset.

        Like

      4. keunwoochoi says:

        Please elaborate more? I still can’t get the point of your question.

        Like

      5. Dinesh Vadhia says:

        Will send email otherwise will go round and round.

        Like

  2. K R Srinidhi says:

    Hi,

    I downloaded GTZAN Music genre dataset from http://marsyasweb.appspot.com/download/data_sets/?_sm_au_=i7HSSSWqdVMd13T7.
    I converted the GTZAN dataset from 22050hz to 16000 hz sampling rate using sox. (ex: sox inputfile.wav -b16 -r16000 out.wav)
    When I ran the example tagging script with audio files from GTZAN/rock directory, most of the predictions are showing it as jazz.
    What am I doing wrong? (Using CRNN with Theano)

    regards
    Srinidhi

    Like

  3. K R Srinidhi says:

    Hi,

    I downloaded GTZAN Music genre dataset from http://marsyasweb.appspot.com/download/data_sets/?_sm_au_=i7HSSSWqdVMd13T7.
    I converted the GTZAN dataset from 22050hz to 12000 hz sampling rate using sox. (ex: sox inputfile.wav -b16 -r12000 out.wav)
    When I ran the example tagging script with audio files from GTZAN/rock directory, most of the predictions are showing it as jazz.
    What am I doing wrong?

    regards
    Srinidhi

    Like

    1. keunwoochoi says:

      I’d recommend you to use it as a feature extractor and add a classifier on the top of it, rather than use the result as it is.

      Like

    2. K R Srinidhi says:

      So you recommend me to build a new trained model with my training data and then test it against GTZAN dataset.
      Why the uploaded pretrained weights are giving wrong results with GTZAN dataset.
      Thanks
      Srinidhi

      Like

  4. keunwoochoi says:

    Yes, I tested it with a similar network. It will get you 70-80% of accuracy. It is not the problem of gtzan. The current CRNN weights are kinda weird, it makes sense with AUC evaluation scheme though. (AUC is not about top-K prediction.) I’m planning to update it.

    Like

Leave a Comment