Last month, I gave an invited talk at NYU. I somehow ended up talking and talking and talking for 100 minutes. You’d better watch it on YouTube since I put quite a few timestamps there. This talk is based on my previous talk, which was done in Korean, 2021 Nov. In case you’re interested in…More
In the middle of PhD programs: internship, academia vs industry, etc.
Just my two cents to someone who emailed me – but the question is pretty general. There are clear pros and cons in academia/industry. Some are about the work itself – how you spend your time of a day, or a year, and what you’ll get to have learned and not after a few years.…More
Music Classification: Beyond Supervised Learning, Towards Real-world Applications 📕
https://music-classification.github.io/tutorial/ I wrote this book – Music Classification: Beyond Supervised Learning, Towards Real-world Applications with Minz Won and Janne Spijkervet. We used this in our ISMIR 2021 tutorial session and will keep updating the book. When you google “music classification deep learning”, you get nothing but replications of the nice and brief blog posts. Those…More
My ISMIR 2021 Submission and its reviews
As I did 2 years ago on my DrummerNet paper, I’m open-sourcing the reviews my submission received. I did it back then, and I’m doing it again now, since when I had no paper in ISMIR, I was always very curious about how ISMIR review is being done. By not making this information available, I…More
slightly better research code – avoid hard-coded values
Imagine you need to crop the first 10 second of a waveform. This can be improved by like this. Of course it does the same thing. But this is better because.. Now you know the meaning of this magic number 160000 . And this means that.. Now ANYONE would know the meaning of 160000. Because…More
Tensorflow2 Keras – Custom loss function and metric classes for multi task learning
It is well known that we can use a masking loss for missing-label data, which happens a lot in multi-task learning (example). But how about metrics? Without a similar modification, the keras.metric classes and functions would get you some numbers, but they won’t be quite accurate. No worries though, they can be modified as in…More
Kapre doc → kapre.readthedocs.io
Recently, I put some effort to improve it. Now it supports Tensorflow 2.0. Please enjoy! https://kapre.readthedocs.io/en/latest/More
Some choices I’ve made and why
Only occasionally though, I’ve been asked those classic questions like “So how did you start your career?”, “What motivated you to start a PhD course?”, etc., and somehow I ended up promising that I’ll write a post about it. So, here we go. Disclaimer: I’ll be only straightforward, simple, and dumb. Bachelor: EE My tutor…More
Q&A: How to transcribe rap songs
… I want to understand what they are rapping about … I want to ask if it is indeed possible to transcribe rap songs? I have vocals extracted from the songs and tried to use Google speech2text API for it but the results look very random and bad. I am given the impression that transcribing…More
ICASSP 2020 papers and summaries
Let me reuse my tweets 🙂 https://t.co/ZABBXEDS1c "Improving Universal Sound Separation Using Sound Classification". Used a pre-trained net to extract an embedding that conditions a separation model. Nice work! Turned out it's the same first author (@ETzinis) of the paper above. — Keunwoo Choi (@keunwoochoi) May 18, 2020 https://t.co/hgfCBMnRSU The structure of separate formant mask…More