My recent talks

Last month, I gave an invited talk at NYU. I somehow ended up talking and talking and talking for 100 minutes. You’d better watch it on YouTube since I put quite a few timestamps there. This talk is based on my previous talk, which was done in Korean, 2021 Nov. In case you’re interested in…More


My ISMIR 2021 Submission and its reviews

As I did 2 years ago on my DrummerNet paper, I’m open-sourcing the reviews my submission received. I did it back then, and I’m doing it again now, since when I had no paper in ISMIR, I was always very curious about how ISMIR review is being done. By not making this information available, I…More

slightly better research code – avoid hard-coded values

Imagine you need to crop the first 10 second of a waveform. This can be improved by like this. Of course it does the same thing. But this is better because.. Now you know the meaning of this magic number 160000 . And this means that.. Now ANYONE would know the meaning of 160000. Because…More

Some choices I’ve made and why

Only occasionally though, I’ve been asked those classic questions like “So how did you start your career?”, “What motivated you to start a PhD course?”, etc., and somehow I ended up promising that I’ll write a post about it. So, here we go. Disclaimer: I’ll be only straightforward, simple, and dumb. Bachelor: EE My tutor…More

Q&A: How to transcribe rap songs

… I want to understand what they are rapping about … I want to ask if it is indeed possible to transcribe rap songs? I have vocals extracted from the songs and tried to use Google speech2text API for it but the results look very random and bad. I am given the impression that transcribing…More

ICASSP 2020 papers and summaries

Let me reuse my tweets 🙂 "Improving Universal Sound Separation Using Sound Classification". Used a pre-trained net to extract an embedding that conditions a separation model. Nice work! Turned out it's the same first author (@ETzinis) of the paper above. — Keunwoo Choi (@keunwoochoi) May 18, 2020 The structure of separate formant mask…More