Unity Muse at the Microsoft Build Conference

Since April 2023, I’ve been working on Muse – Unity’s in-Editor AI assistant for developers. I was one of the first to join this team, and it’s been quite the ride until now, working with some excellent colleagues and negotiating through several challenges along the way! Our product was recently featured as a success story at Microsoft’s Build conference for which we had a camera crew visit us on behalf of Microsoft in Copenhagen to create a promotional video, parts of which were shown during a live demo of Muse at this conference. I had the privilege to be a part of this video, and got to talk about the product and share some of my contributions to it!

There’s lot of exciting stuff coming up for Muse, and I’m really looking forward to being a part of the future of this product!

Attending the 20th International Society for Music Information Retrieval Conference (ISMIR 2019)

It’s just been confirmed that four of us from Moodagent – Reinier de Valk, Pierre Lafitte, Tomas Gajarsky and I, will be attending ISMIR 2019 in Delft (The Netherlands). This year, two of my colleagues from Moodagent will be presenting their work at ISMIR:

  1. Reinier will be presenting his paper, titled “JosquIntab: A Dataset for Content-based Computational Analysis of Music in Lute Tablature” in the main conference.
  2. Tomas will be presenting his paper, titled “Reinforcement Learning Recommender System for Modelling Listening Sessions” in the Late-breaking session of the conference.

Do stop by at these posters to learn more about these interesting topics!

Remote Talk at Event Organised by Music Tech Community – India

I was invited by the Music Tech Community – India (MTC – India) to deliver a talk on the 29th of December, 2018 in Bengaluru. The theme of the event was Machine Learning for Art & Music Generationwhere my work at Jukedeck fit in perfectly alongside that of the other speakers at the event.

I happened to be on a holiday then in beautiful Mararikulam in Kerala around then, but I really didn’t want to miss this opportunity to speak so we decided to make it a remote talk that I delivered via Skype. Thanks to the excellent organisers – Albin Correya, Manaswi Mishra and Siddharth Bharadwaj, the talk went off smoothly and was apparently well-received. Other speakers during the event were Harshit Agarwal, and two of the organisers themselves – Albin Correya and Manaswi Mishra.

Oral Presentation at the 19th International Society for Music Information Retrieval Conference

A few months following the acceptance of our paper at ISMIR 2018, I attended the conference in Paris with several of my colleagues from Jukedeck. We had a fairly large presence there dwarfed (as far as I can tell) only by a larger one from Spotify. The conference was organised very well and everything went-off smoothly. It was great to be back in the beautiful city after my last visit nearly 8 years ago!

I was particularly pleased by the new format for presenting accepted papers at this ISMIR wherein each paper was given both oral and poster presentation slots thus removing the traditional distinction between papers that exists in conferences. In the case of our paper on StructureNet, I made the oral presentation and my colleagues and co-authors – Gabriele and Marco – made the poster presentation. Fortunately, this year ISMIR was streamed live and the videos were later stored on YouTube so I’m able to share the video of my presentation with you. It’s only a 4-minute presentation so do check it out! And it appeared to me each time I passed our poster by that it received a lot of attention, and this was of course great! I, with help from members of my team, also prepared a blog post on StructureNet which was published recently on Jukedeck R & D Team’s Medium page. I urge you to give it a read if you’re curious what the paper is all about. Here’s a picture of the Jukedeck team at ISMIR:

The Jukedeck Team at ISMIR 2018 – (from left-to-right) Ben, Reinier, Gabriele, Matt, me, Katerina and Marco.

I also signed up to play in this year’s ISMIR jam session organised by Uri Nieto from Pandora! If I remember correctly, it’s something that started in 2014 and has been getting more popular by the year. As anticipated, the jam session was a success and a lot of fun, with music ranging from AI-composed folk tunes to Jazz, Blues, Rock and Heavy Metal. I played two songs with my fellow attendees – Blackest Eyes by Porcupine Tree and Plush by Stone Temple Pilots. My friend Juanjo shared a recording of the first song with me in which I played bass.

As always, ISMIR this year provided a great opportunity to make new acquaintances, and meet old friends and colleagues. As it turns out quite a few of my friends from the Music Informatics Research Group (MIRG) at City, University of London showed up this time and it was great to catch up with them.

The MIRG at ISMIR 2018: (from left-to-right, back-to-front) Shahar, me, Daniel, Tillman, Andreas, Radha and Reinier.

And to top it all off, my master thesis supervisor Hendrik Purwins managed to make it to the conference on the last day giving me the opportunity to get this one selfie with Tillman (my PhD thesis supervisor) and him.

Tillman, me and Hendrik at the conference venue.

(Automated) Curriculum Learning

I’ve lately spent some time reading about Curriculum Learning and experimenting with the algorithms described in two of the papers in this domain

Bengio, Y., Louradour, J., Collobert, R., & Weston, J. (2009, June). Curriculum learning. In Proceedings of the 26th annual international conference on machine learning (pp. 41-48). ACM.

Graves, A., Bellemare, M. G., Menick, J., Munos, R., & Kavukcuoglu, K. (2017). Automated Curriculum Learning for Neural Networks. arXiv preprint arXiv:1704.03003.

The first of the above can be considered important given how with empirical results supporting Curriculum Learning, it revived the interest among researchers in this technique. The second is one of the recently proposed approaches for Curriculum Learning that I thought would be interesting to understand in greater depth.

I’ve summarised my thoughts on these in a short presentation. I hope to share my code and results not too long from now as well.

Invited Talks at the International Institute of Information Technology – Bangalore and Robert Bosch

I’m currently on a break from work at Jukedeck until the 22nd of September, and visiting friends and old colleagues in Bangalore for a few days. On coming to know of my visit to Bangalore, my past mentors invited me to give talks at their respective organisations – the International Institute of Information Technology – Bangalore, and Robert Bosch. Today I presented the work I did on sequence modelling in music, RBMs and Recurrent RBMs during my PhD to the staff and students at the International Institute of Information Technology – Bangalore (IIIT-B). And next Monday (the 18th of September, 2017) it will be more or less the same talk at Robert Bosch.

Here is a copy of the slides for those presentations.

Breaking Down the Differentiable Neural Computer

Ever since first working with Recurrent Neural Networks (RNNs) for predicting musical sequences during my PhD, I have been fascinated by these models and try to keep up with exciting developments in connectionist machine learning research surrounding these models. One of these for me has been the emergence of RNNs that are augmented by a dedicated memory unit. The idea was notably illustrated as the Neural Turing Machine (NTM) in an ArXiV submission by Alex Graves and colleagues from Google DeepMind. This early work while having gathered a fair deal of acclaim in the community, has since been followed up in a publication in the prestigious journal Nature that introduces a more evolved variant of the NTM known as the Differentiable Neural Computer (DNC). During the past couple of weeks, I managed to spend some time learning about the NTM and the DNC and prepared a little slide-show (with Google Slides) containing my observations to share with others.

So here is the link to the slides, and I hope some of you who read it benefit from it! Please let me know if you find anything that needs to be corrected in it. I would appreciate that!

Jukedeck @ The Science Museum Lates

I had the opportunity to join my colleagues at Jukedeck – Patrick, Lydia, Eliza, Matt, Katerina and Gabriele – at the Science Museum Lates last night. For those of you that are unfamiliar with the concept, Lates are adults-only, after-hours theme nights that take place in The Science Museum (in London) on the last Wednesday of every month. It is attended by various organisations that would like to showcase their work relating to a chosen theme to an audience, as well as an audience that is keen on learning more about the science and technology underlying the theme. On the last day of August 2016, it was Jukedeck’s turn to show-off its awesome technology at the museum and some of us volunteered to tag along.

Lydia and me (in the background) explaining what Jukedeck and its technology is about to curious visitors at our stall.

The museum was packed with visitors, and it was great to see so many people interested in our technology! I hardly had the time to go grab some dinner amidst the constant stream of people wanting to listen to our music and know more about the underlying algorithms. To me, as someone who does the research and writes the code that generates our music, this was an incredibly rewarding experience to see first-hand the appreciation people had for our work. It’s, in many ways, like having a poster presentation at a conference but with a non-technical audience. I enjoyed it very muchIn the future, I’ll try my best not to let such opportunities pass. And I look forward to attending the event myself in the future as a spectator! If you happen to be in London around the time this event is on, I highly recommend attending it if you’re interested in science and technology.

Oral Presentation at the 28th International Joint Conference on Neural Networks

My paper was accepted accepted for oral presentation at the 28th International Joint Conference on Neural Networks, held in the picturesque town of Killarney in Ireland. The title of the paper is quite a mouthful – “Discriminative Learning and Inference in the Recurrent Temporal RBM for Melody Modelling” and its abstract is the following:

“We are interested in modelling musical pitch sequences in melodies in the symbolic form. The task here is to learn a model to predict the probability distribution over the various possible values of pitch of the next note in a melody, given those leading up to it. For this task, we propose the Recurrent Temporal Discriminative Restricted Boltzmann Machine (RTDRBM). It is obtained by carrying out discriminative learning and inference as put forward in the Discriminative RBM (DRBM), in a temporal setting by incorporating the recurrent structure of the Recurrent Temporal RBM (RTRBM). The model is evaluated on the cross entropy of its predictions using a corpus containing 8 datasets of folk and chorale melodies, and compared with n-grams and other standard connectionist models. Results show that the RTDRBM has a better predictive performance than the rest of the models, and that the improvement is statistically significant.

I presented the paper in the session on Recurrent Neural Networks. The model that we proposed in the paper – the RTDRBM – was the first original Machine Learning contribution of my PhD. And it was a pleasure to collaborate with my friend and colleague Son Tran in the work. He presented a second paper at the conference titled, “Efficient Representation Ranking for Transfer Learning” .

With Son and my supervisor Artur after my presentation.

Yet again a conference has taken me to a place in the world that I probably would’ve never visited otherwise! This doesn’t at all mean that the visit wasn’t worthwhile. The lush green Irish landscape, the charming town of Killarney and the abounding nature around it, and a friendly and welcoming hostel all made this a very memorable trip! Unfortunately, I had sore throat and a fever during much of my stay so I chose Irish coffee over a pint of Guinness (which I heard tastes much better in Ireland) when I had the chance. I regret this, but maybe that’s another reason to visit Ireland once again sometime!

On one of my healthier days in Killarney.

Oral Presentation at the 15th International Society for Music Information Retrieval Conference

We had two papers accepted at the 15th International Society for Music Information Retrieval Conference (ISMIR). Given the fantastic experience I had at ISMIR the year before, I was super-excited to travel to Taipei to attend the conference. The first of these papers is titled, “Multiple Viewpoint Melodic Prediction with Fixed-Context Neural Networks” and is in some ways a continuation of my work from the previous ISMIR conference. The abstract of the paper is as follows:

“The multiple viewpoints representation is an event-based representation of symbolic music data which offers a means for the analysis and generation of notated music. Previous work using this representation has predominantly relied on n-gram and variable order Markov models for music sequence modelling. Recently the efficacy of a class of distributed models, namely restricted Boltzmann machines, was demonstrated for this purpose. In this paper, we demonstrate the use of two neural network models which use fixed-length sequences of various viewpoint types as input to predict the pitch of the next note in the sequence. The predictive performance of each of these models is comparable to that of models previously evaluated on the same task. We then combine the predictions of individual models using an entropy-weighted combination scheme to improve the overall prediction performance, and compare this with the predictions of a single equivalent model which takes as input all the viewpoint types of each of the individual models in the combination.”

The paper was presented as a poster. The second paper is based on very interesting work I did in collaboration with Siddharth Sigtia and Emmanouil Benetos on automatic transcription of polyphonic music, titled “An RNN-based Music Language Model for Improving Automatic Music Transcription” that Siddharth presented as another poster.

I have to note that this year’s ISMIR organisation was fantastic! Everything from the review process, information on the website to the venue, the assitance at the venue, and the banquet were very well managed and executed by the organisers. The most interesting part of the conference for me was the keynote lecture, titled “Sound and Music Computing for Exercise and (Re-)habilitation” by Prof. Ye Wang, in which he described the potential in music to serve as a means to rehabilitate and improve the quality of life of individuals with different ailments, and illustrated this with the help of a few projects his group at the National University of Singapore has been working. It was a very inspiring talk, and I really admire Dr. Wang’s statement regarding the often overlooked direct impact of research and published work to society which has been the cornerstone of these projects. I have lately taken interest in Music Therapy and have been going through some literature to see if my own work on music modelling can in some way be applied to achieve therapeutic goals. There were some interesting late-breaking sessions as well that I took part in, including the very successful one organised by my supervisor Tillman on Big Data and Music where I was taking notes during the discussion.

And finally, as is always the case when I attend a conference, I did take some time off in Taipei and its surrounding areas. On one evening, I joined some friends and colleagues to go see the tallest building in the city – Taipei 101.

Jan and I with Taipei 101 in the background (Photo Courtesy: Marius Miron)
Jan and I with Taipei 101 in the background (Photo Courtesy: Marius Miron)

On another day, a couple of us planned a day-trip to a nearby village called Jiufen where we checked out some temples, the market and the old Japanese mining village on top of a hill.

The gang that went on a day-trip to Jiufen, led by our lovely host Kailie (first from the right).
The gang that went on a day-trip to Jiufen, led by our lovely host Kailie (first from the right).

And on another day, I joined my buddy Marius on a local site-seeing round to see some local museums, Shilin night market, Chiang Kai Shek Memorial, and other places before taking the long flight back to London eventually.

Taipei was fantastic, and I’d be up for another visit anytime! Last but not least, the hospitality of Fun Taipei hostel made the whole trip a little better each day.