IMC: Difference between revisions
Line 8: | Line 8: | ||
More ideas: | More ideas: | ||
[[Neural Guide]] | |||
Revision as of 17:36, 4 November 2016
Contact: Liselot de Jonge <liselot.dejonge@imc.nl>
2017 proposals
Confirmed:
More ideas:
24000 words per second
Ben Catterall : ben.catterall@imc.com
Many fields in machine learning, from image recognition to machine translation have recently received a tremendous boost using deep learning. Deep architectures have sparked a renewed interest in artificial intelligence, and resulted in a lot of cool applications. It has also arrived together with a new wave of peer-reviewed research, where people share and publish all of their code online. Most big companies race to provide their pre-trained models online for free. In this project we will focus on automatic video captioning, and the aim is to build a prototype system for a real-time captioning system, using already published research. The resulting product could be used by visually impaired people, or to create automatic tags on instagram.
Feedback:
Quite a lot of our students enjoy playing around with pre-trained deep neural nets, so this is certainly feasible. However, it seems at present like a one-person project, and would have to be expanded to suit work from a team. We already have some projects planned for this year that involve training a deep net, but this involves some ingenuity in identifying suitably labelled training data, as well as getting hold of a machine suitable for running the GPU-intensive deep learning frameworks that are currently popular.
Do you want to provide a reference to the specific piece of published research that you thought might be applied, and we can see where we might go from there?
Response:
The image captioning is from Andrej Karpathy:
http://cs.stanford.edu/people/karpathy/deepimagesent/
I think in order to create an interesting application from this research, we can extend the problem to include audio captions through a mobile application. The students will have to:
- Write a server application that accepts images, and produces captions - Convert captions into audio files - Write a mobile application which shoots images and uploads to server - Replays the audio that comes back from the server
I believe this is already not a one person project.
If you want we can further extend the project to include training of custom captions as well. This can be used to train an 'audio guide' for a particular location for example. We can even think about creating a meta-trainer which receives images and captions for a particular building(museum, or even the computer lab), and automatically forms an audio guide. But personally, I think we are entering a dangerous area, where the project becomes prone to failure.
More:
My understanding is that it’s relatively straightforward to run the Karpathy libraries on novel images using one of their trained models. I agree that it would be much more of a challenge to collect a sufficiently large dataset (and computational resources) to train your own model.
As you say, there is far more chance of failure if doing something other than image labelling, which is already a research topic rather than a design project. Nevertheless, we are taking a chance on this with the project brief I sent you - and I’m hoping that looking at text rather than image encoding will make that one feasible. I got one of our machine learning profs to check it out for sanity.
I would have thought that converting the captions into audio is also an API-calling job, rather than involving any substantial software engineering. I’ve used Festival for this: http://www.cstr.ed.ac.uk/projects/festival/
I think the issue with both deep learning image recognition techniques and speech synthesis is that in both cases, there are fairly good libraries and APIs that allow you to create acceptable results with a few lines of Python, while any improvement over the off-the-shelf libraries is way beyond the capability of undergrads.
Neural Networks in FPGA
Taylan.Toygarlar@imc.com
We are entering a new era in computing, where the services provided are becoming more and more dependent on machine learning and artificial intelligence, with deep learning at the forefront of this new gold-rush. The introduction of deep learning architectures were only possible due to the use of GPUs, and they have done a great job training more and more complex models. However, FPGAs offer huge benefits over GPUs in terms of power savings, which is crucial when web-scale applications are considered. There are many exciting developments in this field, but unfortunately, contrary to GPUs, there are close to no public tooling available for FPGAs. We would like to develop a code generator, which takes as input a commonly used neural-network definition file, and spits out fpga code. This project could potentially help fuel the next generation systems for which all of us interact with daily.
Feedback:
I've discussed with a couple of the staff who teach our hardware course, and they confirm that the dev boards the students use do have FPGA that might be used here.
I wasn't suggesting online learning - rather that they might see how fast they can train a simple image classifier.
Our research team have recently implemented something along these lines. Here are some relevant papers:
Neural nets in custom hardware: http://www.cl.cam.ac.uk/~atm26/papers/fccm2012-bluehive.pdf
Custom hardware v vector processing http://www.cl.cam.ac.uk/~atm26/pubs/FPL2013-BlueVec.pdf
Video and background http://www.cl.cam.ac.uk/research/comparch/research/bimpa.html
An undergrad group could perhaps work with their code, rather than build something from scratch?
Shelved for 2017: Tamagotchi Brief
Jan Kis : jan.kis@imc.com
(‘Creative’) Pokemon Go took the world by storm. Now it’s your turn! Remember those cute, egg-shaped devices with the dancing critters you all played with when you were younger? Well, this task will involve the exciting opportunity of creating an interactive online Tamagotchi world! Your aim is to entice users to explore the fascinating concept of optimal parameter selection, critical to many real world problems such as Machine Learning and automated trading, through their Tamagotchi’s. Develop a mobile application allowing users to view their Tamagotchi and exchange limited resources with each other by agreeing a fair value for their exchange. Once acquired, users should be able to modify the appearance of their Tamagotchi using these resources. Users will thus need to carefully optimize their basket of resources to build their ultimate Tamagotchi. They want to recreate that incredible wig they saw at last night’s bop but only have two bundles of cloth? Well, they trade six spindles of thread for two bundles of cloth, apply some suspect sowing skills and ta dah!
(Formal) Optimal selection of parameters is an important aspect of many exciting real world problems from Machine Learning to automated trading in world markets. This project aims to get users to explore this concept. The task is to build an interactive online mobile application to enable users to view and build their ultimate Tamagotchi. Users will need to exchange limited resources with each other by coming to an agreement about a fair value. These resources can then be used to alter the appearance of their Tamagotchi. Consequently, users will need to carefully select their optimal basket of resources.
Feedback:
Thanks for your “Tamagotchi” suggestion for the Cambridge design project course. Sorry that I’ve left yours to last – I have been discussing other projects with your colleagues that were either more obvious or more problematic.
My initial reaction was that current undergraduates might be unlikely to remember Tamagotchi, so an alternative creature focus might have been necessary. Pokemon would be the obvious choice.
We do already have a design brief for next year that shares some similarities with your proposal, so if we did do something in this area, I’d like to reduce surface resemblance: https://wiki.cam.ac.uk/cl-design-projects/Learn_to_be_an_Alien
I noted the market-making aspect of your more formal version. I found this reminiscent of other market-making projects we have done in the past, such as these two:
https://wiki.cam.ac.uk/cl-design-projects/Scrobble_Exchange:_A_massively_multiplayer_game
https://wiki.cam.ac.uk/cl-design-projects/AI_racing_market
We don’t yet have a “market” proposal for this year, so could perhaps think of a Pokemon Go variant that included this aspect?
2016 projects
earlier ideas
Maksym Korotkiy Maksym Korotkiy <Maksym.Korotkiy@imc.nl>
Prototype a 2D visualization for an execution of a genetic algorithm (GA) applied to a multi-dimensional search problem. The visualization should provide insights into all stages of GA (selection, crossover, mutation) as well as into evolution of candidate solutions. We can assume that number of dimensions is between 10 and 50, number of candidate solutions (population size) is 500 and number of generations is around 100. The visualization should make it easy to understand internal workings of GA and to show an impact of different selection and crossover strategies or mutation rates. Students can use any general GA implementation and can apply it to any multi-dimensional search problem.
Taylan Toygarlar <Taylan.Toygarlar@imc.nl>
Radmilo Racic <radmilo.racic@imc.nl>
Visualization techniques for large set of financial markets data This project develops techniques for visualizing multiple data sets of financial data, including ticker states of global futures and significant stocks, bonds and currencies. The end goal is not only to unearth hidden relationships and correlations between global markets but also to convey trader sentiment and pin point market moving trades. As global market landscape is quite complex and correlated, we will be using Oculus Rift as the principal display tool. Students will be provided with data from Eurostoxx, DAX, CAC, KOSPI, Nikkei, ES, EUR/USD, T-Note, GBL, etc.