Multi-lingual SMS: Difference between revisions

From Computer Laboratory Group Design Projects
Jump to navigationJump to search
No edit summary
No edit summary
Line 4: Line 4:


Services such as Google Translate are a useful tool for global languages like Mandarin or French, but many people in remote rural areas contribute to public life in their own local languages. It's not necessary to translate these, but it's useful for UNICEF and other international agencies to know what general concerns these people share with neighbouring regions. Cambridge organisation Africa's Voices has datasets of SMS messages collected from different African countries, in local languages, addressing shared health and human rights issues. Your goal is to create a visual browser, perhaps in the style of word clouds, that would allow international policy teams to explore topic maps bringing multiple languages together. The LDA algorithm might be a useful approach to topic map construction.
Services such as Google Translate are a useful tool for global languages like Mandarin or French, but many people in remote rural areas contribute to public life in their own local languages. It's not necessary to translate these, but it's useful for UNICEF and other international agencies to know what general concerns these people share with neighbouring regions. Cambridge organisation Africa's Voices has datasets of SMS messages collected from different African countries, in local languages, addressing shared health and human rights issues. Your goal is to create a visual browser, perhaps in the style of word clouds, that would allow international policy teams to explore topic maps bringing multiple languages together. The LDA algorithm might be a useful approach to topic map construction.
Old version:
The popularity of radio in rural areas, combined with high penetration of mobile phones, enables African citizens to contribute their opinions to radio discussions via SMS. International agencies such as UNICEF could use this data to design and monitor development initiatives. Cambridge organisation Africa's Voices has datasets of SMS messages collected from different African countries, in local languages, addressing health and human rights issues. Your task is to create a multi-lingual browser that analyses and presents these themes, for example as an interactive map or semantic cloud using LDA topic models. The goal is to create a tool that would help international teams hold policy discussions that relate and recognise ideas relating to important issues (keeping in mind that some important concepts in a particular society might have no exact translation).

Revision as of 15:02, 18 November 2014

Client: Robert Catherall, ARM <Robert.Catherall@arm.com>

New version to clarify that this is not a machine translation project:

Services such as Google Translate are a useful tool for global languages like Mandarin or French, but many people in remote rural areas contribute to public life in their own local languages. It's not necessary to translate these, but it's useful for UNICEF and other international agencies to know what general concerns these people share with neighbouring regions. Cambridge organisation Africa's Voices has datasets of SMS messages collected from different African countries, in local languages, addressing shared health and human rights issues. Your goal is to create a visual browser, perhaps in the style of word clouds, that would allow international policy teams to explore topic maps bringing multiple languages together. The LDA algorithm might be a useful approach to topic map construction.