Sony Research has inked a partnership to help test and finetune the Southeast Asian Languages in One Network (SEA-LION) artificial intelligence (AI) model, focusing on Indian languages.
The AI arm of Sony Research will work with AI Singapore (AISG) responsible for the development of SEA-LION, to plug gaps in ensuring the large language model (LLM) stands up well on the global landscape, representing the region’s populations and languages. The partners said in a statement Tuesday their research collaboration will involve LLMs under the SEA-LION umbrella, all of which are pre-trained and instruct-tuned specifically on Southeast Asian cultures and languages.
Sony will exchange best practices on LLM development and research methodologies, as well as the application of its research in speech generation, content analysis, and recognition.
Also: AI leaders urged to integrate local data models for diversity’s sake
IBM and Google are among other industry players drawn into finetuning the regional LLM, including making it available for developers to build customized AI applications.
These technologies can be used in AI-powered electronics products spanning various sectors, such as games, movies, and music, and games, Sony said.
Its interactive entertainment unit has filed a patent for a “harassment detection apparatus” that includes an input unit built to receive biometric data and with capabilities to generate, based on biometric data, emotion data associated with users, according to an April 2024 publication on World Intellectual Property Organization’s PatentScope search platform.
With the system, Sony hopes to be able to detect and mitigate communications between individuals in multi-player games or virtual reality experiences that are malicious, such as harassment. Tapping machine learning and AI models, the system can detect biometric data such as speech and determine a player’s emotional state, for instance, through sounds such as sobbing and screaming. These may be used to identify victims of harassment within the shared environment, according to the filing.
How I used ChatGPT to scan 170k lines of code in seconds and save me hours of detective work