Notice: Function _load_textdomain_just_in_time was called incorrectly. Translation loading for the betterdocs domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the init action or later. Please see Debugging in WordPress for more information. (This message was added in version 6.7.0.) in /data/user/htdocs/wp-includes/functions.php on line 6114

Notice: Function _load_textdomain_just_in_time was called incorrectly. Translation loading for the jnews-view-counter domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the init action or later. Please see Debugging in WordPress for more information. (This message was added in version 6.7.0.) in /data/user/htdocs/wp-includes/functions.php on line 6114

Notice: Function _load_textdomain_just_in_time was called incorrectly. Translation loading for the wp-statistics domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the init action or later. Please see Debugging in WordPress for more information. (This message was added in version 6.7.0.) in /data/user/htdocs/wp-includes/functions.php on line 6114

Notice: Function _load_textdomain_just_in_time was called incorrectly. Translation loading for the wpdiscuz domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the init action or later. Please see Debugging in WordPress for more information. (This message was added in version 6.7.0.) in /data/user/htdocs/wp-includes/functions.php on line 6114

Notice: 函数 _load_textdomain_just_in_time 的调用方法不正确。 jnews 域的翻译加载触发过早。这通常表示插件或主题中的某些代码运行过早。翻译应在 init 操作或之后加载。请查阅调试 WordPress来获取更多信息。（这个消息是在 6.7.0 版本添加的。） in /data/user/htdocs/wp-includes/functions.php on line 6114

Notice: 函数 _load_textdomain_just_in_time 的调用方法不正确。 jnews-like 域的翻译加载触发过早。这通常表示插件或主题中的某些代码运行过早。翻译应在 init 操作或之后加载。请查阅调试 WordPress来获取更多信息。（这个消息是在 6.7.0 版本添加的。） in /data/user/htdocs/wp-includes/functions.php on line 6114

Warning: Cannot modify header information - headers already sent by (output started at /data/user/htdocs/wp-includes/functions.php:6114) in /data/user/htdocs/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /data/user/htdocs/wp-includes/functions.php:6114) in /data/user/htdocs/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /data/user/htdocs/wp-includes/functions.php:6114) in /data/user/htdocs/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /data/user/htdocs/wp-includes/functions.php:6114) in /data/user/htdocs/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /data/user/htdocs/wp-includes/functions.php:6114) in /data/user/htdocs/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /data/user/htdocs/wp-includes/functions.php:6114) in /data/user/htdocs/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /data/user/htdocs/wp-includes/functions.php:6114) in /data/user/htdocs/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /data/user/htdocs/wp-includes/functions.php:6114) in /data/user/htdocs/wp-includes/rest-api/class-wp-rest-server.php on line 1893
{"id":36047,"date":"2024-11-13T11:10:10","date_gmt":"2024-11-13T03:10:10","guid":{"rendered":"https:\/\/linguaresources.com\/?p=36047"},"modified":"2024-11-13T11:10:10","modified_gmt":"2024-11-13T03:10:10","slug":"cohere-releases-aya-expanse-multilingual-ai-models","status":"publish","type":"post","link":"https:\/\/linguaresources.com\/?p=36047","title":{"rendered":"Cohere Releases \u201cAya Expanse\u201d Multilingual AI Models"},"content":{"rendered":"\n

Cohere for AI<\/a>, the research arm of the language artificial intelligence (AI)<\/a> company, has introduced two new large language models (LLMs) \u2014 Aya Expanse 8B and 32B \u2014 as part of its ongoing project aimed at closing language divides in foundational AI datasets and models. The Aya Expanse models provide researchers access to advanced AI capabilities across 23 languages, including Arabic, Chinese, French, and Hindi.<\/p>\n\n\n\n

\u201cBuilding on more than two years of open science research, Aya Expanse offers significant performance advances, setting a new state-of-the-art for multilingual LLMs,\u201d the Cohere website<\/a> states. \u201cThis includes a series of breakthroughs in data arbitrage, preference training for performance and safety, and model merging.\u201d<\/p>\n\n\n\n

According to a SiliconANGLE<\/em> article<\/a>, the two Aya Expanse models were launched with open weights on hosting sites Hugging Face<\/a> and Kaggle<\/a>, and they used \u201cseveral new core research innovations\u201d to achieve high performance, including \u201csynthetic data and human feedback in late-term training.\u201d<\/p>\n\n\n\n

In a blog post<\/a>, Cohere claims that Aya Expanse 32B outperforms models like Google<\/a>\u2019s Gemma 2 27B and Meta<\/a>\u2019s Llama 3.1 70B. For lower-parameter options, Aya Expanse 8B also demonstrated advantages over other similar-sized models like Gemma 2 9B and Llama 3.1 8B. \u201cThe improvements in Aya Expanse are the result of a sustained focus on expanding how AI serves languages around the world by rethinking the core building blocks of machine learning breakthroughs,\u201d the blog post states.<\/p>\n\n\n\n

According to a VentureBeat<\/em> article<\/a> by Emilia David, the Aya initiative attempts to solve the problem of research being done on LLMs that don\u2019t perform well in languages other than English. \u201cMany LLMs eventually become available in other languages, especially for widely spoken languages, but there is difficulty in finding data to train models with the different languages,\u201d David writes. \u201cIt can also be difficult to accurately benchmark the performance of models in different languages because of the quality of translations.\u201d<\/p>\n\n\n\n

Aya, derived from the Twi language term for \u201cfern,\u201d has grown into one of the world\u2019s largest open-source multilingual projects, featuring over 513 million data points curated across 101 languages and 250 language ambassadors worldwide. This collaborative approach allows Aya\u2019s datasets to expand research opportunities in regions where non-English AI resources remain limited.<\/p>\n","protected":false},"excerpt":{"rendered":"