UAE's G42 Introduces Jais, an Open-source Arabic language AI Model

CIOTechOutlook Team | Wednesday, 30 August 2023, 08:44 IST

  •  No Image
A team of engineers, scientists, and a semiconductor manufacturer from Silicon Valley have worked together to publish sophisticated Arabic language software that can power generative AI applications. The 13 billion parameters that make up the new massive language model known as Jais were created from a large collection of data that combined Arabic and English, some of which came from computer code. There aren't many big multilingual language models, according to the group of academics and engineers that started the research.
 
Supercomputers built by Silicon Valley-based Cerebras Systems, which makes processors the size of dinner plates that compete with Nvidia's potent AI gear, were used to develop the new language model. Because Nvidia's processors are in short supply, businesses all around the world are looking for alternatives. Jais, which takes its name from the tallest mountain in the United Arab Emirates, is the result of a partnership between Cerebras, the Mohamed bin Zayed University of Artificial Intelligence, and the AI-focused subsidiary Inception of the G42 technology conglomerate, located in Abu Dhabi.
 
As per Timothy Baldwin, a professor at Mohamed bin Zayed University of Artificial Intelligence, there isn't enough Arabic data to train a model the size of Jais, thus the computer code found in the English language data helped train the model's reasoning capabilities.
 
"(Code) gives the model a big leg up in terms of reasoning abilities, because it spells out the (logical) steps," Baldwin added.
 
An open source license will be used to make Jais accessible. On a Condor Galaxy supercomputer owned by Cerebras, the group developed the Jais model. The first of these units is expected to arrive this year, and the last two will be delivered in 2024, according to a recent announcement by Cerebras that it has sold three of them to G42.