The AI company Cerebras Systems, announced it has trained and is releasing a series of seven GPT-based large language models (LLMs) for open use by the research community.
First, since Cerebras is sharing the models, weights, and training recipe via an industry-standard Apache 2.0 license, the entire AI community now has a known-good base to build customized models for specific use cases and domains, without paying a license to access the Open.ai APIs.
Second, Cerebras was able to train these models quite easily, especially when compared to the staffing required to create and distribute a large model across a super-cluster of accelerators such as GPUs or Google TPUs. In fact, Cerebras claims it only assigned a single engineer to train these models, compared to some 35 engineers Open.ai used to build the distributed model to run across its GPU infrastructure. Cerebras wins for simplicity of development and deployment.
According to Cerebras, “Typically a multi-month undertaking, this work was completed in a few weeks thanks to the incredible speed of Cerebras CS-2 systems comprising Andromeda, and the ability of their Weight Streaming architecture to eliminate the pain of distributed compute. These results demonstrate that Cerebras’ systems can train the largest and most complex AI workloads today.”Finally, Cerebras GPT is the first family of GPT models that are compute-efficient at every model size.
Australia Latest News, Australia Headlines
Similar News:You can also read news stories similar to this one that we have collected from other news sources.
AI computing startup Cerebras releases open source ChatGPT-like modelsArtificial intelligence chip startup Cerebras Systems on Tuesday said it released open source ChatGPT-like models for the research and business community to use for free in an effort to foster more collaboration.
Read more »
ChatGPT and generative AI are changing the software-making gameChatGPT brings us way closer to the dream of replacing code with just telling the computer what you want.
Read more »
Microsoft, Google, Amazon Look to Generative AI to Lift Cloud BusinessesAmazon, Microsoft and Google are trying to capitalize interest in applications like the viral chatbot ChatGPT to revive sales in their cloud-computing businesses
Read more »
Goldman Sachs' Marco Argenti explains how bank is using generative AIGoldman Sachs is exploring use cases for large language models, the same AI fueling ChatGPT. Here's 3 areas where the bank is experimenting with the tech.
Read more »
Picsart Has Developed a New Text-to-Video Generative AI ModelPicsart’s artificial intelligence research team (PAIR) has built a new generative AI model that can create entirely new video content from only text descriptions.
Read more »
How Generative AI Can Transform ‘Functional Music’ Into an Artist-Driven Experience (Guest Column)Imagine soundscape versions of popular albums, giving artists and labels fuel to reclaim market share and to drown out the white noise in ambient playlists.
Read more »