November 15, 2023

Multiverse Computing Launches CompactifAI to Streamline LLMs to Reduce Energy Use and Compute Costs

Thumbnail

SAN SEBASTIÁN, SPAIN, November 15, 2023 /EINPresswire.com/ -- Multiverse Computing, a global leader in value-based quantum computing solutions, today announced CompactifAI, a solution for the high compute cost of machine learning algorithms. This new tool for managing the significant energy demands required to train and run large language models (LLMs) like ChatGPT and Bard is designed to reduce development costs and make it easier to integrate these models into more digital services.

CompactifAI uses tensor networks to reduce the number of parameters in a model, reducing its overall size as well as shrinking memory and storage space requirements. Retraining has the potential to be faster as well since users can add new data to an existing model that has already been compressed to generate an updated model that retains the benefits of compression while preserving the quality of results.

CompactifAI is designed to reduce energy requirements at multiple points in the lifecycle of an LLM, including the training phase, general operations and retraining. Multiverse’s new software also reduces the overall footprint of the models, making them more portable and easier to run at the edge in applications such as autonomous vehicles and remote production facilities. A video showing how CompactifAI works is available here.

“This new tool will lift a major barrier to the growth of large language models across industries: the sheer size of these algorithms and data sets and the energy required to run them,” said Enrique Lizaso Olmos, CEO of Multiverse Computing. “CompactifAI also opens the door to new LLM uses cases for on premise, on the edge and other applications where there is no dedicated cloud connection.”

A new study of AI energy consumption found that by 2027 worldwide AI-related electricity consumption each year could match the annual energy use of countries such as the Netherlands, Argentina and Sweden.[1,2] The research conclusions are based on estimations of the amount of energy needed to run the most popular servers that power LLMs. The research also considered increased usage as the models become integrated into popular search engines and other elements of internet infrastructure.

LLMs already consume a significant amount of energy in both the training phase and in daily operations. A University of Washington researcher estimates that a single LLM could require up to 10 gigawatts of power during the training phase, equal to the energy use of more than 1,000 U.S. households annually. To respond to hundreds of millions of queries each day, those same models could use up to 1 gigawatt of power each day, which is roughly equal to the daily energy usage rate of 33,000 US households.

"LLMs demand significant energy, compute power, and memory resources during their training process and during their lifecycle,” said Rafael San Juan, Global Innovation Manager at Iberdrola, one of the world’s largest electric utility providers and Multiverse customer. “An innovative solution capable of optimizing use of resources holds the potential to revolutionize the landscape, minimizing the impact this has on the electric grid and enabling the true scaling of LLM solutions.”

The software offers low, medium and high options for compressing a model, depending upon the application requirements for a particular LLM. Multiverse expects AI developers to be the initial users for this software-as-a-service platform.

Originally used in the study of condensed matter physics, a tensor network is a visual language that describes complex systems. This visual language makes it easier to understand how each component in the system interacts with all the others and to predict the results of those interactions. Extracting information from tensor networks is relatively straightforward, providing another benefit to this approach to computation.

Multiverse Computing’s Chief Science Officer Roman Orus was one of the first researchers to study tensor networks during a research fellowship at the University of Queensland in 2006. As a co-founder of the company, he applied this knowledge in one of Multiverse’s first projects: using tensor networks for portfolio optimization with a multinational bank.

More recently, researchers have used tensor networks as models for machine learning architecture and to compress the layers in neural networks. Tensor networks can be mapped directly to quantum circuits as well. Researchers expect tensor networks to serve as a bridge between today’s noisy quantum computers and the fault tolerant machines of the future.

Full article here