Posted by z3d in ArtInt

Google has been developing the Gemini large language model (LLM) over the past eight months and recently gave a small group of companies access to an early version.

The conversational, genAI tool is by far Google’s most powerful, according to the company, and it could be a serious challenger to other LLMs such as Meta’s Llama 2 and OpenAI’s GPT-4.

“This new era of models represents one of the biggest science and engineering efforts we’ve undertaken as a company,” Google CEO Sundar Pichai wrote in a blog post.

The new LLM is capable of multiple methods of input, such as photos, audio, and video, or what’s known as a multimodal model. The standard approach to creating multimodal models typically involved training separate components for different modalities and then stitching them together.

1

Comments

You must log in or register to comment.

There's nothing here…