Uncategorized

We still need the Next Big Leap in Artificial Intelligence

Claude 3.5 Sonnet: AI Innovation Meets Machine Learning: Anthropic’s New Model of Artificial Intelligence for Biological and Computer Science

Michael Gerstenhaber, head of product at Anthropic, says the company’s new Claude 3.5 Sonnet model is larger than its predecessor but draws much of its new competence from innovations in training. For example, the model was given feedback designed to improve its logical reasoning skills. Claude 3.5 Sonnet scored better than models from OpenAI,Google, and Facebook on popular artificial intelligence tests, including a graduate level test of expertise in biology, physics, and chemistry. The changes are a matter of percentage points. This latest progress in AI might not be revolutionary but it is fast-paced: Anthropic only announced its previous generation of models three months ago. If you look at intelligence change at a fast pace, you will realize how fast we are moving.

Anthropic has a new feature called Artifacts. With Artifacts, you’ll be able to see and interact with the results of your Claude requests: if you ask the model to design something for you, it can now show you what it looks like and let you edit it right in the app. If Claude writes you an email, you can edit the email in the Claude app instead of having to copy it to a text editor. It’s a small feature, but a clever one — these AI tools need to become more than simple chatbots, and features like Artifacts just give the app more to do.

What is it that actually means? Claude 3.5 Sonnet will be better at writing and interpreting code, handling multistep operations, interpreting charts and graphs, and typing text from images according to Anthropic. Claude is better at understanding humor and is now more able to write in a more human way.

GPT-4: OpenAI’s next generation of large langauge solutions to problems the world wouldn’t have known before – and how it had gotten worse

When OpenAI announced GPT-4, its latest large language model, last March, it sent shockwaves through the tech world. It was clearly more capable of solving problems than had been seen before.