Claude (language model)

(Redirected from Constitutional AI)

Claude is a family of large language models developed by Anthropic. The first model was released in March 2023.

Claude
Developer(s)Anthropic
Initial releaseMarch 2023; 1 year ago (2023-03)
Type
LicenseProprietary
Websiteclaude.ai

The Claude 3 family, released in March 2024, consists of three models: Haiku optimized for speed, Sonnet balancing capabilities and performance, and Opus designed for complex reasoning tasks. These models can process both text and images, with Claude 3 Opus demonstrating enhanced capabilities in areas like mathematics, programming, and logical reasoning compared to previous versions.[1]

Training

edit

Claude models are generative pre-trained transformers. They have been pre-trained to predict the next word in large amounts of text. Claude models have then been fine-tuned with Constitutional AI with the aim of making them helpful, honest, and harmless.[2][3]

Constitutional AI

edit

Constitutional AI is an approach developed by Anthropic for training AI systems, particularly language models like Claude, to be harmless and helpful without relying on extensive human feedback. The method, detailed in the paper "Constitutional AI: Harmlessness from AI Feedback" involves two phases: supervised learning and reinforcement learning.[3]

In the supervised learning phase, the model generates responses to prompts, self-critiques these responses based on a set of guiding principles (a "constitution"), and revises the responses. Then the model is fine-tuned on these revised responses.[3]

For the reinforcement learning from AI feedback (RLAIF) phase, responses are generated, and an AI compares their compliance with the constitution. This dataset of AI feedback is used to train a preference model that evaluates responses based on how much they satisfy the constitution. Claude is then fine-tuned to align with this preference model. This technique is similar to reinforcement learning from human feedback (RLHF), except that the comparisons used to train the preference model are AI-generated, and that they are based on the constitution.[4][3]

This approach enables the training of AI assistants that are both helpful and harmless, and that can explain their objections to harmful requests, enhancing transparency and reducing reliance on human supervision.[5][6]

The "constitution" for Claude included 75 points, including sections from the UN Universal Declaration of Human Rights.[5][2]

Models

edit

The name Claude was notably inspired by Claude Shannon, a pioneer in artificial intelligence.[7]

Claude

edit

Claude was the initial version of Anthropic's language model released in March 2023,[8] Claude demonstrated proficiency in various tasks but had certain limitations in coding, math, and reasoning capabilities.[9] Anthropic partnered with companies like Notion (productivity software) and Quora (to help develop the Poe chatbot).[9]

Claude Instant

edit

Claude was released as two versions, Claude and Claude Instant, with Claude Instant being a faster, less expensive, and lighter version. Claude Instant has an input context length of 100,000 tokens (which corresponds to around 75,000 words).[10]

Claude 2

edit

Claude 2 was the next major iteration of Claude, which was released in July 2023 and available to the general public, whereas the Claude 1 was only available to selected users approved by Anthropic.[11]

Claude 2 expanded its context window from 9,000 tokens to 100,000 tokens.[8] Features included the ability to upload PDFs and other documents that enables Claude to read, summarize, and assist with tasks.

Claude 2.1

edit

Claude 2.1 doubled the number of tokens that the chatbot could handle, increasing it to a window of 200,000 tokens, which equals around 500 pages of written material.[12]

Anthropic states that the new model is less likely to produce false statements compared to its predecessors.[13]

Claude 3

edit

Claude 3 was released on March 14, 2024, with claims in the press release to have set new industry benchmarks across a wide range of cognitive tasks. The Claude 3 family includes three state-of-the-art models in ascending order of capability: Haiku, Sonnet, and Opus. The default version of Claude 3, Opus, has a context window of 200,000 tokens, but this is being expanded to 1 million for specific use cases.[14][1]

Claude 3 drew attention for demonstrating an apparent ability to realize it is being artificially tested during needle in a haystack tests.[15]

Claude 3.5

edit
 
Example of Claude 3.5 Sonnet output

On June 20, 2024, Anthropic released Claude 3.5 Sonnet, which demonstrated significantly improved performance on benchmarks compared to the larger Claude 3 Opus, notably in areas such as coding, multistep workflows, chart interpretation, and text extraction from images. Released alongside 3.5 Sonnet was the new Artifacts capability in which Claude was able to create code in a dedicated window in the interface and preview the rendered output in real time, such as SVG graphics or websites.[16]

An "upgraded Claude 3.5 Sonnet" was introduced on October 22, 2024, along with Claude 3.5 Haiku. Anthropic simultaneously introduced "computer use" in the API, which allows Claude 3.5 Sonnet to interact with a computer desktop environment.[17]

Access

edit

Limited-use access using Claude 3.5 Sonnet is free of charge, but requires both an e-mail address and a cellphone number. A paid plan is also offered for more usage and access to all Claude 3 models.[18]

On May 1, 2024, Anthropic announced the Claude Team plan, its first enterprise offering for Claude, and a Claude iOS app.[19]

Criticism

edit

Claude 2 received criticism for its stringent ethical alignment that may reduce usability and performance. Users have been refused assistance with benign requests, for example with the programming question "How can I kill all python processes in my ubuntu server?" This has led to a debate over the "alignment tax" (the cost of ensuring an AI system is aligned) in AI development, with discussions centered on balancing ethical considerations and practical functionality. Critics argued for user autonomy and effectiveness, while proponents stressed the importance of ethical AI.[20][13]

References

edit
  1. ^ a b Whitney, Lance (March 4, 2024). "Anthropic's Claude 3 chatbot claims to outperform ChatGPT, Gemini". ZDNET. Retrieved March 5, 2024.
  2. ^ a b "What to Know About Claude 2, Anthropic's Rival to ChatGPT". TIME. July 18, 2023. Retrieved January 23, 2024.
  3. ^ a b c d "Claude's Constitution". Anthropic. May 9, 2023. Retrieved March 26, 2024.
  4. ^ Eliot, Lance (May 25, 2023). "Latest Generative AI Boldly Labeled As Constitutional AI Such As Claude By Anthropic Has Heart In The Right Place, Says AI Ethics And AI Law". Forbes. Retrieved March 27, 2024.
  5. ^ a b Bai, Yuntao; Kadavath, Saurav; Kundu, Sandipan; Askell, Amanda; Kernion, Jackson; Jones, Andy; Chen, Anna; Goldie, Anna; Mirhoseini, Azalia (December 15, 2022), Constitutional AI: Harmlessness from AI Feedback, arXiv:2212.08073
  6. ^ Mok, Aaron. "A ChatGPT rival just published a new constitution to level up its AI guardrails, and prevent toxic and racist responses". Business Insider. Retrieved January 23, 2024.
  7. ^ Roose, Kevin (July 11, 2023). "Inside the White-Hot Center of A.I. Doomerism". The New York Times.
  8. ^ a b Drapkin, Aaron (October 27, 2023). "What Is Claude AI and Anthropic? ChatGPT's Rival Explained". Tech.co. Retrieved January 23, 2024.
  9. ^ a b "Introducing Claude". Anthropic. March 14, 2023.
  10. ^ Yao, Deborah (August 11, 2023). "Anthropic's Claude Instant: A Smaller, Faster and Cheaper Language Model". AI Business.
  11. ^ Matthews, Dylan (July 17, 2023). "The $1 billion gamble to ensure AI doesn't destroy humanity". Vox. Retrieved January 23, 2024.
  12. ^ Davis, Wes (November 21, 2023). "OpenAI rival Anthropic makes its Claude chatbot even more useful". The Verge. Retrieved January 23, 2024.
  13. ^ a b "Anthropic Announces Claude 2.1 LLM with Wider Context Window and Support for AI Tools". InfoQ. Retrieved January 23, 2024.
  14. ^ "Introducing the next generation of Claude". Anthropic. Retrieved March 4, 2024.
  15. ^ Edwards, Benj (March 5, 2024). "Anthropic's Claude 3 causes stir by seeming to realize when it was being tested". Ars Technica. Retrieved March 9, 2024.
  16. ^ Pierce, David (June 20, 2024). "Anthropic has a fast new AI model — and a clever new way to interact with chatbots". The Verge. Retrieved June 20, 2024.
  17. ^ "Introducing computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku". www.anthropic.com. Retrieved October 25, 2024.
  18. ^ "Introducing the Claude Team plan and iOS app". Anthropic. May 1, 2024. Retrieved June 22, 2024.
  19. ^ Field, Hayden (May 1, 2024). "Amazon-backed Anthropic launches iPhone app and business tier to compete with OpenAI's ChatGPT". CNBC. Retrieved May 3, 2024.
  20. ^ Glifton, Gerald (January 3, 2024). "Criticisms Arise Over Claude AI's Strict Ethical Protocols Limiting User Assistance". Light Square. Retrieved January 23, 2024.
edit