Safety Measures and Concerns
Anthropic has implemented several safety features to address potential risks:
- No training on user screenshots or prompts
- Restricted web access during training
- Built-in classifiers to prevent high-risk actions
- 30-day retention of screenshots for safety monitoring
- Ability to restrict access to specific websites and features
- Pre-deployment testing by U.S. and U.K. AI Safety Institutes