The Berkeley Artificial Intelligence Research Blog
- Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign)
- Repurposing Protein Folding Models for Generation with Latent Diffusion
- Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment
- Virtual Personas for Language Models via an Anthology of Backstories
- Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination
- How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark
- Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark!
- TinyAgent: Function Calling at the Edge
- Modeling Extremely Large Images with xT
- 2024 BAIR Graduate Directory
- The Shift from Models to Compound AI Systems
- Ghostbuster: Detecting Text Ghostwritten by Large Language Models
- Asymmetric Certified Robustness via Feature-Convex Neural Networks
- Goal Representations for Instruction Following
- Rethinking the Role of PPO in RLHF
- Training Diffusion Models with <br> Reinforcement Learning
- On the Stepwise Nature of <br> Self-Supervised Learning
- Generating 3D Molecular Conformers via Equivariant Coarse-Graining and Aggregated Attention