Algorithmic-based Guardrails: External guardrail models and alignment methods
Posted on Mo 28 Juli 2025 in ml-memorization
You've probably at some point heard the term "guardrails" when talking about security or safety in AI systems like LLMs or multi-modal models (i.e. models that include and produce multiple modalities, like speech and image, videos, image and text).
Are you a visual learner? There's a YouTube video for …
Continue reading