DeepSeek's Refusals: How Do the Guardrails of Language Models Compare?

Publications

DeepSeek's Refusals: How Do the Guardrails of Language Models Compare?

03-02-2025

With the recent release of DeepSeek, the AIDA team has revised their work “Large Language Models Reflect the Ideology of Their Creators”, which aims to explore whether widely-used LLMs exhibit certain ideological biases, possibly shaped by the worldview of their creators.

Read the full blog post on the AIDA team website.

Blog post

Want to know more?

Visit our team page and learn more about team AIDA.

AIDA

More news

O-RAN compliant Wi-Fi Access Points for Programmable and Intelligent Multi-RAT Networks Demo 03-07-2026

High-speed DAC paper featured in IEEE SSCS Editors-in-Chief Quarterly Spotlight 26-06-2026

BlueSky paper at A* conference AAMAS 13-02-2026

All news

News

DeepSeek's Refusals: How Do the Guardrails of Language Models Compare?

More news