DeepSeek's Refusals: How Do the Guardrails of Language Models Compare?

03-02-2025

With the recent release of DeepSeek, the AIDA team has revised their work “Large Language Models Reflect the Ideology of Their Creators”, which aims to explore whether widely-used LLMs exhibit certain ideological biases, possibly shaped by the worldview of their creators.

Read the full blog post on the AIDA team website.

Want to know more?
Visit our team page and learn more about team AIDA.
AIDA
Copyright © 2025 IDLab. All rights reserved.