Feedback

/moe/

Not synched.



Image 1771721551348.jpg (184 KB, 1202x2048)

「late February」

Samu /人◕ ‿‿ ◕人\   Expand Last 100
147 replies omitted.
Rei !p8eYCadcMo
>>>/@Asmiirin/2027132341722771494
Rei !p8eYCadcMo
stay safe whoa-kun
Samu /人◕ ‿‿ ◕人\
we fight the witches at dawn
Samu /人◕ ‿‿ ◕人\
>>>/watch?v=UqpttJ4SFXU
duuude
extended version of sis puella magica
Cinema
Samu /人◕ ‿‿ ◕人\
> Previous work assumed refusal behavior to be encoded as a single direction in the model's latent space; e.g., computed as the difference between the centroids of harmful and harmless prompt representations. However, emerging evidence suggests that concepts in LLMs often appear to be encoded as a low-dimensional manifold embedded in the high-dimensional latent space. Just like numbers and days of week are encoded in circles or helices, in recent advanced neural networks like GPT-OSS refusals are becoming ingrained in complex multi-directional clusters and one-directional ablation is not enough to get rid of the refusal reasoning.

uhh sure if you say so
moon i need an adult

Image 1772269391539.jpg (30 KB, 640x480)
Mahou Shoujo Marsh-chan   Expand