LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 6 days ago

LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users

JoeByeThen [he/him, they/them]@hexbear.net · 6 days ago

Sure, but can you fine tune the culture out of it without the whole base (I forgot the proper word, sorry) collapsing? The training data isn’t open, right? Like while I totally agree with you about the importance of openness, this shit is coming from the training data and our shit culture it was derived from.

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 6 days ago

Yes, you absolutely can. That’s precisely what LoRAs are for. You can completely change the way the model responds by adding a layer on top. All the core knowledge stays the same. I’ve actually done this myself. I rented some time on runpod to train a LoRA on Lovecraft that I applied to a base Qwen model.

JoeByeThen [he/him, they/them]@hexbear.net · 6 days ago

OVERFITTING! The word I’m thinking of is overfitting. Lol and yeah I swear I know what a Lora is, but I don’t think you have a chance in hell of using a Lora to consistently remove cultural discrimination from a model. I very much think that’s wishful thinking. You’d be playing whackamole and then you’re still hoping that you dont introduce some ‘stop talking about gremlins’ type version of some asshole that doesn’t believe racism exists because America had a black president. Lol.

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 6 days ago

I think at some point you can be fairly sure that the model performs well enough. And the simplest thing it can do is literally just act as a translator layer on top of the model. So, if you give a query, it’ll reformulate it in a way the model is known to respond well to. You can do a random sample test to see that you’re generally getting the results you expect too.

At the end of the day, models shouldn’t be treated like oracles in the first place, it’s a useful tool for helping point you in the right direction, or work through a problem. But it should always be the human making a decision in the end, and doing their own due diligence to verify the information.