It is such a stupid and obvious market failure that nobody has made a consumer AI LLM product that is 1. trained on consensually-acquired material 2. powered with renewable energy 3. genuinely open about its weights and models. Just achieving these things and being creator-friendly would be massive.
@anildash i was actually thinking, as an art project, of getting a solar panel and doing this with a collection of CC0 content. i decided not to do pursue this after seeing how things developed with OpenAI, on the grounds that if a true-open model existed, the proponents of closed/stolen models would point to my open model to go "see? AI doesn't have to be based on stolen content!" then continue using the stolen content.
@anildash Put a different way, I think one reason this doesn't exist is that the presence of stolen material in LLM models is not a flaw, but the primary attraction. Copyright laundering is the core product.
If the users did not want to do copyright laundering, then the product might not even need the machine learning model at all, in that world a simple tag system might be adequate. The purpose the model serves in the system is to randomize the inputs enough to disguise the sources.
@mcc I think about this a lot. The “then they’ll use it to justify the bad thing”. But they do that *anyway*, and we end up without the ethical thing. Like… we’re on Mastodon. You know who literally forked it to make a fascist network. They would have done that anyway! But this is still a thing of value.