I don’t use Google, but DuckDuckGo Search Assist often shows summaries with links that don’t at all support what the summary says, but I often still forget to fact check it. I might start using the no AI version of the site because I’m concerned I might accidentally register something important as factual at some point that isn’t.
91% accuracy is the kind of thing that may sound good… hey! It’s an A minus! But it’s actually completely, totally unacceptable. Imagine if the turn signal wand on your car operated with 91% accuracy. About one in every ten times it would light up the wrong direction. How many accidents are we causing? A lot.
Whether 91% accuracy is acceptable depends on how unacceptable the 9% inaccuracy is. If 91% of the information in your term paper is correct you’ll probably get a decent grade, but if you only kill 91% of cancer cells the surviving 9% will grow a treatment-resistant tumor and you’ll probably die. This makes percentages essentially useless - more important is how badly wrong the worst wrong result is.
This is why we should ban cars outright. Go back to writing on paper. I can stick a pen in my ass and make a cute drawing of a cat. In fact, I might be able to eat a cat and defecate it later, to make it more realistic. And that’s what we need to be; realistic.
(This comment is about AI data centers)
I make this “comment” every once and a while because I called someone out on how their post made little sense by parodying it, and now I just do this.
Awesome news
Removing all the emotion from this, the specific problem with these AI overviews is how Google presents them to you.
Everybody with some sense knows AI’s can excrete total hogwash and it’s answers need to be fact checked down to the most minute detail. Some people take what they get from AI’s as gospel anyway, but that is a them problem.
But Google a: calls these summaries, and b: presents them as the top search result. Both of these things come with a greater than normal degree of implied factuality.
Someone techincally minded will know it’s still AI an subject to the same scrutiny but the population at large simply does not, because they entered a search query in a google search box and aren’t willingly and deliberately talking to an AI.
Everybody with some sense knows AI’s can excrete total hogwash and it’s answers need to be fact checked down to the most minute detail. Some people take what they get from AI’s as gospel anyway, but that is a them problem.
The vast majority of people have less than zero idea of what an AI chatbot is. They’re purposefully marketed to ensure it makes as little sense as possible to anyone who doesn’t do a good bit of research.
People hear or read ‘artificial intelligence’, and picture the batcomputer or C3-P0, which were, for the most part, always 100% correct.
But Google a: calls these summaries, and b: presents them as the top search result.
Also c: gives it a colorful, more pleasant looking window to manipulate users into subconsciously prioritizing it over ‘plain, ugly’ results below.
It still seems like letting people off the hook for media literacy is a knee jerk reaction. Since the dawn of Google, the VAST majority of people who use it have just treated the first result as gospel. I don’t know if scraping that same content and putting it in the same place with the word “Summary” above it is materially that different.
The core problem here is still individuals not taking accountability for their own education. I would actually argue that holding Google to the standard of somehow being “arbiters of truth” is even more dangerous. No one should trust any information presented to them by an entity that has vested financial interests in influencing consumer behavior.
And that’s a problem too. If everything ai says should be fact checked, the burden is shifted back to the consumer making the convenience or ‘productivity’ aspect virtually nonexistent. Such a cop out. Here’s your answer! *be sure to fact check. Okay so google it basically? Why the ai then? Just stupid
Here is an example… Apparently Zelensky was a US president at one point and we all missed it.

That’s actually a really good summary of the issue. It’s the tacitly implied authenticity and “goodness of match” that being the top result implies that shifts the balance.
If they’d put a “generate AI summary of search” button to display the AI result, I the they’d be on firmer ground.
Can we apply the same logic and principle to self driving cars now please and hold the owners of the proprietary software fully and properly responsible for every poor judgement, traffic violation, accident injury and death that happens in self drive mode.
What needs to happen is:
-
an enforceable certification process, like part of FMVSS, to state “this vehicle is certified as L[0-5] self driving per https://www.nhtsa.gov/sites/nhtsa.gov/files/2022-05/Level-of-Automation-052522-tag.pdf”. Put the certification on the window sticker. Have it reported to insurance.
-
Levels 0-2 cannot be advertised as self driving, even though there may be hands-free driving capability in some limited cases. The driver remains fully responsible and liable (no change to current liability rules essentially)
-
Levels 3 and 4 will be required to have shared liability with the driver and manufacturer in all conditions where the vehicle is in control of itself. This includes roads that the vehicle should be able to navigate autonomously and the driver has requested it to, but it is not for any reason.
-
Level 5 would place liability on the manufacturer solely, as there is no indicated driver in this case. This is the only one that can be advertised as “self-driving”.
I have little assurance in anything FMVSS certified. Motorcycle helmets with the DOT sticker(a legal requirement for on-road use in the US) are suppose to meet FMVSS 218.
When the NHTSA did a random test 42% failed. How is that possible you might ask. They let the manufacturer conduct the testing and self-certify.
Would work IF you add the proviso that levels 3-4 cannot be advertised or implied to operate on their own initiative.
-
Yes, but at the same time can we stop marketing as “self driving cars” normal cars with a somewhat sophisticated cruise control, like Teslas, and stop pretending their “super full self driving unsupervised for realsies plus plus” is a “self drive mode”?
Limited liability stops the owners being responsible. It’s the executives running the company that should go to jail.
“This slop is not available in your country.”
VPN getting set to Germany if that happens.
Based.
All the arguments of “AI doesn’t impact copyright because it creates derivative content” were bound to lead here. You can’t (or at least shouldn’t be able to) have it both ways.
I was thinking the same thing.
An AI output is EITHER an original work (either as a wholly original work or as a derivative of another work), or it’s not (and is thus a republication of an existing work).
If it’s a republication, then Google owes a ton of copyright fees and the original publisher of whatever bit of training data got regurgitated is liable. If it’s an original / derivative work, then Google owes nobody anything, but is responsible for whatever the AI outputs.
For example if I write somewhere ‘It’s 100% safe to mix ammonia and chlorine, it gets stains out super fast!’ (note- DON’T do this, it’s toxic), I’m the author of that statement so if someone does that and dies I’ve got partial responsibility for that death.
Same thing with Google.
For example if I write somewhere ‘It’s 100% safe to mix ammonia and chlorine, it gets stains out super fast!’ (note- DON’T do this, it’s toxic), I’m the author of that statement so if someone does that and dies I’ve got partial responsibility for that death.
Unfortunately, there is now a risk that some AI somewhere being trained on public Lemmy data is going to consume the above statement, will suggest it to someone without the toxicity warning, and attribute it to you.
Sadly this is happening.
Example one- Q: How many USB ports does my computer have? A: kill yourself.
Example two- Q: how should I deal with depression? A: jump off the Golden Gate Bridge.
Example three- Q: Should I run with scissors? A: Yup!
Let’s not forget a healthy diet includes eating rocks.
Or that you should drink urine to pass kidney stones.
Hey, your examples are missing /u/fucksmith’s pizza recipe from reddit.
Fun fact: Methyl cellulose is actually used as a form of glue, but is also used in food. It’s derived from cellulose. This also happens to be the stuff those anti-vegan fanatics are referring to, when they claim that vegan alternatives have “wallpaper paste” in them (while ignoring that it’s also used in some meat products and ice cream).
I thought wallpaper glue was made of cornstarch…
There are probably plenty of different products with different recipes out there. Might also depend on what’s readily available in the region.
For example if I write somewhere ‘It’s 100% safe to mix ammonia and chlorine, it gets stains out super fast!’ (note- DON’T do this, it’s toxic), I’m the author of that statement so if someone does that and dies I’ve got partial responsibility for that death.
Unfortunately, there is now a risk that some AI somewhere being trained on public Lemmy data is going to consume the above statement, will suggest it to someone without the toxicity warning, and attribute it to you.
Or us, since we’ve both quotes them now.
For example if I write somewhere ‘It’s 100% safe to mix ammonia and chlorine, it gets stains out super fast!’ (note- DON’T do this, it’s toxic), I’m the author of that statement so if someone does that and dies I’ve got partial responsibility for that death.
Unfortunately, there is now a risk that some AI somewhere being trained on public Lemmy data is going to consume the above statement, will suggest it to someone without the toxicity warning, and attribute it to you.
Or us, since we’ve both quotes them now.
I am Spartacus
His name is Robert Paulson.
Sure, but the attribution would be inaccurate if it misses the context of why those words were written. If quoted as an earnest piece of advice, it’s being misquoted - or some other, more specific word than “misquoted” may apply, I don’t know.
Legally? Probably. But that really wasn’t the point. The point was more that without suitable controls in place AIs are able to consume all sorts of bad data and potentially attribute it to you (or me, or whomever) while leaving out important context.
It won’t matter if some AI consumed your message and gave someone the advise to inappropriately mix harmful chemicals, attributed it to you, and they wound up hurting themselves or someone else. They might still blame you, and may not care that there was missing context.
Note that that’s not intended as any sort of criticism of you or your post, more that we’ve entered a wild-west of AI development, and we as content producers may not be entirely safe. We’ve already seen AIs recommend people try adding Elmer’s glue to pizza sauce based on joke posts online. It might only be a matter of time before a child or youth gets hurt — and an upset parent may not care about the semantics of whether or not you were correctly attributed or not.
But your honor I really wanna
If you add in “and here’s your check, sir” at the end this method actually works in the USA.
It’s all
moneychecks and balan… Ahh
Ayyy
Excellent. Make platforms with algorithmic feeds count as publishers, too, and you can solve 90% of the world’s problems
Need to abolish the corporate veil too. Those parasitic fucks can buy liability insurance.
What do you mean by “algorithmic”? Something like a recommendation engine?
Basically anything where the platform prioritises some things over others, rather than just giving you the posts/videos/whatever in order from the people you’ve subscribed to. Recommendation engines would be one example
To add, what happens is it prioritizes engagement (comments the most, then likes, then dislikes). The problem with that is this ends up prioritizing ragebait, preaching to the choir, disinformation, etc. since that is how you get engagement.
Hahahahahahahahahahahhahahahahaa! YES!
A regular search engine just points to outside websites. But AI overviews generate “independent, new, and substantive statements” by evaluating and combining content from various third-party sites. And only Google can check those statements, the court said, “at least by comparing the underlying third-party websites with its own statements based on them.”
Honestly this is all the reasoning you need to infer that Google should be liable. Google alone has editorial control over the summary their AI generates, not the outside sources used to generate these statements, ergo Google should be held liable for that.
At the hearing, Google argued that users could check the linked sources themselves to verify whether the AI summary was correct. Users generally knew “that information generated with AI should not be blindly trusted,” the company claimed.
… And you know that’s true when the best Google could muster as a defence is to say that people shouldn’t be blindly trusting the AI, which ironically means even Google thinks their AI is full of shit.
But unfortunately for Google, not only does the court not buy that defence, but it would appear that’s contrary to how most people use the feature.
The ruling may also have international reach, according to the court.
I seriously hope so. Its about time companies started taking proper liability for the actions of their LLMs.
And if Google complains that it’s on the pieces of info they got from 3th parties that were wrong and name them, then the 3th parties are able to request compensation for using that info.
Exactly. Can’t have it both ways.
If Google want to claim the liability falls with the source’s its pulling from, then it should be taking explicit permission to cite these sources and be paying them.
Otherwise it’s an AI-powered editorial, and that’s on Google.
Though personally I’d be happy with the entire system being scrapped, as it only serves to fuck over small publishers and people’s ability to search for and be critical of information.
Every “AI” company should be (reasonably) liable for what their tools say. How is this even a novel idea?
If a newspaper accidentaly prints false information, they have to publish a correction and might pay a fine. But if they print a front page article about making a pipe bomb, then the editor would probably get sentenced.
This approach would be perfectly fine with LLMs. I understand the nature of the technology, but if they cannot guarantee the quality of the output, then the product is just not ready yet, and we are currently doing unpaid testing.
Except that this will fuck over small companies. Because if we follow this reasoning, the next step is to debate what is AI and what’s not. And the poor folk lose that battle because of legal fees.
I mean, hey, tell me how an automated summary is not AI. Argue that. Give me a clear legal standard… Easy to hand wave, hard to get right.
I fail to understand why it should be bad for small companies.
In my experience most small companies don’t have public AI summaries. And even if they do i still think it’s their obligation to check what they make public.
In the not so distant future just about every site will have AI summarization or QnA as a core part.
Instead of searching through endless documentation you ask AI to trawl and give you the answer. This is undeniably useful. But if they give the wrong answer once and suddenly become liable, that’s a potential risk.
How is it “undeniably useful” if it has the potential of giving wrong answers?
Also and perhaps more importantly, are these the lengths people go to avoid reading? If so, we are doomed.
Not everyone enjoys reading documentation. We don’t need to be defensive about this. We already have search that can trawl through a well maintained site.
AI can not only go through the documentation but also translate it to layman and point to the sources.
If it gives the wrong answer 1 in every thousand results, it is still undeniably useful. You shouldn’t blindly trust AI is common place knowledge. And it’s no different than doing a Google search for something and some times clicking into a result that is bad. The fact that that possibility exists doesn’t change the fact google is "undeniably " useful.
I’m going to be straightforward with you and say that if someone doesn’t want to read documentation, they shouldn’t be doing the job the documentation is for.
I’ve been bitten by AI summarizing documentation so many times, these days I refuse to use it for that purpose anymore. It’s just not worth it. It creates a loop where it wants to try things that don’t work, walk back, try something else, repeat, and spend $10 worth of tokens in the process.
You say that I shouldn’t blindly trust AI like I shouldn’t blindly trust Google results. The difference is that AI is presented as an authoritative source in itself. Hell, most of the time LLMs don’t link sources unless explicitly asked for. And here’s the thing, if I have to go and read the actual sources, it isn’t doing anything significantly more time efficient than just text search, but it is doing it at ten times the cost.
I’m going to be straightforward with you and say that if someone doesn’t want to read documentation, they shouldn’t be doing the job the documentation is for.
That’s way too black and white. It’s a time and convenience thing. Let’s say i want to troubleshoot something on my motherboard that caused my pc to stop working. I really do not want to be reading through a 300 page manual from my phone (because my pc is not working). Search may turn up 10-20 relevant results that id have to scroll through.
And AI could take my query and do the work for me. Give me the link to the result they think is most relevant as well as explain it in more layman way than the manual.
I’m technical so I could do this without AI. But let’s take a less technical person. Now they can follow along and try as well.
The function is good. Its arguably the best path forward. The issue is accuracy and cost.
But something does not need to be accurate 100% of the time if we are all aware of it. If we wait for something to be 100% perfect nothing would ever progress.
And cost should only concern us on the environmental side. We should absolutely force them to fix that side of it. But price wise? Im really confused why the internet continues to bring up cost. I honestly dont care how much it costs a trillion dollar company to provide a service to us if I dont have to pay. Google, Youtube, Amazon, Netflix, all operated in the red for years and years. I don’t remember public discourse being omg how is Google going to afford to keep giving us nonshitty search.
A potential risk that any company implementing an AI for something as simple as a Q&A should be aware of prior to doing that.
If they don’t want the liability, then just don’t use AI for public facing functions. Its not difficult.
In the not so distant future just about every site will have AI summarization or QnA as a core part.
Hopefully not, and this ruling goes some way to ensuring sense prevails. It’s a little different if the LLM providing the “AI” summarization has been trained exclusively on the contents of the site; that ensures that only the work of the site authors is used in generating the summary, which means it’s their words, and also probably less likely to hallucinate.
Instead of searching through endless documentation you ask AI to trawl and give you the answer. This is undeniably useful.
I deny it. The results of an LLM being used to answer a question are far too often wrong to ever be trusted. Sometimes the errors are obvious, much more often they are subtle and harder to spot, but delivered with certainty none-the-less. This ruling ensures that the ones providing the LLM summary are held liable, in the same way they would be if a human wrote the same summary.
But if they give the wrong answer once and suddenly become liable, that’s a potential risk.
Correct, and that is as it should be. Apply the same logic to a human written piece and you will see that.
It’s very obviously not even the tiniest bit useful and, in fact, is simply a huge liability that could be done safer and cheaper by a person.
In the not so distant future you use your own personal AI to do the trawling and if that thing gets it wrong that’s on you or the company that made it.
There’s a difference between you making use of a tool and you publishing the results of that tool.
Why should a vendor be able to make false claims about a product with impunity?
I have a feeling that the megacorporation’s AI generating false statements about smaller businesses that effectively drives customership away from them harms a lot more small businesses way more than AI-powered businesses being held to account for what their AI states as fact publically.
(And that’s not even counting the harm Google’s AI summaries are already doing to small publishers by driving traffic away from the very websites its using as source material.)
If a company doesn’t want the liability associated with a rogue agent making false statements, then I’ve got news for you - they don’t have to use AI. Literally nobody is forcing small private businesses to use AI for anything.
And what @MyButSmellsBat@feddit.org has said is entirely true. Most small companies won’t have an AI, and those that do should still be held accountable for their AI’s public statements.
Does it use an LLM to generate the summary. Yes or no. This is a binary. It is incredibly easy to define. It’s almost laughable that you think this is a problem.
what small companies are giving users of their search engine ai overviews? why would you argue ai summaries are not ai? what the muddy waters is going on here…?
for liability it doesn’t matter whether small company has written the summary themselves or generated it. they already had liability for what they say.
I literally laughed out loud reading the headline. Good shit, hopefully the Find Out season will carry on at this kinda pace. Probably won’t, but it’d be nice to see.
A German regional court has ruled that Google is directly liable for false claims in its AI-generated search overviews.
Unfortunately, the regional court is the lowest court stage. This will climb up until highest German court and after this to the EU court, I expect to see.
Being then a „Grundsatzurteil“ that is leading all courts in Germany. Our legal system isn’t case driven.
Unfortunately, the regional court is the lowest court stage
No. Landgericht is the second stage already.
It’s Germany, they’ll just find a way to blame Brussels and throw more money at the US as an apology.
While this is a solid ruling and establishes great precedent, it’s in Germany and so likely will only eventually apply to the EU. It would be cool to see a similar decision from a US court.
If Google wants to stick with its AI push, I can’t imagine they would want to keep training 2 different models; especially if one of them could land them in more hot water down the road. While it would eventually apply to the EU, I can imagine the rollout would be global. Similar to how Apple was forced by the EU to ditch their proprietary connector for USB-C: instead of having an EU & North American model; they just adapted USB-C across all their devices
I imagine the play will be to not offer AI anything in Germany and fan the flames of “we’re being left behind in technology”* paranoia amongst politicians until they legislate a special carve out.
* Not my opinion, but it is a thing that the briefcase class likes to say to each other to give them an excuse to dome off big tech CEOs.
This is more important that just AI overviews and establishes that companies are responsible for the editorializing they do. That’s much more important in algorithmic suggestions, which drive people into doing things they never would have done otherwise (see: Trump voters).
I hope this stands through all instances and is applied generously, because so many people have been fucked up beyond recognition by following the trail of the algorithm, especially on social and video sites.
Iv always compared it to a library vs news station.
A library collects and helps distribute information. But none of it is their own. While a news station driectly reports on and creates the information that is later catalogued.
Its why in theory a reporter and new source should be held to a very high standard, while a library could in theory be full of bad, false, or other wise misleading information.
A library can’t actually do anything about it realistically on a grand scale. Sure they can ban or bar repeated known offenders. But it’s a cat and mouse game. Same as a search engine. They can stop indexing people who are problems, but they have no real way to know ahead of time till it becomes a problem.
Ai on the other hand is reporting and generating direct sources by its own actions. Its no longer just indexing.
Nor should the library do something! They’re in the job of archiving. The news people, that’s reporting, and it better be done accurately and conscientiously.
Nor should the library do something!
They should and are doing something about it, because libraries are being flooded with digital slop books that they end up having to pay the distributor for when a patron checks it out.
404Media had some stories last year I think where librarians were removing books from the catalog to save money for real authors and to cut down on the number of complaints received about the obvious slop the books contained.
Yes! They are responsible. They’re not quoting, they are hallucinating crap they think someone else wrote somewhere.