this post was submitted on 23 May 2024
7 points (100.0% liked)

Technology

58061 readers
31 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Archive link: https://archive.ph/GtA4Q

The complete destruction of Google Search via forced AI adoption and the carnage it is wreaking on the internet is deeply depressing, but there are bright spots. For example, as the prophecy foretold, we are learning exactly what Google is paying Reddit $60 million annually for. And that is to confidently serve its customers ideas like, to make cheese stick on a pizza, “you can also add about 1/8 cup of non-toxic glue” to pizza sauce, which comes directly from the mind of a Reddit user who calls themselves “Fucksmith” and posted about putting glue on pizza 11 years ago.

A joke that people made when Google and Reddit announced their data sharing agreement was that Google’s AI would become dumber and/or “poisoned” by scraping various Reddit shitposts and would eventually regurgitate them to the internet. (This is the same joke people made about AI scraping Tumblr). Giving people the verbatim wisdom of Fucksmith as a legitimate answer to a basic cooking question shows that Google’s AI is actually being poisoned by random shit people say on the internet.

Because Google is one of the largest companies on Earth and operates with near impunity and because its stock continues to skyrocket behind the exciting news that AI will continue to be shoved into every aspect of all of its products until morale improves, it is looking like the user experience for the foreseeable future will be one where searches are random mishmashes of Reddit shitposts, actual information, and hallucinations. Sundar Pichai will continue to use his own product and say “this is good.”

top 34 comments
sorted by: hot top controversial new old
[–] [email protected] 3 points 3 months ago (2 children)

Here's Google suggesting suicide!

[–] [email protected] 4 points 3 months ago (1 children)

I want a whole Lemmy subreddit ( community? ) of the AI overviews gone wild like this, it's funny af

[–] [email protected] 2 points 3 months ago

You should make one. I'd sub immediately

[–] [email protected] 0 points 3 months ago (1 children)

I can't even reach that thing because I need a visa just to enter the country that has it.

[–] [email protected] 1 points 3 months ago* (last edited 3 months ago) (1 children)

My guy, Google pays Reddit $60 Million/year for this. $60Million.

I remember I once got told, years ago that I was stupid for saying "Data is the new Oil" and now look! Do you know what I could do if I had $60Million in my bank right now? And Google isn't the only one! Companies the world over are paying out the nose for user-generated content and business is booming! If I'm an oil well, it's time my oil came with a price tag. I was a Reddit user for YEARS! Almost since the beginning of Reddit! I made some of the training data that Google and others are using! Where's my cut of that $60M?

[–] [email protected] 1 points 3 months ago

That picture will forever haunt me in my dreams.

[–] [email protected] 1 points 3 months ago (2 children)

Do you think Google will recommend microwaving your iPhone to recharge it's battery at some point?

[–] [email protected] 1 points 3 months ago

I notice their AI answers are off for that question. I bet it was already a thing.

[–] [email protected] 1 points 3 months ago (1 children)

Yeah but that actually works tho

[–] [email protected] 1 points 3 months ago (1 children)
[–] [email protected] 0 points 3 months ago (1 children)

Man, you really can’t beat homemade artisanal misinformation

[–] [email protected] 1 points 3 months ago

There’s an old adage in computing which really applies here:

Garbage in, garbage out.

[–] [email protected] 1 points 3 months ago

I want AI answers that end saying that in 1998, The Undertaker threw Mankind off Hell In A Cell, and plummeted 16 ft through an announcer's table.

[–] [email protected] 1 points 3 months ago (1 children)

I've used an LLM that provides references for most things it says, and it really ruined a lot of the magic when I saw the answer was basically copied verbatim from those sources with a little rewording to mash it together. I can't imagine trusting an LLM that doesn't do this now.

[–] [email protected] 0 points 3 months ago (1 children)
[–] [email protected] 1 points 3 months ago (1 children)

Kagi's FastGPT. It's handy for quick answers to questions I'd normally punch in a search engine with the same ability to vet the sources.

[–] [email protected] 1 points 3 months ago

I'd hate to defend an llm, but Kagi FastGPT explicitly works by rewording search sources through an llm. It's not actually a stand alone llm, that's why it's able to cite it's sources.

[–] [email protected] 1 points 3 months ago

They also highlight the fact that Google’s AI is not a magical fountain of new knowledge, it is reassembled content from things humans posted in the past indiscriminately scraped from the internet and (sometimes) remixed to look like something plausibly new and “intelligent.”

This. "AI" isn't coming up with new information on its own. The current state of "AI" is a drooling moron, plagiarizing any random scrap of information it sees in a desperate attempt to seem smart. The people promoting AI are scammers.

[–] [email protected] 1 points 3 months ago

I once said that the current "AI" is just a excel spread sheet with a few billion rows, from what all of the answer gets interpolated from...

[–] [email protected] 0 points 3 months ago (1 children)

oh gods what happens when the ai discovers the poop knife

[–] [email protected] 1 points 3 months ago (2 children)

Or the cumbox. Or that kid who broke his arms. Or that dog, Colby I think? No wonder AI always wants to exterminate humanity in sci-fi.

[–] [email protected] 1 points 3 months ago

Hey Google, I like space movies. Please describe the Swamps of Dagobah.

[–] [email protected] 0 points 3 months ago (1 children)

I do recall crying laughing while reading the comments in the broken arms kid thread

[–] [email protected] 1 points 3 months ago

I thought it was hilarious how redditors fell for some guys bait/fetish post. Iirc the guy admitted to making it all up in some dm’s

[–] [email protected] 0 points 3 months ago (1 children)

So, basically shitposting poisons AI training. Good to know 👍

[–] [email protected] 1 points 3 months ago* (last edited 3 months ago)

Wanted to like, but 69 likes at this time

Edit: oh hey, this posted 3 times lol that's a new one. Sorry for the spam there

[–] [email protected] 0 points 3 months ago* (last edited 3 months ago) (2 children)

Is this real though? Does ChatGPT just literally take whole snippets of texts like that? I thought it used some aggregate or probability based on the whole corpus of text it was trained on.

[–] [email protected] 1 points 3 months ago

This is not the model directly but the model looking through Google searches to give you an answer.

[–] [email protected] 1 points 3 months ago

It does, but the thing with the probability is that it doesn't always pick the most likely next bit of text, it basically rolls dice and picks maybe the second or third or in rare cases hundredth most likely continuation. This chaotic behaviour is part of what makes it feel "intelligent" and why it's possible to reroll responses to the same prompt.

[–] [email protected] 0 points 3 months ago* (last edited 3 months ago) (1 children)

I've been trying out SearX and I'm really starting to like it. It reminds me of early Internet search results before Google started added crap to theirs. There's currently 82 Instances to choose from, here

https://searx.space/

[–] [email protected] 0 points 3 months ago (1 children)

it literally just proxies/aggregates google/bing search results tho?

[–] [email protected] 1 points 3 months ago (1 children)

So does pretty much every search engine. Running your own web crawler requires a staggering amount of resources.

Mojeek is one you can check out if that's what you're looking for, but it's index is noticeably constrained compared to other search engines. They just don't have the compute power or bandwidth to maintain an up to date index of the entire web.

[–] [email protected] 1 points 3 months ago

we're working on it 😉 slow and steady and all that; we also fixed a bug with recrawl recently that should be improving things