Back

Google stops letting sites like Forbes rule search for "Best CBD Gummies"

175 points6 hoursarstechnica.com
NelsonMinar3 hours ago

I'm confused about how or why this is a new policy. My memory is inside Google we were discussing this risk back in 2003, probably earlier. Search quality was on it. I just assumed they'd lost the arms race, or that the parasites' ranking was justified for other reasons that were hard to tease apart. What are they doing new now?

I think often about Mahalo, the sleazy shovel content that was spamming the web back in 2007. Google shut that down somewhat fast, although it did take several years. These days with AI and more aggressive spammers it's a losing battle. The real problem is the financial incentives that make this kind of spamming profitable in the first place.

My tiny little blog gets about 3 requests a week for someone to "pay me to run a guest article". Going rate is $50-$200 and again, my blog is tiny.

dawnerd1 hour ago

Seriously, they tackled this years ago with the panda update to kill off all the how to and similar seo spam. It's like after around that time they just stopped caring at all and let the best X sites take over.

nurumaik3 hours ago

Just manually review top K websites and ban such garbage?

Sometimes dumb, bruteforce and biased solution can work way better than any automation you can come up with

stackghost32 minutes ago

>My memory is inside Google we were discussing this risk back in 2003, probably earlier.

Yeah but that was before they hired the incompetent grifter Prabhakar Raghavan and eventually made him head of Search.

mjr003 hours ago

> I'm confused about how or why this is a new policy.

My best guess is it's because they finally have a real competitor in ChatGPT.

> The real problem is the financial incentives that make this kind of spamming profitable in the first place.

Yeah, but the financial incentives exist on both ends. There's a gross symbiotic relationship between Google and SEO spammers, because Google also owns the ad network the spammers put on their page. If Google puts ad-laden SEO blogspam as the top result and a user clicks it, the user sees a bunch of ads from Google. Everyone wins: Google, the SEO spammers, and advertisers. Well, everyone except the user, but who cares about them?

My guess/hope is that ChatGPT has made someone who actually cares about the quality of search results actually step in and say things have gone too far.

NelsonMinar3 hours ago

You're totally right about that symbiotic relationship. We were aware of that risk in the early days when AdSense launched, we saw some very innovative and gross exploitation and created some policies to rein it in. But ultimately if Google makes a buck coming and going, they will do that.

Wasn't there a big story last year in the wake of the DOJ antitrust investigation about Google manipulating search quality to boost ad revenue? I can't put my hands on a reference now, in part because Google is so bad at search these days I can't find anything more than a few months old.

resoluteteeth2 hours ago

> My best guess is it's because they finally have a real competitor in ChatGPT.

My guess is it's because a bunch of articles about it have been posted to hn recently.

chipsrafferty3 hours ago

Because ChatGPT is dependent on good search when it searches the web? Or because it completes with Google when it provides a good answer without searching? Or what do you mean specifically?

ryandvm3 hours ago

I would say the latter. For software dev questions, my Google searches and Stack Overflow visits have fallen off a cliff since I started paying for ChatGPT.

Ironically, I probably would have paid the same amount to Google for ad-free, old-style (accurate) Google searches, but no, they just wanted to keep cranking that ad dial up every year so that ship has sailed.

At this point, I'm enjoying watching the old guard of search scrambling to find a life jacket.

nurumaik2 hours ago

> Google wins

Define "wins". From what happening right now, it seems that google may lose much more than it earned by aligning with seo spammers

Maybe they need to start locking employee stock options for 100 years to prevent them optimizing short-term gains?

johnnyanmac2 hours ago

Google's only taking the greedy approach. Spam sites on top, spam sites use google adsense, people click spam sites, they click google ads.

It works great, until it doesn't. But that's a problem for the next CEO.

Scoundreller7 minutes ago

iunno, I used to rank pretty well for things in my country like "$company's tech support number". Unlike $company, my page had a nice clean URL like whatever.tld/Tech-Support-Number-for-$company and I'd just list of their phone numbers with a few paragraphs about how $company is shit. Meanwhile $company's page was whatever.tld/234897234982-029823749823742-2340823492 and 3 pages down was a phone number.

For ISP ones, I recommended people print a copy so they can call if they can't get on the internet, which kinda backfired when a major ISP changed their tech support number (!) and it redirected to a toll-free squatter's sex chat line.

Turns out $company really hates it when you call their call (cost) centres.

I had maybe 50 pages for our different oligopolies and averaged $500/month revenue on adsense, so GOOG's cut was $250/month.

Today, for one $company, the first 9 results are different pages from $company.tld, each unhelpful with a phone number in their own way, and they don't run adsense!

klabb32 hours ago

> My best guess is it's because they finally have a real competitor in ChatGPT.

Bingo. I always chuckle when people here say Google has lost it, and become incompetent. Well, they all make the mistake of assuming that they’re trying but failing, rather than that it’s deliberate simply due to boring economics.

Now look at how quickly decades-long problems, so big they have an entire cottage industry built around it, suddenly be cleaned up. Incompetence? Nah.

Of course, this does nothing to convince regulators and not even average HN user that innovation is harmed by these dominant players. Someone’s gotta think of the poor mega-corps.

lumost1 hour ago

Searching for python documentation was the worst, geeks for geeks and others would get the top slot for reskinning the pypi docs with ads.

The entire thing was so blatant and obvious that I assumed Google did not care due to ad revenue.

When ChatGPT launched search, you could immediately skip over all the crap. It made search nice again.

lern_too_spel2 hours ago

Because it became an embarrassing news story (https://larslofgren.com/forbes-marketplace/, also mentioned in this article). They would have lazily left it unfixed if everybody weren't laughing at them.

itsoktocry4 hours ago

What's hilarious is when people boast about being "in Forbes" like it's the magazine from 20 years ago, and not this parasitic SEO operation that publishes garbage on anything.

pimlottc3 hours ago

Well, of course; “Forbes” still carries prestige and cachet. If it didn’t, this scam wouldn’t work.

maeil2 hours ago

> If it didn’t, this scam wouldn’t work.

It would, as this scam relies on the domain authority, regardless of their "offline authority".

dangus3 hours ago

I took advantage of this in business school. A lot of my professors considered Forbes a reputable business magazine. It was amazing, I could easily cite a source for just about anything I wanted to say.

lotsofpulp3 hours ago

Maybe the professors knew and were taking advantage to look the other way to make their job easier.

dangus2 hours ago

haha maybe, I got my A’s and my piece of paper, and now I’m qualified to cut costs at your local private equity firm!

Ancalagon5 hours ago

Hopefully this is a step in the right direction. Google's search results have gotten so bad - seems like even some of the simplest searches are just packed with AI generated and SEO garbage. I don't even want SearchGPT do take over the search market space because I'm almost sure it will still be garbage. Just bring back the google from 5-10 years ago please :(.

paxys4 hours ago

> Just bring back the google from 5-10 years ago please

What you really want is the internet from 5-10 years ago (really even longer than that), and that's not coming back.

Terr_20 minutes ago

A close approximation might be a search-mode which penalizes results based on how many ads they have and how much of the page they cover...

kevinmchugh4 hours ago

Google search degraded in usefulness before the panda update, when spammers had filled the web with low quality content designed to exploit Google's algorithms. Google improved their search to punish the content farms, and people were happy with that search for many years.

firecall3 hours ago

Agreed. That’s really my thoughts.

The internet or rather the crawlable WWW itself has changed.

The WWW content itself is no longer what it was.

chipsrafferty3 hours ago

Do you think the proliferation of web frameworks makes it harder to crawl?

thwarted1 hour ago

Probably has some influence. Because everything needs to be an app, implemented as an SPA, rather than just a web page. Hypertext is dead.

vasco4 hours ago

I'm fine with this internet but the previous search.

NewJazz4 hours ago

Exactly. In a constantly changing world, you need constantly changing policy to achieve the same outcomes. Even then you probably won't replicate the past universe perfectly.

dpkirchner4 hours ago

I want the Internet from 5-10 years ago before Google incentivized this much SEO garbage. It wasn't awesome then but it was a lot better.

fragmede3 hours ago

What I really want is me from 5-10 years ago. When can we get a pill that will do that?

grugagag3 hours ago

You could get there somewhat. Do you exercise? Ready to put in some effort?

firecall3 hours ago

You could probably take some pills that will make you think you are! LOL

I feel like testosterone therapy should be more readily available….

viewtransform4 hours ago

append ' before:2023' to your google searches.

0xbadcafebee5 hours ago

You can't go back to the way things were. The world moves forward and changes, and we have to adapt to it.

Web search has always been an extremely messy solution to many problems. Think about the premise: type in anything, and somehow it will read your mind, intuit who you are and what you really wanted, find the exact thing amid the morass of the whole web, and then give it to you?

That's impossible. So it uses tricks to make it seem like it worked. It uses information about you to refine results. It uses curated, human-edited search and result heuristics for the most common or difficult search queries. It uses a giant corups of data, and shows you things that are like what you wanted.

You don't notice that it isn't giving you the best result, because there are so many mediocre-but-acceptable results to look at. And it doesn't have to work perfectly every time, because we can "sift through" results and "refine" our search. Often we are flooded with results that are targeted at us, rather than what we want, because, remember: Google is an advertising company, and the entire Web is now a shopping mall, where either you're being sold-to, or you're just being sold.

You will get results, and they will sort-of seem like what you wanted, so you will just sort of sigh and accept it. Because what other option is there?

There are more intelligent, more accurate, more safe, ways to solve the problems people have, that are not "a search engine". It's time we start implementing them.

jerry19794 hours ago

> You can't go back to the way things were.

> type in anything, and somehow it will read your mind

I think we can go back to the way things were, which had nothing to do with mind reading. In the past, you could type in word, and google would offer 10 million results, and you could page through each of them. That was very powerful, and google does not do that today.

Ferret744648 minutes ago

I don't think you know what you are asking. Do you really want 10 million pages of results, of which 99.999...% are SEO spam for Viagra et al, and on average you will need to browser ~9 million pages of results to find something that's actually "relevant"?

wbl4 hours ago

I was in high school 15 years ago and Google absolutely read minds to conclude Briney Spears was not a search for pickles but rather a pop artist. This was significant enough for them to come to go talk about it.

PhasmaFelis4 hours ago

> You don't notice that it isn't giving you the best result

That's fine. It's always been fine. I don't need Google to read my mind and fulfill my dreams.

The problem isn't that they're not divinely perfect. The problem is that they used to be good enough, and now they're not.

> There are more intelligent, more accurate, more safe, ways to solve the problems people have, that are not "a search engine". It's time we start implementing them.

What solutions are there that fulfill all the use cases of a search engine, while definitively not being a search engine? An AI chatbot that gives me synopses of the same websites that I was searching for does not count.

bgun4 hours ago

Not sure why you’re being downvoted, this is a pointed analysis of why crawl-based search is insufficient for an Internet of our current scale. There is no corporate-curated algorithm that is up to the task, especially when the primary purpose is to profit from advertising.

jfengel4 hours ago

Google is remarkably effective at handling the scale. It doesn't seem up for handling the sheer army dedicated to misleading it. Especially now that they've been given tools for automating crap generation.

Ironically, Google itself was a key developer of that tech.

If there is any solution it would seem to involve removing the incentive to merely look at your page. That problem seems remarkably stubborn.

the_snooze4 hours ago

>There is no corporate-curated algorithm that is up to the task, especially when the primary purpose is to profit from advertising.

I think this is the root cause of the problem. Google can easily put a big dent in this problem by allowing users to create their own importable/exportable filters and support the dissemination of something like "EasyList for search results." But that kills their golden goose of advertising influence.

+1
warkdarrior3 hours ago
conorcleary4 hours ago

In before your comment ends up in QuickBooks search results for the rest of time :-)

gerdesj3 hours ago

"seems like even some of the simplest searches are just packed with AI generated and SEO garbage"

I'll give you a concrete example of that and it is a right old pain.

Let's try upgrading Debian Bullseye to Bookworm. Search "upgrade debian bullseye to bookworm" - first hit from DDG is: https://www.debian.org/releases/bookworm/amd64/release-notes... - YES - Debian documentation, staid, verbose, stolid and correct.

Now let's try to upgrade a Raspberry Pi from Bullseye to Bookworm: Search "upgrade raspberry pi bullseye to bookworm". First hit: https://raspberrytips.com/upgrade-raspberry-pi-os-bookworm/.

There are loads more hits like the above and they are nearly all wrong. The RPI distribution is based on Debian Linux but has a few differences. Between those two versions of Debian, RPi changed things in /boot quite dramatically and failing to do that, you will end up with a weird chimera - I created several of these beasts until I fixed them: https://blog.scheib.me/2024/04/14/upgrade-raspberry-bullseye...

In this case it may actually be a blog matching the template of the AI clones! However, they do all look very similar.

shombaboor3 hours ago

I was searching for a uniquely named company by exact name (think: verizon), and it was 80% of the way down the results page. Google knew exactly what I wanted to see and flooded my screen with alternatives who had paid them.

JumpCrisscross4 hours ago

> bring back the google from 5-10 years ago

Given Kagi's abysmal adoption rates, it's clear that good search isn't worth it for most people.

rurp3 hours ago

I tried Kagi but just didn't see notably better results than other search engines. Maybe if I spent more time on the power user tools, or if Kagi offered more of a trial period I would have, but adding yet another monthly subscription is a high bar for me and what I saw didn't clear it.

These days my default assumption is that any SAAS product will get worse and more expensive over time, so it has to be pretty good to justify reworking my online habits around, given that I don't know how long I'll keep using it. Hopefully Kagi will be the exception to that rule, but I wouldn't bet on it.

pesus4 hours ago

That subscription fee is just too big of an obstacle in a time when everything has a subscription and is still often degrading in quality. Seems like an unsolvable chicken and egg scenario though, since relying on advertising to make it free would just result in the same issues as everything else.

deprecative3 hours ago

It's quite literally this. It costs more than free and people don't want that. We're poor and poorer and everyone is overburdened by subscriptions for everything. I get that HN is in a rich bubble but most folks can't afford rent, food, and a search engine.

1vuio0pswjnm73 hours ago

I still search Google and other search engines from the command line. There is no "AI" garbage in the results. The way HN commenters refer to Google search in this thread, one might conclude it is not possible anymore to search the web without a popular browser running Javascript (which is a prerequisite for this "AI" stuff). That conclusion would be incorrect. It is still possible; I am still doing it every day.

1vuio0pswjnm720 minutes ago

This is how I do it

1. Make HTTP

2. Send HTTP via TCP

3. Perform text processing on the response body (I create own SERP instead of using Google's)

Personally, I use multiple programs, some I wrote myself in C, to perform these individual steps, connected by UNIX pipes and the shortest, simplest possible Bourne shell scripting

However there are countless ways to perform these steps in wide variety of programming languages; there is no need for UNIX or shell scripting, it is purely personal preference

chipsrafferty3 hours ago

How do you do that?

walterbell4 hours ago

September thread on Forbes Marketplace and "parasite SEO" (300 comments), https://news.ycombinator.com/item?id=41590466

bhouston3 hours ago
walterbell3 hours ago

As others noted, this has been an issue for years. What prompted Google to act now, using a manual override that was supposedly not feasible in the era of algorithms? Did the viral articles by Lars provide ammunition for teams within Google?

simonw52 minutes ago

An interesting intellectual exercise is to think about how a search engine could provide the best possible answer (from a user satisfaction perspective) to a query like "best CBD gummies".

A lot of people have a significant financial incentive to win at that search query.

What would the perfect top search result for that look like?

It would probably be an article written by professional writers in a trustworthy publication with a strong ethics policy, provably followed over the years, concerning whether they accept payment for promoting specific products in supposedly impartial reviews.

If you can figure out how to algorithmically detect that kind of content you could build a pretty great search engine!

ameliaquining36 minutes ago

Are we assuming that this search engine is only used by a few nerds, or is the idea to build something that remains good even if it gets popular enough that webmasters have strong financial incentives to game it like they currently do with Google Search? Because the latter sounds like a much, much harder problem, and in particular like it probably requires huge financial resources in order to win the ongoing cat-and-mouse game, if that's even possible.

johnfn43 minutes ago

I think I'd be pretty happy if Consumer Reports was on the top for queries like these (if they had the relevant data, of course). I think they follow your criteria pretty closely.

batch1246 minutes ago

I think it'd maybe a query for the best gummies would be based on reviews from users, but I guess that's the point. Having something understand what one means by best is hard.

johnfn42 minutes ago

Hm, I think that Amazon shows that just user votes might not be sufficient - e.g. because users can be paid off to give 5 star reviews, which bias the results.

rjrdi38dbbdb2 hours ago

One area where Google search is terribly broken is porn.

If your search for some specific term "$foo", nearly every result is just 'search site $bar for "$foo"', taking you to the site's search page, regardless of whether $foo is actually found on the site.

buryat1 hour ago

how many people do you think search for $foo and the opposite?

rjrdi38dbbdb48 minutes ago

What do you mean by the opposite?

The search behavior is the same whether $foo is a popular generic term or something niche.

scarface_742 hours ago

On the other hand, when I use to search for anything near the city I lived in, porn showed up way too frequently

https://en.m.wikipedia.org/wiki/Cumming,_Georgia

rjrdi38dbbdb44 minutes ago

Are locals particularly attached to that name?

Seems like a referendum for a name change would be a good idea, considering the current typical usage of that term.

akira25015 hours ago

An entire department was just rendered useless. I genuinely don't feel bad.

kasey_junk5 hours ago

Was probably a whole company right? Pretty good argument that Forbes the traditional media property and Forbes the seo giant are 2 different things: https://larslofgren.com/forbes-marketplace/

fakedang4 hours ago

And Forbes the SEO giant was going to buy out the legacy media property.

_DeadFred_5 hours ago

It was funny watching the warrior whatever site back in the day when Panda came along. Love when these people get their horrible business models kneecapped.

Now let's make corporate stock manipulation illegal again and ban corporate stock buybacks. Talk about a purely manipulative business strategy.

red_trumpet5 hours ago

What's the problem with stock buybacks?

_DeadFred_3 hours ago

They are nothing but direct stock manipulation that was 'legalized' at the same time where executive compensation was moved from salary to... stock, so that you end up with a quasi-legal (stock manipulation by executives is supposed to be illegal) corrupt incentives system.

+1
quickthrowman3 hours ago
notyourwork5 hours ago

Nothing directly, it just sounds bad at face value.

+2
_DeadFred_3 hours ago
scarface_742 hours ago

The company is giving money back to shareholders. What exactly is wrong with that?

readyplayernull5 hours ago

I wouldn't be surprised of the whole thing being automated.

kiru_io5 hours ago

Google should vibe out others as well. If I search now "Best CBD Gummies", the first few results are: vice.com independent.co.uk healthline.com observer.com

How is Forbes worse than any of those shallow comparison pages?

glial5 hours ago

FWIW, Kagi is a little better but not much. First results are:

* cornbreadhemp.com * forbes.com * healthline.com

al_borland4 hours ago

Kagi does let the user adjust the rankings of these sites if they don't want them coming up. While it would be nice to have this done proactively for the link farms, at least the user does have some control.

Forbes did make it on the blocked and lowered leaderboards.

https://kagi.com/stats?stat=leaderboard

zappchance3 hours ago

DDG gives me:

forbes.com healthline.com medicalnewstoday.com fortune.com

But I usually use site:reddit.com in my searches anyway so it doesn't matter to my personal habits.

vasco4 hours ago

Probably all ran by the same people too.

al_borland4 hours ago

I assume the threat to their business posed by OpenAI (and others) is what is getting them to start addressing these long standing issues. I'm glad they're doing it, but upset that they let users suffer with sub-par results for so many years.

warkdarrior3 hours ago

How does OpenAI's searchGPT rank Forbes Advisor/Marketplace?

al_borland49 minutes ago

Good point. I just asked ChatGPT for the best CBD gummies, then asked what sources it used for the list. This was the first thing it said…

> Consumer reviews from trusted websites like Healthline, CNET, and Forbes Health that provide in-depth reviews and rankings based on effectiveness, ingredients, and customer feedback.

So the LLMs are now giving us affiliate link garbage, but we can’t easily see that was the source, and the affiliate links don’t even work. Everyone loses in this scenario.

PaulHoule4 hours ago

AMEN!

I remember this guy

http://www.seobook.com/blog

pointing out the line between what you can get away with with SEO and what you can't get away with and what you can't get away with is making Google look stupid.

PhasmaFelis4 hours ago

Not for more than five years or so, at least.

AlienRobot4 hours ago

After scrolling a bit I looked at my scrollbar and I was scared.

Yes, this is /blog. The whole blog.

xnx4 hours ago

Google's official statement on the change: https://developers.google.com/search/blog/2024/11/site-reput...

That page gives a good hint at how opaque these paid placements can be to an outsider like Google. Really tough to prevent too much collateral damage when going after bad actors like Forbes. Glad Google is working on it though.

jsemrau2 hours ago

Google starts getting into the way too much. Recently I searched for German football clubs that "fan rivalries" and Google refused that and only gave me results for "fan friendships" .

tayo422 hours ago

I just tried this(German football fan rivalries) and only got one thing with friendly and it was a reddit post. The ai response on top was on topic too

fwip28 minutes ago

It sounds like they're just deranking the blog spam posts. In my opinion, Google should derank the whole domain/brand. If you purposely put your name on garbage, we should put your name in the trash.

bhartzer4 hours ago

Google should have done this 5-10 years ago.

urbandw311er3 hours ago

Site that has historically abused its monopoly to game search profits clamps down on others doing exactly the same thing.

scarface_742 hours ago

When I search for “best pet insurance”, US News & World Reports still shows up 5th.

When I search for “Best Delta credit card”, CNBC shows up 6th

dmead4 hours ago

What was the website they had to delist like 10 years ago that was like this? I'm drawing a blank.

wslh5 hours ago

My feeling as a Google user since the beginning is that the search engine doesn't matter anymore in terms of quality. That is why I wonder how Google supposely discover their own "bugs" so late.

yapyap2 hours ago

one lawsuit too late

burnt-resistor4 hours ago

[flagged]