Post by radact » Thu Jun 13, 2024 1:32 pm

OC2.3.0.2
Not directly an issue with OC I think, but an issue with Google not indexing the site properly. I'm getting lots of errors like:

Code: Select all

Alternate page with proper canonical tag
These pages aren't indexed or served on Google

Code: Select all

https://www.anotherworld.com.au/shop/macpaw-cleanmymac?sort=p.model&order=ASC&limit=100
https://www.anotherworld.com.au/shop/powerline?sort=rating&order=DESC
https://www.anotherworld.com.au/shop/internal-hard-drives?sort=p.model&order=ASC&limit=100
https://www.anotherworld.com.au/shop/reelplaytv-starter-kit?tag=Reelplay
I guess it's because it's using the various 'search' options to pick weird urls for products that are listed. Is there any way to stop Google from doing this? I've got over 35000 bad pages for a site that has < 1500 items!!

I've already added the extension that creates SEO friendly url's as you can see from the examples above... I think it's text after the ? that is the problem.

Any assistance or advice appreciated.

New member

Posts

Joined
Fri Nov 25, 2016 11:36 am

Post by ADD Creative » Thu Jun 13, 2024 5:08 pm

Although not actually an error, as the pages are not indexed, it does waste crawl budget.
https://support.google.com/webmasters/a ... onical_tag

As well as finding the URLs on the search page. They can also be from the sort by and show options on the category pages. Google seems to find the URLs in the code even though they are not actually links. There used to be URL Parameters tool where you could configure what to ignore, but that's not longer an option.

You could block and URLs with the parameters in robots.txt. There is an example in the later versions, without the search and tag parameters.
https://github.com/opencart/opencart/bl ... robots.txt

However, doing that can cause worse issues if any of the URLs do get indexed, as Google can't see the canonical tags.

www.add-creative.co.uk


Guru Member

Posts

Joined
Sat Jan 14, 2012 1:02 am
Location - United Kingdom

Post by radact » Thu Jun 13, 2024 7:31 pm

I can give that a try at least, add them to the robots.txt file. Might try a couple at a time or so and see what happens.
At least can be undone easily enough.

Code: Select all

Disallow: /*?sort=
Disallow: /*?limit=
Disallow: /*?tag=
I've added these for now, as they are most common types to see what affect it has.

Thanks for the tip.

New member

Posts

Joined
Fri Nov 25, 2016 11:36 am
Who is online

Users browsing this forum: No registered users and 46 guests