The third iteration of Doug Bolden's various thoughts and musings.

Category: Dougiverse

The Blogger Canonical (?m=1) Issue Revisited

If you want to just see an explanation of the issue, you can skip to THE TECHNICAL ISSUE, below. First, I get to rant a bit and give some context.

When I first returned to blogging after eight years, it was not with a traditional blog: it was with The Doug Alone PROLOGUE. It was a place for me to post notes and recaps about the solo rpg stuff I was doing.1 Only there was a problem. I actually mentioned it on my final post on that blog. Google more or less refused to index it.

It looks like it did at least briefly index a single page and then wiped it later.

Even though the blog was primarily meant as a play journal, there were elements that I wanted people to find. Only there was a primary error that kept showing up by way of explanation:

I had a vague notion of what that meant but the more I looked into it, the more I found posts by people insisting it was not an error. It was intended. It’s not up to Google to SEO for you. Maybe your blog isn’t worthy. Here’s a reddit thread with most of those things said from just a few months ago.

However, after Noism Games posted a post noting their Blogger/Blogspot traffic had just plummeted, I felt curious and looked again.

Doug Is Right: The Blogger Canonical Edition

Here’s the tl;dr: I am right. The SEO experts are wrong on this one. Neener neener.

I knew I was roughly correct. I’ve worked with a lot of different web platforms over the years and am well aware that Google is a fickle beast when it comes to promoting something (say, a one-off post about carpet beetles) over things that are more core to your blog identity (such as old posts about a variety of horror movies). However, months of Google flat out ignoring a blog with unique content was not consistent. At least a few pages would have passed The Algorithm.

Those more in the know of the technical issues probably know, and I had an idea but just not why Blogger/Blogspot was being hit by it. Had I cared more, I would probably have put it together earlier. Would I have still moved blogs? Oh yes. I like having my own space to play.

The Technical Issue

What’s the issue?

Webpages can have canonical tags. It’s not required. It just helps Google (and other search engine type things) to say that the page with the listing is the page you want to index. If you are on a platform where your content might bounce from page to page, you can use it to say that this is the correct page.

EXAMPLE: You have a cooking blog. You have a set of pages with different recipes and other pages that include snippets of those recipes and you don’t want Google to send folks to the pages with only the snippets (such as a category page or a front page that shows the most recent). You prefer your recipes to be front and center. You put the canonical tag on those pages.

In the specific case of Blogger/Blogspot, there’s a bit of code that basically tells each new page to have a tag on the post itself:

<b:include data='blog' name='all-head-content'/>

One aspect of this is to drop a simple line that gives the URL and says “this one, Google” in the <HEAD>:

<link href='https://dougalone.blogspot.com/2025/09/beginning-to-migrate-some-content-to.html' rel='canonical'/>

And that should be well in good except for a technical glitch on Google’s side. It does not scan the blog like a person on a home computer will. It scans largely as a mobile device. And Blogger/Blogspot, a GOOGLE PRODUCT, tries to be helpful by serving up a ?m=1 version of the page. Old themes did not have a native mobile version. Newer ones do, but the artifact from Ye Olde Times is still there.

Which means that Google gets a link like this for the page linked above:

https://dougalone.blogspot.com/2025/09/beginning-to-migrate-some-content-to.html?m=1

You can likely see where this is going. If you click on it, it is identical to the previous page, except the rel='canonical' is not pointing to that link, it is posted to the .html, not the .html?m=1 version.

This means for every Blogger/Blogspot page scanned, Google sees a page constantly serving up alternate pages and because the ?m=1 keeps persisting, it constantly fails to find the canonical pages.

What’s the Fix?

Unfortunately, the two primary fixes are both on Google engineers and since this has been brewing for a few years, I have no idea if they will fix it. Hopefully so, because Blogger/Blogspot is a nice all-in-one blog for people who don’t want to fiddle too hard and just want to get their content out there.

FIX #1 would be for Google to not treat ?x=y as wholly different pages at least in the case of mobile pages where the canonical link has identical content. I appreciate there are lots of cases where it is different content, but there should be a way to prevent that.

FIX #2 would be for Blogger/Blogspot to stop appending the ?m=1 to mobile pages. There are better ways to handle that. That feels like an artifact from 2010 era internet. Back when you had completely separate mobile sites. Ah, I remember those days unfondly.

What can we do as users of the product? I’m not sure. If you look, there are suggestions for Javascript workarounds. I am attempting to use the script at this page. Go gently into that night and double check before you use it, yourself.

I also did try updating my robots.txt file to tell Google to ignore ?m=1 pages. Will it work? I don’t know. I’m not precisely holding my breath. If I remember to check in a couple of months and it has worked, I’ll let you know.

User-agent: Mediapartners-Google
Disallow:
User-agent: *
Disallow: /search
Disallow: /share-widget
Disallow: /*?m=1
Allow: /
Sitemap: https://dougalone.blogspot.com/sitemap.xml

Obviously, if you want to use that you want to change the final line to be whatever your blog’s address is. I’ve seen variations of that across multiple posts so I don’t know where it originated. Apparently older Blogger blogs had a baked in robots.txt but mine didn’t. I had to add it whole cloth.

Let’s see what the outcome of this double approach might be.

NOTE: It is possible that Google will eventually scan it via a non-mobile-first scanner and make all this a non-issue. Just 16-months seems like a fair time to run a test.

  1. There is a paradox of solo play where a lot of folks, myself included, have a strong urge to share it with someone. The initial idea was not a blog. I thought about streaming some stuff on Youtube. Since I ended up figuring out a lot of mistakes, tweaking a lot of notions, and so forth: I am glad I went for a format that did not involve me just sitting there confused and sweaty on camera. ↩︎

A Day in the Life #17671: Cool Snail, “How Dare you, Wordle!?,” and Website Whoopsie

First off, while out fixing our doorbell this morning and cursing the curse of tiny screws, I saw this cool looking snail:

How Dare You, Wordle!?

Second, how dare you Wordle!? It is October for goodness, sake…

Click to see the actual problem.

And yes, I partially posted that just to see if I can work out the mechanics of a “spoiler” type image. It should “enlarge” to the unspoilered image. If it doesn’t, I might just remove this whole section.

Just in case you don’t want to do that, here’s my explanation (click to expose spoiler): 2v Zo oj336s oS0o35, o3 2 J0o vEZ56Z5h vE0v W0szS vES J3RI J3pYI zS blxx4 05I h3v S9mZvSI 0z3pv Zv. n3JSkSR, V3RIYS jY0sSI WS YZ6S 0 M33Y!

EDIT: I’m going to leave it like th at for now, where instead of opening into a lightbox it opens another tab with the unspoilered version. I really don’t think it’s worth it, but I’ll give it a think about how to do it better without needing plugins.

Big Old Whoops on the Website Backend

I have been cleaning up a good bit of the backend of my old wyrmis.com website and today was chunking out a few hundred-ish tiny files from the file structure that were no longer used and at least potentially, therefore, a security risk [at worst] but just a hindrance to sort through, at best.

I ran what I took to be a basic rm -rf * type command but apparently the software does it a bit more complicated than that (and even has the option to move the files into a local recycle bin).

This was treated as me {moving | uploading | downloading} a lot of files and triggered an automatic kick. The software got booted from the server and I am at least on a temporary ban. My assumption here was very wrong, see UPDATE below.

I can still log on to the server through other means, and the website seems completely unimpacted. Now I’m waiting an hour or two to see if it clears up on its own or I might have to contact someone for some technical support.

While I can still edit the website in a number of ways, I had a nice workflow going.

I’ll spend the time, instead, building up a tool that might help me to semi-automate some of the process of fixing hundreds of HTML files and then like, get back to house repairs, instead.

Now I am off to take some photos of damaged bookcases that the shipper broke and wants more photos to prove despite sending them a number of photos.

[2 hours later] Update to Website Woes

Turns out my previous assumption was just plain wrong. The [re]moving of a large number of files might have exacerbated the situation by making it harder to tell what was happening, but the actual culprit was that after I backed out of the directory that had the files, I thought it was sending me {Doug's Directory} while instead it was sending me {The Directory ABOVE Doug's Directory} and then, because the program I was using had the option to recall last directory, it entered into a loop where it kept trying to enter a forbidden zone.

I figured this out when at a whim I tried to enter directly into a sub-directory, which failed and then kicked me back out into the directory I was supposed to be in and I realized what was wrong.

The reason it wasn’t immediately obvious before was because when it kicked me out it essentially prevented me from even seeing what sort of directory I was in. It was the sort like finding a NO ENTRY sign in the middle of a very dark room and having to guess your location. Only every time you re-enter the room, a helpful guide runs up and escorts you to the same place you were just told you not to enter.

Hello, is this thing on?

It is nice to talk to you again, Space Pilgrims.

The very last post I made to the old version of Dickens of a Blog was “I, This Thinking Thing”. That was August 2016. That means it has been over nine years since I’ve made a real post under that branding.

Today, I went through and created a new [possibly temporary] front page to the wyrmis.com site that looks a bit like this:

It mostly directs people to here, to The Doug Alone and to the [still very much so being finalized] Doug Talks Weird. Those two and this site are the new “Dougiverse” [pronounced “Dougie Verse”].

While Doug Alone has been brewing for over a year now, and Doug Talks Weird dates back to something like 2014 YouTube videos, I have spent a good amount of the past two weeks sorting and trying to rebuild my online identity so that I can start posting and sharing things without relying on “more traditional” social media. A strange sentence to type.

So Many Words to Say

I reached a point those nine years ago where I wanted to shut up for a minute. Then, around two-to-three-years later I kind of wanted to take it back. However, the time it would take to rescue the old blog — from younger-Doug’s rambles as much as younger-Doug’s hand-coded functions that had been left behind by something like ten years on a changing web — always made me shy away. I would post online here or there, share pictures here or there, but mostly I just withdrew.

However, I am at a time again where I would like to just have a spot to ramble. So this blog is here, now. It is not a replacement of the old one. It is more a continuation in a way that is a bit more responsive, a bit less intensive — I would sometimes have to go into the Python back-end of the old one and custom tweak things to keep posts working and had to remember dozens of custom commands, tools, and pieces — and hopefully a bit reader-friendly without so many baked-in Dougisms.

It Will Take Time

That being said, it will probably a week or two at least before the page even looks like it is going to look. I’m going to try and not sweat it too much.

As for today, I have just spent five hours getting everything set up to hit the point I can post this. I am an hour behind eating lunch and still need to do my daily work out and shower first. Well, maybe not first. I’ll figure it out.

Hopefully, I’ll see you soon.

–Doug Bolden

Powered by WordPress & Theme by Anders Norén