For a while, I thought I was losing it.
Anyone who sells photos online, via POD or direct, knows the importance of keywords – they’re the bedrock of SEO. And if you use a POD or two in addition to your own site, you don’t want to be hammering in keywords more than once, so you embed them in your JPGs as IPTC data, and the PODs parse them out. Well, the good ones do.
Recently I’ve been going back through all my photos and improving the keywording, which in many cases was bad. For one thing, somewhere along the line I’d used a keywording tool that helpfully alphabetized them, which is a disaster because some sites treat the first keywords as the important ones for searches. Others were stuffed with keywords from back when I did microstock; for PODs you just want keywords that describe the image, not possible uses for it.
Editing all my keywords, on hundreds of photos, seemed like a good COVID-19 lockdown project. I mostly use an old program called PixiShot, although you can also do it right in Windows File Explorer via the Details pane, or via various other applications (most of which are poor tools for the job). I’d do a bunch, then take a break and come back later – and after a while, realize that my earlier changes were gone, and the keywords were back to what they were originally. This is the “thought I was losing it” part.
A lot of confusion, aggravation, and thrashing around in image editing applications ensued, as I tried to figure out what was happening. I’ll skip that part.
Eventually I caught the gremlin in the act. I’d just added keywords to a photo, moved on to another, when out of the corner of my eye I saw the thumbnail of the first photo ‘blink’. I realized some other process had just modified that file, and Pixishot detected the change and updated its thumbnail. But what was that process? The answer turned out to be OneDrive, Microsoft’s cloud storage, which I use as backup for my photos.
OneDrive monitors the folders you select and automatically syncs with the MS cloud. When you change a file it’s quickly detected and uploaded. What was happening was that OneDrive uploaded my changed file, and then, a minute or so later, downloaded a copy of that file in which IPTC changes had been reverted to whatever was previously on OneDrive. Once I realized this was happening, I could see the action in real time, in OneDrive’s activity display. Changed file detected…. uploaded…. wait a bit…. same file downloaded with IPTC as it used to be.
Why would this happen? I suspect a combination of 2 things. OneDrive is using the files’ metadata for its own purposes, maybe something to do with it’s “auto tagging” of photos. And in doing so, it’s not properly preserving the rest of the metadata. Unwanted feature, meet bug.
Of course, I Googled this extensively, hoping to find an explanation, a workaround, or at least others in the same boat. I didn’t come up with much, but found enough dark stories to confirm that something’s going on. One guy uploaded thousands of old family photos and found out OneDrive had changed all the “Date Taken” fields, destroying the chronology, and he was not happy. Here’s the link:
At that point I gave up on OneDrive as my photo backup. I’ve verified several times that this is happening, using just File Explorer to modify the keywords, then watching OneDrive revert them a minute later. Whatever is going on, it might be a bug, it might be a feature, or some toxic amalgam of the two. It might involve the way applications lock files in use. It might relate to something I’m doing with IPTC, but I have no clue what that might be. I’ve used several well known metadata editors to look at my IPTC, and there’s nothing wrong with it; PODs and other sites parse it fine. And yes I’ve used Phil Harvey’s legendary ExifTool, and it says I’m fine.
I’ve spent way too much time on this, and I don’t want to live in fear of finding hours of work undone. I can live without OneDrive.
I’d really like to hear from anyone who’s encountered this, or heard about it; or maybe not encountered it, despite using OneDrive for photos with IPTC data.