ccie, vcp ccvp, rhce, giac, gcih, cisco, netscreen, netscaler, juniper, f5, security, virtualization, vmware

OpenStack Nerd, CCIE, DevOps Junkie

Changing the world, one person at a time

OpenStack Nerd, CCIE, DevOps Junkie header image 2

Fixing UCS Config Failures due to local disk config requirements

July 30th, 2010 · 6 Comments · CISCO, Cisco UCS, Cisco Unified Computing System, ESXi, vmware

Boring title huh? Well, this isn’t what I wanted to be dealing with on a Friday afternoon either. My plan was to set up a PXE boot environment to create scripted installations of ESXi 4.1 on UCS. However when I shelled into the UCS lab at work, I was made aware of a little bug.

The Problem

Apparently when the UCS pod’s firmware was updated to 1.3(1c) an interesting bug was encountered. What happens is when you associate your service profile to a server which has had another service profile associated with it previously you a config failure error stating that the local disk config you are applying is destructive to the physical server and violates a requirement.

I headed over to the local disk policies section of UCSM to see what was going on. Once I got there I noticed a new check box labeled “Protect Configuration”.

It’s a lab, which means that change controls should be followed but rarely are. Following that mantra and went ahead and unchecked the “Protect Configuration” box on one of the commonly used local disk policies and saved my changed. A little background on HOW I am shelled into this lab. I am shelled in via a VMware View server that has component installed on blades 1 and 2 of this chassis. And, yes the local disk configuration that I made a change to was used by these profiles.

A little while later, after all the servers went through a reboot and settings application cycle and a coworker kicked a View server in the butt I was back in. The first thing I did was try to apply my service profile. I got the same error message.

I went back to the local disk config profile section to see if maybe the change hadn’t applied. But, when I edited the profile, it showed that the local disk config should not be in a protected configuration.

However when I went into a service profile to change the same local disk policy, you will notice that this same local disk config was still showing up as a protected configuration.

The Solution

When you encounter a bug, the first thing you do if the obvious isn’t working is check the release notes. This bug did show up int he 1.3 release notes, with a tag of “no workaround”. I don’t believe in not having workarounds so proceeded to bang my head up against a wall trying time consuming things like rediscovering servers, recovering bios’s, creating custom firmware packs and more.

After all that failed, I tried the simple things (you like the reverse order?). My excuse was that I assumed that TAC who had created the bug had already tried this. I guess I should assume less often…

The simple thing, that I should have done from the start was create a brand new local disk profile. I named this profile FIX-PERSIST-ANY, choosing to use any local disk config and expressly making it non-persistent. I then applied the same local disk policy that was on previously to my service profile, and then once applied changed it to FIX-PERSIST-ANY.

At that point I whatever flag that had gotten stuck in the UCSM database got unstuck and I was able to get my service profile applied.

Moral of the story

Just because a bug shows no workaround, it doesn’t mean you can’t fix it. It just means that the engineer who submitted the bug wasn’t able to.

Now I can finally get around to what I was originally trying to do today, getting scripted PXE installs of ESXi 4.1 working.

Tags: ···········

6 responses so far ↓

  • 1 Sudheer // Mar 4, 2013 at 8:53 pm

    Thanks Colin , I was facing the exact problem, your workaround worked for me.

  • 2 Danny // May 23, 2014 at 5:58 am

    Thanks Colin , still worked for me in 2014 with a similar problem.

  • 3 John Dufour // Jul 17, 2014 at 1:46 pm

    WOW, what a pain in the ****. Thanks for the post, I had the same problem. All I did was create a new local disk profile and the error went away. Hope there is a future perm fix for this.

  • 4 Ibrahim // Aug 1, 2014 at 3:50 am

    Thanks for the post, I have two UCSM and I have configured NO Raid on the old UCS environment but the New UCS environment had Protected configuration selected which was stoping me to proceed.
    Regards,
    Ibrahim

  • 5 T-Bone // Nov 19, 2014 at 10:15 am

    Thank you for this post! Was driving me up a wall. I deselected ‘Protected’ and the fault disappeared.

  • 6 Kai Rogal // Nov 20, 2014 at 9:17 am

    Great posting, I was looking for a solution quiet a while already.

    Regards,
    Kai

Leave a Comment