Hello,
I know this post might go unanswered but it has been a very long 28 hour day already and I am becoming quite worried and frustrated.
I have a VM with 100GB disk, 200GB provisioned size on an ESXI 5 server. It had one snapshot which had a 16GB size. I, now know, made the mistake of letting this snapshot run for 2 months. When I tried to commit this snapshot to the parent disk, this action is now going on for 16+ hours.
There is disk activity, the log get written every so often, and I see that it is going though a cycle of writing to two files:
000001-delta.vmdk and 000002-delta.vmdk
Around 1.5GB gets written on to number one, then it goes to number 2 and does it all over again, it has done this cycle at least 3 times already. The progress bar on teh vSphere Client shows 85% completed. It gains a few % points on every cycle.
Some more information, there is high CPU usage, has been pretty high since the process started. I had to shutdown two other VMs for the night because they were being affected by the high CPU usage. There is enough space on the HD, around 140GB free.
Here are some of the last lines written to the log:
SnapshotVMXConsolidateOnlineCB: Starting combine of scsi0:0 /vmfs/volumes/4dc2024d-f7148cb4-d760-9c8e9916a9fa/Ubuntu Server 10 - Zimbra/Ubuntu Server 10 - Zimbra-000002.vmdk. 2 links, starting from 1
2012-06-22T05:54:22.056Z| vcpu-0| SnapshotVMXConsolidateOnlineCB: Created thread 10 for scsi0:0
2012-06-22T05:54:22.079Z| SnapshotVMXCombiner| DISKLIB-LIB : Upward Combine 2 links at 1. Need 0 MB of free space (163155 MB available)
2012-06-22T05:54:22.079Z| SnapshotVMXCombiner| DISKLIB-DDB : "longContentID" = "9af46be9fbc154aa1ea42132f1c38dbc" (was "76214bdf1195fd40dac07a99a21e7b84")
2012-06-22T05:54:22.330Z| vcpu-1| HBACommon: First write on scsi0:0.fileName='/vmfs/volumes/4dc2024d-f7148cb4-d760-9c8e9916a9fa/Ubuntu Server 10 - Zimbra/Ubuntu Server 10 - Zimbra-000002.vmdk'
2012-06-22T05:54:22.330Z| vcpu-1| DISKLIB-DDB : "longContentID" = "edebe0011137a528aad6303433d05e3c" (was "9af46be9fbc154aa1ea42132f1c38dbc")
2012-06-22T05:54:22.444Z| vcpu-1| DISKLIB-CHAIN : DiskChainUpdateContentID: old=0xf1c38dbc, new=0x33d05e3c (edebe0011137a528aad6303433d05e3c)
2012-06-22T05:55:22.906Z| vmx| GuestRpcSendTimedOut: message to toolbox-dnd timed out.
2012-06-22T06:38:30.376Z| vcpu-0| VIDE: (0x170) Switching active drive from 0 to 1 while command 0xa0 is in progress (status 0x58).
2012-06-22T06:38:30.377Z| vcpu-0| VIDE: IDE1:0: Aborting command a0, I/O was not in progress
2012-06-22T06:38:31.284Z| vcpu-0| VIDE: Already in check condition 02 3a 00
2012-06-22T06:38:31.302Z| vcpu-0| VIDE: Already in check condition 02 3a 00
2012-06-22T07:50:53.884Z| vcpu-0| VIDE: (0x170) Switching active drive from 0 to 1 while command 0xa0 is in progress (status 0x58).
2012-06-22T07:50:53.884Z| vcpu-0| VIDE: IDE1:0: Aborting command a0, I/O was not in progress
2012-06-22T07:50:54.874Z| vcpu-0| VIDE: Already in check condition 02 3a 00
2012-06-22T07:50:54.875Z| vcpu-0| VIDE: Already in check condition 02 3a 00
2012-06-22T09:12:33.985Z| vcpu-5| VIDE: (0x170) Switching active drive from 0 to 1 while command 0xa0 is in progress (status 0x58).
2012-06-22T09:12:33.985Z| vcpu-5| VIDE: IDE1:0: Aborting command a0, I/O was not in progress
2012-06-22T09:12:34.846Z| vcpu-5| VIDE: Already in check condition 02 3a 00
2012-06-22T09:12:37.112Z| vcpu-5| VIDE: Already in check condition 02 3a 00
2012-06-22T09:12:49.794Z| vcpu-5| VIDE: (0x170) Switching active drive from 0 to 1 while command 0xa0 is in progress (status 0x58).
2012-06-22T09:12:49.794Z| vcpu-5| VIDE: IDE1:0: Aborting command a0, I/O was not in progress
2012-06-22T09:12:50.618Z| vcpu-5| VIDE: Already in check condition 02 3a 00
2012-06-22T09:12:50.619Z| vcpu-5| VIDE: Already in check condition 02 3a 00
2012-06-22T09:15:20.652Z| vcpu-3| VIDE: (0x170) Switching active drive from 0 to 1 while command 0xa0 is in progress (status 0x58).
2012-06-22T09:15:20.652Z| vcpu-3| VIDE: IDE1:0: Aborting command a0, I/O was not in progress
2012-06-22T09:15:49.177Z| vcpu-3| VIDE: Already in check condition 02 3a 00
2012-06-22T09:15:49.262Z| vcpu-3| VIDE: Already in check condition 02 3a 00
2012-06-22T09:29:25.838Z| vcpu-0| VIDE: (0x170) Switching active drive from 0 to 1 while command 0xa0 is in progress (status 0x58).
2012-06-22T09:29:25.838Z| vcpu-0| VIDE: IDE1:0: Aborting command a0, I/O was not in progress
2012-06-22T09:29:36.733Z| vcpu-0| VIDE: Already in check condition 02 3a 00
2012-06-22T09:29:36.793Z| vcpu-0| VIDE: Already in check condition 02 3a 00
Any advice will be greatly appreciated. Let me know if anyone has gone through this ordeal and how they got out of this mess.
Thank you in advance.
GPB