Greetings,<br>
We had a case where a server with external disks had a bad problem. Somehow the power cord for the disks got pulled out and the disk stopped responding (of course).<br>
Oddly enough, the kernel kept running, at least for the time being, and the box responded to pings, snmp traffic, etc.<br>
Our mgmt. software never knew there was anything amiss with the box.<br>
Now that the post mortum is in, managers want to know if we can know when the disk stops responding on any given system.<br>
This is a dilemma. Any shell program that runs periodically will not suffice -- it won't be able to read the script, and even if it does run (as in constantly memory resident) what would a script do? In all probability it would pend waiting for disk I/O and never respond.<br>
Has anyone ever worked out a problem like this before?
We had a case where a server with external disks had a bad problem. Somehow the power cord for the disks got pulled out and the disk stopped responding (of course).<br>
Oddly enough, the kernel kept running, at least for the time being, and the box responded to pings, snmp traffic, etc.<br>
Our mgmt. software never knew there was anything amiss with the box.<br>
Now that the post mortum is in, managers want to know if we can know when the disk stops responding on any given system.<br>
This is a dilemma. Any shell program that runs periodically will not suffice -- it won't be able to read the script, and even if it does run (as in constantly memory resident) what would a script do? In all probability it would pend waiting for disk I/O and never respond.<br>
Has anyone ever worked out a problem like this before?