Howdy folks --- after a long day we've gotten this resolved! yay! A very long day; the below has been posted to a specialized mailing list for the law office system we use but the story and solution, symptoms and such are there.
mark
Evening/Afternoon,
Last night I got a call from the law firm I worked at and still contract with regarding Prolaw and their server systems. The problem seemed simple: could not logon to the server at the console or via Terminal Services. Several hours of troubleshooting last night with me 2,700 miles away and the person in the office having similar results.
I VPN'd into the office network and RDP'd to a user's workstation with the 2003 Admin tools installed (didn't need them just the normal XP ones would work), no event log errors, nothing really out of shape in services, I went ahead and disabled the Prolaw Exchange agent becuase there were a few hundred PID entries within about 1 minute and there were a number of items in the application log regarding the event sync and transactions or something so that seemed like a place to start; no go.
Next said ok, lets disable the AV systems and see, so hadthe person at the office boot the server to safe mode, disable the av on the server and reboot, still no go. AT this point I e-mailed a real expert (read true IT professional with MCSE style credentials and such over at Unif of Pacific where I wnet to school) to see if they'd go look this weekend on site becuase I was stumped. Left the servers attempting to boot but users could login.
When I say attempting to boot, the servers (a SBS 2k3 w/ Exchange and a 2003 Standard with MS-SQL SP3a) hung at "Applying computer settings", attempted login through TS/RDP and hung at "Applying personal settings". We could telnet into both servers and execute commands at the command line and had limited RPC connectivity from XP workstations connected to another computer. The servers would still "serv", users could login, access files and run prolaw. The SQL Server agent was stuck in a start state as well as the Retrospect Helper services (system backup software -
This morning users logged in OK, although we still had the prolaw agent disabled (we have had a lot of problems with it). Still out of ideas and googling and searching on Tek-Tips.com (btw, it apears that people are having issues with Symanted AV corp v. 10 so be advised) I posted to a yahoo group dealing with Prolaw about if anybody else was having trouble and gave them my AOL screennames and MSN messenger account asing them to contact me, a few minutes later I got an e-mail back on on the way out the door I ahd the person call my cell phone while I drove to meet sombody for lunch. I spoke with somebody from: reinischmackenzielaw.com in Portland who was having the same problems.
His inital research indicated this may be a DDOS attack on port 3389 (RDP/TS) and to close the port from direct internet connectivity. Did that, restarted servers during lunch, same problem.
About an hour later, begin doing more Troubleshooting and discover that the APC agent and APC server on both servers is hung on start up. E-mailed the person I spoke with and he was on the phone with APC who was aware of the problem (though they still charged him $250 for the call) and he had been on the phone with Microsoft. The solution he had was: boot your server to safe mode, disable the APC services, reboot to normal, uninstall the APC systems and install the latest version (
At CLG we have 2 servers which take a very long time to shut down (especially with exchange) and with the state they were in we couldn't do a normal power down procedure, we had to hold power button in for 10 seconds (the person there) which is not the ideal way to do things at all!
As an alternative to that since I could get into the servers via Telnet and have access to the command line, we used the following two commands:
Code:
taskkill /IM apcserver.exe /f
taskkill /IM apcagent.exe /f
Immediately on doing these the pending logins for TS logged in, and the consoles presented thier logon screens. We promptly set the APC services in services manager to disabled and uninstalled the APC agents. We are waiting until tomorrow morning and going to reboot again just to ensure that this was the problem before we install the newer ones.
A possible alternative to this method and if you have control over your group policies would be to set the policy to disable the APC services if you can and then reboot and hope your servers goto update their policy -- I have nto tested this scenario, it just makes sense if you have a lot of servers and are using GPO's to help control your services states.
These problems started 48-72 hours ago after a "spontaneous reboot" as it was described to me by the person in the office; I suspect that the servers are set (not a setting I maintain on servers usually just for this reason) to auto update themselves from windows update and something required a restart which caused thei APC disruption.
Mark Lappin
Contractor to Calone Law Group, LLp in Stockton, CA