SOS requires restart
Moderator: SourceGear
SOS requires restart
SOS Version: 4.1.2
VSS: 6.0d (Build 31222)
OS: Windows 2003 w/sp1
Hardware: 1x3.4Ghz Xeon w/o HT; 1 GB of RAM; 3 x72 GB raid 5
So I've been searching for help with this thing. Just when the it looks like the thread is going somewhere, it ends without any conclusion.
What have I done so far to troubleshoot this .... headache?
First I rebuild this server from scratch.
Then I loaded VSS 6.0d (Build 31222) and install SOS 4.1.2. Made sure that the second NIC is disabled. Made sure HT is disabled.
Then my users would do their thing, and next thing I know, they have to restart the SOS service because the users can't log in. It just hangs there. Nothing happens. Once they restart the service, about 1/2 a day later, the cycle starts all over again. There are nothing on the event logson the server. Enabled verbose messaging on the SOS server and I can't see anything.
I hardly see any CPU utilization nor do I see any memory used. The SOS service lingers at about 64 MB. It doesn't want any more or any less.
I tried to troubleshoot this thing. At first I was convinced that it was the IT guy who build this server incorrectly because he had version 3.5.3 on a Windows 2003 server. You know that MSJVM issue. But now, I don't think that is the case.
Or do we need to have a dedicated SOS server for each of the developers just so that we have something stable. I need some help with this issue. Having these developers half way around the world is not helping with my sleep.
Any ideas?
Thanks
VSS: 6.0d (Build 31222)
OS: Windows 2003 w/sp1
Hardware: 1x3.4Ghz Xeon w/o HT; 1 GB of RAM; 3 x72 GB raid 5
So I've been searching for help with this thing. Just when the it looks like the thread is going somewhere, it ends without any conclusion.
What have I done so far to troubleshoot this .... headache?
First I rebuild this server from scratch.
Then I loaded VSS 6.0d (Build 31222) and install SOS 4.1.2. Made sure that the second NIC is disabled. Made sure HT is disabled.
Then my users would do their thing, and next thing I know, they have to restart the SOS service because the users can't log in. It just hangs there. Nothing happens. Once they restart the service, about 1/2 a day later, the cycle starts all over again. There are nothing on the event logson the server. Enabled verbose messaging on the SOS server and I can't see anything.
I hardly see any CPU utilization nor do I see any memory used. The SOS service lingers at about 64 MB. It doesn't want any more or any less.
I tried to troubleshoot this thing. At first I was convinced that it was the IT guy who build this server incorrectly because he had version 3.5.3 on a Windows 2003 server. You know that MSJVM issue. But now, I don't think that is the case.
Or do we need to have a dedicated SOS server for each of the developers just so that we have something stable. I need some help with this issue. Having these developers half way around the world is not helping with my sleep.
Any ideas?
Thanks
This sounds like a crash in the VSS automation component, since restarting the SOS Server gets things running again. The version of the VSS Client you're using on the SOS Server machine isn't one with a known issue, but we haven't extensively tested it with SOS. Is there another ssapi.dll on that machine? Perhaps the wrong one is registered:
http://support.sourcegear.com/viewtopic.php?t=255
You could try the Microsoft Hotfix version instead:
http://download.sourcegear.com/files/vss_60c_hotfix.zip
If these suggestions don't work, set SOS logging to Verbose and send me a copy of the log.txt file the next time SOS hangs.
http://support.sourcegear.com/viewtopic.php?t=255
You could try the Microsoft Hotfix version instead:
http://download.sourcegear.com/files/vss_60c_hotfix.zip
If these suggestions don't work, set SOS logging to Verbose and send me a copy of the log.txt file the next time SOS hangs.
Linda Bauer
SourceGear
Technical Support Manager
SourceGear
Technical Support Manager
I've done what you had asked......
Here's my log file...
Thanks
Thanks
- Attachments
-
- log.txt
- (635.18 KiB) Downloaded 898 times
There are many errors in the log that point to network problems outside of SOS:
If the server hangs are being caused by a crash of the VSS automation component, I would suggest changing the version used by the SOS Server. We have not done extensive testing of the version you are using.
However, we do know that the Microsoft Hotfix version is one of the more stable versions to use with SOS. I had suggested trying this earlier -- are you using this now and still having issues?
http://download.sourcegear.com/files/vss_60c_hotfix.zip
If this doesn't help, the next things to try are:
Run Analyze on your VSS database to make sure database inconsistencies are not causing the VSS automation component and the SOS Server to hang:
http://support.sourcegear.com/viewtopic.php?t=50
Use -f and then -c until no more errors are reported.
Since you have 65 users, you also may want to consider splitting the load among two or more SOS Servers. You can install the SOS Server on one or more machines and have each SOS Server serve a certain number of users.
We have seen this error when there is a concurrency crash:2/21/2006 8:10:35 AM - 1: ParseStream created a null message!
2/21/2006 8:17:45 AM - 2: Error processing client request: at System.Net.Sockets.Socket.Receive(Byte[] buffer, Int32 offset, Int32 size, SocketFlags socketFlags)
at System.Net.Sockets.Socket.Receive(Byte[] buffer, Int32 size, SocketFlags socketFlags)
at ClassicService.ProtocolMessage.ParseStream(Socket socket, Crypto crypto)
at ClassicService.Client.GetMessage()
2/21/2006 1:33:00 PM - The requested name is valid, but no data of the requested type was found
So it's possible there are network errors as well as issues with the VSS Automation Component.2/21/2006 8:37:12 PM - 22: Exception: A blocking operation was interrupted by a call to WSACancelBlockingCall
If the server hangs are being caused by a crash of the VSS automation component, I would suggest changing the version used by the SOS Server. We have not done extensive testing of the version you are using.
However, we do know that the Microsoft Hotfix version is one of the more stable versions to use with SOS. I had suggested trying this earlier -- are you using this now and still having issues?
http://download.sourcegear.com/files/vss_60c_hotfix.zip
If this doesn't help, the next things to try are:
Run Analyze on your VSS database to make sure database inconsistencies are not causing the VSS automation component and the SOS Server to hang:
http://support.sourcegear.com/viewtopic.php?t=50
Use -f and then -c until no more errors are reported.
Since you have 65 users, you also may want to consider splitting the load among two or more SOS Servers. You can install the SOS Server on one or more machines and have each SOS Server serve a certain number of users.
Linda Bauer
SourceGear
Technical Support Manager
SourceGear
Technical Support Manager
found the fix
Just so that anyone who has database on Novell server, you need to add "sharable" attribute to rights.dat file.
No more restarts since
No more restarts since