SQL Replication - New subscriber timeout *sometimes*
-
Wednesday, December 07, 2011 7:11 AM
Hello.
I've got a new architecture of replication. The new one is with SQL 2008 (win 2008 64bit) as publisher and subscriber are sql 2008 express.
The sync is done with a WebSync (replisapi.dll) is an other server (win 2008 64 bit) in the same datacenter than sql 2008.I do not enable the FTP on replication because it was difficult about firewall and company intern policy.
All is working well, expect that, since few weeks, some new laptop (subscriber) have error when they sync the first time.
BUT, what is strange, after try and try (there is an automatic sync set on the subscriber) then it can works ; without changing anything on subscriber or publisher.
There is not a lot of subscriber, maybe 20, and they don't sync each 1 min at the same time (but it can occurs).
The error is the following :
On the subscriber i've got : SQLServer.Replication.ComErrorException (0x80004005): the connection with the server was terminated abnormally (at MergeSynchronizationAgent.Run).
I've checked the websync.log on the IIS server and there I found strange error line :
pListener , 2011/12/05 16:54:23.513, 1044, 2755, S1, ERROR: ErrNo = 0x80070715, ErrSrc = <null>, ErrType = 10, ErrStr = The process could not write the response message due to OS error 995.
CHttpListener , 2011/12/05 16:54:23.513, 1044, 1271, S1, ERROR: ErrNo = 0x80070715, ErrSrc = <null>, ErrType = 10, ErrStr = The processing of the response message failed.I have no more info at the subscriber even with LogLevel Enabled. ON the Replication Monitor also, no more info. It just add the subscriber, start the sync (only 30 sec) and then nothing more. The error occurs after few minutes.
The SQL Merge agent profile is the default one.
My question is the following : is it possible that an option on IIS (webSync) need to be set about timeout or something else ?
Is it possible that the fact FTP SYNC is not set that i give me problem about the first sync ?
Best Regards,
Olivier
All Replies
-
Wednesday, December 07, 2011 7:55 AMModerator
Oliver,
I used to get these types of errors via Web Synchronization although I'm unsure if you're facing the same issue.
My issue is covered here: http://support.microsoft.com/kb/942712
Are you seeing any Faulting application w3wp.exe events in the IIS logs?
I was able to workaround my issue by limiting @max_concurrent_merge to 1, and was able to completely resolve the issue by applying the latest service pack.
Hope this helps.
www.sqlrepl.com -
Wednesday, December 07, 2011 9:15 AM
Hello Brandon.
I have also this kind of error, but it is on a other datacenter with Win 2003 and SQL 2005 ; there i've got so problem with iis like this (messagebox and item on the eventViewer).
But it's not that case in this new datacenter. And have nothing on the EventViewer about problem with IIS on the SyncServer.
Also, put a max_concurrent_merge to 1 is not really a best option. In the next month i will have maybe 100 subscribers, that will maybe sync each 1 min. But what occurs if i've got 1 subscriber that didn't sync since 3 weeks ? his sync will take maybe 1 hour and all the others will not be able to sync due to that ? Hum no we can't.
-
Monday, January 16, 2012 10:05 AM
hum, no reply, no info about that :(
I still don't know if the fact that do not use the FTP on the merge replication has an influence or not.
-
Wednesday, January 18, 2012 2:25 PM
Browsing MSDN i found that FTP can be used but this is only between the Database on the datacenter and the computer that is running iis on the datacenter. It is said also that it's not recommanded and that not using FTP is better.
hum, ok so I can do a full merge replication without this FTP settings.
But, in this case, what can be the solution with my problems that "some" subscribers cannot make a first full sync and that i've got some error.
I really need some help because this problem exists from now 1 month and I have no clue how to debug / solve it.
Any idea, clue,comments, feedback (or just a smile to say that you are with me ;) ) will be good
-
Friday, January 20, 2012 11:20 AM
New info.
It seems that when the customer is using the intranet of the company (it's a big company) they have the trouble. When using a normal internet conneciton then no problem.
Error on WebSync (replisapi.dll)
CHttpListener , 2012/01/20 11:07:15.994, 3028, 2755, S1, ERROR: ErrNo = 0x80070715, ErrSrc = <null>, ErrType = 10, ErrStr = The process could not write the response message due to OS error 995.
CHttpListener , 2012/01/20 11:07:15.994, 3028, 1271, S1, ERROR: ErrNo = 0x80070715, ErrSrc = <null>, ErrType = 10, ErrStr = The processing of the response message failed.Error on Subscriber (RMO) using DebugLevel to 3
2012-01-20 11:07:01.058 Downloaded snapshot file 'user_job_description_240.cft'.
2012-01-20 11:07:01.165 OLE DB Subscriber '%pcName%': {call sys.sp_MSadd_merge_history90 (?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?)}
2012-01-20 11:07:01.173 Downloaded snapshot file 'job_descriptio5154d564_241.sch'.
2012-01-20 11:07:01.304 OLE DB Subscriber '%pcName%': {call sys.sp_MSadd_merge_history90 (?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?)}
2012-01-20 11:07:01.313 Downloaded snapshot file 'job_descriptio5154d564_241.cft'.
2012-01-20 11:07:01.459 OLE DB Subscriber '%pcName%': {call sys.sp_MSadd_merge_history90 (?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?)}
2012-01-20 11:07:01.468 Downloaded snapshot file 'db_modify_242.sch'.
2012-01-20 11:07:01.579 OLE DB Subscriber '%pcName%': {call sys.sp_MSadd_merge_history90 (?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?)}
2012-01-20 11:07:01.589 Downloaded snapshot file 'db_modify_242.cft'.
2012-01-20 11:07:01.691 OLE DB Subscriber '%pcName%': {call sys.sp_MSadd_merge_history90 (?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?)}
2012-01-20 11:07:01.705 Downloaded snapshot file 'MSmerge_tombstone.bcp'.
2012-01-20 11:07:16.109 OLE DB Subscriber '%pcName%': {call sys.sp_MSadd_merge_history90 (?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?)}
2012-01-20 11:07:16.121 The connection with the server was terminated abnormallyI do not have any idea ... Is it possible that there is a fix to be applied on the WebSync (replisapi.dll) ? A bug ? Something to change on iis ? Something to change on the Agent Profile (default one) ?
-
Thursday, June 07, 2012 1:50 PM
Hi Oliver,
I have experienced simialar problems with snapshots. My issue was that we are using web sync to send the snapshot file to the client and the snapshot files were too big to load into the web servers memory and thus causing the web process to crash.
I think the reason for the crash is because the replisapi.dll will load each snapshot file into memory (I watched this in taskmgr) before sending it off to the client. Some of our snaphot files were 5GB or more.
Using FTP or share in my case produced the same results because whether its an FTP or share, all this does is change the way the snapshot file is sent to the web server and not client.
I have applied SQL SP3 to our production DB servers and also updated the replisapi.dll (from SQL SP3) on the web servers and unfortunatly no luck. The only way I was able to give the client the snapshot was to use the parameter -altsnapshotfolder on the client merge job and give them the file manually for them to apply locally.
Please let me know if you have discovered a better solution... because I will really like to remove my manual intervention :)

