none
cluster出现奇怪现象,两个节点争抢Windows cluster,MSDTC IP和SQL CLuster的IP地址. RRS feed

  • 问题

  • 很奇怪 之前一直都没有这个问题的.

    看了下系统日志,The node lost communication with cluster node 'NEOFISNODE1' on network 'Private'.

    The node lost communication with cluster node 'NEOFISNODE1' on network 'Public'.

    说两个节点之间失去联系.


    If you haven't all the things you want,be grateful for the things you don't have that you didn't want.
    2011年12月14日 10:12

答案

全部回复

  • Cluster node NEOFISNODE1 was removed from the active server cluster membership. Cluster service may have been stopped on the node, the node may have failed, or the node may have lost communication with the other active server cluster nodes.
    If you haven't all the things you want,be grateful for the things you don't have that you didn't want.
    2011年12月14日 10:14
  • Have to use static ip addresse for everything on the cluster, double check if anyone gets ip address from dhcp. Fix it if find any.
    2011年12月14日 13:59
  • Have to use static ip addresse for everything on the cluster, double check if anyone gets ip address from dhcp. Fix it if find any.

    当然,我们cluster都是使用的静态IP地址.而且我根据之前争抢IP地址的MAC地址,发现该MAC地址属于该群集中的另外一个节点.也就是说两个节点在相互争抢三个IP.
    If you haven't all the things you want,be grateful for the things you don't have that you didn't want.
    2011年12月14日 23:57
  • 另外 我在群集管理器中看到public的网卡是mixed模式(Public&Private).是否应该修改成只适用于public模式?
    If you haven't all the things you want,be grateful for the things you don't have that you didn't want.
    2011年12月15日 0:07
  • 下面是我的猜测,请指正.

    假如当前活动节点为A,备用节点为B.

    当A和B之间失去联系的时候,那么资源就会向B上转移.但是之后B突然又侦测到A的存活,这个时候A就会去争抢资源.所以出现了IP地址冲突.

    不知道是否正确.


    If you haven't all the things you want,be grateful for the things you don't have that you didn't want.
    2011年12月15日 1:32
  • Public link should be for public connection and heartbeat, so cluster still has heartbeat in case private link goes down. Looks like nodes lost communication in this case and both nodes tried to start cluster resources, therefore got address conflict. You have to find out why nodes lost communication.
    2011年12月15日 2:27
  • Public link should be for public connection and heartbeat, so cluster still has heartbeat in case private link goes down. Looks like nodes lost communication in this case and both nodes tried to start cluster resources, therefore got address conflict. You have to find out why nodes lost communication.
    失去联系是不是就是因为网络出现了问题?ping不到了.
    If you haven't all the things you want,be grateful for the things you don't have that you didn't want.
    2011年12月15日 2:29
  • Possible but you have to confirm it, check windows event logs and cluster log file.
    2011年12月15日 2:30
  • 汗 今天中午又出现了....
    If you haven't all the things you want,be grateful for the things you don't have that you didn't want.
    2011年12月15日 5:31
  • 查看cluster的log,也看的不是很明白.

     

    00000d68.00000da8::2011/12/14-17:23:03.775 INFO [Qfs] GetDiskFreeSpaceEx Q:\MSCS\, status 0

    00000d68.0000c750::2011/12/14-17:23:45.042 INFO [CP] CppRegNotifyThread checkpointing key SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL10_50.MSSQLSERVER\MSSQLServer to id 2 due to timer

    00000d68.0000c750::2011/12/14-17:23:45.042 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\compaq\LOCALS~1\Temp\, CLS, 559 => C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS22F.tmp, status 0

    00000d68.0000c750::2011/12/14-17:23:45.042 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS22F.tmp, status 0

    00000d68.0000c750::2011/12/14-17:23:45.058 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS22F.tmp, status 0

    00000d68.0000c750::2011/12/14-17:23:45.058 INFO [CP] CpSaveData: checkpointing data id 2 to quorum node 2

    00000d68.0000c750::2011/12/14-17:23:45.058 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS22F.tmp to file Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT

    00000d68.0000c750::2011/12/14-17:23:45.058 INFO [Qfs] QfsCreateDirectory Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d, status 183

    00000d68.0000c750::2011/12/14-17:23:45.058 INFO [Qfs] QfsOpenFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS22F.tmp => 3, 3e4f960 status 0

    00000d68.0000c750::2011/12/14-17:23:45.058 INFO [Qfs] QfsOpenFile Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT => 2, 3e4f8d0 status 183

    00000d68.0000c750::2011/12/14-17:23:45.058 INFO [Qfs] ReadFile 948 (regf) 32768 16384, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:23:45.058 INFO [Qfs] WriteFile 16bc (regf) 16384, status 0 (0=>0)

    00000d68.0000c750::2011/12/14-17:23:45.058 INFO [Qfs] ReadFile 948 (regf) 32768 0, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:23:45.073 INFO [Qfs] QfsFlushBuffers 16bc, status 0

    00000d68.0000c750::2011/12/14-17:23:45.073 INFO [Qfs] QfsCloseHandle 948, status 0

    00000d68.0000c750::2011/12/14-17:23:45.073 INFO [Qfs] QfsCloseHandle 16bc, status 0

    00000d68.0000c750::2011/12/14-17:23:45.073 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS22F.tmp, status 0

    00000d68.0000c750::2011/12/14-17:24:45.059 INFO [CP] CppRegNotifyThread checkpointing key SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL10_50.MSSQLSERVER\MSSQLServer to id 2 due to timer

    00000d68.0000c750::2011/12/14-17:24:45.059 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\compaq\LOCALS~1\Temp\, CLS, 560 => C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS230.tmp, status 0

    00000d68.0000c750::2011/12/14-17:24:45.059 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS230.tmp, status 0

    00000d68.0000c750::2011/12/14-17:24:45.075 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS230.tmp, status 0

    00000d68.0000c750::2011/12/14-17:24:45.075 INFO [CP] CpSaveData: checkpointing data id 2 to quorum node 2

    00000d68.0000c750::2011/12/14-17:24:45.075 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS230.tmp to file Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT

    00000d68.0000c750::2011/12/14-17:24:45.075 INFO [Qfs] QfsCreateDirectory Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d, status 183

    00000d68.0000c750::2011/12/14-17:24:45.075 INFO [Qfs] QfsOpenFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS230.tmp => 3, 3e4f960 status 0

    00000d68.0000c750::2011/12/14-17:24:45.075 INFO [Qfs] QfsOpenFile Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT => 2, 3e4f8d0 status 183

    00000d68.0000c750::2011/12/14-17:24:45.075 INFO [Qfs] ReadFile 16bc (regf) 32768 16384, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:24:45.075 INFO [Qfs] WriteFile 948 (regf) 16384, status 0 (0=>0)

    00000d68.0000c750::2011/12/14-17:24:45.075 INFO [Qfs] ReadFile 16bc (regf) 32768 0, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:24:45.075 INFO [Qfs] QfsFlushBuffers 948, status 0

    00000d68.0000c750::2011/12/14-17:24:45.075 INFO [Qfs] QfsCloseHandle 16bc, status 0

    00000d68.0000c750::2011/12/14-17:24:45.090 INFO [Qfs] QfsCloseHandle 948, status 0

    00000d68.0000c750::2011/12/14-17:24:45.090 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS230.tmp, status 0

    00000d68.0000c750::2011/12/14-17:25:45.076 INFO [CP] CppRegNotifyThread checkpointing key SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL10_50.MSSQLSERVER\MSSQLServer to id 2 due to timer

    00000d68.0000c750::2011/12/14-17:25:45.076 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\compaq\LOCALS~1\Temp\, CLS, 561 => C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS231.tmp, status 0

    00000d68.0000c750::2011/12/14-17:25:45.076 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS231.tmp, status 0

    00000d68.0000c750::2011/12/14-17:25:45.076 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS231.tmp, status 0

    00000d68.0000c750::2011/12/14-17:25:45.076 INFO [CP] CpSaveData: checkpointing data id 2 to quorum node 2

    00000d68.0000c750::2011/12/14-17:25:45.076 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS231.tmp to file Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT

    00000d68.0000c750::2011/12/14-17:25:45.076 INFO [Qfs] QfsCreateDirectory Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d, status 183

    00000d68.0000c750::2011/12/14-17:25:45.076 INFO [Qfs] QfsOpenFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS231.tmp => 3, 3e4f960 status 0

    00000d68.0000c750::2011/12/14-17:25:45.076 INFO [Qfs] QfsOpenFile Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT => 2, 3e4f8d0 status 183

    00000d68.0000c750::2011/12/14-17:25:45.076 INFO [Qfs] ReadFile 948 (regf) 32768 16384, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:25:45.076 INFO [Qfs] WriteFile 16bc (regf) 16384, status 0 (0=>0)

    00000d68.0000c750::2011/12/14-17:25:45.076 INFO [Qfs] ReadFile 948 (regf) 32768 0, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:25:45.092 INFO [Qfs] QfsFlushBuffers 16bc, status 0

    00000d68.0000c750::2011/12/14-17:25:45.092 INFO [Qfs] QfsCloseHandle 948, status 0

    00000d68.0000c750::2011/12/14-17:25:45.092 INFO [Qfs] QfsCloseHandle 16bc, status 0

    00000d68.0000c750::2011/12/14-17:25:45.092 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS231.tmp, status 0

    00000d68.0000c750::2011/12/14-17:26:45.093 INFO [CP] CppRegNotifyThread checkpointing key SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL10_50.MSSQLSERVER\MSSQLServer to id 2 due to timer

    00000d68.0000c750::2011/12/14-17:26:45.093 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\compaq\LOCALS~1\Temp\, CLS, 562 => C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS232.tmp, status 0

    00000d68.0000c750::2011/12/14-17:26:45.093 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS232.tmp, status 0

    00000d68.0000c750::2011/12/14-17:26:45.093 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS232.tmp, status 0

    00000d68.0000c750::2011/12/14-17:26:45.093 INFO [CP] CpSaveData: checkpointing data id 2 to quorum node 2

    00000d68.0000c750::2011/12/14-17:26:45.093 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS232.tmp to file Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT

    00000d68.0000c750::2011/12/14-17:26:45.093 INFO [Qfs] QfsCreateDirectory Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d, status 183

    00000d68.0000c750::2011/12/14-17:26:45.093 INFO [Qfs] QfsOpenFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS232.tmp => 3, 3e4f960 status 0

    00000d68.0000c750::2011/12/14-17:26:45.093 INFO [Qfs] QfsOpenFile Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT => 2, 3e4f8d0 status 183

    00000d68.0000c750::2011/12/14-17:26:45.093 INFO [Qfs] ReadFile 778 (regf) 32768 16384, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:26:45.093 INFO [Qfs] WriteFile 1384 (regf) 16384, status 0 (0=>0)

    00000d68.0000c750::2011/12/14-17:26:45.093 INFO [Qfs] ReadFile 778 (regf) 32768 0, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:26:45.109 INFO [Qfs] QfsFlushBuffers 1384, status 0

    00000d68.0000c750::2011/12/14-17:26:45.109 INFO [Qfs] QfsCloseHandle 778, status 0

    00000d68.0000c750::2011/12/14-17:26:45.125 INFO [Qfs] QfsCloseHandle 1384, status 0

    00000d68.0000c750::2011/12/14-17:26:45.125 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS232.tmp, status 0

    00000d68.0000c750::2011/12/14-17:27:45.095 INFO [CP] CppRegNotifyThread checkpointing key SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL10_50.MSSQLSERVER\MSSQLServer to id 2 due to timer

    00000d68.0000c750::2011/12/14-17:27:45.095 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\compaq\LOCALS~1\Temp\, CLS, 563 => C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS233.tmp, status 0

    00000d68.0000c750::2011/12/14-17:27:45.095 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS233.tmp, status 0

    00000d68.0000c750::2011/12/14-17:27:45.111 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS233.tmp, status 0

    00000d68.0000c750::2011/12/14-17:27:45.111 INFO [CP] CpSaveData: checkpointing data id 2 to quorum node 2

    00000d68.0000c750::2011/12/14-17:27:45.111 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS233.tmp to file Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT

    00000d68.0000c750::2011/12/14-17:27:45.111 INFO [Qfs] QfsCreateDirectory Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d, status 183

    00000d68.0000c750::2011/12/14-17:27:45.111 INFO [Qfs] QfsOpenFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS233.tmp => 3, 3e4f960 status 0

    00000d68.0000c750::2011/12/14-17:27:45.111 INFO [Qfs] QfsOpenFile Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT => 2, 3e4f8d0 status 183

    00000d68.0000c750::2011/12/14-17:27:45.111 INFO [Qfs] ReadFile 16bc (regf) 32768 16384, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:27:45.111 INFO [Qfs] WriteFile 948 (regf) 16384, status 0 (0=>0)

    00000d68.0000c750::2011/12/14-17:27:45.111 INFO [Qfs] ReadFile 16bc (regf) 32768 0, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:27:45.126 INFO [Qfs] QfsFlushBuffers 948, status 0

    00000d68.0000c750::2011/12/14-17:27:45.126 INFO [Qfs] QfsCloseHandle 16bc, status 0

    00000d68.0000c750::2011/12/14-17:27:45.126 INFO [Qfs] QfsCloseHandle 948, status 0

    00000d68.0000c750::2011/12/14-17:27:45.126 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS233.tmp, status 0

    00000d68.00000da8::2011/12/14-17:28:03.783 INFO [Qfs] GetDiskFreeSpaceEx Q:\MSCS\, status 0

    00000d68.0000c750::2011/12/14-17:28:45.112 INFO [CP] CppRegNotifyThread checkpointing key SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL10_50.MSSQLSERVER\MSSQLServer to id 2 due to timer

    00000d68.0000c750::2011/12/14-17:28:45.112 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\compaq\LOCALS~1\Temp\, CLS, 564 => C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS234.tmp, status 0

    00000d68.0000c750::2011/12/14-17:28:45.112 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS234.tmp, status 0

    00000d68.0000c750::2011/12/14-17:28:45.128 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS234.tmp, status 0

    00000d68.0000c750::2011/12/14-17:28:45.128 INFO [CP] CpSaveData: checkpointing data id 2 to quorum node 2

    00000d68.0000c750::2011/12/14-17:28:45.128 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS234.tmp to file Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT

    00000d68.0000c750::2011/12/14-17:28:45.128 INFO [Qfs] QfsCreateDirectory Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d, status 183

    00000d68.0000c750::2011/12/14-17:28:45.128 INFO [Qfs] QfsOpenFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS234.tmp => 3, 3e4f960 status 0

    00000d68.0000c750::2011/12/14-17:28:45.128 INFO [Qfs] QfsOpenFile Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT => 2, 3e4f8d0 status 183

    00000d68.0000c750::2011/12/14-17:28:45.128 INFO [Qfs] ReadFile 948 (regf) 32768 16384, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:28:45.128 INFO [Qfs] WriteFile 16bc (regf) 16384, status 0 (0=>0)

    00000d68.0000c750::2011/12/14-17:28:45.128 INFO [Qfs] ReadFile 948 (regf) 32768 0, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:28:45.128 INFO [Qfs] QfsFlushBuffers 16bc, status 0

    00000d68.0000c750::2011/12/14-17:28:45.128 INFO [Qfs] QfsCloseHandle 948, status 0

    00000d68.0000c750::2011/12/14-17:28:45.143 INFO [Qfs] QfsCloseHandle 16bc, status 0

    00000d68.0000c750::2011/12/14-17:28:45.143 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS234.tmp, status 0

    00000d68.0000c750::2011/12/14-17:29:45.129 INFO [CP] CppRegNotifyThread checkpointing key SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL10_50.MSSQLSERVER\MSSQLServer to id 2 due to timer

    00000d68.0000c750::2011/12/14-17:29:45.129 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\compaq\LOCALS~1\Temp\, CLS, 565 => C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS235.tmp, status 0

    00000d68.0000c750::2011/12/14-17:29:45.129 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS235.tmp, status 0

    00000d68.0000c750::2011/12/14-17:29:45.129 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS235.tmp, status 0

    00000d68.0000c750::2011/12/14-17:29:45.129 INFO [CP] CpSaveData: checkpointing data id 2 to quorum node 2

    00000d68.0000c750::2011/12/14-17:29:45.129 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS235.tmp to file Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT

    00000d68.0000c750::2011/12/14-17:29:45.129 INFO [Qfs] QfsCreateDirectory Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d, status 183

    00000d68.0000c750::2011/12/14-17:29:45.129 INFO [Qfs] QfsOpenFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS235.tmp => 3, 3e4f960 status 0

    00000d68.0000c750::2011/12/14-17:29:45.129 INFO [Qfs] QfsOpenFile Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT => 2, 3e4f8d0 status 183

    00000d68.0000c750::2011/12/14-17:29:45.129 INFO [Qfs] ReadFile 16bc (regf) 32768 16384, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:29:45.129 INFO [Qfs] WriteFile 948 (regf) 16384, status 0 (0=>0)

    00000d68.0000c750::2011/12/14-17:29:45.129 INFO [Qfs] ReadFile 16bc (regf) 32768 0, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:29:45.145 INFO [Qfs] QfsFlushBuffers 948, status 0

    00000d68.0000c750::2011/12/14-17:29:45.145 INFO [Qfs] QfsCloseHandle 16bc, status 0

    00000d68.0000c750::2011/12/14-17:29:45.161 INFO [Qfs] QfsCloseHandle 948, status 0

    00000d68.0000c750::2011/12/14-17:29:45.161 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS235.tmp, status 0

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [DM]DmpCheckpointTimerCb- taking a checkpoint

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [LM] LogReset entry...

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [LM] LogpReset entry...

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [Qfs] QfsGetTempFileName Q:\MSCS\, tquolog, 566 => Q:\MSCS\tqu236.tmp, status 0

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [LM] LogpCreate : Entry 

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [Qfs] QfsOpenFile Q:\MSCS\tqu236.tmp => 4, 3ccf540 status 183

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [LM] LogpMountLog : Entry pLog=0x04438830

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [Qfs] QfsGetFileSize 784 0

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [LM] LogpMountLog::Quorumlog File size=0x00000000

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [LM] LogpInitLog : Entry pLog=0x04438830

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [Qfs] QfsSetEndOfFile 784 32768, Status 0

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [LM] LogpAppendPage : Writing 1024 bytes to disk at offset 0x00000000

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [Qfs] WriteFile 784 (CLOG) 1024, status 0 (0=>0)

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [Qfs] QfsFlushBuffers 784, status 0

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [LM] LogpInitLog : NextLsn=0x00000408 FileAlloc=0x00000800 ActivePageOffset=0x00000400

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [LM] LogpCreate : Exit with success

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [LM] LogGetLastChkPoint:: Entry

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [Qfs] ReadFile df8 (....) 1024 0, (0=>0) 4599608 status 997

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [Qfs] QfsMapFileAndCheckSum Q:\MSCS\chkDD1.tmp, compatibility 1, ret 1 status 87

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [Qfs] QfsMapFileAndCheckSum Q:\MSCS\chkDD1.tmp, compatibility 0, ret 0 status 0

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [LM] LogGetLastChkPoint - Succeeded with normal CheckSum. Stored=57614, Retrieved=57614, MNS Compatible Retrieved=0

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [LM] LogGetLastChkPoint: ChkPt File Q:\MSCS\chkDD1.tmp ChkPtSeq=3537 ChkPtLsn=0x00000408 Checksum=57614

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [LM] LogGetLastChkPoint exit, returning 0x00000000

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [LM] LogCheckPoint entry

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [Qfs] QfsGetTempFileName Q:\MSCS\, chkpt, 567 => Q:\MSCS\chk237.tmp, status 0

    00000d68.00000da8::2011/12/14-17:30:21.740 INFO [Qfs] QfsDeleteFile Q:\MSCS\chk237.tmp, status 0

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [Qfs] QfsRegSaveKey Q:\MSCS\chk237.tmp, status 0

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [DM] DmpGetSnapShotCb: DmpGetDatabase returned 0x00000000

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [Qfs] QfsGetTempFileName Q:\MSCS\, chkpt, 3537 => Q:\MSCS\chkDD1.tmp, status 0

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [LM] DmpGetSnapshotCb: Checkpoint file name=Q:\MSCS\chkDD1.tmp Seq#=3537

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [Qfs] QfsMoveFileEx Q:\MSCS\chk237.tmp=>Q:\MSCS\chkDD1.tmp

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [Qfs] QfsIsOnline => 0, Status 1169

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [Qfs] QfsMapFileAndCheckSum Q:\MSCS\chkDD1.tmp, compatibility 0, ret 0 status 0

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [LM] LogCheckPoint: ChkPtFile=Q:\MSCS\chkDD1.tmp Chkpt Trid=3537 CheckSum=57614

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [LM] LogFlush : pLog=0x04438830 writing the 1024 bytes for active page at offset 0x00000400

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [Qfs] WriteFile 784 (....) 1024, status 0 (0=>0)

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [Qfs] QfsFlushBuffers 784, status 0

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [LM] LogCheckPoint: EndChkpt written. EndChkPtLsn =0x00000438 ChkPt Seq=3537 ChkPt FileName=Q:\MSCS\chkDD1.tmp

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [Qfs] ReadFile 784 (....) 1024 0, (0=>0) 4438888 status 997

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [LM] LogpCheckpoint : Writing 1024 bytes to disk at offset 0x00000000

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [Qfs] WriteFile 784 (CLOG) 1024, status 997 (0=>0)

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [Qfs] QfsFlushBuffers 784, status 0

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [Qfs] QfsFlushBuffers 784, status 0

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [LM] LogCheckPoint Exit

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [LM] LogGetLastChkPoint:: Entry

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [Qfs] ReadFile 784 (....) 1024 0, (0=>0) 4438888 status 997

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [Qfs] QfsMapFileAndCheckSum Q:\MSCS\chkDD1.tmp, compatibility 1, ret 1 status 87

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [Qfs] QfsMapFileAndCheckSum Q:\MSCS\chkDD1.tmp, compatibility 0, ret 0 status 0

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [LM] LogGetLastChkPoint - Succeeded with normal CheckSum. Stored=57614, Retrieved=57614, MNS Compatible Retrieved=0

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [LM] LogGetLastChkPoint: ChkPt File Q:\MSCS\chkDD1.tmp ChkPtSeq=3537 ChkPtLsn=0x00000408 Checksum=57614

    00000d68.00000da8::2011/12/14-17:30:21.755 INFO [LM] LogGetLastChkPoint exit, returning 0x00000000

    00000d68.00000da8::2011/12/14-17:30:21.771 INFO [Qfs] QfsCloseHandle df8, status 0

    00000d68.00000da8::2011/12/14-17:30:21.771 INFO [Qfs] QfsCloseHandle 784, status 0

    00000d68.00000da8::2011/12/14-17:30:21.771 INFO [Qfs] QfsMoveFileEx Q:\MSCS\tqu236.tmp=>Q:\MSCS\quolog.log

    00000d68.00000da8::2011/12/14-17:30:21.771 INFO [Qfs] QfsOpenFile Q:\MSCS\quolog.log => 4, 3ccf5d0 status 183

    00000d68.00000da8::2011/12/14-17:30:21.771 INFO [LM] LogpReset exit, returning 0x00000000

    00000d68.00000da8::2011/12/14-17:30:21.771 INFO [LM] LogReset exit, returning 0x00000000

    00000d68.0000c750::2011/12/14-17:30:45.147 INFO [CP] CppRegNotifyThread checkpointing key SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL10_50.MSSQLSERVER\MSSQLServer to id 2 due to timer

    00000d68.0000c750::2011/12/14-17:30:45.147 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\compaq\LOCALS~1\Temp\, CLS, 568 => C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS238.tmp, status 0

    00000d68.0000c750::2011/12/14-17:30:45.147 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS238.tmp, status 0

    00000d68.0000c750::2011/12/14-17:30:45.147 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS238.tmp, status 0

    00000d68.0000c750::2011/12/14-17:30:45.147 INFO [CP] CpSaveData: checkpointing data id 2 to quorum node 2

    00000d68.0000c750::2011/12/14-17:30:45.147 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS238.tmp to file Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT

    00000d68.0000c750::2011/12/14-17:30:45.147 INFO [Qfs] QfsCreateDirectory Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d, status 183

    00000d68.0000c750::2011/12/14-17:30:45.147 INFO [Qfs] QfsOpenFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS238.tmp => 3, 3e4f960 status 0

    00000d68.0000c750::2011/12/14-17:30:45.147 INFO [Qfs] QfsOpenFile Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT => 2, 3e4f8d0 status 183

    00000d68.0000c750::2011/12/14-17:30:45.147 INFO [Qfs] ReadFile 758 (regf) 32768 16384, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:30:45.147 INFO [Qfs] WriteFile df8 (regf) 16384, status 0 (0=>0)

    00000d68.0000c750::2011/12/14-17:30:45.147 INFO [Qfs] ReadFile 758 (regf) 32768 0, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:30:45.162 INFO [Qfs] QfsFlushBuffers df8, status 0

    00000d68.0000c750::2011/12/14-17:30:45.162 INFO [Qfs] QfsCloseHandle 758, status 0

    00000d68.0000c750::2011/12/14-17:30:45.178 INFO [Qfs] QfsCloseHandle df8, status 0

    00000d68.0000c750::2011/12/14-17:30:45.178 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS238.tmp, status 0

    00000d68.0000c750::2011/12/14-17:31:45.148 INFO [CP] CppRegNotifyThread checkpointing key SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL10_50.MSSQLSERVER\MSSQLServer to id 2 due to timer

    00000d68.0000c750::2011/12/14-17:31:45.148 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\compaq\LOCALS~1\Temp\, CLS, 569 => C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS239.tmp, status 0

    00000d68.0000c750::2011/12/14-17:31:45.148 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS239.tmp, status 0

    00000d68.0000c750::2011/12/14-17:31:45.164 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS239.tmp, status 0

    00000d68.0000c750::2011/12/14-17:31:45.164 INFO [CP] CpSaveData: checkpointing data id 2 to quorum node 2

    00000d68.0000c750::2011/12/14-17:31:45.164 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS239.tmp to file Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT

    00000d68.0000c750::2011/12/14-17:31:45.164 INFO [Qfs] QfsCreateDirectory Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d, status 183

    00000d68.0000c750::2011/12/14-17:31:45.164 INFO [Qfs] QfsOpenFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS239.tmp => 3, 3e4f960 status 0

    00000d68.0000c750::2011/12/14-17:31:45.164 INFO [Qfs] QfsOpenFile Q:\MSCS\2bc951e1-0ce0-4179-8d86-9bb11474c57d\00000002.CPT => 2, 3e4f8d0 status 183

    00000d68.0000c750::2011/12/14-17:31:45.164 INFO [Qfs] ReadFile e00 (regf) 32768 16384, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:31:45.164 INFO [Qfs] WriteFile 16bc (regf) 16384, status 0 (0=>0)

    00000d68.0000c750::2011/12/14-17:31:45.164 INFO [Qfs] ReadFile e00 (regf) 32768 0, (0=>0) 0 status 0

    00000d68.0000c750::2011/12/14-17:31:45.179 INFO [Qfs] QfsFlushBuffers 16bc, status 0

    00000d68.0000c750::2011/12/14-17:31:45.179 INFO [Qfs] QfsCloseHandle e00, status 0

    00000d68.0000c750::2011/12/14-17:31:45.179 INFO [Qfs] QfsCloseHandle 16bc, status 0

    00000d68.0000c750::2011/12/14-17:31:45.179 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\compaq\LOCALS~1\Temp\CLS239.tmp, status 0

    00000d68.0000c750::2011/12/14-17:32:45.165 INFO [CP] CppRegNotifyThread checkpointing key SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL10_50.MSSQLSERVER\MSSQLServer to id 2 due to timer


    If you haven't all the things you want,be grateful for the things you don't have that you didn't want.
    2011年12月15日 6:54
  • What time you got conflict? Double check system time on all nodes, ensure they are same.
    2011年12月15日 14:41
  • What time you got conflict? Double check system time on all nodes, ensure they are same.


    是的 我发现IP冲突的时候就是12-14下午五点二十三到五点半,还没到五点三十一.

    也就是上面日志中的时间段.

    这个是从cluster.log中找到的信息.

    在两台节点上只是发现lost communication的信息


    If you haven't all the things you want,be grateful for the things you don't have that you didn't want.
    2011年12月16日 0:00
  • Did you check system clock on all nodes? Any error in windows system event log?
    2011年12月16日 4:12
  • 我将system event viewer中的记录贴出来吧.

    最开始是:40是sql virtual ip ,38是windows cluster ip,39是MSDTC IP.R盘是存放数据的盘.Q盘是仲裁盘

    The system detected an address conflict for IP address 192.168.123.40 with the system having network hardware address 00:21:5E:C4:E3:76. Network operations on this system may be disrupted as a result.

    The system detected an address conflict for IP address 192.168.123.38 with the system having network hardware address 00:21:5E:C4:E3:76. Network operations on this system may be disrupted as a result.

    The node lost communication with cluster node 'NEOFISNODE2' on network 'Private'.

    Cluster service is requesting a bus reset for device \Device\ClusDisk0.

    The driver for device \Device\RaidPort1 performed a bus reset upon request.

    The system detected an address conflict for IP address 192.168.123.39 with the system having network hardware address 00:21:5E:C4:E3:76. Network operations on this system may be disrupted as a result.

    Reservation of cluster disk 'Disk R:' has been lost. Please check your system and disk configuration.

    Reservation of cluster disk 'Disk Q:' has been lost. Please check your system and disk configuration.

     

    {Delayed Write Failed} Windows was unable to save all the data for the file . The data has been lost. This error may be caused by a failure of your computer hardware or network connection. Please try to save this file elsewhere.

     

    The system failed to flush data to the transaction log. Corruption may occur.

     

    等类似的错误


    If you haven't all the things you want,be grateful for the things you don't have that you didn't want.
    2011年12月16日 8:28
  • Did you check system clock on all nodes? Do they have same date/time?
    2011年12月16日 14:28
  • 不好意思 忘记回复你了 rmiao.他们的时间都是一致的.
    If you haven't all the things you want,be grateful for the things you don't have that you didn't want.
    2011年12月17日 12:57
  • Find out which machine has NIC with mac address 00:21:5E:C4:E3:76, is it one of cluster node?
    2011年12月17日 17:33
  • Find out which machine has NIC with mac address 00:21:5E:C4:E3:76, is it one of cluster node?
    是的 就是其中一个节点的MAC地址
    If you haven't all the things you want,be grateful for the things you don't have that you didn't want.
    2011年12月18日 2:07
  • Back to basic, find out why 2 nodes lost communiction. You can set netmon between them.
    2011年12月18日 18:29