I have to report a scaling problem when changing instance count in a web role from 1 to 2.
When I do this instance count change my service stops responding for a period of about 5 mins, starting about 30 secs after the change and until the second instance comes online (about 4-5 mins later). After that, although
I have 2 instances online (I can check this through logging) only the 2nd one is answering - like the 1st instance is taken off the load balancer. I've checked that the 2 instances still communicate through internal endpoints, but the 1st is
not receiving external requests.
After that, when I change back to 1 instance, after a while I see both instances answering (for about 30 secs) - the 1st is activated in load balancer again, and then normally the 2nd instance goes offline and the 1st is online again.
If I want 2 instances activated after this change I have to reboot the 1st instance through the management portal to see them both answering. Everything is ok then.
This is a really strange behaviour and I have an offline period about 5-10 minutes and after that only one working instance - although I am charged for 2 and the azure portal reports 2 active instances with status 'ready'. I've noticed this happening last week,
I believe it is a bug of the system and I can say that this was not happenning some time before. I use this schedule - changing from 1 to 2 instances - 2-3 months with no problems, but last period I am experiencing this problem.
Thank you for your attention.