Last 10 posts, sorted by last updated
Order posts by limited to posts

MAINTENANCE Assumed Completed Broadband
AFFECTING
Broadband
STARTED
Nov 19, 09:30 AM (1¾ days ago)
DESCRIPTION

The work outlined below will start from Saturday 23rd November.

Background:

Our FireBrick team has been working on the 'hang' problem that we faced with the LNSs earlier in the year.

The nature of the problem has made investigating the problem very time consuming as it is extremely difficult to reproduce. However, we do believe that a plausible cause has been identified, and code changes have been made to mitigate the problem.

We have been testing this new code, both in our test lab and on a few select A&A routers, for over two months. During this time the new code has not caused the hardware to hang, where older versions of the code did.

Our next step is to run the new code on our LNSs, the ones our customers connect to for their broadband connections.

We plan to do this slowly, out of hours and in a couple of phases.

We believe the cause of the hang is related to how memory is initially allocated for the tasks the FireBrick will be performing, this means that if the hardware is going to hang then this will most likely happen over the first couple of days (or first couple of hours).

Stage one:

We plan to upgrade only one of our LNSs at first. We will move broadband connections on to it in the early hours of the morning and then move them back off a few hours later. This means that during the day, customers will be on the normal set of LNSs.

Then, each night, over the course of two weeks, the LNS will be power cycled and we will move an increasing number of connections over, until it is at the point of taking twice the amount of connections that we'd normally run on an LNS. (We normally run LNSs at around 40% capacity, so twice the number of connections is not a problem.)

Stage two:

Once we have confirmed that the hang is not happening, the second phase would be to run customer connections on the upgraded for a few days at a time.

We will go through a cycle of: move connections off, reboot the LNS, move connections on, wait a few days. Repeat. We will do this with an increasing number of connections until it's at the point of taking a normal amount of connections.

More information and to opt out:

So as to minimise impact to customers, the work of moving connections off and on will happen overnight between 1AM and 5AM.

As mentioned, this phase of upgrading involved only one LNS being upgraded. This will be the one named 'i.gormless'. The connections that will be moved on to 'i.gormless' will be those currently on the LNS named 'h.gormless'. if you are currently on 'h.gormless' (as seen on the top/left) of your line quality graph and want to opt out, then please email support.

Once this phase has been completed, we will review and plan the next stages.


MAINTENANCE Assumed Completed authoritative DNS
AFFECTING
authoritative DNS
STARTED
Jun 03, 10:35 AM (5½ months ago)
DESCRIPTION

This is only relevant to customers who run their own authoritative DNS servers and use our secondary-dns.co.uk as an additional nameserver.

Overview: We run a "secondary" DNS service for customers where they run the master DNS server and we are secondary slaves. We have a project underway that involves migrating all our authoritative DNS services to a new platform. As part of this we are needing to disable some of the automation we do for adding and updating the customer's master IP address automatically.

The change: From June 17th, If you run your own master DNS server for your domain(s) and secondary-dns.co.uk is a slave, if you change the IP address of your master you will need to contact support@aa.net.uk to request us to update our side.

We have more information about our Authoritative DNS project on our Support Site: https://support.aa.net.uk/New_Authoritive_DNS


MINOR Closed VoIP
AFFECTED
VoIP
STARTED
Nov 20, 10:35 AM (19¾ hours ago)
CLOSED
Nov 20, 11:30 AM (19 hours ago)
DESCRIPTION
One of our VoIP carriers is reporting that they are aware of some issues that may be impacting calls – engineers are investigating and more information will follow as soon as possible.
Resolution: One of our carriers had a problem affecting calls - this would have affected some inbound calls to some numbers that we host.

Our carrier reports: Following on from our earlier issues we are now confident that we have isolated the problem but the underlying cause is still being investigated. Our initial findings are, as you can expect, that the failover systems did not meet our expectations in this event and that itself will be taken on as a project by our team to ensure greater resilience going forward.


MINOR Closed General
AFFECTED
General
STARTED
Nov 11, 03:50 PM (9½ days ago)
CLOSED
Nov 11, 04:04 PM (9½ days ago)
DESCRIPTION
We're investigating internet routing problems affecting some services.
Resolution: service restored at around 16:04

MAJOR Closed General Internet Access
AFFECTED
General Internet Access
STARTED
Oct 23, 01:29 PM (28¾ days ago)
CLOSED
Oct 23, 03:00 PM (28½ days ago)
DESCRIPTION
We are aware of problem affecting some services. Investigations are underway.
Resolution: We do apologise to customers affected by the problems this afternoon. Please feel free to email support@aa.net.uk for a 'Reason for Outage'.

MAINTENANCE Completed LNS
AFFECTING
LNS
STARTED
Oct 23, 03:00 AM (29 days ago)
CLOSED
Oct 23, 03:51 AM (29 days ago)
DESCRIPTION
Due to the problems on Tuesday afternoon we have customers unevenly spread across our pool of LNSs. We will move some customers in the early hours of Wednesday morning. These customers will see a drop and reconnect of their service.
Resolution: This work has been completed.

MAJOR Closed Broadband
AFFECTED
Broadband
STARTED
Oct 22, 02:46 PM (29½ days ago)
CLOSED
Oct 22, 03:20 PM (29½ days ago)
DESCRIPTION
We''re investigating disconnections of broadband lines.
Resolution: Three of our eight live LNSs lost power causing customers on those to drop their connection. Many customers reconnected fairly quickly, but due to the number of LNSs out of action normal fail-over was hampered and needed manual intervention to re-route traffic. Further investigations will take place.

MINOR Closed L2TP
AFFECTED
L2TP
STARTED
Oct 16, 03:00 PM (1 month ago)
CLOSED
Oct 16, 04:21 PM (1 month ago)
DESCRIPTION
Some of our L2TP customers have experienced a couple of drops this afternoon. We're investigating the cause.
Resolution: We expect the service to be stable now.

MAINTENANCE Completed Control Pages
AFFECTING
Control Pages
STARTED
Oct 16, 02:00 PM (1 month ago)
CLOSED
Oct 16, 04:18 PM (1 month ago)
DESCRIPTION
We're performing a database migration which will mean the display of broadband usage stats, VoIP Call records (CDRs) and SMS logs will be unavailable on one control pages on the afternoon of 16 October.
Resolution: This work has been reverted.

MINOR Closed Data SIMs
AFFECTED
Data SIMs
STARTED
Oct 11, 02:30 PM (1¼ months ago)
CLOSED
Oct 11, 03:00 PM (1¼ months ago)
DESCRIPTION
Since 14:30 we're seeing packet loss on Data SIMs. We're investigating this with our carrier as we're not seeing loss elsewhere.
Resolution: Our carrier reports that they were experiencing minor service degradation on our mobile data SIM service between 14:30 and 15:00 today. The team has successfully tracked down the source and has completed corrective work on the service.