Jenkins and Nexus stability have been quite poor recently. The root cause is a storage device that is beyond its capacity. The EF Releng team has cleared the roadblocks that were blocking the migration to a more stable, robust and scalable storage, and has already migrated 21 of the 240 instances.
This issue is to communicate that the team is aware of the issues, and to communicate progress.
To set expectations: while this is good progress, the amount of low-hanging fruits is limited and will result in (a lot slower) progress towards the end. The storage device will be a huge bottleneck during the migration of larger instances.
11
Frederic Gurrchanged the descriptionCompare with previous version
Not surprisingly, two of the biggest CI instances (JakartaEE-TCK and Glassfish) were clearly identified to cause a lot of load and traffic on our storage server, whenever excessive builds are started. There might be others as well. The plan is to move those two to CephFS ASAP.
The releng team has crossed the 35% mark, and some larger instances have moved over, leading to better stability for all instances.
Please migrate repo.eclipse.org with priority
This is the plan; there are specific challenges to contend with. At this time, stability has much improved already, and the team is not even halfway done.
Thanks for your patience.
Frederic Gurrchanged the descriptionCompare with previous version
They can't and should not need to, since it's a transparent change. In some cases, we ask the project to support clean up tasks before the migration, though.
We're seeing some weird errors in the Eclipse ESCET builds we've not seen before, and we're wondering if it is due to not yet having been moved, or maybe because we were moved and this caused unexpected changes somehow.