Would hardware hit the same problem? the Terramaster
looks like it is hardware RAID?
Hardware RAID for ESP partitions requires that the EUFI
firmware to understand how to talk to the hardware raid
device. If the EUFI doesn't have the right driver it will
try to talk to the hardware raid as a raw device in a
different way and it goes to pot. [You would think that
server hardware would be built to do this out of the box,
but I have had a couple where hardware raid was only
supported to boot non-EUFI or required the EUFI to be on a
separate drive.]
I encountered a strange problem installing
AlmaLinux 9.1 on a RAID1
(MDADM) with the current configuration:
- /boot/efi on md125
- swap on md126
- / on md127
My limited understanding is that RAID on EFI
has been something of a hack as the backing store
that EFI uses is a slightly modified VFAT. What
happens is that there is some code to 'clone' the
data across but it isn't really RAID1. My guess is
that something in the 9.1 kernel broke that hack.
Could you try CentOS Stream 9 kernel (you can
install that with your existing Alma or Rocky
system) and see if the problem still occurs? If it
does then it is a bug that needs to be tracked
upstream at bugzilla.redhat.com
and if it doesn't then it should have been fixed
in an upcoming kernel. You could continue to then
use the CS9 kernel until whatever works in
Alma/Rocky 9
disks are 2
SSD MLC Type.
After the installation, if I reboot the system I
get:
"md: md125 stopped" (it is printed many times like
in a loop)
alternated with:
"systemd-shutdown[1]: Not all MD devices stopped,
1 left
Stopping MD Devices
Stopping MD /dev/md125 (9:125)"
and the system hangs on this loop until I cut the
power.
I encountered this issue on my Workstation with
Asus Prime Z490-A /
i9-10850k.
I tried with another workstation that runs on Asus
Prime Z370-A / i7
8700K, to exclude bad SATA controller and bad
cables.
Also on the second workstation the problem is
found.
I tried to replicate this using 9.0 ISO. The
problem does not occur
until I update to 9.1
I tried also with 8.7. No problem here.
I tried also RockyLinux 9.1 and got the same
problem but with different
messages:
"block device autoconfig is deprecated and will
removed"
alternated with:
"blkdev_get_no_open: 270 callbacks suppressed."
To stop the machine I need to cut the power.
I tried also Debian 11.5 without problems.
So seems that the problem is 9.1 related. Actually
I can't test the same
with RHEL 9.1 but probably the problem will occour
also on RHEL 9.1
There is a way to fix this or I should wait an
upgrade?