Re: md(adm) ... Re: Next meeting July 26th 2020, Tomorrow!

Subject: Re: md(adm) ... Re: Next meeting July 26th 2020, Tomorrow!

From: tom r lopes <tomrlopes@gmail.com>

Date: Sun, 26 Jul 2020 11:27:33 -0700

Arc-authentication-results: i=2; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=iY8x7qG2; spf=pass (google.com: domain of tomrlopes@gmail.com designates 2a00:1450:4864:20::236 as permitted sender) smtp.mailfrom=tomrlopes@gmail.com; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com

Arc-authentication-results: i=1; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=iY8x7qG2; spf=pass (google.com: domain of tomrlopes@gmail.com designates 2a00:1450:4864:20::236 as permitted sender) smtp.mailfrom=tomrlopes@gmail.com; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com

Arc-message-signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:to:subject:message-id:date :from:in-reply-to:references:mime-version:sender:dkim-signature :dkim-signature; bh=sah2Apc/U3oH+xqhq0CJP4IaaoAyKirCGP9+7hJGBX8=; b=ooDknxvbobdb0+XmBy/gkaU3n3Ct4K8t1Ias1H53nQcBgXprzPcSBD4wn+Te9wpnUp DYBU8ctnMJDolWjRc/f9uB8TT54zh9KrJPM65+F2ydx+iG0J4lZz60Z1SwKwnpanfrQr 8mNlkkscQv6vl71lJSU7Ux8kIgskyiE37ZoIxCdMCiZnsyBqyRQ0GhKfC5FHkGWkalju 1QSKDYoktYRW8DVB9KeLKFjnEWdGFBR9dadedindZRtpN1ps5XU6FPxdFm+SNM5eCgoG +T2dq4cQXz9hV3uRCaQasNUC+993dwt3bzzr1sCYuLyvhG53Jsfi9Y6lutBcrKuzR9YU wzuw==

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=to:subject:message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=yf63Xz1CpsxQmxScgENY7o3F7KIJrt2nMs0Pf66XQiU=; b=rWWc6vKJJ9h21HdIoXvrWb5AcOL+S3Sw7XJ3sLhdEG4VYbgGYDpombCgQnXLgA/Pn5 Fmn07gu1c6zXpPKUpwZvVrA+0QXckywK1/D7LRYXiPCb9Zps3fJZVQtGfI1ksXuO25sc cbZTdwVTPxufINczv3m2FvZ/pL8r2aPs9BrQ3JaULT2b0aU6E+lp6+FUDT9HOJA6+vtM 8+KvIRGwSvSi9iLreKzVhgSCDeSWAyRd6b/8nkJ1Hta9B41525e/LmK6y8XjgsZG/2+o Z6+Ct60/S/P35dcJdRq8PDnXDWn3yrhjsp0EjZ08L5cRuVFKVbxqS8nBSVwuA4T3jBh7 3hdg==

Arc-seal: i=2; a=rsa-sha256; t=1595788065; cv=pass; d=google.com; s=arc-20160816; b=ZJCujmC6HGEC/CrP2wGFkLXPQyIg/FR1GxXgx2s3mhkPXMhhaiudkwhMYAek7vlL94 P0IYIspvJC5+sGoSmPuYmv0ELcAhnpXsZMBtoOyEzBMcyI1yxm4U9c2N7q6zp+VwGQ/Z /tKJ5n3/+eoUb74tZcZdYEXI7gUvaQ4xdTcsCDqYr4qc5Jyo3KaD6/TmS8qk8Vqlv1EX +kv5Lw+PU74B87JxYyoFYpYcXXspsmBtpVVuyPB28v7xSZyitufvjLSecUCA6VIi4i3d 9o1zNMCJImODxtI1bVciR4kHlfX994+9kVrmh672J+ZzMlKUcH+6CRdZJmEQTaMfcUXc Ok7A==

Arc-seal: i=1; a=rsa-sha256; t=1595788064; cv=none; d=google.com; s=arc-20160816; b=xpf9NtglWgtAJbVzVNUGs3zYvtNevGVz3D6eoLSNRA9dBSuNNIY2ZzMHOjK424yDW6 ommakTeRBEy0FEyvWgIMybOvuA5azFKh1iqNvY082LR/8JaQugs4tPbMAoJYZ9DzLqhv YrTn4JPXUHA39fO6FPoNEpuZN5fRDQ1L9ihEUeRtQ+upTSCxKy3TpoXoZQyW4heqrA8+ n97fZcScUqZiYjiRGkXgJVLiZKRTWLUTj2qrQ82Zvb5Sl/ho61yXtOGuaYRt1dBBJAzB OIDzfN2NadAXlDHml/i/q08JDnmGymmHr2yIACF/msJ/gudcbKRpNrRYYN3mSwz0yMlr O8CQ==

Delivered-to: historian@entropia.netisland.net

Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:mime-version:references:in-reply-to:from:date:message-id :subject:to:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:list-subscribe:list-unsubscribe; bh=sah2Apc/U3oH+xqhq0CJP4IaaoAyKirCGP9+7hJGBX8=; b=WKvkmNyHYFDbcxxuodSFn5Ls/ZyvatSp3x6izf2QtQHBosNvSCE6iOzfLmLAPcKNg6 oMowvkE/D4dGPeN2GzmhTrkqzs6dTcNQ9Pp4gohAmhsFni0Kc2RmGcu3oWMkV+tz/8Be 1GR3pqKRjRqlRs75kzt99Hq6vzhJRz/6eZilY1auWrgyP2LtJBsUNsZsZ4YbGIPllzan aNKHT/158droAtWXygkxBiTwXi8ILfp1LS4h5YJUl4XS9kkP/bNc6alalbdDXqic+ktI 4irhZH69/gDP0TLFiiDeDF0x96HHtgWQD8ckwwnkwYi3xo8DPlOoL9O4Q2D0LQ/eTqNq 6pjg==

Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=sah2Apc/U3oH+xqhq0CJP4IaaoAyKirCGP9+7hJGBX8=; b=FWeHa4HJTqonSwQdhtUPtyF3nBeZ6mU2szZQvJn0heTbzz78ELy1EuREeb/tGE/Gxg n9oLcCWdHYVmNqbsrceZ8lkk+SBruPZfy3GhEBtI/EJwmrK0GBjC/rmXz5l0y9Ic5gJ/ RuAUQ8XFVNFsZPh1FqLDorvRXmB76Jwf1ki/6zO88+vHoM4LkrlRNyuKOwmf6Qzgs4WX zEeKd8BiBIwbQ9lX6T8wH7CXUSL8CPF3ws0KT6h8jE1y3muC1CxvwGAfd0MuXkEhm3PS 65iMsJVNdVwtDVUbr33fV0YUQ2iQiLW+eiqf3ldGnaagqLPXzRjADw2RaET/Q9lDYJZR c4uw==

In-reply-to: <20200726033304.13885iwu7mlx9cdc@webmail.rawbw.com>

List-archive: <https://groups.google.com/group/berkeleylu>

List-help: <https://groups.google.com/support/>, <mailto:berkeleylug+help@googlegroups.com>

List-id: <berkeleylug.googlegroups.com>

List-post: <https://groups.google.com/group/berkeleylug/post>, <mailto:berkeleylug@googlegroups.com>

List-subscribe: <https://groups.google.com/group/berkeleylug/subscribe>, <mailto:berkeleylug+subscribe@googlegroups.com>

List-unsubscribe: <mailto:googlegroups-manage+61884646931+unsubscribe@googlegroups.com>, <https://groups.google.com/group/berkeleylug/subscribe>

Mailing-list: list berkeleylug@googlegroups.com; contact berkeleylug+owners@googlegroups.com

References: <CAGpvfso9F0S20MWUrjQSOSRTmtjUgn23Cuu90hgmeS1y2UhZQw@mail.gmail.com> <20200726033304.13885iwu7mlx9cdc@webmail.rawbw.com>

Reply-to: berkeleylug@googlegroups.com

Sender: berkeleylug@googlegroups.com

Yeah, I'm running late.

I have achieved caffeination and will be connecting soon.

It will take me some time to get the system up before I

can start on the md-raid.

Thomas

On Sun, Jul 26, 2020 at 3:33 AM Michael Paoli <Michael.Paoli@cal.berkeley.edu> wrote:

> From: "tom r lopes" <tomrlopes@gmail.com>
> Subject: Next meeting July 26th 2020, Tomorrow!
> Date: Sat, 25 Jul 2020 14:00:49 -0700

> 4th Sunday virtual meeting 11 am
>
> meet.jit.si/berkeleylug
>
> (no typo this time :-)
>
> I'm hoping to work on a file server running on a sbc.
> Plan was to work on this last week for the PI meeting but
> I couldn't find the SATA hat for my NanoPi. Now I have it.
> So I will install Armbian and add two 1TB and combine them
> in md-raid.
>
> Hope to see you there,
>
> Thomas

Let me know if you need any md(adm) assistance.

I quite recently had need/occasion to snag copy of (the very top bit):
file (large, about 5GiB) on
filesystem on
LVM LV on VG on PV on
partition on
VM raw format disk image file on
filesystem on
md raid1 on
(pair of) LVM LV on (each their own) VG on PV on
(pair of) partitions (one each) on
2 physical drives on
physical host
and without network (only virtual console) access to the VM.

The topmost bit being a file on filesystem within a Virtual Machine (VM)
where that VM's drive storage was the aforementioned VM raw format disk
image file, and needed to snag copy of topmost referenced (and large -
~5GiB) file from within the VM - with no network (only virtual serial
console) access to the VM. And, "of course", to make it more
interesting, has to be consistent/recoverable, and conflict with neither
the ongoing use of the VM nor the physical host, and all while the VM
and physical host remained up and running. So, among other bits,
to do that, took a LV snapshot of the lowest level LV,
that then gave point-in-time snapshot of one of the two md raid1
constituent member devices under the lowest raid1 shown in that stack.
"Of course" that immediately has UUID conflict potential - so wiped that
metadata to eliminate that hazard, then to be able to make use of the
data, took that snapshot, and turned it into an md raid1 device - being
careful to use the same metadata format - notably so it would be same
size of earlier metadata and not stomp on any data that would be within
the md device at the md device level. Also, to make it the same(ish),
and not complain about missing device, created it as md raid1 ... but
with single member device and configured for just one device. Once that
was done, had recoverable (point-in-time snapshot from live) filesystem.
Again to thwart potential conflicts, changed UUID of that filesystem,
then mounted it nosuid,nodev. It needed to be mounted rw, due to some
bits needing teensy bit 'o write further up the chain to metadata.
Then once that was mounted, losetup and and partx -a to get to
applicable partition within the file on that filessytem within the
drive image. Was then able to bring the VG from that PV (activate)
onto the physical (were the UUID and/or VG name conflicting with any on
the physical host, there would've been some other steps needed too).
From there, mounted that filesystem ro(,nosuid,nodev) (but device under
it again - rw needed - as filesystem state was recoverable but not
clean) provided by that LV. Was then able to access and copy the
desired file from that filesystem - now seen via snapshot and some
medatada mucking about, on the physical host, whereas before it was
effectively only accessible on the VM - and all that with the VM and
phyisical still up and running throughout.

Yeah, I didn't design it like that. That's the way some particular
vendor's "appliance" devices structure things and manage their VMs on
the device.

Had another occasion some while back, to fix rather a mess on quite same
type of device. There were two physical hard drives ... lots of RAID-1.
So far so good. But, no backups ("oops"). And, one of the two hard
drives had failed long ago ("oops"), and not been replaced ("oops").
And now the one hard drive that wasn't totally dead was giving
hard errors - notably unrecoverable read errors on a particular sector
... uh oh.

Well, the vendor and their support, and the appliance were too
stupid(/smart?) to be able to fix/recover that mess. But I didn't give
up so easily. I drilled all the way down to isolate exactly
where the failed sector was, and exactly what it was/wasn't being used
by. Turned out it wasn't holding any data proper, but just
recoverable/rewritable metadata - or allocated but not used data.
So, I did an operation to rewrite that wee bit 'o data.
The drive, being "smart enough", since it was unrecoverable read
sector, but got a write operation to it then, automagically remapped and
wrote it out. At that point drive was operational (enough) again -
could read the entire drive with no read errors - and was then able to
successfully mirror to a good replacement for the other failed drive
(before that all such attempts failed, notably due to the hard read
error). Anyway, successfully and fully recovered what the vendor's
appliance and the vendor's support could not recover, where they were
saying it would have to be reinstalled from scratch. Oh, and also,
after the successful remirroring - also got the drive that was having
the sector hard read error replaced, then remirrored onto the
replacement drive, thus ending fully recovered onto two newly replaced
good drives. Not the first time I've recovered RAID-1 when it was
discovered there were problems when the 2nd drive started failing
after the 1st drive had long since totally died and not been
earlier replaced. "Of course" it's highly preferable to not get into
such situations ... have good (and validated) backups, and replace
failed drives in redundant arrays as soon as feasible - especially
before things start to hard fail without redundancy.

--
You received this message because you are subscribed to the Google Groups "BerkeleyLUG" group.
To unsubscribe from this group and stop receiving emails from it, send an email to berkeleylug+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/berkeleylug/20200726033304.13885iwu7mlx9cdc%40webmail.rawbw.com.

berkeleylug+unsubscribe@googlegroups.com

https://groups.google.com/d/msgid/berkeleylug/CAGpvfspkuKv_hC1w5zUsx%2BAvj0Lw6mOMOA4GfjaK5vYkQWZfVg%40mail.gmail.com