Rich Freeman on 19 Dec 2016 08:17:12 -0800
|
[Date Prev] [Date Next] [Thread Prev] [Thread Next] [Date Index] [Thread Index]
Re: [PLUG] Wanted: volunteers with bandwidth, storage, coding skills to help save climate data
|
- From: Rich Freeman <r-plug@thefreemanclan.net>
- To: "Philadelphia Linux User's Group Discussion List" <plug@lists.phillylinux.org>
- Subject: Re: [PLUG] Wanted: volunteers with bandwidth, storage, coding skills to help save climate data
- Date: Mon, 19 Dec 2016 11:17:07 -0500
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:sender:in-reply-to:references:from:date:message-id :subject:to; bh=aLPze3HTRZHOU60UuIYKgvtumhL65t8iuBrOgCDM24A=; b=onPC+qQt9L6bhHQ5hwxPJaAQTWbi2t5m0BB+RbihfQu4HxTTLvKcYt2vkTd9z+jYSr 2WXzvAXd71e/jH6hlxifGPDCD+oShKlY817GgwcIjspwCesBn53R4zYin2Q0NZMStHt3 iMf7y24pp6XwTIyMch6Zk1V+HmWT0DLMlhULVDQb2Qb2wmY+HzU7D2TQfwzZbnr+mF1A E/tjORDUfZwtbisqb+486LH8Ywu06+nNm1AC4hYs6y3vOcayFR9uYjjyL4tZhbOVojso GXhA6PTTKTfQRibgX7fYjtj4HgBkQEkeB3B9qiM1bawvJqdlFXmNUX9jJzesvFEtBsH+ +LwQ==
- Reply-to: Philadelphia Linux User's Group Discussion List <plug@lists.phillylinux.org>
- Sender: "plug" <plug-bounces@lists.phillylinux.org>
On Mon, Dec 19, 2016 at 11:11 AM, Doug Stewart <zamoose@gmail.com> wrote:
> The problem with data is that, even at the fattest pipe speeds, the fastest
> transit method is still overnighting HDDs via FedEx. We used to get DNA
> sequences from Tufts, Johns Hopkins, etc. via this method when I was at
> CHOP. Transfer time via Internet2 connections: ~1 month. Via FedEx: 2 days.
>
How long ago was that? A human genome is only 4gigabases, with 2 bits
per base (before compression). Granted, I hear some plants are just
insane but a lot of that is duplicative.
1GB isn't THAT much data to transfer, and that is before compression.
Now, if it is all stored as ASCII files with 1 character per base and
maybe 10-20% overhead with things like line numbers and such then I
could see it expanding, but that is still only a 4-5x expansion in
size.
So, maybe a human genome that is 10-20x oversampled (you're sending
raw contigs and not the assembled result) and poorly encoded you're
talking about a day of downloading.
Unless you're talking about 1998 and your network admin doesn't want
you using more than 20kb/s of bandwidth...
--
Rich
___________________________________________________________________________
Philadelphia Linux Users Group -- http://www.phillylinux.org
Announcements - http://lists.phillylinux.org/mailman/listinfo/plug-announce
General Discussion -- http://lists.phillylinux.org/mailman/listinfo/plug
- References:
- [PLUG] Wanted: volunteers with bandwidth, storage, coding skills to help save climate data
- From: Rich Kulawiec <rsk@gsp.org>
- Re: [PLUG] Wanted: volunteers with bandwidth, storage, coding skills to help save climate data
- From: Rich Freeman <r-plug@thefreemanclan.net>
- Re: [PLUG] Wanted: volunteers with bandwidth, storage, coding skills to help save climate data
- From: LeRoy <ldc@lrcressy.com>
- Re: [PLUG] Wanted: volunteers with bandwidth, storage, coding skills to help save climate data
- From: "Eric H. Johnson" <ejohnson@camalytics.com>
- Re: [PLUG] Wanted: volunteers with bandwidth, storage, coding skills to help save climate data
- From: Alex Ruijie Fang <frjalex@temple.edu>
- Re: [PLUG] Wanted: volunteers with bandwidth, storage, coding skills to help save climate data
- From: Rich Freeman <r-plug@thefreemanclan.net>
- Re: [PLUG] Wanted: volunteers with bandwidth, storage, coding skills to help save climate data
- From: Paul Walker <pjwalker76@gmail.com>
- Re: [PLUG] Wanted: volunteers with bandwidth, storage, coding skills to help save climate data
- From: "Eric H. Johnson" <ejohnson@camalytics.com>
- Re: [PLUG] Wanted: volunteers with bandwidth, storage, coding skills to help save climate data
- From: "Keith C. Perry" <kperry@daotechnologies.com>
- Re: [PLUG] Wanted: volunteers with bandwidth, storage, coding skills to help save climate data
- From: Rich Freeman <r-plug@thefreemanclan.net>
- Re: [PLUG] Wanted: volunteers with bandwidth, storage, coding skills to help save climate data
- From: Doug Stewart <zamoose@gmail.com>