[Federated-fs] Conf call 6/5/2008
Paul Lemahieu
LeMahieu_Paul at emc.com
Wed Jun 11 10:40:53 PDT 2008
As for why, it's to have a global namespace that spans heterogeneous
file servers, and have a common configuration/management for that. A
Linux box or AIX or a NAS vendor could all be exposing the same /fedfs/
home, and handing out referrals to users.
--Paul
On 2008-Jun-11, at 10:23, Everhart, Craig wrote:
> OK: why? Also, what do we do with the hard questions (synchronizing
> multiple data sources, doing permissions correctly)?
>
>
> ________________________________
>
> From: LeMahieu, Paul [mailto:LeMahieu_Paul at emc.com]
> Sent: Wednesday, June 11, 2008 1:19 PM
> To: Everhart, Craig; Robert Thurlow
> Cc: Ellard, Daniel; federated-fs at sdsc.edu
> Subject: Re: [Federated-fs] Conf call 6/5/2008
>
>
> Yes, it's independent of anything with DNS. When I say "top-of-
> tree", I'm talking about storing all the configuration of the
> namespace (tens of thousands of junction points and their paths in
> the logical namespace). For example, if there is /fedfs/home/bob,
> there is an entry mapping /fedfs/home/bob to bob's share on some
> physical file server. We'd be storing a pseudo file system
> representing the top-of tree in the NSDB.
>
> --Paul
>
>
> On 08/6/10 20:11, "Everhart, Craig" <Craig.Everhart at netapp.com>
> wrote:
>
>
>
> Is this "top of tree" thought independent of the DNS-based
> lookup? Why
> wouldn't that simply *be* the top level, leading to, as you say,
> real
> file systems? Is this an intermediate level?
>
> I can read all your text substituting "DNS" for "NSDB" and get a
> working
> (and specified and nearly existing) result. What am I missing?
> Do I
> need to invent yet another replicated global service?
>
> Craig
>
> > -----Original Message-----
> > From: Paul Lemahieu [mailto:LeMahieu_Paul at emc.com]
> > Sent: Tuesday, June 10, 2008 7:53 PM
> > To: Robert Thurlow
> > Cc: Everhart, Craig; federated-fs at sdsc.edu; Ellard, Daniel
> > Subject: Re: [Federated-fs] Conf call 6/5/2008
> >
> > Robert,
> >
> > This is very much the motivation. It's a couple of key things:
> >
> > * A standard to facilitate the administration of a
> > federated global namespace
> > * The ability to federate different file servers so they
> > all expose the same global namespace
> >
> > A few things about how I would see this:
> >
> > * I always thought of this as a "top-of-tree" namespace.
> > In other words, at some point the global namespace ends and
> > you hit real file systems. and the global namespace ends.
> > Perhaps it is possible to manage junction points at arbitrary
> > locations in the global tree, I just hadn't really considered it.
> > * Changes to the global namespace are made by
> > administrators, and made via the NSDB. The global namespace
> > is not frequently changing.
> > Participating file servers reflect those changes later in a
> > loosely- coupled manner.
> > * This does not invalidate the existing federated-fs
> > work. It would be essentially an additional database in the
> > NSDB, mapping paths to FSNs.
> > * It is assumed that the NSDB takes care of replicating this
> data.
> >
> > The big difference from what you describe below and my view
> > is whether we have an NSDB that reflects the namespace
> > created on the file server (the query model you describe
> > below), or whether the NSDB is authoritative for the
> > namespace and the file servers reflect that namespace
> > configuration (my description, where the NSDB is authoritative).
> >
> > --Paul
> >
> > On 2008-Jun-09, at 15:36, Robert Thurlow wrote:
> > > Everhart, Craig wrote:
> > >> I *totally* agree with Dan's bias on this. It's a surprise
> to me
> > >> that others thought that there was a "namespace" that existed
> > >> independently of the file systems that make it up.
> > >>
> > >> What is the relationship between path data that exists both
> in a
> > >> fileset (in a file server--Dan's #1 case) and in the NSDB
> > and (as in
> > >> Paul's addendum to Dan's #3) all the instances of the parent
> path
> > >> data that are subsidiary filesets? Is there some
> > authoritative copy
> > >> with the others just hints? What is the replication protocol
> by
> > >> which path data is propagated? How consistent does it have
> to be?
> > >>
> > >> If we were stumbling thinking about the constraints on fileset
> > >> replication and consistency, why would this be a simple answer?
> > >> Right
> > >> now it's an unmotivated, underspecified, and unconstrained
> > additional
> > >> criterion for any implementation. (And them's its good
> > points...:-})
> > >>
> > >> If others (Paul? Renu?) feel like this needs to be changed,
> could
> > >> they do it with a more fleshed-out proposal?
> > >
> > > I don't want to keep a copy of all of the mapping data (with or
> > > without authority) in the NSDB, either. But I do wonder if the
> > > motivation is something like this:
> > >
> > > I want an admin application that can show me the whole
> > namespace. I
> > > want to be able to browse it like a Google map, going up/down,
> > > left/right in the global directory tree. I'd like to see
> > which nodes
> > > are junctions, and to be able to click on them and see the
> details
> > > about their replicas. In the fullness of time, I'd like to
> > be able to
> > > click on a point in the namespace and select an option to make
> that
> > > point a separate, replicated filesystem, and to adjust the
> > > characteristics of existing filesystems.
> > >
> > > Now, if all we have is a way to drill down from the top of the
> > > namespace for any particular path, this could be bloody
> > hard. Having
> > > more information would help. An alternate question is,
> > where could we
> > > get more info?
> > >
> > > One suggestion is that the protocol permit a query to an NFS
> server
> > > participating in the namespace, to list the junctions in a
> > particular
> > > filesystem. If we could start at the top, find the Nth level
> > > filesystems and ask them what junctions point outside of the
> > > filesystem, we could enumerate the namespace far quicker. We
> would
> > > still have to send packets all over the place - "find the
> root" DNS
> > > queries, NSDB lookups, NFS accesses and whatever we use to
> > enumerate
> > > junctions in a filesystem - but I have always expected that.
> > >
> > > Does this shed light or kick up dust? :-)
> > >
> > > Rob T
> > >
> >
> >
>
>
>
>
More information about the Federated-fs
mailing list