Re: [thredds] nco as a web service

To: thredds@xxxxxxxxxxxxxxxx
Subject: Re: [thredds] nco as a web service
From: Robert Casey <rob@xxxxxxxxxxxxxxxxxxx>
Date: Mon, 2 Jul 2012 09:25:52 -0700

Hope a seismology-side perspective can help in this interestingthread (don't mean to keep perpetuating this, but....)

IRIS DMC is using both GET and POST style web services, applying eachapproach as is suitable to the use case. For many of the functionsthat require just some level of parameterization and flags, we use GET-style URLs. For simple transformation workflows with predefined orderof precedence, meaning you just fill in the params, GET-style alsoworks well. ( see http://www.iris.edu/ws/timeseries )

However, we had one use case early on that a colleague identified asvery important: the ability to have multi-row data requestspecifications that could consist of hundreds or thousands of lines.In seismology, we tend to ask for data from specific sensors, eachwith different time ranges specific to earthquake signal arrivals.Using a GET-style URL simply would not be feasible, so we go with aPOST style text submission using a simple columnar format. Thisallows users to access gigabytes of data at a time. ( see http://www.iris.edu/ws/bulkdataselect).

On the question of client accessibility for POST-style servicerequests, you can easily use custom built code, but you can also usegeneric HTTP utilities such as curl and wget. We have a help pagethat demonstrates this and it's very simple. ( http://www.iris.edu/ws/doc/bulkdataselect_help.htm).

Finally, you can enable powerful use of GET and POST services foryour target audience by building sample utilities and libraries fordifferent languages, which serves the developer not having to code tothe protocol, but simply to the code function or API. ( http://www.iris.edu/ws/wsclients/).

The conclusion we reached here at the DMC is to start with the mostcommon use cases, figure out standard, consistent naming for servicepaths, parameters, and service returns, and start building. Thoughmost of our services align to data access at this stage, and workflowsare client-driven (using cache tag IDs for handoff to the nextservice), I think you'll find that GET and POST style calls to webservices have their place when it comes to server-side processing andboth approaches are quite accessible to the general scientist.


        -Rob



On Jul 2, 2012, at 9:11 AM, Dennis Heimbigner wrote:

Since there is not a lot of traffic on the thredds
mailing list, I see no reason to move the discussion.

IMO DAP2 queries are not usable for specifying server-side operations.
There is too much missing.
DAP4 is a possibility although I am not sanguine on the prospects
given the way it is evolving.
John Caron's CDMremote seems like a better candidate.
Visad also would be promising for its data model, but
as far as I know, there is no remote access protocol
using visad.

=Dennis Heimbigner
Unidata


Doug Lindholm wrote:
Hi,
If people are interested in reviving the OPeNDAP server-sidefunctions working group, I'd help out. If nothing else, could werevive that mailing list instead of perpetuating this "nco" threadon the thredds list?In addition to the KISS principle, I'm also a big fan of standards.OPeNDAP (with DAP2) has already cracked that nut, so that is whereI would prefer to start. I think there is a lot of low hangingfruit (the first 20% of the 80%) that would be trivial for serviceproviders to implement once we have a standard syntax. I believe inan evolutionary design approach based on use cases. That approachmay lead to less stable APIs in the beginning, but it tends to bemuch more fruitful and usable than "big design up front". It soundslike many of us already have an API that "works", and plenty of usecases, so I think we could make a lot of progress evolving somecommon APIs.
Doug
On 7/2/12 8:25 AM, Roland Schweitzer wrote:
Hi,

I've been following the conversation...  A couple of comments in
general, then to a couple specific to this message.

Years ago at the OPeNDAP developers meeting I made a plea for the
community to help define a syntax for server-side functions. Weformed
a working group and had essentially this entire conversation
(application specific syntax vs a functional language, GET vs POST,
synchronous vs asynchronous) and so on. We even wrote theconversation
down in the Wiki
(http://docs.opendap.org/index.php/Server-side_Functions). Thedate on
the document is 2007.  In the end, we couldn't agree on the right
approach, got tired and stopped working on the problem. I don'tknow
what lesson is to be learned from that experience except that I'm
probably not the right person to lead the effort to form aconsensus on
the right approach.
Our product F-TDS will always allow transformations to be definedusing
Ferret syntax.  However, if there is a consensus on a functional
language, I would be thrilled to implement it for F-TDS.
As for Ben's comments on forming the URL, the idea when we build F-TDS
was that an ordinary Ferret user would be able to key in simple
transformations in their desktop clients.  Instead of opening a data
set, they could open the data set with a new variable defined thatwas atransformation of existing variables. However, the reality ofstuffing
the Ferret syntax into the URL is ugly and complicated.  The Ferret
scripting language wasn't designed for transmission on the URL soall
kinds of syntax that is significant to Ferret is also significant to
HTTP clients. Therefore things have to be very carefully encodedandeven then we had to make some extensions to the Ferret syntax tomake itwork. So a functional language that was part of the DAP spec andURL
safe would be a big win.  Some folks in our group think it's still
viable for folks to type Ferret syntax into URLs, but I don't.
However, our web application (LAS) uses the F-TDS syntax to do allofthe transformation users request from the Web UI (average, sum,min, maxfor now) and to automatically request that a variable be re-gridded whenthere is a request to compute the difference between twovariables. All
the fussy preparation of the URL is handled by software and this has
been a big win for us. LAS is faster and more capable because ofthis.
 To make sure this works universally, LAS will "wrap" a remote data
source in a local F-TDS URL so it can make the same transformation
requests of remote data albeit without the significant performancewin
of doing the transformation local to the data.  So, if  we develop a
common functional language we would jump on that straight away --bothto implement the functions it defines in F-TDS and to allow LAS tomake
requests from remote data with the language.

Roland

On Mon, Jul 2, 2012 at 8:17 AM, Ben Domenico <bendomenico@xxxxxxxxx
<mailto:bendomenico@xxxxxxxxx>> wrote:

   Hi all,

   Just a quick note to emphasize a "use case" that I am especially
interested in. That is the case where an end user wants toinvoke a
   server side process from within an html document.   Being able to
   specify the process in a URL makes this possible.
On the other hand, having the user construct the URL by hand isnot
   practical.   Roy's approach allows the user to set up the process
using a browser-based client to set up the processinteractively andthem offers the resulting URL for the user to embed in adocument.
     From the user viewpoint, this combination is very powerful, but
I'm not sure how much it limits the complexity of the processthat
   can be specified.

   -- Ben


   On Sun, Jul 1, 2012 at 11:55 PM, Tom Kunicki <tkunicki@xxxxxxxx
   <mailto:tkunicki@xxxxxxxx>> wrote:


       On (C) I definitely concur.  I am not against simplicity and
HTTP GET requests. I just want to make sure that theapproach
       is discussed and that one doesn't fall into the trap of
believing HTTP GET is a panacea of simplicity. These URLsthat
       have been posted are pretty complex and aren't the kinds of
       things that anyone but expert users will be crafting by hand.
         There will be a client implementation in front of them and
they will need to be updated if the server processing APIbehindthem changes. In this case, the client implementation willhave
       to change in tandem with the server side processing API. This
       will be true regardless of whether the request is GET, POST,
       PUT, etc.  One benefit of GET is an embeddable link, to my
       knowledge this isn't easily done with POST or PUT.

       Our group uses WPS.  We had issues with some holes with some
implementation and the specification so we made a choice tojoin
       on to the WPS 2.0 SWG.
There are advantages to the WPS specification.Implementationscan list a set of supported operations and processes usingthe
       GetCapabilities request (a GET or POST, we use GET).  Each
process can be queried for it's API including supportedinputs
       and outputs (name, mime-type and schema if xml) using a
DescribeProcess request (GET or POST, we use GET). If youknow
       the arguments and types you can parse the DescribeProcess
response and automatically generate a UI. We haveimplementedthis in JavaScript for our Web-based brokering services.There
       are python clients as well as an Arc plugin in-progress
(completed?) by ERSI and 52n, also a qGIS plugin amongothers.
         Processes can be executed with an Execute request (a GET or
       POST request, we use POST).  POST for us because we deal with
some pretty complex inputs (WFS calls with server sidegeometry
       filtering by reference to a GET or POST request; or Base64
encoded shapefiles sent in-line). These would bump us intosome
       URL len
         gth restrictions we have dealt with in the past.  We don't
       have to use these complex inputs but since WPS offers this
       flexibility we are happy to leverage it.  When we execute
processes we have the options to execute them synchronouslyorasynchronously (and an implementation can control theseoptions
       by advertising them per process.)  We can query the executing
       process for it's completion state (POST, don't know if GET is
       possible as I haven't looked into it).  We can request
       executions results in-line with the response or by reference.
         We provide inputs to WPS calls as the results of other WPS
calls. WPS processing implementations can be complex orsimple.
         Given our use cases we made an architectural decision to
       leverage some of the more advanced components of the
       specification.  We've developed some complex processing that
does some cool and useful things that we are able toleverage inother projects and share with other groups. With ourprocessing
       endpoints we can a
         dd a process and have it automatically be displayed in our
       UIs.  One of the benefits of WPS was processing end-points
       became self-documenting.
Now, the WPS execute by GET is pretty tricky as it requiressodouble URL encoding. We are happy using POST and didn'tdelve
       too much into GET. If there was a need and someone wanted to
look at this with me (ahem, Roy?) I would be more thathappy to
       submit some change requests to simplify the specification for
some use cases. In my experience with the OGC standardsalmost
       everything can be done with GET, it's when you get into the
outlying use cases you have to represent your requests withPOST.
       WPS is an OGC specification.  I think the last 2 words of the
previous sentence instantly turn people off. But there'ssome
       real value to the work that's been done.  We've used it as a
       thin wrapper on process execution.  Our initial cut at
processing involved using simple GET-based services. Wefoundwe had to generate a whole suite of utility/supporting GET-basedservices relying on clients to perform operations withcorrectordering. The architecture was becoming difficult tomaintainand document. A large number of tasks have now beenimplemented
       with the OGC standards suite and available standards
implementations. This has saved our group a lot ofdevelopment
       time and in turn taxpayer dollars.

       Tom Kunicki
       Center for Integrated Data Analytics
       U.S. Geological Survey
       8505 Research Way
       Middleton, WI  53562



       On Jul 1, 2012, at 11:34 PM, Gerry Creager wrote:

        > Roy,
        >
> That's a good explanation, and one I can live with.However,I also agree with Jeff's later comments, that A) ingeneral, the
       same interpreter can handle GET and POST, and B) file uploads
       can't happen with a GET.
        >
        > And, most important: C) KISS is a good mantra.
        >
        > I'll sit back and listen to the debate again.
        >
        > gerry
        >
        > On Sun, Jul 1, 2012 at 3:13 PM, Roy Mendelssohn
<roy.mendelssohn@xxxxxxxx<mailto:roy.mendelssohn@xxxxxxxx>> wrote:> BTW - a discussion we have been having around theseparts iscan you do enough in the way of server-side functionswithout a
       POST  (ie the URL defines the function).  That is why I would
       like to hear more from people who are running F-TDS and GDS -
how many requests do they get for server side functions,but is
       the usual response time and download for these request, how
       large are the usual expressions?  And then contrast it with a
WPS or WCPS approach. I clearly believe in one approach,but
       I would welcome people who are using some of these other
       approaches to describe what they have done, the benefits of
       doing things that way, and what it means for a client.
        >
        > Thanks,
        >
        > -Roy
        >
        > On Jul 1, 2012, at 11:25 AM, Dennis Heimbigner wrote:
        >
        > > Roy-
        > >
> > > ... One comment. I think you misunderstood mypoint about
        > > > Matlab and R.  I am not interested in Matlab specific
> > > implementations. The point was because the URLcompletely
        > > > defines the request, I can implement scripts in any
       application
> > > that can send an URL and receive a file in terms offunctions> > > built-in to that application - that is my clients donot
       break as
        > > > the application or operating system change.
        > >
> > Not quite sure I understand. This phrase "...receive afile in
        > > terms of functions built-in to that application" sounds
> > like you are creating an association between functionsdefined> > on the client side and functions defined on the serverside.
        > > Can you elaborate?
        > >
        > > > Why I strongly prefer, if it is at all reasonable,
       services that
        > > > only use GET, not POST.
        > >
        > > Again, that is only possible if you keep your requests
        > > short enough to not violate the URL length restrictions.
        > >
        > > =Dennis Heimbigner
        > > Unidata
        > >
        > >
        > >
        > > Roy Mendelssohn wrote:
        > >> Hi Dennis:
> >> Thanks. One comment. I think you misunderstood mypoint
       about Matlab and R.  I am not interested in Matlab specific
       implementations.  The point was because the URL completely
defines the request, I can implement scripts in anyapplication
       that can send an URL and receive a file in terms of functions
built-in to that application - that is my clients do notbreak
       as the application or operating system change.
> >> While I understand why this occurred, a few years agowe
       had straight OPeNDAP implementations.  We had a lot of users
       using scripts we developed for Matlab, running under Windows.
Due to updates in both Windows and Matlab, the OPeNDAPfilesfor Windows stopped working (at least for Matlab). We hada lotof users that were left stranded and stranded for quite alongtime. Developing and maintaining clients, particularlyclients
       that are working within an application for which you have to
       write code, very quickly becomes a non-trivial exercise.
        > >> Since we switched to a service where the URL completely
       defines the request, our Matlab and R scripts have survived
       quite nicely any number of updates both to the applications
       themselves and to the operating systems.  That is because the
       clients now only use functions built into the applications.
        > >> Why I strongly prefer, if it is at all reasonable,
       services that only use GET, not POST.
        > >> -Roy
        > >> On Jun 28, 2012, at 1:03 PM, Dennis Heimbigner wrote:
> >>>> I am old and slow, but suppose I am in OpeNDAP, areyou
       proposing
> >>>> to separate say constraint expressions and server-side
       function
        > >>>> requests basically the same (ie I just scan what is
       after each
> >>>> comma) or do you propose some method that signifiesin
       the URL
        > >>>> that what follows is an expression?  In F-TDS and GDS
       the form of
        > >>>> the URL is:
        > >>> First, I am proposing to subsume DAP constraints.
> >>> Second, I am proposing, like DAP, to put theexpressions
        > >>> in the query part of the URL (i.e. after the '?').
        > >>>
        > >>>>
http://machine:port/thredds/dodsC/dataset_expr_{dataset2,dataset3,...}{expression1;expression2;...}.URLsuffix?constraint> >>> So, I would rewrite this as something more-or-lesslike this:
        > >>> http://machine.../dataset?expression1,expression2,...
        > >>> Where the expressions would include the references to
       dataset2, dataset3,
        > >>> and the constraint.
        > >>>
        > >>>> BTW, the reason I have asked about the experience of
       people who
> >>>> are using F-TDS and GDS on whether synchronousrequests
       can cover
> >>>> the large majority of cases, is because I am verypartial to
        > >>>> systems where the URL completely defines the request,
       and hence
        > >>>> essentially use GET as the verb.
        > >>> The synchronous/asynchronous issue is, for me, a
       separable issue.
> >>> I should note that GET has a limit on the size ofURLS, so
        > >>> there needs to be ways to deal with that. Two
       possibilities are
> >>> 1) use POST or PUT, or 2) provide a way to upload along
       expression
        > >>> in parts USING multiple GETs.
        > >>>
        > >>>> The reason for this is long
> >>>> experience. where client code has broken withchanges in> >>>> operating system and/or application, fixes wereslow in
       coming,
        > >>>> so many users were left with nothing working.  In a
       system where
> >>>> the URL completely defined the request, say ERDDAP,in
       Matlab:
        > >>>>
        > >>>>>>
link='http://coastwatch.pfeg.noaa.gov/erddap/griddap/erdBAsstamday.mat?sst[(2010-01-16T12:00:00Z):1:(2010-01-16T12:00:00Z)][(0.0):1:(0.0)][(30):1:(50.0)][(220):1:(240.0)]';
        > >>>>>> F=urlwrite(link,'cwatch.mat');
        > >>>>>>
> >>>> Will get the related file, and the entire commandis in
       Matlab,
        > >>>> no extra code required.  The same in R is:
        > >>>>
        > >>>>>>
download.file(url="http://coastwatch.pfeg.noaa.gov/erddap/griddap/erdBAsstamday.nc?sst[(2010-01-16T12:00:00Z):1:(2010-01-16T12:00:00Z)][(0.0):1:(0.0)][(30):1:(50.0)][(220):1:(240.0)]",
       destfile="AGssta.nc",mode='wb')
        > >>>>>>
        > >>>> again, "download.file" is an R command.
        > >>> I think that we do not want to be R/MATLAB specific
        > >>> in a proposal to put stuff in URLs. I would rather
> >>> propose to allow uploading of R/MATLAB scripts toserve
        > >>> as additional, user-defined functions.
        > >>>
        > >>> I would prefer to
> >>>> maintain this simplicity and cover 80% of the casesif
       possible,
> >>>> than cover the rest but where more complex,application
       specific
        > >>>> code would have to be developed and maintained.
        > >>> Agreed. However my assumption is the the output of any
       function that
> >>> is not assigned to a single-assignment variable willbe
       returned as part
        > >>> of the response; but other ways of specifying this are
       possible within
        > >>> the functional framework I am proposing.
        > >>>
        > >>> =Dennis Heimbigner
        > >>> Unidata
        > >> **********************
> >> "The contents of this message do not reflect anyposition
       of the U.S. Government or NOAA."
        > >> **********************
        > >> Roy Mendelssohn
        > >> Supervisory Operations Research Analyst
        > >> NOAA/NMFS
        > >> Environmental Research Division
        > >> Southwest Fisheries Science Center
        > >> 1352 Lighthouse Avenue
        > >> Pacific Grove, CA 93950-2097
        > >> e-mail: Roy.Mendelssohn@xxxxxxxx
       <mailto:Roy.Mendelssohn@xxxxxxxx> (Note new e-mail address)
        > >> voice: (831)-648-9029 <tel:%28831%29-648-9029>
        > >> fax: (831)-648-8440 <tel:%28831%29-648-8440>
        > >> www: http://www.pfeg.noaa.gov/
        > >> "Old age and treachery will overcome youth and skill."
        > >> "From those who have been given much, much will be
expected" "the arc of the moral universe is long, but itbends
       toward justice" -MLK Jr.
        >
        > **********************
> "The contents of this message do not reflect anyposition of
       the U.S. Government or NOAA."
        > **********************
        > Roy Mendelssohn
        > Supervisory Operations Research Analyst
        > NOAA/NMFS
        > Environmental Research Division
        > Southwest Fisheries Science Center
        > 1352 Lighthouse Avenue
        > Pacific Grove, CA 93950-2097
        >
        > e-mail: Roy.Mendelssohn@xxxxxxxx
       <mailto:Roy.Mendelssohn@xxxxxxxx> (Note new e-mail address)
        > voice: (831)-648-9029 <tel:%28831%29-648-9029>
        > fax: (831)-648-8440 <tel:%28831%29-648-8440>
        > www: http://www.pfeg.noaa.gov/
        >
        > "Old age and treachery will overcome youth and skill."
> "From those who have been given much, much will beexpected"> "the arc of the moral universe is long, but it bendstoward
       justice" -MLK Jr.
        >
        > _______________________________________________
        > thredds mailing list
        > thredds@xxxxxxxxxxxxxxxx <mailto:thredds@xxxxxxxxxxxxxxxx>
        > For list information or to unsubscribe,  visit:
       http://www.unidata.ucar.edu/mailing_lists/
        >
        > _______________________________________________
        > thredds mailing list
        > thredds@xxxxxxxxxxxxxxxx <mailto:thredds@xxxxxxxxxxxxxxxx>
        > For list information or to unsubscribe,  visit:
       http://www.unidata.ucar.edu/mailing_lists/


       _______________________________________________
       thredds mailing list
       thredds@xxxxxxxxxxxxxxxx <mailto:thredds@xxxxxxxxxxxxxxxx>
       For list information or to unsubscribe,  visit:
       http://www.unidata.ucar.edu/mailing_lists/



   _______________________________________________
   thredds mailing list
   thredds@xxxxxxxxxxxxxxxx <mailto:thredds@xxxxxxxxxxxxxxxx>
   For list information or to unsubscribe,  visit:
   http://www.unidata.ucar.edu/mailing_lists/
_______________________________________________
thredds mailing list
thredds@xxxxxxxxxxxxxxxx
For list information or to unsubscribe,  visit: 
http://www.unidata.ucar.edu/mailing_lists/
_______________________________________________
thredds mailing list
thredds@xxxxxxxxxxxxxxxx
For list information or to unsubscribe,  visit: 
http://www.unidata.ucar.edu/mailing_lists/

References:
- Re: [thredds] nco as a web service
  - From: Jeff McWhirter
- Re: [thredds] nco as a web service
  - From: Russ Rew
- Re: [thredds] nco as a web service
  - From: Roy Mendelssohn
- Re: [thredds] nco as a web service
  - From: John Cartwright
- Re: [thredds] nco as a web service
  - From: stephen.pascoe
- Re: [thredds] nco as a web service
  - From: Roy Mendelssohn
- Re: [thredds] nco as a web service
  - From: Dennis Heimbigner
- Re: [thredds] nco as a web service
  - From: John Caron
- Re: [thredds] nco as a web service
  - From: Dennis Heimbigner
- Re: [thredds] nco as a web service
  - From: Roy Mendelssohn
- Re: [thredds] nco as a web service
  - From: Dennis Heimbigner
- Re: [thredds] nco as a web service
  - From: Roy Mendelssohn
- Re: [thredds] nco as a web service
  - From: Dennis Heimbigner
- Re: [thredds] nco as a web service
  - From: Roy Mendelssohn
- Re: [thredds] nco as a web service
  - From: Gerry Creager
- Re: [thredds] nco as a web service
  - From: Tom Kunicki
- Re: [thredds] nco as a web service
  - From: Ben Domenico
- Re: [thredds] nco as a web service
  - From: Roland Schweitzer
- Re: [thredds] nco as a web service
  - From: Doug Lindholm
- Re: [thredds] nco as a web service
  - From: Dennis Heimbigner

2012 messages navigation, sorted by:
1. Thread
2. Subject
3. Author
4. Date
5. ↑ Table Of Contents
Search the thredds archives: