Re: [ferret_users] memory strategies for handling large computational requests

To: ferret_users@xxxxxxxx

Subject: Re: [ferret_users] memory strategies for handling large computational requests

From: "Ansley C. Manke" <ansley.b.manke@xxxxxxxx>

Date: Mon, 27 Nov 2017 16:36:30 -0800

Arc-authentication-results: i=2; mx.google.com; dkim=pass header.i=@noaa-gov.20150623.gappssmtp.com header.s=20150623 header.b=NgJ2Yxej; spf=pass (google.com: domain of ansley.b.manke@xxxxxxxx designates 209.85.220.41 as permitted sender) smtp.mailfrom=ansley.b.manke@xxxxxxxx

Arc-authentication-results: i=1; mx.google.com; dkim=pass header.i=@noaa-gov.20150623.gappssmtp.com header.s=20150623 header.b=NgJ2Yxej; spf=pass (google.com: domain of ansley.b.manke@xxxxxxxx designates 209.85.220.41 as permitted sender) smtp.mailfrom=ansley.b.manke@xxxxxxxx

Arc-message-signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-archive:list-help:list-post:list-id:mailing-list:precedence :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:to:subject:arc-authentication-results :arc-message-signature:dkim-signature:arc-authentication-results; bh=wcJ99t2aORx506I2SMgxlA7GFnGSphaA0wU994Q0oYQ=; b=A2/xIuWjVB6iDepC/M46/sTZowm6sOt7vvZ6WubaZWaWZ73BvM+12aU6ZqwRC6hDBR m5G508tNdMGHu1aweXa3L2V7SFaEnnVsQpWi+agF9KNO+H6mDX5WxiLu8A3fD8ESfVNi 0sDf/lE8/mRTrKhtaBk8X1hWWcb7Qi8UM5OqQkad2nbYpDrHCkcftm8AJTTUn66ULqS/ mgMfDN878zjqYQQvG00FiYOpmsZ2CfPSelqBAMQrRCNK1+j9RqRuCzxrtYoHKymHML8H 3etiry4fMdyFWwhsu2ojHzfdNhq6wDBz4lSSuPhzjOcx7qUw1snPN4NAhL9upZTHR26i mU9A==

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:to:subject:dkim-signature :arc-authentication-results; bh=wcJ99t2aORx506I2SMgxlA7GFnGSphaA0wU994Q0oYQ=; b=L2rBxrdd/Pc0L2Nkbx9Oc/554oVJWLjMKHOCmxXNrOt3VwF65Dmhx5m00VFjvYDjpa txCPXilzsZr4dQeYAZ/1kXOrutWzoIWALh5pT2Paq/MRSS0i63xL7YN55wUMoIFngWrY Pt9pz2FB4LSoSR65sOWClEay/ZGGxYWBM/YWu/8fMrAO2DCwAFdEUd5Q0NL39UHbhbQD TUYzJfHnXH2l2BD42ie5SXm3FFKrkPrFS4aGVstXaH9LQEgIMH/Gmo6tBS0dLeLZ7NJL 3c8KdG44agfTRB8hAz95l+t5GVLhsFCpYx69T6WY2Jq0wabM0luejQVsfr91OAoX6Jnx PnIg==

Arc-seal: i=2; a=rsa-sha256; t=1511829393; cv=pass; d=google.com; s=arc-20160816; b=DiXLB4JVWBEfxhH7nGhL1iuHY+KynZ+39/crEhMAtw+Zaf9F/DyNPiQvZoX6OB9x2m rxZE0a5yHxo7tANjC6a5lWmlg5UMIH2GQ7QqaDw8duUDWWA+SnXa7K2CEjETQs/P/O6S yweHpmxQMEv0VVDTmzQcbKM8LtPdtWfgoZnSH78x0o2NgYtFfafIudtiNUBQwh1Mqymp C5hwhns/xFj4ROIU3L1NYDprn2X9oMZBHJlxR0n0asxFenzQjG9kL5EDOVUxKxJEl60T LnqJSwfooRMsNv7ur7YrjcNplprSxpDk4cNdSJ/3E8TohoTeL9GQJjuN570zRFPyXnw+ wy0A==

Arc-seal: i=1; a=rsa-sha256; t=1511829392; cv=none; d=google.com; s=arc-20160816; b=pzFdnHalf/X8qPQD8BY5gTr7SKOhfm1XbEQU/L5z4KR1exMS0UdGVeMbgwb+VNySHG QzS+w92We1WGAXB2H2/VmWS4r9jPo3F20ROwr5I6CpklwJN3jKcUWlxdmPQYwKqrS3gD BtI4Kz9NXvkNXUJUPSlrRt3vlvJRzebscLK52vlblNik4JEHai90rOmLmncF0UO5s/lk UN/V34ZeC2I/Valeps7AItAOkw27NvYj+BP4JxC+QIUQ7ROr4zXPiKIiu1lMdposeCbR EW4mCVqBLLz5eydIE4cyv/HH7i7GRrCWy7kpjpPynLBCLPF/7eiaGDNFlGeQjUAH2TNI E5SQ==

Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=noaa-gov.20150623.gappssmtp.com; s=20150623; h=subject:to:references:from:message-id:date:user-agent:mime-version :in-reply-to:content-language:x-original-sender :x-original-authentication-results:precedence:mailing-list:list-id :list-post:list-help:list-archive; bh=wcJ99t2aORx506I2SMgxlA7GFnGSphaA0wU994Q0oYQ=; b=sblpc76brqKCctUz8aZTJwc2de4MMD5T0UExi5Kj9WBEukZ7x4T+JgQnqI042mgTsz 4McXV7I8uGC4uU6ebRFK9IGzKzpnh3O4lBkS0lkjCNTMSpb4U1eGw/vwU0Afk6In3x3I /t0qsgj0ZFO8/SnEC8y0UpuNhS/YrJQ3SQycEzi6mLirCgLLrsftqFyaCjMsKjsk40Mo 0hv+iv1gdvUejpEm7ySXGJHZfIIuRjum2oqJ03OqdL3oGi3BP8lwvRPmPkL/uHj8Lx8d CE9cs6r4ejCxaZOUU/T1p04jv0ndAz+O7IoBeewJ2medaOjBxq88ver+kz35qTozh7Go Xf7w==

In-reply-to: <400769719.1005202.1511348224834.JavaMail.zimbra@lsce.ipsl.fr>

List-archive: <https://groups.google.com/a/noaa.gov/group/ferret_users/>

List-help: <https://support.google.com/a/noaa.gov/bin/topic.py?topic=25838>, <mailto:ferret_users+help@noaa.gov>

List-id: <ferret_users.noaa.gov>

List-post: <https://groups.google.com/a/noaa.gov/group/ferret_users/post>, <mailto:ferret_users@noaa.gov>

Mailing-list: list ferret_users@xxxxxxxx; contact ferret_users+owners@xxxxxxxx

References: <400769719.1005202.1511348224834.JavaMail.zimbra@lsce.ipsl.fr>

Sender: owner-ferret_users@xxxxxxxx

User-agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.4.0

Hi Patrick,
Ferret does not need to load the whole grid into memory to do these operations. Ferret will try to break up the calculation itself, particularly with the memory-use enhancements in v7.2. For the operations you're using ferret will break up this computation. Not all computations can be broken up, either because of the nature of the operations themselves, or because we haven't implemented all combinations of operations. For instance function calls are not broken up. If that is the case for your computations, then writing a file will likely be necessary. Doing the operations in pieces in the T direction and appending is a good option. You can also append in other directions, as long as you first save the entire grid, containing say, missing data, and then use APPEND/K= /J= /I= to overwrite the file as the actual results are computed. See a few small examples of that in chapter 10 of the Ferret users Guide, examples 4, 4a, and 5.

I'm going to do an example with your operations in some detail, since few of us have explored this.

yes? use https://vesg.ipsl.upmc.fr/thredds/dodsC/IPSLFS/brocksce/tmp/CM6012.1-pi-ttop-02_23200101_30091231_1M_transpir.ncyes? show data     currently SET data sets:    1> https://vesg.ipsl.upmc.fr/thredds/dodsC/IPSLFS/brocksce/tmp/CM6012.1-pi-ttop-02_23200101_30091231_1M_transpir.nc (default) name     title                             I         J         K         L TIME_CENTERED          Time axis                        ...       ...       ...       1:8280 TIME_CENTERED_BOUNDS                                           1:2       ...       ...       1:8280 TRANSPIR Transpiration                    1:144     1:143     1:13      1:8280 AREAS    Mesh areas                       1:144     1:143     ...       ... CONTFRAC Continental fraction             1:144     1:143     ...       ... (I made a local dataset with the same size grid for testing, as reading lots of data from the thredds server is somewhat slow.)

Diagnostic mode lists information as Ferret runs, noting when it puts tasks on its operation stack, reads data, computes transformations, and does "gathering". The memory management in Ferret v7.2 breaks up computations into parts, executes those parts while saving partial results and finally "finalizes" by putting the results together. Earlier versions of Ferret also break computations up into pieces to reduce the amount of data that must be read in; in fact earlier Ferret versions do this particular thing in exactly the same way - the example below works with any old Ferret version. V7.2 would also break up the computation along the axes being compressed if that was the only way to shorten the computation.

I'll put in some comments here in orange,

First a shorter example, compute the result on L=1:360. yes? let var=TRANSPIR[k=@sum, x=@ave, y=@ave]yes? set memory/size=200yes? set mode diagnostic ! this is not needed, but lets us see Ferret memory management in actionyes? save/clobber/file=file1.nc/L=1:360 var[l=@sbx:120] getgrid EX#1     C: 5 dset:   1 I:      1      1 J:    1    1 K:    1    1 L:      1      1 M:    1    1 N:    1    1 getgrid VAR      C: 7 dset:   1 I:      1      1 J:    1    1 K:    1    1 L:      1      1 M:    1    1 N:    1    1 allocate dynamic grid GBC3            LON       LAT       VEGET1    TIME_COUNT allocate dynamic grid GBC3            LON       LAT       VEGET1    TIME_COUNT
Here Ferret is setting up to get data for L=1:420, to be able to correctly return the boxcar smoother var[L@SBX:120] on L=1:360, and sets up "gathering" to return the averaging and sum requestsstrip limits reconciliation : EX#1 eval    EX#1     C: 5 dset:   1 I:      1    144 J:    1   90 K:    1   13 L:      1    360 strip --> VAR[L=1:360@SBX:120,D=1] eval    VAR      C: 8 dset:   1 I:      1    144 J:    1   90 K:    1   13 L:      1    420 strip gathering TRANSPIR on T axis:        1      420 dset:   1           420=request     100000000=availableMem strip --> TRANSPIR[Z=0.5:13.5@SUM,D=1] strip --> TRANSPIR[Y=90S:90N@AV4,D=1] It will read and compute the XY averages and Z sum for each subset in L: strip --> TRANSPIR[Z=0.5:13.5@SUM,D=1] strip --> TRANSPIR[Y=90S:90N@AV4,D=1] reading TRANSPIR M: 17 dset:   1 I:      1    144 J:    1   90 K:    1   13 L:      1    138 doing --> TRANSPIR[Y=90S:90N@AV4,D=1] final --> TRANSPIR[Y=90S:90N@AV4,D=1] doing --> TRANSPIR[Z=0.5:13.5@SUM,D=1] doing gathering TRANSPIR on T axis:        1      138 dset:   1           138=request      99999724=availableMem strip --> TRANSPIR[Z=0.5:13.5@SUM,D=1] strip --> TRANSPIR[Y=90S:90N@AV4,D=1] reading TRANSPIR M: 13 dset:   1 I:      1    144 J:    1   90 K:    1   13 L:    139    276 doing --> TRANSPIR[Y=90S:90N@AV4,D=1] final --> TRANSPIR[Y=90S:90N@AV4,D=1] doing --> TRANSPIR[Z=0.5:13.5@SUM,D=1] doing gathering TRANSPIR on T axis:      139      276 dset:   1           138=request      99999304=availableMem strip --> TRANSPIR[Z=0.5:13.5@SUM,D=1] strip --> TRANSPIR[Y=90S:90N@AV4,D=1] reading TRANSPIR M: 10 dset:   1 I:      1    144 J:    1   90 K:    1   13 L:    277    414 doing --> TRANSPIR[Y=90S:90N@AV4,D=1] final --> TRANSPIR[Y=90S:90N@AV4,D=1] doing --> TRANSPIR[Z=0.5:13.5@SUM,D=1] doing gathering TRANSPIR on T axis:      277      414 dset:   1           138=request      99999304=availableMem strip --> TRANSPIR[Z=0.5:13.5@SUM,D=1] strip --> TRANSPIR[Y=90S:90N@AV4,D=1] reading TRANSPIR M: 7 dset:   1 I:      1    144 J:    1   90 K:    1   13 L:    415    420 doing --> TRANSPIR[Y=90S:90N@AV4,D=1] final --> TRANSPIR[Y=90S:90N@AV4,D=1] doing --> TRANSPIR[Z=0.5:13.5@SUM,D=1] doing gathering TRANSPIR on T axis:      415      420 dset:   1             6=request      99999568=availableMem And finally Ferret does the @SBX on the entire time series of XY averaged, Z summed data, returningVAR[L=1:360@SBX:120] doing --> VAR[L=1:360@SBX:120,D=1] LISTing to file file1.nc yes?Now do the whole set, using a larger memory setting. It does perhaps 12-15 different reads.
yes? set mem/siz=400yes? save/clobber/file=file2.nc   var[l=@sbx:120]... doing --> VAR[T=01-JAN-232018:00:31-DEC-300918:00@SBX:120,D=1]
LISTing to file file2.nc

Verify that the operations on a subset of the data, computed in different chunks, matches what is computed for the entire dataset.
yes? cancel data/allyes? cancel var/allyes? use file1.nc, file2.ncyes? list/l=60:70 var[d=2] - var[d=1]             VARIABLE : VAR[D=file2] - VAR[D=file1]             SUBSET   : 11 points (TIME)             CALENDAR : NOLEAP 16-DEC-2324 12 / 60:    .... 16-JAN-2325 12 / 61: 0.0000 15-FEB-2325 00 / 62: 0.0000 16-MAR-2325 12 / 63: 0.0000 16-APR-2325 00 / 64: 0.0000 16-MAY-2325 12 / 65: 0.0000 16-JUN-2325 00 / 66: 0.0000 16-JUL-2325 12 / 67: 0.0000 16-AUG-2325 12 / 68: 0.0000 16-SEP-2325 00 / 69: 0.0000 16-OCT-2325 12 / 70: 0.0000

etc.

On 11/22/2017 2:57 AM, Patrick Brockmann wrote:

Hi ferreters,

I would like to plot time series with quite huge file (8.3G) from a variable XYZT (144x143x13x8280).

I have worked with last 7.2 ferret release and tried different increases of memory without success.

I always get **ERROR: request exceeds memory setting

Next step would be as suggested from doc (http://ferret.pmel.noaa.gov/Ferret/documentation/users-guide/computing-environment/MEMORY-USE)

to break up my request into fragments.

Is it the best solution in my case ?

My ressource is available from

https://vesg.ipsl.upmc.fr/thredds/catalog/IPSLFS/brocksce/tmp/catalog.html?dataset=DatasetScanIPSLFS/brocksce/tmp/CM6012.1-pi-ttop-02_23200101_30091231_1M_transpir.nc

Typical code lines are:

yes? use CM6012.1-pi-ttop-02_23200101_30091231_1M_transpir.nc

yes? let var=TRANSPIR[k=@sum, x=@ave, y=@ave]
yes? plot var[l=@sbx:120]

! the following pass because I have limites time range (1:1200)

yes? plot var[l=1:1200@sbx:120]

Any help welcome.

Regards

Patrick

--
Data Analysis and Visualization Engineer
LSCE/IPSL, CEA-CNRS-UVSQ laboratory
LSCE - Climate and Environment Sciences Laboratory
IPSL - Institut Pierre Simon Laplace
--