[Thread Prev][Thread Next][Index]

Re: [ferret_users] R-square problem



Hi everyone,
Ana sent me her data, with P and Q.  These are variables that look quite well correlated but with one being of larger magnitude than the other. This is causing a loss of numeric accuracy; Ferret only operates in single-precision, and so the variations in the data of smaller magnitude are overwhelmed by the larger numbers.

The result is improved with:
yes? use pq
yes? set var/name=q_in q  ! I am going to re-define q
yes? let q = q_in/100

yes? go regresst
yes? stat rsquared

I'm asking some of our data-analysis experts to chime in:  What more can we do to work with this data?  I'm attaching the netCDF file.
yes? can data/all; use pq.nc
yes? stat p
             VAR_1
             LONGITUDE: 118.8E to 61.2W
             LATITUDE: 36.2S to 16.2N
             Z:  N/A
             TIME: 01-JUN-1958 00:00 to 01-JUN-2002 00:00
             DATA SET: ./pq.nc
 
 Total # of data points: 798336 (72*21*1*528)
 # flagged as bad  data: 0
 Minimum value: 1000
 Maximum value: 1031.3
 Mean    value: 1012.9 (unweighted average)
 Standard deviation: 3.723

yes? stat q
             VAR_2
             LONGITUDE: 118.8E to 58.8W
             LATITUDE: 36.2S to 16.2N
             Z:  N/A
             TIME: 01-JUN-1958 00:00 to 01-JUN-2002 00:00
             DATA SET: ./pq.nc
 
 Total # of data points: 809424 (73*21*1*528)
 # flagged as bad  data: 0
 Minimum value: 100000
 Maximum value: 103278
 Mean    value: 101288 (unweighted average)
 Standard deviation: 394.51

yes? set var/name=q_in q
yes? let q = q_in/100
yes? go regresst
...
yes? stat rsquare
 
             (PQVAR*PQVAR) / (PVAR*QVAR)
             LONGITUDE: 118.8E to 61.2W
             LATITUDE: 36.2S to 16.2N
             Z:  N/A
             TIME: 01-JUN-1958 00:00 to 01-JUN-2002 00:00
             DATA SET: ./pq.nc
 
 Total # of data points: 1512 (72*21*1*1)
 # flagged as bad  data: 0
 Minimum value: 0.27778
 Maximum value: 1.0833
 Mean    value: 0.8884 (unweighted average)
 Standard deviation: 0.1003



Ana Redondo wrote:
Re: [ferret_users] R-square problem Hi Ansley,

I am attaching you the file with the data
Thank you very much!!
Ana


On 20/10/09 10:30 AM, "Ansley Manke" <Ansley.B.Manke@xxxxxxxx> wrote:

Hi Ana,
Would it be possible for you to send me the data you are using?  If you do the following in Ferret, then you will produce a file that I should be able to test this with:

yes? Let P=var_1
  yes? Let Q=var_2
  yes? save/file=pq.nc p,q

and then just attach the netcdf file to an email.  This is not a large amount of data, and these files email just fine.  We can figure out what's happening and then report back to the Ferret group.

I'm at the end of my day, and I see you're in Australia. This is always the hard part about working with others around the world, but maybe within another day we can get this figured out!

Ansley



Ana Redondo wrote:
Re: [ferret_users] R-square problem Hi Ansley,
 
Yes the values of Rsquare are larger than 1:
 
 
yes? stat rsquare
 Total # of data points: 1512 (72*21*1*1)
 # flagged as bad  data: 0
 Minimum value: 0.34722
  Maximum value: 1.2
 
Mean    value: 0.89056 (unweighted average)
 Standard deviation: 0.10312
 
Any ideas??
 
Thanks
 
Ana
 
 
On 20/10/09 2:16 AM, "Ansley Manke" <Ansley.B.Manke@xxxxxxxx> wrote:
 
  
Hi Ana,
Yes, the values of rsquare should be between 0 and 1.   If you look at the result of
 
   
yes? STAT rsquare
 
are the values really larger than 1?  I wonder if the automatically chosen color fill levels in the FILL command are making the color bar extend higher than 1 even though the values of the data field are not actually larger than 1.
 
Another possibility is that the accuracy of numeric calculations is putting values slightly larger than 1.  Here too the STAT command might tell us more.
 
Ansley
 
 
Ana Redondo wrote:
  
R-square problem Hi,
 
I am doing a linear regression between two independent variables in order to find out the R-square. This is how I am doing it:
  yes? Let P=var_1
  yes? Let Q=var_2
  yes? Go regresst
  yes? show var
  yes? fill rsquare
 
But when filling the R-square (and I also list some values to test) I realized that some values are greater than 1!!!!!!!!  Isn’t it R-square the coefficient of determination? If so it should be 0<R^2<1, what I am doing wrong???
 
 
Thanks
 
Ana
 

 

Attachment: pq.nc
Description: Binary data


[Thread Prev][Thread Next][Index]

Contact Us
Dept of Commerce / NOAA / OAR / PMEL / TMAP

Privacy Policy | Disclaimer | Accessibility Statement