Difference between revisions of "Filling Out the Field Statistics Class"

From The SBN Wiki
Jump to navigation Jump to search
(Creation)
 
 
(One intermediate revision by the same user not shown)
Line 1: Line 1:
 
The '''''<Field_Statistics_Class>''''' groups together a number of statistics calculated across all values that occur in the file for the associated field.
 
The '''''<Field_Statistics_Class>''''' groups together a number of statistics calculated across all values that occur in the file for the associated field.
  
{| class="wikitable" style="background-color: thistle"
+
{| class="wikitable" style="background-color: yellow"
| '''''N.B.:''''' ''What constitutes the "values that occur in the file for the associated field" is ambiguous when this class is used in a ''Field'' within a ''Group_Field''.  If you must use both grouped fields and field statistics, use the ''<description>'' attribute in the ''<Field_Statistics_Class>'' to define specifically what values are included.''
+
| '''''N.B.:''''' ''What constitutes the "values that occur in the file for the associated field" is ambiguous when this class is used in a ''Field'' within a ''Group_Field''.  If you must use both grouped fields and field statistics, use the ''<description>'' attribute in the ''<Field_Statistics_Class>'' to define specifically what values are included in the calculations.''
 
|}
 
|}
 
 
For additional explanation, see the PDS4 ''Standards Reference'', or contact your PDS node consultant.
 
For additional explanation, see the PDS4 ''Standards Reference'', or contact your PDS node consultant.
  
Line 15: Line 14:
 
''OPTIONAL''
 
''OPTIONAL''
  
If you need to refer to this particular set of ''Field_Statistics'' from elsewhere in the label, you can git it an identifier here.  If you stick to something that looks like a variable name from a typical programming language, you should be OK syntactically.
+
If you need to refer to this particular set of ''Field_Statistics'' from elsewhere in the label, you can give it an identifier here.  If you stick to something that looks like a variable name from a typical programming language, you should be OK syntactically.
 +
 
 +
{| class="wikitable" style="background-color: yellow"
 +
| '''''Note:''''' The following fields refer to values "physically stored" in the data object.  While this is not clearly defined in the data dictionary, we take this to mean that any bit mask ''is'' applied to the bytes read from the file, but no scaling factor or offset is applied prior to performing the calculation involved in deriving the statistical value.  Also, all flag values defined in any associated ''<Special_Constants>'' are ignored and excluded from the calculations.
 +
|}
  
 
== <maximum> ==
 
== <maximum> ==
Line 21: Line 24:
 
''OPTIONAL''
 
''OPTIONAL''
  
This is the maximum value that is physically stored in the file for the associated field. Any flag values defined in a ''Special_Constants'' class are ignored.  This value is the maximum '''''before''''' any scaling factor or offset is applied.
+
This is the maximum value that is physically stored in the file for the associated field.
 
 
{| class="wikitable" style="background-color: thistle"
 
| ''I don't understand why there is no ''<maximum_scaled_value>'' in this class, as there is for the equivalent ''<Object_Statistics>'' class used in ''Arrays''.''
 
|}
 
  
 
== <minimum> ==
 
== <minimum> ==
Line 31: Line 30:
 
''OPTIONAL''
 
''OPTIONAL''
  
 +
This is the minimum value that is physically stored in the file for the associated field.
  
This is the minimum value that is physically stored in the file for the associated field. Any flag values defined in a ''Special_Constants'' class are ignored.  This value is the minimum '''''before''''' any scaling factor or offset is applied.
+
== <mean> ==
  
{| class="wikitable" style="background-color: thistle"
 
| ''I don't understand why there is no ''<minimum_scaled_value>'' in this class, as there is for the equivalent ''<Object_Statistics>'' class used in ''Arrays''.''
 
|}
 
== <mean> ==
 
 
''OPTIONAL''
 
''OPTIONAL''
This is the arithmetic mean of the values stored in the file (that is, no scaling and no offsets applied), excluding defined flag values.
+
 
 +
This is the arithmetic mean of the values physically stored in the file.
 +
 
 
== <standard_deviation> ==
 
== <standard_deviation> ==
 +
 
''OPTIONAL''
 
''OPTIONAL''
This is the statistical standard deviation of the values stored in the file (that is, no scaling and no offsets applied), excluding defined flag values.
+
 
 +
This is the statistical standard deviation of the values physically stored in the file.
 +
 
 
== <median> ==
 
== <median> ==
 +
 
''OPTIONAL''
 
''OPTIONAL''
This is the median value stored in the file (that is, no scaling and no offsets applied), excluding defined flag values.
+
 
 +
This is the median of the values physically stored in the file.  
 +
 
 
== <description> ==
 
== <description> ==
 +
 
''OPTIONAL''
 
''OPTIONAL''
 +
 
Use this free-format text field to provide any relevant explanatory notes or caveats.
 
Use this free-format text field to provide any relevant explanatory notes or caveats.

Latest revision as of 19:08, 22 April 2015

The <Field_Statistics_Class> groups together a number of statistics calculated across all values that occur in the file for the associated field.

N.B.: What constitutes the "values that occur in the file for the associated field" is ambiguous when this class is used in a Field within a Group_Field. If you must use both grouped fields and field statistics, use the <description> attribute in the <Field_Statistics_Class> to define specifically what values are included in the calculations.

For additional explanation, see the PDS4 Standards Reference, or contact your PDS node consultant.

Following are the attributes and subclasses you'll find in the <Field_Statistics> class.

Note that in the PDS4 master schema, all classes have capitalized names; attributes never do.

<local_identifier>

OPTIONAL

If you need to refer to this particular set of Field_Statistics from elsewhere in the label, you can give it an identifier here. If you stick to something that looks like a variable name from a typical programming language, you should be OK syntactically.

Note: The following fields refer to values "physically stored" in the data object. While this is not clearly defined in the data dictionary, we take this to mean that any bit mask is applied to the bytes read from the file, but no scaling factor or offset is applied prior to performing the calculation involved in deriving the statistical value. Also, all flag values defined in any associated <Special_Constants> are ignored and excluded from the calculations.

<maximum>

OPTIONAL

This is the maximum value that is physically stored in the file for the associated field.

<minimum>

OPTIONAL

This is the minimum value that is physically stored in the file for the associated field.

<mean>

OPTIONAL

This is the arithmetic mean of the values physically stored in the file.

<standard_deviation>

OPTIONAL

This is the statistical standard deviation of the values physically stored in the file.

<median>

OPTIONAL

This is the median of the values physically stored in the file.

<description>

OPTIONAL

Use this free-format text field to provide any relevant explanatory notes or caveats.