<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>OpenText Analytics Database 26.2.x – Column encoding</title>
    <link>/en/data-analysis/query-optimization/column-encoding/</link>
    <description>Recent content in Column encoding on OpenText Analytics Database 26.2.x</description>
    <generator>Hugo -- gohugo.io</generator>
    
	  <atom:link href="/en/data-analysis/query-optimization/column-encoding/index.xml" rel="self" type="application/rss+xml" />
    
    
      
        
      
    
    
    <item>
      <title>Data-Analysis: Improving column compression</title>
      <link>/en/data-analysis/query-optimization/column-encoding/improving-column-compression/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/en/data-analysis/query-optimization/column-encoding/improving-column-compression/</guid>
      <description>
        
        
        &lt;p&gt;If you see slow performance or a large storage footprint with your &lt;a href=&#34;../../../../en/sql-reference/data-types/numeric-data-types/double-precision-float/&#34;&gt;FLOAT&lt;/a&gt; data, evaluate the data and your business needs to see if it can be contained in a &lt;a href=&#34;../../../../en/sql-reference/data-types/numeric-data-types/numeric/#&#34;&gt;NUMERIC&lt;/a&gt; column with a precision of 18 digits or less. Converting a FLOAT column to a NUMERIC column can improve data compression, reduce the on-disk size of your database, and improve performance of queries on that column.&lt;/p&gt;
&lt;p&gt;When you define a NUMERIC data type, you specify the precision and the scale; NUMERIC data are exact representations of data. FLOAT data types represent variable precision and approximate values; they take up more space in the database.&lt;/p&gt;
&lt;p&gt;Converting FLOAT columns to NUMERIC columns is most effective when:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;NUMERIC precision is 18 digits or less. Performance of NUMERIC data is fine-tuned for the common case of 18 digits of precision. OpenText recommends converting FLOAT columns to NUMERIC columns only if they require precision of 18 digits or less.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;FLOAT precision is bounded, and the values will all fall within a specified precision for a NUMERIC column. One example is monetary values like product prices or financial transaction amounts. For example, a column defined as NUMERIC(11,2) can accommodate prices from 0 to a few million dollars and can store cents, and compresses more efficiently than a FLOAT column.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;If you try to load a value into a NUMERIC column that exceeds the specified precision, the database returns an error and does not load the data. If you assign a value with more decimal digits than the specified scale, the value is rounded to match the specified scale and stored in that column.&lt;/p&gt;
&lt;h2 id=&#34;see-also&#34;&gt;See also&lt;/h2&gt;
&lt;a href=&#34;../../../../en/sql-reference/data-types/numeric-data-types/#&#34;&gt;Numeric data types&lt;/a&gt;

      </description>
    </item>
    
    <item>
      <title>Data-Analysis: Using run length encoding</title>
      <link>/en/data-analysis/query-optimization/column-encoding/using-run-length-encoding/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/en/data-analysis/query-optimization/column-encoding/using-run-length-encoding/</guid>
      <description>
        
        
        &lt;p&gt;When you run Database Designer, you can choose to optimize for loads, which minimizes database footprint. In this case, Database Designer applies encodings to columns to maximize query performance. &lt;a href=&#34;../../../../en/sql-reference/statements/create-statements/create-projection/encoding-types/&#34;&gt;Encoding options&lt;/a&gt; include run length encoding (RLE), which replaces sequences (runs) of identical values in a column with a set of pairs, where each pair represents the number of contiguous occurrences for a given value: (&lt;em&gt;&lt;code&gt;occurrences&lt;/code&gt;&lt;/em&gt;, &lt;em&gt;&lt;code&gt;value&lt;/code&gt;&lt;/em&gt;).&lt;/p&gt;
&lt;p&gt;RLE is generally applicable to a column with low-cardinality, and where identical values are contiguous—typically, because table data is sorted on that column. For example, a customer profile table typically includes a gender column that contains values of F and M only. Sorting on gender ensures runs of F or M values that can be expressed as a set of two pairs: (&lt;em&gt;&lt;code&gt;occurrences&lt;/code&gt;&lt;/em&gt;, F) and (&lt;em&gt;&lt;code&gt;occurrences&lt;/code&gt;&lt;/em&gt;, M). So, given 8,147 occurrences of F and 7,956 occurrences of M, and a projection that is sorted primarily on gender, the database can apply RLE and store these values as a single set of two pairs: (8147, F) and (7956, M). Doing so reduces this projection’s footprint and improves query performance.&lt;/p&gt;

      </description>
    </item>
    
  </channel>
</rss>
