Do Nulls and Indexes Work Together?-Indexes

On 24/03/2023 by Robert Corvino

BTree indexes, except in the special case of cluster BTree indexes, do not store completely null entries, but bitmap and cluster indexes do. This side effect can be a point of confusion, but it can actually be used to your advantage when you understand what not storing entirely null keys implies.

To see the effect of the fact that null values are not stored, consider this example:
$ sqlplus eoda/foo@PDB1
SQL> create table t ( x int, y int );Table created.
SQL> create unique index t_idx on t(x,y); Index created.
SQL> insert into t values ( 1, 1 );
SQL> insert into t values ( 1, NULL ); 1 row created.
SQL> insert into t values ( NULL, 1 ); 1 row created.
SQL> insert into t values ( NULL, NULL ); 1 row created.
SQL> analyze index t_idx validate structure; Index analyzed.
SQL> select name, lf_rows from index_stats;

The table has four rows, whereas the index only has three. The first three rows, where at least one of the index key elements was not null, are in the index. The last row with (NULL, NULL) is not in the index. One of the areas of confusion is when the index is a unique index, as just shown. Consider the effect of the following three INSERT statements:
SQL> insert into t values ( NULL, NULL ); 1 row created.
SQL> insert into t values ( NULL, 1 ); insert into t values ( NULL, 1 ) *
ORA-00001: unique constraint (EODA.T_IDX) violated
SQL> insert into t values ( 1, NULL ); insert into t values ( 1, NULL ) *
ORA-00001: unique constraint (EODA.T_IDX) violated

The new (NULL, NULL) row is not considered to be the same as the old row with (NULL, NULL):
SQL> select x, y, count()from tgroup by x,yhaving count() > 1;X Y COUNT(*)

This seems impossible; our unique key isn’t unique if we consider all null entries. The fact is that, in Oracle, (NULL, NULL) is not the same as (NULL, NULL) when considering uniqueness—the SQL standard mandates this. (NULL,NULL) and (NULL,NULL) are considered the same with regard to aggregation, however. The two are unique for comparisons but are the same as far as the GROUP BY clause is concerned. That is something to consider: each unique constraint should have at least one NOT NULL column to be truly unique.

The question that comes up with regard to indexes and null values is, “Why isn’t my query using the index?” The query in question is something like the following:
SQL> select * from T where x is null;

This query cannot use the index we just created—the row (NULL, NULL) simply is not in the index; hence, the use of the index would return the wrong answer. Only if at least one of the columns is defined as NOT NULL can the query use an index. For example, the following shows Oracle will use an index for an X IS NULL predicate if there is an index with X on the leading edge and at least one other column in the index is defined as NOT NULL in the base table:
SQL> create table t ( x int, y int NOT NULL ); Table created.
SQL> create unique index t_idx on t(x,y); Index created.
SQL> insert into t values ( 1, 1 );1 row created.
SQL> insert into t values ( NULL, 1 ); 1 row created.
SQL> begindbms_stats.gather_table_stats(user,’T’);end;/
PL/SQL procedure successfully completed.

When we go to query that table this time, we’ll discover this:
SQL> set autotrace on
SQL> select * from t where x is null;

Previously, I said that you can use to your advantage the fact that totally null entries are not stored in a B*Tree index—here is how. Say you have a table with a column that takes exactly two values. The values are very skewed; say, 90 percent or more of the rows take on one value, and 10 percent or less take on the other value. You can index this column efficiently to gain quick access to the minority rows. This comes in handy when you would like to use an index to get to the minority rows, but you want to full scan toget to the majority rows, and you want to conserve space. The solution is to use a null for majority rows and whatever value you want for minority rows or, as demonstrated earlier, use a function-based index to index only the non-null return values from a function.

Now that you know how a B*Tree will treat null values, you can use that to your advantage and take precautions with unique constraints on sets of columns that all allow nulls (be prepared to have more than one row that is all null as a possibility in this case).

M	T	W	T	F	S	S
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30

Do Nulls and Indexes Work Together?-Indexes

Leave a Reply Cancel reply

Archives

Calendar

Categories