Informix Guide to SQL: Tutorial

Informix Guide to SQL: Tutorial
Chapter 5: Programming with SQL

Home Contents Index Master Index New Book

Retrieving Multiple Rows

When any chance exists that a query could return more than one row, the program must execute the query differently. Multirow queries are handled in two stages. First, the program starts the query. (No data is returned immediately.) Then the program requests the rows of data one at a time.

These operations are performed using a special data object called a cursor. A cursor is a data structure that represents the current state of a query. The following list shows the general sequence of program operations:

1. The program declares the cursor and its associated SELECT statement, which merely allocates storage to hold the cursor.

2. The program opens the cursor, which starts the execution of the associated SELECT statement and detects any errors in it.

3. The program fetches a row of data into host variables and processes it.

4. The program closes the cursor after the last row is fetched.

5. When the cursor is no longer needed, the program frees the cursor to deallocate the resources it uses.

These operations are performed with SQL statements named DECLARE, OPEN, FETCH, CLOSE, and FREE.

Declaring a Cursor

You use the DECLARE statement to declare a cursor. This statement gives the cursor a name, specifies its use, and associates it with a statement. The following example is written in INFORMIX-ESQL/C:

EXEC SQL DECLARE the_item CURSOR FOR

	SELECT order_num, item_num, stock_num

	INTO o_num, i_num, s_num

	FROM items
	FOR READ ONLY;

The declaration gives the cursor a name (the_item in this case) and associates it with a SELECT statement. (Chapter 6, "Modifying Data Through SQL Programs," discusses how a cursor can also be associated with an INSERT statement.)

The SELECT statement in this example contains an INTO clause. The INTO clause specifies which variables receive data. You can also specify which variables receive data by using the FETCH statement as discussed in "Locating the INTO Clause".

The DECLARE statement is not an active statement; it merely establishes the features of the cursor and allocates storage for it. You can use the cursor declared in the preceding example to read once through the items table. Cursors can be declared to read backward and forward (see "Cursor Input Modes"). This cursor, because it lacks a FOR UPDATE clause and because it is designated FOR READ ONLY, is used only to read data, not to modify it. (The use of cursors to modify data is covered in Chapter 6, "Modifying Data Through SQL Programs.")

Opening a Cursor

The program opens the cursor when it is ready to use it. The OPEN statement activates the cursor. It passes the associated SELECT statement to the database server, which begins the search for matching rows. The database server processes the query to the point of locating or constructing the first row of output. It does not actually return that row of data, but it does set a return code in SQLSTATE and SQLCODE for SQL APIs. The following example shows the OPEN statement:

EXEC SQL OPEN the_item;

Because the database server is seeing the query for the first time, many errors are detected. After the program opens the cursor, it should test SQLSTATE or SQLCODE. If the SQLSTATE value is greater than 02000, or the SQLCODE contains a negative number, the cursor is not usable. An error might be present in the SELECT statement, or some other problem might prevent the database server from executing the statement.

If SQLSTATE is equal to 00000, or SQLCODE contains a zero, the SELECT statement is syntactically valid, and the cursor is ready for use. At this point, however, the program does not know if the cursor can produce any rows.

Fetching Rows

The program uses the FETCH statement to retrieve each row of output. This statement names a cursor and can also name the host variables to receive the data. The following example shows the completed INFORMIX-ESQL/C code:

EXEC SQL DECLARE the_item CURSOR FOR

	SELECT order_num, item_num, stock_num

		INTO :o_num, :i_num, :s_num

		FROM items;

EXEC SQL OPEN the_item;

while(SQLCODE == 0)

{

	EXEC SQL FETCH the_item;

	if(SQLCODE == 0)

		printf("%d, %d, %d", o_num, i_num, s_num);

}

Detecting End of Data

In the previous example, the while condition prevents execution of the loop in case the OPEN statement returns an error. The same condition terminates the loop when SQLCODE is set to 100 to signal the end of data. However, the loop contains a second test of SQLCODE. This test is necessary because, if the SELECT statement is valid yet finds no matching rows, the OPEN statement returns a zero, but the first fetch returns 100, end of data, and no data. The following example shows another way to write the same loop:

EXEC SQL DECLARE the_item CURSOR FOR

	SELECT order_num, item_num, stock_num

	INTO :o_num, :i_num, :s_num 

	FROM items;

EXEC SQL OPEN the_item;

if(SQLCODE == 0)

	EXEC SQL FETCH the_item;        /* fetch 1st row

while(SQLCODE == 0)

{

	printf("%d, %d, %d", o_num, i_num, s_num);

	EXEC SQL FETCH the_item;

}

In this version, the case of zero returned rows is handled early, so no second test of SQLCODE exists within the loop. These versions have no measurable difference in performance because the time cost of a test of SQLCODE is a tiny fraction of the cost of a fetch.

Locating the INTO Clause

The INTO clause names the host variables that are to receive the data returned by the database server. The INTO clause must appear in either the SELECT or the FETCH statement. However it cannot appear in both. The following example specifies host variables in the FETCH statement:

EXEC SQL DECLARE the_item CURSOR FOR

	SELECT order_num, item_num, stock_num

		FROM items;

EXEC SQL OPEN the_item;   

while(SQLCODE == 0)   

{

	EXEC SQL FETCH the_item INTO :o_num, :i_num, :s_num;

	if(SQLCODE == 0)

		printf("%d, %d, %d", o_num, i_num, s_num);

}

This form lets you fetch different rows into different locations. For example, you could use this form to fetch successive rows into successive elements of an array.

Cursor Input Modes

For purposes of input, a cursor operates in one of two modes, sequential or scrolling. A sequential cursor can fetch only the next row in sequence so a sequential cursor can read through a table only once each time the sequential cursor is opened. A scroll cursor can fetch the next row or any prior row, so it can read rows multiple times. The following example shows a sequential cursor declared in INFORMIX-ESQL/C:

EXEC SQL declare pcurs cursor for

	select customer_num, lname, city

		from customer;

After the cursor is opened, it can be used only with a sequential fetch that retrieves the next row of data, as the following example shows.

EXEC SQL fetch p_curs into :cnum, :clname, :ccity;

Each sequential fetch returns a new row.

A scroll cursor is declared with the keywords SCROLL CURSOR, as the following example from INFORMIX-ESQL/C shows:

	EXEC SQL DECLARE s_curs SCROLL CURSOR FOR

   		SELECT order_num, order_date FROM orders

   			WHERE customer_num > 104

Use the scroll cursor with a variety of fetch options. The ABSOLUTE option specifies the rank number of the row to fetch.

	EXEC SQL FETCH ABSOLUTE :numrow s_curs

  		INTO :nordr, :nodat

This statement fetches the row whose position is given in the host variable numrow. You can also fetch the current row again or fetch the first row and then scan through the entire list again. However, these features have a price, as the next section describes.

The Active Set of a Cursor

Once a cursor is opened, it stands for some selection of rows. The set of all rows that the query produces is called the active set of the cursor. It is easy to think of the active set as a well-defined collection of rows and to think of the cursor as pointing to one row of the collection. This situation is true as long as no other programs are modifying the same data concurrently.

Creating the Active Set

When a cursor is opened, the database server does whatever is necessary to locate the first row of selected data. Depending on how the query is phrased, this action can be very easy, or it can require a great deal of work and time. Consider the following declaration of a cursor:

EXEC SQL DECLARE easy CURSOR FOR

	SELECT fname, lname FROM customer

		WHERE state = 'NJ'

Because this cursor queries only a single table in a simple way, the database server quickly determines whether any rows satisfy the query and identifies the first one. The first row is the only row the cursor finds at this time. The rest of the rows in the active set remain unknown. As a contrast, consider the following declaration of a cursor:

EXEC SQL DECLARE hard SCROLL CURSOR FOR

	SELECT C.customer_num, O.order_num, sum (items.total_price)

		FROM customer C, orders O, items I

		WHERE C.customer_num = O.customer_num

			AND O.order_num = I.order_num

			AND O.paid_date is null

		GROUP BY C.customer_num, O.order_num

The active set of this cursor is generated by joining three tables and grouping the output rows. The optimizer might be able to use indexes to produce the rows in the correct order, but generally the use of ORDER BY or GROUP BY clauses requires the database server to generate all the rows, copy them to a temporary table, and sort the table, before it can know which row to present first.

In cases where the active set is entirely generated and saved in a temporary table, the database server can take quite some time to open the cursor. Afterward, it can tell the program exactly how many rows the active set contains. This information is not made available, however. One reason is that you can never be sure which method the optimizer uses. If the optimizer can avoid sorts and temporary tables, it does; but very small changes in the query, in the sizes of the tables, or in the available indexes can change its methods.

int part_list[200];



boom(top_part)

int top_part;

{

	long this_part, child_part;

	int next_to_do = 0, next_free = 1;

	part_list[next_to_do] = top_part;



	EXEC SQL DECLARE part_scan CURSOR FOR

		SELECT child INTO child_part FROM contains

			WHERE parent = this_part;

	while(next_to_do < next_free)

	{

		this_part = part_list[next_to_do];

		EXEC SQL OPEN part_scan;

		while(SQLCODE == 0)

		{

			EXEC SQL FETCH part_scan;

			if(SQLCODE == 0)	

			{

				part_list[next_free] = child_part;

				next_free += 1;

			}

		}

		EXEC SQL CLOSE part_scan;

		next_to_do += 1;

	}

	return (next_free - 1);

}

Technically speaking, each row of the contains table is the head node of a directed acyclic graph, or tree. The function performs a breadth-first search of the tree whose root is the part number passed as its parameter. The function uses a cursor named part_scan to return all the rows with a particular value in the parent column. The innermost while loop opens the part_scan cursor, fetches each row in the selection set, and closes the cursor when the part number of each component has been retrieved.

This function addresses the heart of the parts-explosion problem, but the function is not a complete solution. For example, it does not allow for components that appear at more than one level in the tree. Furthermore, a practical contains table would also have a column count, giving the count of child parts used in each parent. A program that returns a total count of each component part is much more complicated.

The iterative approach described earlier is not the only way to approach the parts-explosion problem. If the number of generations has a fixed limit, you can solve the problem with a single SELECT statement using nested, outer self-joins.

If up to four generations of parts can be contained within one top-level part, the following SELECT statement returns all of them:

SELECT a.parent, a.child, b.child, c.child, d.child

	FROM contains a

		OUTER (contains b,

			OUTER (contains c, outer contains d))

	WHERE a.parent = top_part_number

		AND a.child = b.parent

		AND b.child = c.parent

		AND c.child = d.parent

This SELECT statement returns one row for each line of descent rooted in the part given as top_part_number. Null values are returned for levels that do not exist. (Use indicator variables to detect them.) To extend this solution to more levels, select additional nested outer joins of the contains table.You can also revise this solution to return counts of the number of parts at each level.

Informix Guide to SQL: TutorialChapter 5: Programming with SQL Home Contents Index Master Index New Book

Retrieving Multiple Rows

Detecting End of Data

Informix Guide to SQL: Tutorial
Chapter 5: Programming with SQL

Home Contents Index Master Index New Book