Java objects
objects
 object

In this chapter we will review the structure of a Java program
and use some of the classes that are provided in the Java libraries.

Program Structure

A Java program is a set of class definitions.  One class definition
is designated as the startup class.  It must contain a method
named main that is where the execution of the program begins.

A minimal Java program consists of a class definition with a single
method definition:

verbatim
class Hello 

  // main: generate some simple output

  public static void main (String[] args) 
    System.out.println ("Hello, world.");
  

verbatim

Some people judge the quality of a programming language by
the simplicity of the ``Hello, World.'' program.  By this
standard, Java does not do very well.

Although this program only defines one class, named Hello,
it is free to use the built-in Java classes.  This program
uses the class named System which contains an object
named out on which we invoke a method named println.
This method has the effect of printing a String.

Packages
 package
 AWT
 Abstract Window Toolkit
 import
 statement!import

The built-in Java classes are divided into a number of packages,
including java.lang, which contains many of the most commonly
used classes (like System), and java.awt, which contains
classes that pertain to the Java Abstract Window Toolkit (AWT),
which contains classes for windows, buttons, graphics, etc.

In order to use a package, you have to import it; for example,
the statment import java.awt.Point imports a class named 
Point from the AWT.  The classes in java.lang are imported
automatically, which is why the Hello class didn't require an 
import statement.

All import statements appear at the beginning of the program,
outside the class definition.

Point objects
 Point
 class!Point

At the most basic level, a point is two numbers (coordinates)
that we treat collectively as a single object.  In mathematical
notation, points are often written in parentheses, with a comma
separating the coordinates.  For example,  indicates
the origin, and  indicates the point  units to the
right and  units up from the origin.

 new
 statement!new

In Java, a point is represented by a Point object.  To
create a new point, you have to use the new command:

verbatim
    Point blank;
    blank = new Point (3, 4);
verbatim

The first line is a conventional variable declaration: blank has
type Point.  The second line invokes the new command,
specifies the type of the new object, and provides arguments.  It will
probably not surprise you that the arguments are the coordinates of
the new point, .

 declaration
 statement!declaration
 reference
 state diagram
 state

The result of the new command is a reference to the new
point.  This reference is assigned to the 
the variable blank.  A standard way to diagram this
assignment is shown in the figure.

0.1in
figure=reference.eps
0.1in

As usual, the name of the variable blank appears outside the box
and its value appears inside the box.  In this case, the value is a
reference, which is shown graphically with a dot and an arrow.  The
arrow points to the object that is referred to.

The big box shows the newly-created object with two values
in it.  The names x and y are the names of the 
instance variables.

Taken together, all the variables, values, and objects in a
program are called the state.  Diagrams like this that
show the state of the program are called state diagrams.
As the program runs, the state changes, so you should think
of a state diagram as a snapshot of a particular point in the
execution.

Instance variables
 variable!instance
 instance variable

The pieces of data that make up an object are sometimes called
components, records, or fields.  In Java they are called instance
variables because each object, which is an instance of its
type, has its own copy of the instance variables.

It's like the glove compartment of a car.  Each car is an instance
of the type ``car,'' and each car has its own glove compartment.  If
you asked me to get something from the glove compartment of your car,
you would have to tell me which car is yours.

 dot notation

Similarly, if you want to read a value from an instance variable, you
have to specify the object you want to get it from.  In Java this is
done using the dot operator.

verbatim
    int x = blank.x;
verbatim

The expression blank.x means ``go to the object blank
refers to, and get the value of x.''  In this case we assign
the value to a local variable named x.  Notice that there is no
conflict between the local variable named x and the instance
variable named x.  The purpose of the dot operator is to identify
which variable you are referring to unambiguously.

You can use the dot operator as part of any Java expression, so the
following are legal.

verbatim
    System.out.println (blank.x + ", " + blank.y);
    int distance = blank.x * blank.x + blank.y * blank.y;
verbatim

The first line prints 3, 4; the second line calculates
the value 25.

Objects as parameters
 parameter
 object!as parameter

You can pass objects as parameters in the usual way.  For
example

verbatim
  public static void printPoint (Point p) 
    System.out.println ("(" + p.x + ", " + p.y + ")");
  
verbatim

is a method that takes a point as an argument and prints it in
the standard format.  If you invoke printPoint (blank),
it will print (3, 4).  Actually, Java has a built-in
method for printing Points.  If you invoke
System.out.println (blank), you get

verbatim
java.awt.Point[x=3,y=5]
verbatim

This is a standard format Java uses for printing objects.  It prints
the name of the type, followed by the contents of the object,
including the names and values of the instance variables.

As a second example, the method distance 
takes two Points as parameters and calculates the distance
between them.

verbatim
  public static double distance (Point p1, Point p2) 
    double dx = (double)(p2.x - p1.x);
    double dy = (double)(p2.y - p1.y);
    return Math.sqrt (dx*dx + dy*dy);
  
verbatim

The typecasts are not really necessary; I just added them as a
reminder that the instance variables in a Point are integers.

Rectangles
 Rectangle
 class!Rectangle

Rectangles are similar to points, except that they have four
instance variables, named x, y, width and 
height.  Other than that, everything is pretty much the same.

verbatim
    Rectangle box = new Rectangle (0, 0, 100, 200);
verbatim

creates a new Rectangle object and makes box refer to it.
The figure shows the effect of this assignment.

0.1in
figure=rectangle.eps
0.1in

If you print box, you get

verbatim
java.awt.Rectangle[x=0,y=0,width=100,height=200]
verbatim

Again, this is the result of a built-in Java method that knows how
to print Rectangle objects.

Objects as return types
 object!as return type
 return
 statement!return

You can write methods that return objects.  For example,
findCenter takes a Rectangle as an argument and
returns a Point that contains the coordinates of the
center of the Rectangle:

verbatim
  public static Point findCenter (Rectangle box) 
    int x = box.x + box.width/2;
    int y = box.y + box.height/2;
    return new Point (x, y);
  
verbatim

Notice that you can use new to create a new object,
and then immediately use the result as a return value.

Objects are mutable
 object!mutable
 mutable

You can change the contents of an object by making an assignment
to one of its instance variables.  For example, to ``move''
a rectangle without changing its size, you could modify the
x and y values:

verbatim
    box.x = box.x + 50;
    box.y = box.y + 100;
verbatim

The result is shown in the figure:

0.1in
figure=rectangle2.eps
0.1in

 encapsulation
 generalization

We could take this code and encapsulate it in a method, and
generalize it to move the rectangle by any amount:

verbatim
  public static void moveRect (Rectangle box, int dx, int dy) 
    box.x = box.x + dx;
    box.y = box.y + dy;
  
verbatim

The variables dx and dy indicate how far to move the
rectangle in each direction.  Invoking this method has the effect of
modifying the Rectangle that is passed as an argument.

verbatim
    Rectangle box = new Rectangle (0, 0, 100, 200);
    moveRect (box, 50, 100);
    System.out.println (box);
verbatim

prints java.awt.Rectangle[x=50,y=100,width=100,height=200].

Modifying objects by passing them as arguments to methods can be
useful, but it can also make debugging more difficult because it is
not always clear which method invocations do or do not modify their
arguments.  Later, I will discuss some pros and cons of this
programming style.

In the meantime, we can enjoy the luxury of Java's built-in
methods, which include translate, which does exactly
the same thing as moveRect, although the syntax for
invoking it is a little different.  Instead of passing the
Rectangle as an argument, we invoke translate 
on the Rectangle and pass only dx and dy
as arguments.

verbatim
    box.translate (50, 100);
verbatim

The effect is exactly the same.

Aliasing
aliasing
 aliasing
 reference

Remember that when you make an assignment to an object variable, you
are assigning a reference to an object.  It is possible to have
multiple variables that refer to the same object.  For example,
this code:

verbatim
    Rectangle box1 = new Rectangle (0, 0, 100, 200);
    Rectangle box2 = box1;
verbatim

generates a state diagram that looks like this:

0.1in
figure=aliasing.eps
0.1in

Both box1 and box2 refer or ``point'' to the same object.
In other words, this object has two names, box1 and box2.
When a person uses two names, it's called aliasing.  Same thing
with objects.

When two variables are aliased, any changes that affect one
variable also affect the other.  For example:

verbatim
    System.out.println (box2.width);
    box1.grow (50, 50);
    System.out.println (box2.width);
verbatim

The first line prints 100, which is the width of the
Rectangle referred to by box2.  The second
line invokes the grow method on box1, which
expands the Rectangle by 50 pixels in every direction
(see the documentation for more details).  The effect
is shown in the figure:

0.1in
figure=aliasing2.eps
0.1in

As should be clear from this figure, whatever changes are
made to box1 also apply to box2.  Thus, the
value printed by the third line is 200, the width of
the expanded rectangle.  (As an aside, it is perfectly legal
for the coordinates of a Rectangle to be negative.)

As you can tell even from this simple example, code that
involves aliasing can get confusing fast, and it can be
difficult to debug.  In general, aliasing should be avoided
or used with care.

null
 null

When you create an object variable, remember that you are
creating a reference to an object.  Until you make
the variable point to an object, the value of the variable
is null.  null is a special value in Java (and
a Java keyword) that is used to mean ``no object.''

The declaration Point blank; is equivalent to this
initialization

verbatim
    Point blank = null;
verbatim

and is shown in the following state diagram:

0.1in
figure=reference2.eps
0.1in

The value null is represented by a dot with no arrow.

 exception!NullPointer
 run-time error

If you try to use a null object, either by accessing an instance
variable or invoking a method, you will get a 
NullPointerException.  The system will print an error message
and terminate the program.

verbatim
    Point blank = null;
    int x = blank.x;              // NullPointerException
    blank.translate (50, 50);     // NullPointerException
verbatim

On the other hand, it is legal to pass a null object as an argument or
receive one as a return value.  In fact, it is common to do so, for
example to represent an empty set or indicate an error condition.

Garbage collection
 garbage collection

In Section aliasing we talked about what happens when
more than one variable refers to the same object.  What happens
when no variable refers to an object?  For example:

verbatim
    Point blank = new Point (3, 4);
    blank = null;
verbatim

The first line creates a new Point object and makes
blank refer to it.  The second line changes blank
so that instead of referring to the object, it refers to
nothing (the null object).

0.1in
figure=reference3.eps
0.1in

If no one refers to an object, then no one can read or write any of
its values, or invoke a method on it.  In effect, it ceases to exist.
We could keep the object in memory, but it would only waste space, so
periodically as your program runs, the Java system looks for stranded
objects and reclaims them, in a process called garbage
collection.  Later, the memory space occupied by the object will
be available to be used as part of a new object.

You don't have to do anything to make garbage collection work,
and in general you will not be aware of it.

Objects and primitives
 type!object
 type!primitive
 object type
 primitive type

There are two kinds of types in Java, primitive types and
object types.  Primitives, like int and boolean
begin with lower-case letters; object types begin with
upper-case letters.  This distinction is useful because it
reminds us of some of the differences between them:

itemize

When you declare a primitive variable, you get storage
space for a primitive value.  When you declare an object variable, you
get a space for a reference to an object.  In order to get space for
the object itself, you have to use the new command.

If you don't initialize a primitive type, it is given
a default value that depends on the type.  For example,
0 for ints and true for booleans.
The default value for object types is null, which indicates
no object.

Primitive variables are well isolated in the sense that there is
nothing you can do in one method that will affect a variable in
another method.  Object variables can be tricky to work with because
they are not as well isolated.  If you pass a reference to an object
as an argument, the method you invoke might modify the object, in which
case you will see the effect.  The same is true when you invoke a
method on an object.  Of course, that can be a good thing, but you
have to be aware of it.

itemize

There is one other difference between primitives and object
types.  You cannot add new primitives to the Java language
(unless you get yourself on the standards committee), but
you can create new object types.  We'll see how in the next
chapter.

Glossary

description

[package:]  A collection of classes.  The built-in Java
classes are organized in packages.

[AWT:]  The Abstract Window Toolkit, one of the biggest
and most commonly-used Java packages.

[instance:]  An example from a category.  My cat is an
instance of the category ``feline things.''  Every object is
an instance of some class.

[instance variable:]  One of the named data items that make
up an object.  Each object (instance) has its own copy of
the instance variables for its class.

[reference:]  A value that indicates an object.  In a
state diagram, a reference appears as an arrow.

[aliasing:] The condition when two or more variables refer
to the same object.

[garbage collection:]  The process of finding objects that
have no references and reclaiming their storage space.

[state:] A complete description of all the variables and
objects and their values, at a given point during the execution
of a program.

[state diagram:] A snapshot of the state of a program, shown
graphically.

 package
 AWT
 instance
 instance variable
 reference
 aliasing
 garbage collection
 state
 state diagram

description


User-defined objects

Class definitions and object types
classes
 type!object
 type!user-defined
 object type
 class definition
 user-defined type

Every time you write a class definition, you create a new
Object type, with the same name as the class.  In the previous
chapter, when we defined the class named Hello,
we also created an object type named Hello.  We
didn't create any variables with type Hello, and we
didn't use the new command to create any Hello
objects, but we could have!

That example may not make any sense, since there is no
reason to create a Hello object, and it is not clear
what it would be good for if we did.  In this chapter, we
will look at some examples of class definitions that create
useful new Object types.

Here are the most important ideas in this chapter:

itemize

Defining a new class also creates a new object type
with the same name.

A class definition is like a template for objects:
it determines what instance variables the objects have and
what methods can operate on them.

Every object belongs to some object type; hence, it
is an instance of some class.

When you invoke the new command to create an object, Java
invokes a special method called a constructor to initialize the
instance variables.  You provide one or more constructors as part of
the class definition.

Typically all the methods that operate on a type go in the
class definition for that type.

itemize

Here are some syntax issues about class definitions:

itemize

Class names (and hence object types) always begin with a capital
letter, which helps distinguish them from primitive types and variable
names.

You usually put one class definition in each file, and the name
of the file must be the same as the name of the class, with the suffix
.java.  For example, the Time class is defined in the file
named Time.java.

In any program, one class is designated as the startup
class.  The startup class must contain a method named main, which
is where the execution of the program begins.  Other classes may
have a method named main, but they will not be executed.

itemize

With those issues out of the way, let's look at an example of
a user-defined type, Time.

Time
time
 class!Time
 Time

A common motivation for creating a new Object type is to take several
related pieces of data and encapsulate them into an object that can be
manipulated (passed as an argument, operated on) as a single unit.  We
have already seen two built-in types like this, Point and 
Rectangle.

Another example, which we will implement ourselves, is Time,
which is used to record the time of day.  The various pieces of
information that form a time are the hour, minute and second.  Because
every Time object will contain these data, we need to create
instance variables to hold them.

The first step is to decide what type each variable should be.  It
seems clear that hour and minute should be integers.  Just
to keep things interesting, let's make second a double, so
we can record fractions of a second.

 instance variable
 variable!instance

Instance variables are declared at the beginning of the class
definition, outside of any method definition, like this:

verbatim
class Time 
  int hour, minute;
  double second;

verbatim

All by itself, this code fragment is a legal class definition.  The
state diagram for a Time object would look like this:

0.1in
figure=time.eps
0.1in

After declaring the instance variables, the next step is usually
to define a constructor for the new class.

Constructors
 constructor
 method!constructor
 static

The usual role of a constructor is to initialize the instance
variables.  The syntax for constructors is similar to that
of other methods, with three exceptions:

itemize

The name of the constructor is the same as the name of
the class.

Constructors have no return type and no return value.

The keyword static is omitted.

itemize

Here is an example for the Time class:

verbatim
  public Time () 
    this.hour = 0;
    this.minute = 0;
    this.second = 0.0;
  
verbatim

Notice that where you would expect to see a return type,
between public and Time, there is nothing.  That's
how we (and the compiler) can tell that this is a constructor.

This constructor does not take any arguments, as indicated by the
empty parentheses ().  Each line of the constructor initializes
an instance variable to an arbitrary default value (in this case,
midnight).  The name this is a special
keyword that is the name of the object we are creating.  You can use
this the same way you use the name of any other object.  For
example, you can read and write the instance variables of this,
and you can pass this as an argument to other methods.

 this

But you do not declare this and you do not use new
to create it.  In fact, you are not even allowed to make an
assignment to it!  this is created by the system; all you
have to do is store values in its instance variables.

A common error when writing constructors is to put a return
statement at the end.  Resist the temptation.

More constructors
 overloading

Constructors can be overloaded, just like other methods,
which means that you can provide multiple constructors
with different parameters.  Java knows which constructor
to invoke by matching the arguments of the new
command with the parameters of the constructors.

It is very common to have one constructor that takes no
arguments (shown above), and one constructor that takes
a parameter list that is identical to the list of instance
variables.  For example:

verbatim
  public Time (int hour, int minute, double second) 
    this.hour = hour;
    this.minute = minute;
    this.second = second;
  
verbatim

The names and types of the parameters are exactly the same as
the names and types of the instance variables.  All the
constructor does is copy the information from the parameters
to the instance variables.

If you go back and look at the documentation for Points
and Rectangles, you will see that both classes provide
constructors like this.  Overloading constructors provides the
flexibility to create an object first and then fill in the
blanks, or to collect all the information before creating
the object.

So far this might not seem very interesting, and in fact it
is not.  Writing constructors is a boring, mechanical process.
Once you have written two, you will find that you can churn them
out in your sleep, just by looking at the list of instance
variables.

Creating a new object
 new
 statement!new

Although constructors look like methods, you never invoke them
directly.  Instead, when you use the new command, the system
allocates space for the new object and then 
invokes your constructor to initialize the instance variables.

The following program demonstrates two ways to create and
initialize Time objects:

verbatim
class Time 
  int hour, minute;
  double second;

  public Time () 
    this.hour = 0;
    this.minute = 0;
    this.second = 0.0;
  

  public Time (int hour, int minute, double second) 
    this.hour = hour;
    this.minute = minute;
    this.second = second;
  

  public static void main (String[] args) 

    // one way to create and initialize a Time object
    Time t1 = new Time ();
    t1.hour = 11;
    t1.minute = 8;
    t1.second = 3.14159;
    System.out.println (t1);

    // another way to do the same thing
    Time t2 = new Time (11, 8, 3.14159);
    System.out.println (t2);
  

verbatim

As an exercise, figure out the flow of execution through this
program.

In main, the first time we invoke the new command,
we provide no arguments, so Java invokes the first constructor.
The next few lines assign values to each of the instance
variables.

The second time we invoke the new command, we provide
arguments that match the parameters of the second constructor.
This way of initializing the instance variables is more concise
(and slightly more efficient), but it can be harder to read, since
it is not as clear which values are assigned to which instance
variables.

Printing an object
printobject
 print
 statement!print
 object!printing

The output of this program is:

verbatim
Time@80cc7c0
Time@80cc807
verbatim

When Java prints the value of a user-defined object type, it prints
the name of the type and a special hexadecimal (base 16) code that is
unique for each object.  This code is not meaningful in itself; in
fact, it can vary from machine to machine and even from run to run.
But it can be useful for debugging, in case you want to keep track of
individual objects.

In order to print objects in a way that is more meaningful to users
(as opposed to programmers), you usually want to write a method
called something like printTime:

verbatim
  public static void printTime (Time t) 
    System.out.println (t.hour + ":" + t.minute + ":" + t.second);
  
verbatim

The output of this method, if we pass either t1 or t2 as
an argument, is 11:8:3.14159.  Although this is recognizable
as a time, it is not quite in the standard format.  For example, if
the number of minutes or seconds is less than 10, we expect a leading
0 as a place-keeper.  Also, we might want to drop the decimal
part of the seconds.  In other words, we want something like
11:08:03.

In most languages, there are simple ways to control the output format
for numbers.  In Java there are no simple ways.

Java provides very powerful tools for printing formatted things
like times and dates, and also for interpreting formatted input.
Unfortunately, these tools are not very easy to use, so I am going to
leave them out of this book.  If you want, though, you can take a look
at the documentation for the Date class in the 
java.util package.

 Date
 class!Date

Operations on objects
objectops
 object
 operator!object

Even though we can't print times in an optimal format, we can still
write methods that manipulate Time objects.  In the next few
sections, I will demonstrate several of the possible interfaces for
methods that operate on objects.  For some operations, you will have a
choice of several possible interfaces, so you should consider the pros
and cons of each of these:

description

[pure function:]  Takes objects and/or primitives as
arguments but does not modify the objects.  The return value is
either a primitive or a new object created inside the method.

[modifier:]  Takes objects as arguments and modifies some
or all of them.  Often returns void.  void

[fill-in method:]  One of the arguments is an ``empty''
object that gets filled in by the method.  Technically, this is
a type of modifier.

description

Pure functions
 pure function
 function
 method!pure function

A method is considered a pure function if the result depends only on
the arguments, and it has no side effects like modifying an argument
or printing something.  The only result of invoking a pure function is
the return value.

One example is after, which compares two Times and
returns a boolean that indicates whether the first operand
comes after the second:

verbatim
  public static boolean after (Time time1, Time time2) 
    if (time1.hour > time2.hour) return true;
    if (time1.hour < time2.hour) return false;

    if (time1.minute > time2.minute) return true;
    if (time1.minute < time2.minute) return false;

    if (time1.second > time2.second) return true;
    return false;
  
verbatim

What is the result of this method if the two times are equal?  Does
that seem like the appropriate result for this method?  If you were
writing the documentation for this method, would you mention that case
specifically?

A second example is addTime, which calculates the sum of two
times.  For example, if it is 9:14:30, and your breadmaker takes
3 hours and 35 minutes, you could use addTime to figure out when
the bread will be done.

Here is a rough draft of this method that is not quite right:

verbatim
  public static Time addTime (Time t1, Time t2) 
    Time sum = new Time ();
    sum.hour = t1.hour + t2.hour;
    sum.minute = t1.minute + t2.minute;
    sum.second = t1.second + t2.second;
    return sum;
  
verbatim

Although this method returns a Time object, it is not
a constructor.  You should go back and compare the syntax of
a method like this with the syntax of a constructor, because
it is easy to get confused.

Here is an example of how to use this method.  If currentTime
contains the current time and breadTime contains the amount
of time it takes for your breadmaker to make bread, then you
could use addTime to figure out when the bread will be
done.

verbatim
    Time currentTime = new Time (9, 14, 30.0);
    Time breadTime = new Time (3, 35, 0.0);
    Time doneTime = addTime (currentTime, breadTime);
    printTime (doneTime);
verbatim

The output of this program is 12:49:30.0, which is
correct.  On the other hand, there are cases where the result
is not correct.  Can you think of one?

The problem is that this method does not deal with cases
where the number of seconds or minutes adds up to more than
60.  In that case, we have to ``carry'' the extra seconds
into the minutes column, or extra minutes into the hours
column.

Here's a second, corrected version of this method.

verbatim
  public static Time addTime (Time t1, Time t2) 
    Time sum = new Time ();
    sum.hour = t1.hour + t2.hour;
    sum.minute = t1.minute + t2.minute;
    sum.second = t1.second + t2.second;

    if (sum.second >= 60.0) 
      sum.second -= 60.0;
      sum.minute += 1;
    
    if (sum.minute >= 60) 
      sum.minute -= 60;
      sum.hour += 1;
    
    return sum;
  
verbatim

Although it's correct, it's starting to get big.  Later,
I will suggest an alternate approach to this problem that
will be much shorter.

 increment
 decrement
 operator!increment
 operator!decrement

This code demonstrates two operators you may not have seen,
+= and -=.  These operators provide a concise
way to increment and decrement variables.  They are similar
to ++ and --, except (1) they work on doubles
as well as ints, and (2) the amount of the increment
does not have to be 1.  The statement sum.second -= 60.0;
is equivalent to sum.second = sum.second - 60;

Modifiers
 modifier
 method!modifier

As an example of a modifier, consider increment,
which adds a given number of seconds to a Time object.
Again, a rough draft of this method looks like:

verbatim
  public static void increment (Time time, double secs) 
    time.second += secs;

    if (time.second >= 60.0) 
      time.second -= 60.0;
      time.minute += 1;
    
    if (time.minute >= 60) 
      time.minute -= 60;
      time.hour += 1;
    
  
verbatim

The first line performs the basic operation; the remainder
deals with the same cases we saw before.

Is this method correct?  What happens if the argument secs
is much greater than 60?  In that case, it is not enough to
subtract 60 once; we have to keep doing it until second
is below 60.  We can do that by simply replacing the if
statements with while statements:

verbatim
  public static void increment (Time time, double secs) 
    time.second += secs;

    while (time.second >= 60.0) 
      time.second -= 60.0;
      time.minute += 1;
    
    while (time.minute >= 60) 
      time.minute -= 60;
      time.hour += 1;
    
  
verbatim

This solution is correct, but not very efficient.
Can you think of a solution that does not require iteration?

Fill-in methods
 fill-in method
 method!fill-in

Occasionally you will see methods like addTime written
with a different interface (different arguments and return values).
Instead of creating a new object every time addTime is
invoked, we could require the caller to provide an ``empty''
object where addTime should store the result.  Compare
the following with the previous version:

verbatim
  public static void addTimeFill (Time t1, Time t2, Time sum) 
    sum.hour = t1.hour + t2.hour;
    sum.minute = t1.minute + t2.minute;
    sum.second = t1.second + t2.second;

    if (sum.second >= 60.0) 
      sum.second -= 60.0;
      sum.minute += 1;
    
    if (sum.minute >= 60) 
      sum.minute -= 60;
      sum.hour += 1;
    
  
verbatim

One advantage of this approach is that the caller has the option
of reusing the same object repeatedly to perform a series
of additions.  This can be slightly more efficient, although
it can be confusing enough to cause subtle errors.
For the vast majority of programming, it is worth
a spending a little run time to avoid a lot of debugging time.

Which is best?
 programming style

Anything that can be done with modifiers and fill-in methods can also
be done with pure functions.  In fact, there are programming
languages, called functional programming languages, that only
allow pure functions.  Some programmers believe that programs that use
pure functions are faster to develop and less error-prone than
programs that use modifiers.  Nevertheless, there are times when
modifiers are convenient, and some cases where functional programs
are less efficient.

In general, I recommend that you write pure functions whenever
it is reasonable to do so, and resort to modifiers only if there
is a compelling advantage.  This approach might be called a
functional programming style.

Incremental development vs. planning
 incremental development
 prototyping
 program development!incremental
 program development!planning

In this chapter I have demonstrated an approach to program
development I refer to as rapid prototyping with iterative
improvement.  In each case, I wrote a rough draft (or prototype)
that performed the basic calculation, and then tested it on
a few cases, correcting flaws as I found them.

Although this approach can be effective, it can lead to code
that is unnecessarily complicated---since it deals with many
special cases---and unreliable---since it is hard to convince
yourself that you have found all the errors.

An alternative is high-level planning, in which a little insight
into the problem can make the programming much easier.  In
this case the insight is that a Time is really a three-digit
number in base 60!  The second is the ``ones column,''
the minute is the ``60's column'', and the hour
is the ``3600's column.''

When we wrote addTime and increment, we were effectively
doing addition in base 60, which is why we had to ``carry'' from one
column to the next.

 arithmetic!floating-point

Thus an alternate approach to the whole problem is to convert
Times into doubles and take advantage of the fact that
the computer already knows how to do arithmetic with doubles.
Here is a method that converts a Time into a double:

verbatim
  public static double convertToSeconds (Time t) 
    int minutes = t.hour * 60 + t.minute;
    double seconds = minutes * 60 + t.second;
    return seconds;
  
verbatim

Now all we need is a way to convert from a double
to a Time object.  We could write a method to
do it, but it might make more sense to write it as a third
constructor:

verbatim
  public Time (double secs) 
    this.hour = (int) (secs / 3600.0);
    secs -= this.hour * 3600.0;
    this.minute = (int) (secs / 60.0);
    secs -= this.minute * 60;
    this.second = secs;
  
verbatim

This constructor is a little different from the others, since
it involves some calculation along with assignments to the
instance variables.

You might have to think a bit to convince yourself that the technique
I am using to convert from one base to another is correct.  Assuming
you are convinced, we can use these methods to rewrite addTime:

verbatim
  public static Time addTime (Time t1, Time t2) 
    double seconds = convertToSeconds (t1) + convertToSeconds (t2);
    return new Time (seconds);
  
verbatim

This is much shorter than the original version, and it is much easier
to demonstrate that it is correct (assuming, as usual, that the
methods it invokes are correct).  As an exercise, rewrite 
increment the same way.


Generalization
 generalization

In some ways converting from base 60 to base 10 and back is
harder than just dealing with times.  Base conversion is more
abstract; our intuition for dealing with times is better.

But if we have the insight to treat times as base 60 numbers,
and make the investment of writing the conversion methods
(convertToSeconds and the third constructor), we get
a program that is shorter, easier to read and debug, and more
reliable.

It is also easier to add more features later.  For example, imagine
subtracting two Times to find the duration between them.  The
naive approach would be to implement subtraction complete with
``borrowing.''  Using the conversion methods would be much easier.

Ironically, sometimes making a problem harder (more general)
makes is easier (fewer special cases, fewer opportunities for error).

Algorithms
algorithm
 algorithm

When you write a general solution for a class of problems, as opposed
to a specific solution to a single problem, you have written an 
algorithm.  This word is not easy to define, so I will try a couple
of approaches.

First, consider some things that are not algorithms.  For example,
when you learned to multiply single-digit numbers, you probably
memorized the multiplication table.  In effect, you memorized 100
specific solutions, so that knowledge is not really algorithmic.

But if you were ``lazy,'' you probably cheated by learning a few
tricks.  For example, to find the product of  and 9, you can
write  as the first digit and  as the second digit.  This
trick is a general solution for multiplying any single-digit number by 9.
That's an algorithm!

Similarly, the techniques you learned for addition with carrying,
subtraction with borrowing, and long division are all algorithms.  One
of the characteristics of algorithms is that they do not require any
intelligence to carry out.  They are mechanical processes in which
each step follows from the last according to a simple set of rules.

In my opinion, it is embarrassing that humans spend so much
time in school learning to execute algorithms that,
quite literally, require no intelligence.

On the other hand, the process of designing algorithms is
interesting, intellectually challenging, and a central part
of what we call programming.

Some of the things that people do naturally, without difficulty
or conscious thought, are the most difficult to express
algorithmically.  Understanding natural language is a good
example.  We all do it, but so far no one has been able to
explain how we do it, at least not in the form of an
algorithm.

Later in this class, you will have the opportunity to design
simple algorithms for a variety of problems.  If you take
the next class in the Computer Science sequence, Data Structures,
you will see some of the most interesting, clever, and
useful algorithms computer science has produced.

Glossary

description

[class:]  Previously, I defined a class as a collection
of related methods.  In this chapter we learned that a class
definition is also a template for a new type of object.

[instance:]  A member of a class.  Every object is an
instance of some class.

[constructor:]  A special method that initializes the instance
variables of a newly-constructed object.

[project:]  A collection of one or more class definitions
(one per file) that make up a program.

[startup class:]  The class that contains the main
method where execution of the program begins.

[function:]  A method whose result depends only on its
parameters, and that has so side-effects other than returning
a value.

[functional programming style:]  A style of program design
in which the majority of methods are functions.  

[modifier:]  A method that changes one or more of the objects
it receives as parameters, and usually returns void.

[fill-in method:]  A type of method that takes an ``empty''
object as a parameter and fills it its instance variables instead
of generating a return value.  This type of method is usually
not the best choice.

[algorithm:]  A set of instructions for solving a class of
problems by a mechanical, unintelligent process.

 class
 instance
 constructor
 project
 startup class
 function
 functional programming
 modifier
 algorithm

description


Arrays
arrays
 array
 type!array

An array is a set of values where each value is identified by an
index.  You can make an array of ints, doubles, or any
other type, but all the values in an array have to have the same type.

Syntactically, array types look like other Java types except they are
followed by [].  For example, int[] is the type ``array of
integers'' and double[] is the type ``array of doubles.''

You can declare variables with these types in the usual ways:

verbatim
    int[] count;
    double[] values;
verbatim

Until you initialize these variables, they are set to null.
To create the array itself, use the new command.

verbatim
    count = new int[4];
    values = new double[size];
verbatim

The first assignment makes count refer to an array of 4
integers; the second makes values refer to an array of 
doubles.  The number of elements in values depends on 
size.  You can use any integer expression as an array
size.

 null
 state diagram

The following figure shows how arrays are represented in state
diagrams:

0.1in
figure=array.eps
0.1in

The large numbers inside the boxes are the elements of
the array.  The small numbers outside the boxes are the
indices used to identify each box.  When you allocate a new
array, the elements are initialized to zero.

Accessing elements
 element
 array!element

To store values in the array, use the
[] operator.  For example count[0] refers to the
``zeroeth'' element of the array, and count[1] refers to the
``oneth'' element.  You can use the [] operator anywhere in an
expression:

verbatim
    count[0] = 7;
    count[1] = count[0] * 2;
    count[2]++;
    count[3] -= 60;
verbatim

All of these are legal assignment statements.  Here is the
effect of this code fragment:

0.1in
figure=array2.eps
0.1in

By now you should have noticed that the four elements of this array
are numbered from 0 to 3, which means that there is no element with
the index 4.  This should sound familiar, since we saw the same thing
with String indices.  Nevertheless, it is a common error to go
beyond the bounds of an array, which will cause an 
ArrayOutOfBoundsException.  As with all exceptions, you get an error
message and the program quits.

 exception!ArrayOutOfBounds
 run-time error
 index
 expression

You can use any expression as an index, as long as it has type 
int.  One of the most common ways to index an array is with a loop
variable.  For example:

verbatim
    int i = 0;
    while (i < 4) 
      System.out.println (count[i]);
      i++;
    
verbatim

This is a standard while loop that counts from 0
up to 4, and when the loop variable i is 4, the
condition fails and the loop terminates.  Thus, the body
of the loop is only executed when i is 0, 1, 2 and 3.

 loop
 loop variable
 variable!loop

Each time through the loop we use i as an index into
the array, printing the ith element.  This type of
array traversal is very common.  Arrays and loops go together
like fava beans and a nice Chianti.


Copying arrays
 array!copying

When you copy an array variable, remember that you are
copying a reference to the array.  For example:

verbatim
    double[] a = new double [3];
    double[] b = a;
verbatim

This code creates one array of three doubles, and
sets two different variables to refer to it.
This situation is a form of aliasing.

0.1in
figure=array3.eps
0.1in

Any changes in either array
will be reflected in the other.  This is not usually the
behavior you want; instead, you should make a copy of the
array, by allocating a new array and copying each element from
one to the other.

verbatim
    double[] b = new double [3];

    int i = 0;
    while (i < 4) 
      b[i] = a[i];
      i++;
    
verbatim

for loops

The loops we have written so far have a number of elements
in common.  All of them start by initializing a variable;
they have a test, or condition, that depends on that variable;
and inside the loop they do something to that variable,
like increment it.

 loop!for
 for
 statement!for

This type of loop is so common that there is an alternate
loop statement, called for, that expresses it more
concisely.  The general syntax looks like this:

verbatim
    for (INITIALIZER; CONDITION; INCREMENTOR) 
      BODY
    
verbatim

This statement is exactly equivalent to

verbatim
    INITIALIZER;
    while (CONDITION) 
      BODY
      INCREMENTOR
    
verbatim

except that it is more concise and, since it puts all the
loop-related statements in one place, it is easier to read.
For example:

verbatim
    for (int i = 0; i < 4; i++) 
      System.out.println (count[i]);
    
verbatim

is equivalent to 

verbatim
    int i = 0;
    while (i < 4) 
      System.out.println (count[i]);
      i++;
    
verbatim

As an exercise, write a for loop to copy the elements
of an array.

Arrays and objects
 object!compared to array
 array!compared to object

In many ways, arrays behave like objects:

itemize

When you declare an array variable, you get a reference
to an array.

You have to use the new command to create the array
itself.

When you pass an array as an argument, you pass a reference,
which means that the invoked method can change the contents
of the array.

itemize

Some of the objects we have looked at, like Rectangles, are
similar to arrays, in the sense that they are named collection of
values.  This raises the question, ``How is an array of 4 integers
different from a Rectangle object?''

If you go back to the definition of ``array'' at the beginning
of the chapter, you will see one difference, which is that the
elements of an array are identified by indices, whereas the
elements (instance variables) of an object have names
(like x, width, etc.).

Another difference between arrays and objects is that all the
elements of an array have to be the same type.  Although that
is also true of Rectangles, we have seen other objects
that have instance variables with different types (like
Time).

Array length
 length!array
 array!length

Actually, arrays do have one named instance variable: length.
Not surprisingly, it contains the length of the array (number
of elements).  It is a good idea to use this value as the upper
bound of a loop, rather than a constant value.  That way, if
the size of the array changes, you won't have to go through the
program changing all the loops; they will work correctly for any
size array.

verbatim
    for (int i = 0; i < a.length; i++) 
      b[i] = a[i];
    
verbatim

The last time the body of the loop gets executed, i
is a.length - 1, which is the index of the last element.  When
i is equal to a.length, the condition fails and the body
is not executed, which is a good thing, since it would cause an
exception.  This code assumes that the array b contains at least
as many elements as a.

As an exercise, write a method called cloneArray that takes an
array of integers as a parameter, creates a new array that is the same
size, copies the elements from the first array into the new one, and
then returns a reference to the new array.

Random numbers
random
pseudorandom
 random number
 deterministic
 nondeterministic

Most computer programs do the same thing every time they are executed,
so they are said to be deterministic.  Usually, determinism is a
good thing, since we expect the same calculation to yield the same
result.  For some applications, though, we would like the
computer to be unpredictable.  Games are an obvious example, but
there are many more.

Making a program truly nondeterministic turns out to be not
so easy, but there are ways to make it at least seem
nondeterministic.  One of them is to generate random numbers and
use them to determine the outcome of the program.  Java provides
a built-in method that generates pseudorandom numbers, which
are not truly random in the mathematical sense, but 
for our purposes, they will do.

Check out the documentation of the random method in the 
Math class.  The return value is a double between 0.0 and 1.0.
Each time you invoke random you get a different
randomly-generated number.  To see a sample, run this loop:

verbatim
    for (int i = 0; i < 10; i++) 
      double x = Math.random ();
      System.out.println (x);
    
verbatim

To generate a random double between 0.0 and an upper bound like
high, you can multiply x by high.  How would you
generate a random number between low and high?  How would
you generate a random integer?

Statistics
 statistics
 distribution
 mean

The numbers generated by random are supposed to be distributed
uniformly.  If you have taken statistics, you know what that means.
Among other things, it means that if we divide the range of possible
values into equal sized ``buckets,'' and count the number of times a
random value falls in each bucket, each bucket should get the same
number of hits (eventually).

In the next few sections, we will write programs that generate
a sequence of random numbers and check whether this property
holds true.

Array of random numbers

The first step is to generate a large number of random values
and store them in an array.  By ``large number,'' of course,
I mean 8.  It's always a good idea to start with a manageable
number, to help with debugging, and then increase it later.

The following method takes a single argument, the size of
the array.  It allocates a new array of doubles, fills
it with random values, and returns a reference to the new
array.

verbatim
  public static double[] randomArray (int n) 
    double[] a = new double[n];
    for (int i = 0; i<a.length; i++) 
      a[i] = Math.random ();
    
    return a;
  
verbatim

The return type is double[], which means that
this method returns an array of doubles.
To test this method, it is convenient to have a method that
prints the contents of an array.

verbatim
  public static void printArray (double[] a) 
    for (int i = 0; i<a.length; i++) 
      System.out.println (a[i]);
    
  
verbatim

The following code generates an array and prints it:

verbatim
    int numValues = 8;
    double[] array = randomArray (numValues);
    printArray (array);
verbatim

On my machine the output is

verbatim
0.7344558779885422
0.6224282219647016
0.09591424515329172
0.2992298398883563
0.7736458103088713
0.7069110192991597
0.7042440765950522
0.977839532249852
verbatim

which is pretty random-looking.  Your results may differ.

If these numbers are really random, we expect half of them to be
greater than 0.5 and half to be less.  In fact, six are greater than
0.5, so that's a little high.

If we divide the range into four buckets---from 0.0 to 0.25,
0.25 to 0.5, 0.5 to 0.75, and 0.75 to 1.0---we expect 2
values to fall in each bucket.  In fact, we get
1, 1, 4, 2.  Again, not exactly what we expected.

Do these results mean the values are not really random?  It's
hard to tell.  With so few values, the chances are slim
that we would get exactly what we expect.  But as the number
of values increases, the outcome should be more predictable.

To test this theory, we'll write some programs that divide the range
into buckets and count the number of values in each.

Counting
 traverse!counting
 loop!counting
 counter

A good approach to problems like this is to think of simple methods
that are easy to write, and that might turn out to be useful.  Then
you can combine them into a solution.  Of course, it is not easy to
know ahead of time which methods are likely to be useful, but as you
gain experience you will have a better idea.

In this case, I have a method in mind called inBucket that
counts the number of elements in an array that fall in a given bucket.
The parameters are the array and two doubles that specify the
lower and upper bounds of the bucket.

verbatim
  public static int inBucket (double[] a, double low, double high) 
    int count = 0;
    for (int i=0; i<a.length; i++) 
      if (a[i] >= low && a[i] < high) count++;
    
    return count;
  
verbatim

I haven't been very careful about whether something equal
to low or high falls in the bucket, but you can
see from the code that low is in and high is out.
That should prevent me from counting any elements twice.

Now, to divide the range into two pieces, we could write

verbatim
    int low = inBucket (a, 0.0, 0.5);
    int high = inBucket (a, 0.5, 1.0);
verbatim

To divide it into four pieces:

verbatim
    int bucket1 = inBucket (a, 0.0, 0.25);
    int bucket2 = inBucket (a, 0.25, 0.5);
    int bucket3 = inBucket (a, 0.5, 0.75);
    int bucket4 = inBucket (a, 0.75, 1.0);
verbatim

You might want to try out this program using a larger numValues.
As numValues increases, are the numbers in each bucket levelling
off?

Many buckets
 bucket
 histogram

Of course, as the number of buckets increases, we don't
want to have to rewrite the program, especially since the
code is getting big and repetitive.  Any time you find yourself
doing something more than a few times, you should be looking
for a way to automate it.

Let's say that we wanted 8 buckets.  The width of each
bucket would be one eighth of the range, which is 0.125.
To count the number of values in each bucket, we need to
be able to generate the bounds of each bucket automatically,
and we need to have some place to store the 8 counts.

We can solve the first problem with a loop:

verbatim
    int numBuckets = 8;
    double bucketWidth = 1.0 / numBuckets;

    for (int i = 0; i < numBuckets; i++) 
      double low = i * bucketWidth;
      double high = low + bucketWidth;
      System.out.println (low + " to " + high);
    
verbatim

This code uses the loop variable i to multiply by the bucket
width, in order to find the low end of each bucket.  The output of
this loop is:

verbatim
0.0 to 0.125
0.125 to 0.25
0.25 to 0.375
0.375 to 0.5
0.5 to 0.625
0.625 to 0.75
0.75 to 0.875
0.875 to 1.0
verbatim

You can confirm that each bucket is the same width, that they
don't overlap, and that they cover the whole range from 0.0
to 1.0.

Now we just need a way to store 8 integers, preferably
so we can use an index to access each one.  Immediately,
you should be thinking ``array!''

What we want is an array of 8 integers, which we can allocate outside
the loop; then, inside the loop, we'll invoke inBucket and store
the result:

verbatim
    int numBuckets = 8;
    int[] buckets = new int [8];
    double bucketWidth = 1.0 / numBuckets;

    for (int i = 0; i<numBuckets; i++) 
      double low = i * bucketWidth;
      double high = low + bucketWidth;
      //System.out.println (low + " to " + high);

      buckets[i] = inBucket (a, low, high);
    
verbatim

The tricky thing here is that I am using the loop variable
as an index into the buckets array, in addition to using
it to compute the range of each bucket.

This code works.  I cranked the number of values up to 1000
and divided the range into 8 buckets.  The output is:

verbatim
129
109
142
118
131
124
121
126
verbatim

which is pretty close to 125 in each bucket.  At least, it's
close enough that I can believe the random number generator is
working.

A single-pass solution

Although this code works, it is not as efficient as it could
be.  Every time it invokes inBucket, it traverses the
entire array.  As the number of buckets increases, that gets
to be a lot of traversals.

It would be better to make a single pass through the array,
and for each value, compute which bucket it falls in.  Then
we could increment the appropriate counter.

In the previous section, we took an index, i, and
multiplied it by the bucketWidth in order to find
the lower bound of a given bucket.  Now we want to take a
value in the range 0.0 to 1.0, and find the index of the
bucket where it falls.

Since this problem is the inverse of the previous problem we might
guess that we should divide by the bucketwidth instead of
multiplying.  That guess is correct.

Remember that since bucketWidth = 1.0 / numBuckets, dividing by
bucketWidth is the same as multiplying by numBuckets.
If we take a number in the range 0.0 to 1.0 and multiply by
numBuckets, we get a number in the range from 0.0
to numBuckets.  If we round that number to the next
lower integer, we get exactly what we are looking for---the
index of the appropriate bucket.

verbatim
    int numBuckets = 8;
    int[] buckets = new int [8];

    for (int i = 0; i < numValues; i++) 
      int index = (int) (a[i] * numBuckets);
      buckets[index]++;
    
verbatim

Here I am using a typecast to round the value down to the
next integer and convert it to type int at the same
time.

Is it possible for this calculation to produce an index
that is out of range (either negative or greater than
a.length-1)?  If so, how would you fix it?

 histogram

An array like buckets, that contains counts of the
number of values in each range, is called a histogram.
As an exercise, write a method called histogram
that takes an array and a number of buckets as parameters,
and that returns a histogram with the given number of
buckets.

Glossary

description

[array:]  A named collection of values, where all the
values have the same type, and each value is identified by
an index.

[element:]  One of the values in an array.  The []
operator selects elements of an array.

[index:]  An integer variable or value used to indicate
an element of an array.

[deterministic:]  A program that does the same thing every
time it is invoked.

[pseudorandom:]  A sequence of numbers that appear to be
random, but which are actually the product of a deterministic
computation.

[histogram:]  An array of integers where each integer
counts the number of values that fall into a certain range.

 array
 element
 index
 deterministic
 pseudorandom

description


Arrays of Objects


Composition
 composition
 nested structure

Java, and most other languages, take advantage of composition, the
ability to combine language features in a variety of arrangements.
One of example is the use of method invocation as part of an
expression.  Another example is the nested structure of statements:
you can put an if statement within a while loop, or within
another if statement, etc.

Having seen this pattern, and having learned about arrays and objects,
you should not be surprised to learn that you can have arrays of
objects.  In fact, you can also have objects that contain arrays (as
instance variables); you can have arrays that contain arrays; you can
have objects that contain objects, and so on.

In the next two chapters we will look at some examples of these
combinations, using Card objects as an example.

Card objects
 Card
 class!Card

If you are not familiar with common playing cards, now would be a good
time to get a deck, or else this chapter might not make much sense.
There are 52 cards in a deck, each of which belongs to one of four
suits and one of 13 ranks.  The suits are Spades, Hearts, Diamonds and
Clubs (in descending order in Bridge).  The ranks are Ace, 2, 3, 4, 5,
6, 7, 8, 9, 10, Jack, Queen and King.  Depending on what game you are
playing, the rank of the Ace may be higher than King or lower than 2.

 rank
 suit

If we want to define a new object to represent a playing card, it is
pretty obvious what the instance variables should be: rank and
suit.  It is not as obvious what type the instance variables
should be.  One possibility is Strings, containing things like
"Spade" for suits and "Queen" for ranks.  One problem with
this implementation is that it would not be easy to compare cards to
see which had higher rank or suit.

 encode
 encrypt
 map to

An alternative is to use integers to encode the ranks and
suits.  By ``encode,'' I do not mean what some people think, which
is to encrypt, or translate into a secret code.  What a computer
scientist means by ``encode'' is something like ``define a mapping
between a sequence of numbers and the things I want to represent.''
For example,

0.1in
tabularl c l
Spades &  & 3 
Hearts &  & 2 
Diamonds &  & 1 
Clubs &  & 0
tabular
0.1in

The symbol  is mathematical notation for ``maps to.''
The obvious feature of this mapping is that the suits map to
integers in order, so we can compare suits by comparing integers.
The mapping for ranks is fairly obvious; each of the numerical
ranks maps to the corresponding integer, and for face cards:

0.1in
tabularl c l
Jack &  & 11 
Queen &  & 12 
King &  & 13 
tabular
0.1in

The reason I am using mathematical notation for these mappings is
that they are not part of the Java program.  They are part of the
program design, but they never appear explicitly in the code.
The class definition for the Card type looks like this:

verbatim
class Card

  int suit, rank;

  public Card ()  
    this.suit = 0;  this.rank = 0;
  

  public Card (int suit, int rank)  
    this.suit = suit;  this.rank = rank;
  

verbatim

As usual, I am providing two constructors, one of which takes
a parameter for each instance variable and the other of which
takes no parameters.

 constructor

To create an object that represents the 3 of Clubs, we would
use the new command:

verbatim
   Card threeOfClubs = new Card (0, 3);
verbatim

The first argument, 0 represents the suit Clubs.

The printCard method
 printCard
 print!Card

When you create a new class, the first step is usually to declare the
instance variables and write constructors.  The second step is often
to write the standard methods that every object should have, including
one that prints the object, and one or two that compare objects.  I
will start with printCard.

 String!array of
 array!of String

In order to print Card objects in a way that humans
can read easily, we want to map the integer codes onto words.
A natural way to do that is with an array of Strings.  You
can create an array of Strings the same way you create an
array of primitive types:

verbatim
    String[] suits = new String [4];
verbatim

Then we can set the values of the elements of the array.

verbatim
    suits[0] = "Clubs";
    suits[1] = "Diamonds";
    suits[2] = "Hearts";
    suits[3] = "Spades";
verbatim

Creating an array and initializing the elements is such a common
operation that Java provides a special syntax for it:

verbatim
    String[] suits =  "Clubs", "Diamonds", "Hearts", "Spades" ;
verbatim

The effect of this statement is identical to that of the
separate declaration, allocation, and assignment.  A state
diagram of this array might look like:

 state diagram

0.1in
figure=stringarray.eps
0.1in

 reference
 String!reference to

The elements of the array are references to the Strings,
rather than Strings themselves.  This is true of all arrays of
objects, as I will discuss in more detail later.  For now, all we need
is another array of Strings to decode the ranks:

verbatim
  String[] ranks =  "narf", "Ace", "2", "3", "4", "5", "6",
	"7", "8", "9", "10", "Jack", "Queen", "King" ;
verbatim

The reason for the "narf" is to act as a place-keeper
for the zeroeth element of the array, which will never be
used.  The only valid ranks are 1--13.  This wasted entry is not
necessary, of course.  We could have started at 0, as usual, but
it is certainly more mnemonic to encode 2 as 2, and 3 as 3, etc.

Using these arrays, we can select the appropriate Strings by
using the suit and rank as indices.  In the method
printCard,

verbatim
  public static void printCard (Card c) 
    String[] suits =  "Clubs", "Diamonds", "Hearts", "Spades" ;
    String[] ranks =  "narf", "Ace", "2", "3", "4", "5", "6",
		 "7", "8", "9", "10", "Jack", "Queen", "King" ;

    System.out.println (ranks[c.rank] + " of " + suits[c.suit]);
  
verbatim

the expression suits[c.suit] means ``use the instance variable
suit from the object c as an index into the array named
suits, and select the appropriate string.''  The output of this
code

verbatim
    Card card = new Card (1, 11);
    printCard (card);
verbatim

is Jack of Diamonds.

The sameCard method
 sameCard

The word ``same'' is one of those things that occur in natural
language that seem perfectly clear until you give it some thought,
and then you realize there is more to it than you expected.

 ambiguity
 natural language
 language!

For example, if I say ``Chris and I have the same car,'' I
mean that his car and mine are the same make and model, but they are
two different cars.  If I say ``Chris and I have the same mother,'' I
mean that his mother and mine are one and the same.  So the
idea of ``sameness'' is different depending on the context.

When you talk about objects, there is a similar ambiguity.  For
example, if two Cards are the same, does that mean they
contain the same data (rank and suit), or they are actually
the same Card object?

To see if two references refer to the same object, we can use
the == operator.  For example:

verbatim
    Card card1 = new Card (1, 11);
    Card card2 = card1;

    if (card1 == card2) 
      System.out.println ("card1 and card2 are the same object.");
    
verbatim

This type of equality is called shallow equality because
it only compares the references, not the contents of the objects.

 equality
 identity
 shallow equality
 deep equality

To compare the contents of the objects---deep equality---it
is common to write a method with a name like sameCard.

verbatim
  public static boolean sameCard (Card c1, Card c2) 
    return (c1.suit == c2.suit && c1.rank == c2.rank);
  
verbatim

Now if we create two different objects that contain the same
data, we can use sameCard to see if they represent the
same card:

verbatim
    Card card1 = new Card (1, 11);
    Card card2 = new Card (1, 11);

    if (sameCard (card1, card2)) 
      System.out.println ("card1 and card2 are the same card.");
    
verbatim

In this case, card1 and card2
are two different objects that contain the same data

0.1in
figure=card.eps
0.1in

so the condition is true.  What does the state diagram look like when
card1 == card2 is true?

 aliasing

In general, you should never use the
== operator on Strings because it tests 
shallow equality---whether the two Strings are the
same object---rather than what you probably want, which is
to compare the contents of the Strings.

The compareCard method
 compareCard
 operator!conditional
 conditional operator

For primitive types, there are conditional operators that
compare values and determine when one is greater or less
than another.  These operators (< and > and the others)
don't work for object types.  For Strings there is
a built-in compare method.  For Cards we have
to write our own, which we will call compareCard.
Later, we will use this method to sort a deck of cards.

 ordering
 complete ordering
 partial ordering

Some sets are completely ordered, which means that you can
compare any two elements and tell which is bigger.  For
example, the integers and the floating-point numbers are
totally ordered.  Some sets are unordered, which means that
there is no meaningful way to say that one element is bigger
than another.  For example, the fruits are unordered, which
is why we cannot compare apples and oranges.  In Java,
the boolean type is unordered; we cannot say that
true is greater than false.

The set of playing cards is partially ordered, which means that
sometimes we can compare cards and sometimes not.  For example, I know
that the 3 of Clubs is higher than the 2 of Clubs, and the 3 of
Diamonds is higher than the 3 of Clubs.  But which is better, the 3 of
Clubs or the 2 of Diamonds?  One has a higher rank, but the other has
a higher suit.

 comparable

In order to make cards comparable, we have to decide which is more
important, rank or suit.  The choice is completely arbitrary, but for
the sake of choosing, I will say that suit is more important.  As
evidence, I would point out that when you buy a new deck of cards, it
comes sorted with all the Clubs together, followed by all the
Diamonds, and so on.

With that decided, we can write compareCard.  It
will take two Cards as parameters and return 1 if
the first card wins, -1 if the second card wins, and 0 if
they tie (indicating deep equality).  It is sometimes confusing
to keep those return values straight, but they are pretty
standard for comparison methods.

First we compare the suits:

verbatim
    if (c1.suit > c2.suit) return 1;
    if (c1.suit < c2.suit) return -1;
verbatim

If neither statement is true, then the suits must be equal,
and we have to compare ranks:

verbatim
    if (c1.rank > c2.rank) return 1;
    if (c1.rank < c2.rank) return -1;
verbatim

If neither of these is true, the ranks must be equal, so we return
0.  In this ordering, aces will appear lower than deuces (2s).

As an exercise, fix it so that aces are ranked higher than Kings, and
encapsulate this code in a method.

Arrays of cards
 array!of object
 object!array of
 deck

The reason I chose Cards as the objects for this chapter is that
there is an obvious use for an array of cards---a deck.  Here is some
code that creates a new deck of 52 cards:

verbatim
    Card[] deck = new Card [52];
verbatim

Here is the state diagram for this object:

 state diagram

0.1in
figure=cardarray.eps
0.1in

The important thing to see here is that the array contains
only references to objects; it does not contain any
Card objects.  The values of the array elements are
initialized to null.  You can access the elements of
the array in the usual way:

verbatim
    if (deck[3] == null) 
      System.out.println ("No cards yet!");
    
verbatim

But if you try to access the instance variables of the
non-existent Cards, you will get a NullPointerException.

 exception!NullPointer
 run-time error
 null

verbatim
    deck[2].rank;             // NullPointerException
verbatim

Nevertheless, that is the correct syntax for accessing the rank
of the ``twoeth'' card in the deck (really the third---we started
at zero, remember?).  This is another example of composition, the
combination of the syntax for accessing an element of an array
and an instance variable of an object.

 composition
 loop!nested

The easiest way to populate the deck with Card objects
is to write a nested loop:

verbatim
    int index = 0;
    for (int suit = 0; suit <= 3; suit++) 
      for (int rank = 1; rank <= 13; rank++) 
        deck[index] = new Card (suit, rank);
        index++;
      
    
verbatim

The outer loop enumerates the suits, from 0 to 3.  For
each suit, the inner loop enumerates the ranks, from 1
to 13.  Since the outer loop iterates 4 times, and
the inner loop iterates 13 times, the total number of times
the body is executed is 52 (13 times 4).

 index

I used the variable index to keep track of where in the
deck the next card should go.  The following state diagram
shows what the deck looks like after the first two cards
have been allocated:

0.1in
figure=cardarray2.eps
0.1in

As an exercise, encapsulate this deck-building code in a method called
buildDeck that takes no parameters and that returns a
fully-populated array of Cards.

 encapsulation

The printDeck method
printdeck
 printDeck
 print!array of Cards

Whenever you are working with arrays, it is convenient to have
a method that will print the contents of the array.  We have
seen the pattern for traversing an array several times, so the
following method should be familiar:

verbatim
  public static void printDeck (Card[] deck) 
    for (int i=0; i<deck.length; i++) 
      printCard (deck[i]);
    
  
verbatim

Since deck has type Card[], an element of deck
has type Card.  Therefore, deck[i] is a legal argument
for printCard.

Searching
findcard
 searching
 linear search
 findCard

The next method I want to write is findCard, which searches
through an array of Cards to see whether it contains a certain
card.  It may not be obvious why this method would be useful, but it
gives me a chance to demonstrate two ways to go searching for things,
a linear search and a bisection search.

 traverse
 loop!search

Linear search is the more obvious of the two; it involves traversing
the deck and comparing each card to the one we are looking for.  If we
find it we return the index where the card appears.  If it is not in
the deck, we return -1.

verbatim
  public static int findCard (Card[] deck, Card card) 
    for (int i = 0; i< deck.length; i++) 
      if (sameCard (deck[i], card)) return i;
    
    return -1;
  
verbatim

The arguments of findCard are named card and deck.
It might seem odd to have a variable with the same name as a type (the
card variable has type Card).  This is legal and common,
although it can sometimes make code hard to read.  In this case,
though, I think it works.

 statement!return
 return!inside loop

The method returns as soon as it discovers
the card, which means that we do not have to traverse the entire
deck if we find the card we are looking for.  If the loop terminates
without finding the card, we know the card is not in the deck
and return -1.

If the cards in the deck are not in order, there is no way to search
that is faster than this.  We have to look at every card, since
otherwise there is no way to be certain the card we want is not
there.

 bisection search

But when you look for a word in a dictionary, you don't search
linearly through every word.  The reason is that the words are in
alphabetical order.  As a result, you probably use an algorithm that
is similar to a bisection search:

 enumerate

Start in the middle somewhere.

Choose a word on the page and compare it to the word you
are looking for.

If you found the word you are looking for, stop.

If the word you are looking for comes after the word on
the page, flip to somewhere later in the dictionary and go to
step 2.

If the word you are looking for comes before the word on
the page, flip to somewhere earlier in the dictionary and go to
step 2.

 enumerate

If you ever get to the point where there are two adjacent
words on the page and your word comes between them, you can
conclude that your word is not in the dictionary.
The only alternative is that your word has been misfiled somewhere,
but that contradicts our assumption that the words are in alphabetical
order.

In the case of a deck of cards, if we know that the cards are
in order, we can write a version of findCard that is
much faster.  The best way to write a bisection
search is with a recursive method.  That's because bisection
is naturally recursive.

 findBisect

The trick is to write a method called findBisect that takes
two indices as parameters, low and high, indicating the
segment of the array that should be searched (including both
low and high).

enumerate

To search the array, choose an index between low and 
high (call it mid) and compare it to the card you are looking
for.

If you found it, stop.

If the card at mid is higher than your card, search
in the range from low to mid-1.

If the card at mid is lower than your card, search
in the range from mid+1 to high.

enumerate

Steps 3 and 4 look suspiciously like recursive
invocations.  Here's what this all looks like translated into
Java code:

verbatim
  public static int findBisect (Card[] deck, Card card, int low, int high) 
    int mid = (high + low) / 2;
    int comp = compareCard (deck[mid], card);

    if (comp == 0) 
      return mid;
     else if (comp > 0) 
      return findBisect (deck, card, low, mid-1);
     else 
      return findBisect (deck, card, mid+1, high);
    
  
verbatim

Rather than call compareCard three times, I called it once
and stored the result.

Although this code contains the kernel of a bisection search, it
is still missing a piece.  As it is currently written,
if the card is not in the deck, it will recurse forever.  We
need a way to detect this condition and deal with it properly
(by returning -1).

 recursion

The easiest way to tell that your card is not in the deck
is if there are no cards in the deck, which is the
case if high is less than low.  Well, there are
still cards in the deck, of course, but what I mean is that
there are no cards in the segment of the deck indicated by
low and high.

With that line added, the method works correctly:

verbatim
  public static int findBisect
               (Card[] deck, Card card, int low, int high) 
    System.out.println (low + ", " + high);

    if (high < low) return -1;

    int mid = (high + low) / 2;
    int comp = deck[mid].compareCard (card);

    if (comp == 0) 
      return mid;
     else if (comp > 0) 
      return findBisect (deck, card, low, mid-1);
     else 
      return findBisect (deck, card, mid+1, high);
    
  
verbatim

I added a print statement at the beginning so I could watch
the sequence of recursive calls and convince myself
that it would eventually reach the base case.  I tried out the
following code:

verbatim
    Card card1 = new Card (1, 11);
    System.out.println (findBisect (deck, card1, 0, 51));
verbatim

And got the following output:

verbatim
0, 51
0, 24
13, 24
19, 24
22, 24
23
verbatim

Then I made up a card that is not in the deck (the 15 of Diamonds),
and tried to find it.  I got the following:

verbatim
0, 51
0, 24
13, 24
13, 17
13, 14
13, 12
-1
verbatim

These tests don't prove that this program is correct.  In fact, no
amount of testing can prove that a program is correct.  On the other
hand, by looking at a few cases and examining the code, you might be
able to convince yourself.

 testing
 correctness

The number of recursive calls is fairly small, typically 6 or 7.
That means we only had to invoke compareCard 6 or 7 times,
compared to up to 52 times if we did a linear search.  In general,
bisection is much faster than a linear search, especially for
large arrays.

Two common errors in recusive programs are forgetting to include a
base case and writing the recursive call so that the base case is never
reached.  Either error will cause an infinite recursion, in which case
Java will (eventually) throw a StackOverflowException.

 recursion!infinite
 infinite recursion
 exception!StackOverflow

Decks and subdecks
 deck
 subdeck

Looking at the interface to findBisect

verbatim
  public static int findBisect
	       (Card[] deck, Card card, int low, int high)
verbatim

it might make sense to treat three of the parameters, deck, 
low and high, as a single parameter that specifies a 
subdeck.

 parameter!abstract
 abstract parameter

This kind of thing is quite common, and I sometimes think of it as an
abstract parameter.  What I mean by ``abstract,'' is something
that is not literally part of the program text, but which describes the
function of the program at a higher level.

For example, when you invoke a method and pass an array and the bounds
low and high, there is nothing that prevents the invoked
method from accessing parts of the array that are out of bounds.  So
you are not literally sending a subset of the deck; you are really
sending the whole deck.  But as long as the recipient plays by the
rules, it makes sense to think of it, abstractly, as a subdeck.

There is one other example of this kind of abstraction that you might
have noticed in Section objectops, when I referred to an
``empty'' data structure.  The reason I put ``empty'' in quotation
marks was to suggest that it is not literally accurate.  All variables
have values all the time.  When you create them, they are given
default values.  So there is no such thing as an empty object.

But if the program guarantees that the current value of a variable is
never read before it is written, then the current value is irrelevant.
Abstractly, it makes sense to think of such a variable as ``empty.''

This kind of thinking, in which a program comes to take on
meaning beyond what is literally encoded, is a very important
part of thinking like a computer scientist.  Sometimes,
the word ``abstract'' gets used so often and in so many contexts
that it comes to lose its meaning.  Nevertheless, abstraction
is a central idea in computer science (as well as many other
fields).

 abstraction

A more general definition of ``abstraction'' is ``The process of
modeling a complex system with a simplified description in order to
suppress unnecessary details while capturing relevant behavior.''

Glossary

description

[encode:]  To represent one set of values using another
set of values, by constructing a mapping between them.

[shallow equality:]  Equality of references.  Two
references that point to the same object.

[deep equality:]  Equality of values.  Two references
that point to objects that have the same value.

[abstract parameter:]  A set of parameters that act together
as a single parameter.

[abstraction:]  The process of interpreting a program
(or anything else) at a higher level than what is literally
represented by the code.

 encode
 shallow equality
 deep equality
 abstract parameter
 abstraction

description


Objects of Arrays
deck
 deck
 array!of Cards

In the previous chapter, we worked with an array of objects,
but I also mentioned that it is possible to have an object
that contains an array as an instance variable.  In this
chapter I am going to create a new object, called a Deck,
that contains an array of Cards as an instance variable.

 instance variable
 variable!instance

The class definition looks like this

verbatim
class Deck 
  Card[] cards;

  public Deck (int n) 
    cards = new Card[n];
  

verbatim

The name of the instance variable is cards to help
distinguish the Deck object from the array of Cards
that it contains.  Here is a state diagram showing what a
Deck object looks like with no cards allocated:

 state diagram
 constructor

0.2in
figure=deckobject.eps
0.2in

As usual, the constructor initializes the instance variable, but in
this case it uses the new command to create the array of cards.
It doesn't create any cards to go in it, though.  For that we could
write another constructor that creates a standard 52-card deck and
populates it with Card objects:

verbatim
  public Deck () 
    cards = new Card[52];
    int index = 0;
    for (int suit = 0; suit <= 3; suit++) 
      for (int rank = 1; rank <= 13; rank++) 
        cards[index] = new Card (suit, rank);
        index++;
      
    
verbatim

Notice how similar this method is to buildDeck, except
that we had to change the syntax to make it a constructor.
To invoke it, we use the new command:

 new
 statement!new

verbatim
    Deck deck = new Deck ();
verbatim

Now that we have a Deck class, it makes sense to put
all the methods that pertain to Decks in the Deck
class definition.  Looking at the methods we have written so
far, one obvious candidate is printDeck (Section printdeck).
Here's how it looks, rewritten to work with a Deck
object:

 printDeck

verbatim
  public static void printDeck (Deck deck) 
    for (int i=0; i<deck.cards.length; i++) 
      Card.printCard (deck.cards[i]);
    
  
verbatim

The most obvious thing we have to change is the type of the parameter,
from Card[] to Deck.  The second change is that we can no
longer use deck.length to get the length of the array, because
deck is a Deck object now, not an array.  It contains an
array, but it is not, itself, an array.  Therefore, we have to write
deck.cards.length to extract the array from the Deck
object and get the length of the array.

For the same reason, we have to use deck.cards[i] to access an
element of the array, rather than just deck[i].  The last change
is that the invocation of printCard has to say explicitly that
printCard is defined in the Card class.

For some of the other methods, it is not obvious whether they should
be included in the Card class or the Deck class.  For
example, findCard takes a Card and a Deck as
arguments; you could reasonably put it in either class.  As an
exercise, move findCard into the Deck class and rewrite it
so that the first parameter is a Deck object rather than an
array of Cards.

Shuffling
shuffle
 shuffling

For most card games you need to be able to shuffle the deck;
that is, put the cards in a random order.  In Section random
we saw how to generate random numbers, but it is not obvious how
to use them to shuffle a deck.

One possibility is to model the way humans shuffle, which is usually
by dividing the deck in two and then reassembling the deck by choosing
alternately from each deck.  Since humans usually don't shuffle
perfectly, after about 7 iterations the order of the deck is pretty
well randomized.  But a computer program would have the annoying
property of doing a perfect shuffle every time, which is not really
very random.  In fact, after 8 perfect shuffles, you would find the
deck back in the same order you started in.  For a discussion of that
claim, see http://www.wiskit.com/marilyn/craig.html or do a web
search with the keywords ``perfect shuffle.''

A better shuffling algorithm is to traverse the deck one card at a
time, and at each iteration choose two cards and swap them.

 pseudocode

Here is an outline of how this algorithm works.  To sketch the
program, I am using a combination of Java statements and English
words that is sometimes called pseudocode:

verbatim
    for (int i=0; i<deck.cards.length; i++) 
      // choose a random number between i and deck.cards.length
      // swap the ith card and the randomly-chosen card
    
verbatim

The nice thing about using pseudocode is that it often makes it
clear what methods you are going to need.  In this case, we
need something like randomInt, which chooses a random
integer between the parameters low and high,
and swapCards which takes two indices and switches the
cards at the indicated positions.

 random number

You can probably figure out how to write randomInt
by looking at Section random, although you will have to
be careful about possibly generating indices that are out of range.

 swapCards
 reference

You can also figure out swapCards yourself.  The only
tricky thing is to decide whether to swap just the references
to the cards or the contents of the cards.  Does it matter
which one you choose?  Which is faster?

I will leave the remaining implementation of these methods
as an exercise to the reader.

Sorting
sorting
 sorting

Now that we have messed up the deck, we need a way to put it
back in order.  Ironically, there is an algorithm for
sorting that is very similar to the algorithm for shuffling.

Again, we are going to traverse the deck and at each location
choose another card and swap.  The only difference is that
this time instead of choosing the other card at random, we
are going to find the lowest card remaining in the deck.

By ``remaining in the deck,'' I mean cards that are at or
to the right of the index i.

verbatim
    for (int i=0; i<deck.cards.length; i++) 
      // find the lowest card at or to the right of i
      // swap the ith card and the lowest card
    
verbatim

Again, the pseudocode helps with the design of the helper
methods.  In this case we can use swapCards again,
so we only need one new one, called findLowestCard,
that takes an array of cards and an index where it should
start looking.

 helper method
 method!helper

Once again, I am going to leave the implementation up to
the reader.

Subdecks
 subdeck

How should we represent a hand or some other subset of a full deck?
One good choice is to make a Deck object that
has fewer than 52 cards.

We might want a method, subdeck, that takes an array of cards
and a range of indices, and that returns a new array of cards that
contains the specified subset of the deck:

verbatim
  public static Deck subdeck (Deck deck, int low, int high) 
    Deck sub = new Deck (high-low+1);
	
    for (int i = 0; i<sub.cards.length; i++) 
      sub.cards[i] = deck.cards[low+i];
    
    return sub;
  
verbatim

The length of the subdeck is high-low+1 because both the low
card and high card are included.  This sort of computation can be
confusing, and lead to ``off-by-one'' errors.  Drawing a picture is
usually the best way to avoid them.

 constructor
 overloading

Because we provide an argument with the new command, the
contructor that gets invoked will be the first one, which only
allocates the array and doesn't allocate any cards.  Inside the
for loop, the subdeck gets populated with copies of the
references from the deck.

The following is a state diagram of a subdeck being created with the
parameters low=3 and high=7.  The result is a hand with 5
cards that are shared with the original deck; i.e. they are aliased.

0.1in
figure=subdeck.eps
0.1in

 aliasing
 reference

I have suggested that aliasing is not generally a good idea, since
changes in one subdeck will be reflected in others, which is not the
behavior you would expect from real cards and decks.  But if the
objects in question are immutable, then aliasing can be a reasonable
choice.  In this case, there is probably no reason ever to change the
rank or suit of a card.  Instead we will create each card
once and then treat it as an immutable object.  So for Cards
aliasing is a reasonable choice.

As an exercise, write a version of findBisect that takes a
subdeck as an argument, rather than a deck and an index range.  Which
version is more error-prone?  Which version do you think is more
efficient?

Shuffling and dealing
 shuffling
 dealing

In Section shuffle I wrote pseudocode for a shuffling algorithm.
Assuming that we have a method called shuffleDeck that takes
a deck as an argument and shuffles it, we can create and shuffle
a deck:

verbatim
    Deck deck = new Deck ();
    shuffleDeck (deck);
verbatim

Then, to deal out several hands, we can use subdeck:

verbatim
   Deck hand1 = subdeck (deck, 0, 4);
   Deck hand2 = subdeck (deck, 5, 9);
   Deck pack = subdeck (deck, 10, 51);
verbatim

This code puts the first 5 cards in one hand, the next 5 cards
in the other, and the rest into the pack.

When you thought about dealing, did you think we should give out one
card at a time to each player in the round-robin style that is common
in real card games?  I thought about it, but then realized that it is
unnecessary for a computer program.  The round-robin convention is
intended to mitigate imperfect shuffling and make it more difficult
for the dealer to cheat.  Neither of these is an issue for a computer.

This example is a useful reminder of one of the dangers of engineering
metaphors: sometimes we impose restrictions on computers that are
unnecessary, or expect capabilities that are lacking, because we
unthinkingly extend a metaphor past its breaking point.  Beware of
misleading analogies.

Mergesort
 efficiency
 sorting
 mergesort
mergesort

In Section sorting, we saw a simple sorting algorithm that turns
out not to be very efficient.  In order to sort  items, it has to
traverse the array  times, and each traversal takes an amount of
time that is proportional to .  The total time, therefore, is
proportional to .

In this section I will sketch a more efficient algorithm called 
mergesort.  To sort  items, mergesort takes time proportional to
.  That may not seem impressive, but as  gets big, the
difference between  and  can be enormous.  Try out a
few values of  and see.

The basic idea behind mergesort is this: if you have two subdecks,
each of which has been sorted, it is easy (and fast) to merge them
into a single, sorted deck.  Try this out with a deck of cards:

enumerate

Form two subdecks with about 10 cards each and sort
them so that when they are face up the lowest cards are on
top.  Place both decks face up in front of you.

Compare the top card from each deck and choose the
lower one.  Flip it over and add it to the merged deck.

Repeat step two until one of the decks is empty.
Then take the remaining cards and add them to the merged
deck.

enumerate

The result should be a single sorted deck.  Here's what this
looks like in pseudocode:

verbatim
  public static Deck merge (Deck d1, Deck d2) 
    // create a new deck big enough for all the cards
    Deck result = new Deck (d1.cards.length + d2.cards.length);

    // use the index i to keep track of where we are in
    // the first deck, and the index j for the second deck
    int i = 0;
    int j = 0;
		
    // the index k traverses the result deck
    for (int k = 0; k<result.cards.length; k++) 
			
      // if d1 is empty, d2 wins; if d2 is empty, d1 wins;
      // otherwise, compare the two cards
			
      // add the winner to the new deck
    
    return result;
  
verbatim

The best way to test merge is to build and shuffle a deck,
use subdeck to form two (small) hands, and then use the sort
routine from the previous chapter to sort the two halves.  Then
you can pass the two halves to merge to see if it works.

 testing

If you can get that working, try a simple implementation of
mergeSort:

verbatim
  public static Deck mergeSort (Deck deck) 
    // find the midpoint of the deck
    // divide the deck into two subdecks
    // sort the subdecks using sortDeck
    // merge the two halves and return the result
  
verbatim

Then, if you get that working, the real fun begins!  The magical thing
about mergesort is that it is recursive.  At the point where you sort
the subdecks, why should you invoke the old, slow version of 
sort?  Why not invoke the spiffy new mergeSort you are in the
process of writing?

 recursion

Not only is that a good idea, it is necessary in order to
achieve the performance advantage I promised.  In order to make it
work, though, you have to add a base case so that it doesn't recurse
forever.  A simple base case is a subdeck with 0 or 1 cards.  If 
mergesort receives such a small subdeck, it can return it
unmodified, since it is already sorted.

The recursive version of mergesort should look something
like this:

verbatim
  public static Deck mergeSort (Deck deck) 
    // if the deck is 0 or 1 cards, return it

    // find the midpoint of the deck
    // divide the deck into two subdecks
    // sort the subdecks using mergesort
    // merge the two halves and return the result
  
verbatim

As usual, there are two ways to think about recursive programs:
you can think through the entire flow of execution, or you
can make the ``leap of faith.''  I have deliberately constructed
this example to encourage you to make the leap of faith.

 leap of faith

When you were using sortDeck to sort the subdecks, you didn't
feel compelled to follow the flow of execution, right?  You just
assumed that the sortDeck method would work because you already
debugged it.  Well, all you did to make mergeSort recursive was
replace one sort algorithm with another.  There is no reason to read
the program differently.

Well, actually you have to give some thought to getting the
base case right and making sure that you reach it eventually,
but other than that, writing the recursive version should be
no problem.  Good luck!

Glossary

description

[pseudocode:]  A way of designing programs by writing
rough drafts in a combination of English and Java.

[helper method:]  Often a small method that does not
do anything enormously useful by itself, but which helps
another, more useful, method.

 pseudocode
 helper method
 method!helper


description


Object-oriented programming

Programming languages and styles
 programming language
 language!programming
 programming style
 object-oriented programming
 functional programming
 procedural programming
 programming!object-oriented
 programming!functional
 programming!procedural

There are many programming languages in the world, and almost as many
programming styles (sometimes called paradigms).  Three styles that
have appeared in this book are procedural, functional, and
object-oriented.  Although Java is usually thought of as an
object-oriented language, it is possible to write Java programs in any
style.  The style I have demonstrated in this book is pretty much
procedural.  Existing Java programs and the built-in Java packages are
written in a mixture of all three styles, but they tend to be more
object-oriented than the programs in this book.

It's not easy to define what object-oriented programming is,
but here are some of its characteristics:

itemize

Object definitions (classes) usually correspond to 
relevant real-world objects.  For example, in Chapter deck,
the creation of the Deck class was a step toward object-oriented
programming.

The majority of methods are object methods (the kind you
invoke on an object) rather than class methods (the kind you just
invoke).  So far all the methods we have written have been class
methods.  In this chapter we will write some object methods.

The language feature most associated with object-oriented
programming is inheritance.  I will cover inheritance later in
this chapter.

itemize

 inheritance

Recently object-oriented programming has become quite popular, and
there are people who claim that it is superior to other styles in
various ways.  I hope that by exposing you to a variety of styles I
have given you the tools you need to understand and evaluate these
claims.

Object and class methods
 object method
 method!object
 class method
 method!class
 static

There are two types of methods in Java, called class methods and
object methods.  So far, every method we have written has been a
class method.  Class methods are identified by the keyword 
static in the first line.  Any method that does not have the keyword
static is an object method.

Although we have not written any object methods, we have invoked some.
Whenever you invoke a method ``on'' an object, it's an object method.
For example, println is an object method we invoked on System.out,
which is a PrintStream object.  Also, many of the String
methods, like charAt and indexOf, are object methods.

 Graphics
 class!Graphics

Anything that can be written as a class method can also be written as an
object method, and vice versa.  Sometimes it is just more natural to
use one or the other.  For reasons that will be clear soon, object
methods are often shorter than the corresponding class methods.

The current object
 current object
 object!current
 this

When you invoke a method on an object, that object becomes the
current object.  Inside the method, you can refer to the instance
variables of the current object by name, without having to specify the
name of the object.

 constructor

Also, you can refer to the current object using the keyword 
this.  We have already seen this used in constructors.  In
fact, you can think of constructors as being a special kind of object
method.

Complex numbers
 complex number
 Complex
 class!Complex
 arithmetic!complex

As a running example for the rest of this chapter we will consider a
class definition for complex numbers.  Complex numbers are useful for
many branches of mathematics and engineering, and many computations
are performed using complex arithmetic.  A complex number is the sum
of a real part and an imaginary part, and is usually written in the
form , where  is the real part,  is the imaginary part,
and  represents the square root of -1.
Thus, .

The following is a class definition for a new object type called 
Complex:

verbatim
class Complex

  // instance variables
  double real, imag;

  // constructor
  public Complex () 
    this.real = 0.0;  this.imag = 0.0;
  
	
  // constructor
  public Complex (double real, double imag) 
    this.real = real;  this.imag = imag;
  

verbatim

There should be nothing surprising here.  The instance variables
are two doubles that contain the real and imaginary parts.
The two constructors are the usual kind: one takes no parameters
and assigns default values to the instance variables, the other
takes parameters that are identical to the instance variables.
As we have seen before, the keyword this is used to refer
to the object being initialized.

 instance variable
 variable!instance
 constructor

In main, or anywhere else we want to create Complex
objects, we have the option of creating the object and then
setting the instance variables, or doing both at the same time:

verbatim
    Complex x = new Complex ();
    x.real = 1.0;
    x.imag = 2.0;
    Complex y = new Complex (3.0, 4.0);
verbatim

A function on Complex numbers
 operator!Complex
 method!function
 pure function

Let's look at some of the operations we might want to perform
on complex numbers.  The absolute value of a complex number is
defined to be .  The abs method is
a pure function that computes the absolute value.  Written as
a class method, it looks like this:

verbatim
  // class method
  public static double abs (Complex c) 
    return Math.sqrt (c.real * c.real + c.imag * c.imag);
   
verbatim

This version of abs calculates the absolute value of c,
the Complex object it receives as a parameter.  The next version
of abs is an object method; it calculates the absolute value of
the current object (the object the method was invoked on).  Thus,
it does not receive any parameters:

verbatim
  // object method
  public double abs () 
    return Math.sqrt (real*real + imag*imag);
  
verbatim

I removed the keyword static to indicate that this is an object
method.  Also, I eliminated the unnecessary parameter.  Inside the
method, I can refer to the instance variables real and 
imag by name without having to specify an object.  Java knows
implicitly that I am referring to the instance variables of the
current object.  If I wanted to make it explicit, I could have used
the keyword this:

verbatim
  // object method
  public double abs () 
    return Math.sqrt (this.real * this.real + this.imag * this.imag);
  
verbatim

But that would be longer and not really any clearer.  To invoke
this method, we invoke it on an object, for example

verbatim
    Complex y = new Complex (3.0, 4.0);
    double result = y.abs();
verbatim

Another function on Complex numbers

Another operation we might want to perform on complex numbers
is addition.  You can add complex numbers by adding the real
parts and adding the imaginary parts.  Written as a class method,
that looks like:

verbatim
  public static Complex add (Complex a, Complex b) 
    return new Complex (a.real + b.real, a.imag + b.imag);
  
verbatim

To invoke this method, we would pass both operands as arguments:

verbatim
    Complex sum = add (x, y);
verbatim

Written as an object method, it would take only one argument,
which it would add to the current object:

verbatim
  public Complex add (Complex b) 
    return new Complex (real + b.real, imag + b.imag);
  
verbatim

Again, we can refer to the instance variables of the current
object implicitly, but to refer to the instance variables of
b we have to name b explicitly using dot notation.
To invoke this method, you invoke it on one of the operands
and pass the other as an argument.

 dot notation

verbatim
    Complex sum = x.add (y);
verbatim

From these examples you can see that the current object (this)
can take the place of one of the parameters.  For this reason,
the current object is sometimes called an implicit parameter.

A modifier
 modifier
 method!modifier

As yet another example, we'll look at conjugate, which is
a modifier method that transforms a Complex number into
its complex conjugate.  The complex conjugate of  is
.

As a class method, this looks like:

verbatim
  public static void conjugate (Complex c) 
    c.imag = -c.imag;
  
verbatim

As an object method, it looks like

verbatim
  public void conjugate () 
    imag = -imag;
  
verbatim

By now you should be getting the sense that converting a method
from one kind to another is a mechanical process.  With a little
practice, you will be able to do it without giving it much
thought, which is good because you should not be constrained to
writing one kind of method or the other.  You should be equally
familiar with both so that you can choose whichever one seems
most appropriate for the operation you are writing.

For example, I think that add should be written as a class
method because it is a symmetric operation of two operands, and
it makes sense for both operands to appear as parameters.  It just
seems odd to invoke the method on one of the operands and pass
the other as an argument.

On the other hand, simple operations that apply to a single object
can be written most concisely as object methods (even if they
take some additional arguments).

The toString method
 toString
 method!toString

There are two object methods that are common to many object
types: toString and equals.  toString converts
the object to some reasonable string representation that can
be printed.  equals is used to compare objects.

When you print an object using print or println,
Java checks to see whether you have provided an object method
named toString, and if so it invokes it.  If not, it
invokes a default version of toString that produces
the output described in Section printobject.

Here is what toString might look like for the Complex
class:

verbatim
  public String toString () 
    return real + " + " + imag + "i";
  
verbatim

The return type for toString is String, naturally,
and it takes no parameters.  You can invoke toString in
the usual way:

verbatim
    Complex x = new Complex (1.0, 2.0);
    String s = x.toString ();
verbatim

or you can invoke it indirectly through print:

verbatim
    System.out.println (x);
verbatim

Whenever you pass an object to print or println, Java
invokes the toString method on that object and prints the result.
In this case, the output is 1.0 + 2.0i.

This version of toString does not look good if the imaginary
part is negative.  As an exercise, fix it.

The equals method
 equals
 method!equals

When you use the == operator to compare two objects,
what you are really asking is, ``Are these two things the same
object?''  That is, do both objects refer to the same location
in memory.

For many types, that is not the appropriate definition of
equality.  For example, two complex numbers are equal if their
real parts are equal and their imaginary parts are equal.

 type!object

When you create a new object type, you can provide your own
definition of equality by providing an object method called
equals.  For the Complex class, this looks like:

verbatim
  public boolean equals (Complex b) 
    return (real == b.real && imag == b.imag);
  
verbatim

By convention, equals is always an object method.  The return
type has to be boolean.

The documentation of equals in the Object class
provides some guidelines you should keep in mind when you
make up your own definition of equality:

quote

The equals method implements an equivalence relation: 

 equality
 identity

itemize

It is reflexive: for any reference value x, 
x.equals(x) should return true.

It is symmetric: for any reference values x and y,
x.equals(y) should return true if and only if 
y.equals(x) returns true.

It is transitive: for any reference values x, y, and
z, if x.equals(y) returns true and y.equals(z)
returns true, then x.equals(z) should return true.

It is consistent: for any reference values x and y,
multiple invocations of x.equals(y) consistently return true or
consistently return false.

For any reference value x, x.equals(null) should
return false.

itemize

quote

The definition of equals I provided satisfies all these
conditions except one.  Which one?  As an exercise, fix it.

Invoking one object method from another
 method!invoking

As you might expect, it is legal and common to invoke
one object method from another.  For example, to normalize a
complex number, you divide through (both parts) by the absolute
value.  It may not be obvious why this is useful, but it is.

Let's write the method normalize as an object method, and
let's make it a modifier.

verbatim
  public void normalize () 
    double d = this.abs();
    real = real/d;
    imag = imag/d;
  
verbatim

The first line finds the absolute value of the current object
by invoking abs on the current object.  In this case
I named the current object explicitly, but I could have left
it out.  If you invoke one object method within another, Java
assumes that you are invoking it on the current object.

As an exercise, rewrite normalize as a pure function. 
Then rewrite it as a class method.

Oddities and errors
 method!object
 method!class
 overloading

If you have both object methods and class methods in the same class
definition, it is easy to get confused.  A common way to organize a
class definition is to put all the constructors at the beginning,
followed by all the object methods and then all the class methods.

You can have an object method and a class method with the same
name, as long as they do not have the same number and types of
parameters.  As with other kinds of overloading, Java decides
which version to invoke by looking at the arguments you provide.

 static

Now that we know what the keyword static means, you
have probably figured out that main is a class method,
which means that there is no ``current object'' when it is invoked.

 current object
 this
 instance variable
 variable!instance

Since there is no current object in a class method, it is an
error to use the keyword this.  If you try, you might get
an error message like: ``Undefined variable: this.''  Also, you
cannot refer to instance variables without using dot notation
and providing an object name.  If you try, you might get 
``Can't make a static reference to nonstatic variable...''
This is not one of the better error messages, since it uses
some non-standard language.  For example, by ``nonstatic
variable'' it means ``instance variable.''  But once you know
what it means, you know what it means.

Inheritance
 inheritance

The language feature that is most often associated with
object-oriented programming is inheritance.  Inheritance is the
ability to define a new class that is a modified version of a
previously-defined class (including built-in classes).

The primary advantage of this feature is that you can add new methods
or instance variables to an existing class without modifying the
existing class.  This is particularly useful for built-in classes,
since you can't modify them even if you want to.

The reason inheritance is called ``inheritance'' is that the
new class inherits all the instance variables and methods
of the existing class.  Extending this metaphor, the existing
class is sometimes called the parent class.

Drawable rectangles
 Rectangle
 class!Rectangle
 drawable

An an example of inheritance, we are going to take the existing
Rectangle class and make it ``drawable.''  That is, we are going to
create a new class called DrawableRectangle that will have all
the instance variables and methods of a Rectangle, plus an
additional method called draw that will take a Graphics
object as a parameter and draw the rectangle.

The class definition looks like this:

verbatim
import java.awt.*;

class DrawableRectangle extends Rectangle 

  public void draw (Graphics g) 
    g.drawRect (x, y, width, height);
  

verbatim

Yes, that's really all there is in the whole class definition.  The
first line imports the java.awt package, which is where 
Rectangle and Graphics are defined.

 AWT
 import
 statement!import

The next line indicates that DrawableRectangle inherits from
Rectangle.  The keyword extends is used to identify the
parent class.

The rest is the definition of the draw method, which refers to
the instance variables x, y, width and height.
It might seem odd to refer to instance variables that don't appear in
this class definition, but remember that they are inherited from the
parent class.

To create and draw a DrawableRectangle, you could use
the following:

verbatim
  public static void draw
	       (Graphics g, int x, int y, int width, int height) 
    DrawableRectangle dr = new DrawableRectangle ();
    dr.x = 10;  dr.y = 10;
    dr.width = 200;  dr.height = 200;
    dr.draw (g);
  
verbatim

The parameters of draw are a Graphics object and
the bounding box of the drawing area (not the coordinates of the
rectangle).

It might seem odd to use the new command for a class
that has no constructors.  DrawableRectangle
inherits the default constructor of its parent class, so there
is no problem there.

 constructor

We can set the instance variables of dr and invoke methods
on it in the usual way.  When we invoke draw, Java invokes
the method we defined in DrawableRectangle.  If we invoked
grow or some other Rectangle method on dr, Java
would know to use the method defined in the parent class.

The class hierarchy
 class hierarchy
 Object
 parent class
 class!parent

In Java, all classes extend some other class.  The most basic class is
called Object.  It contains no instance variables, but it does
provide the methods equals and toString, among others.

Many classes extend Object, including almost all of the classes
we have written and many of the built-in classes, like 
Rectangle.  Any class that does not explicitly name a parent inherits
from Object by default.

Some inheritance chains are longer, though.  For example, Frame
extends Window, which extends Container, which extends
Component, which extends Object.  No matter how long the
chain, Object is the ultimate parent of all classes.

All the classes in Java can be organized into a ``family tree'' that
is called the class hierarchy.  Object usually appears at the
top, with all the ``child'' classes below.  If you look at the
documentation of Frame, for example, you will see the part of
the hierarchy that makes up Frame's pedigree.

Object-oriented everything?
 object-oriented design

Inheritance is a powerful feature.  Some programs that would be
complicated without inheritance can be written concisely and simply
with it.  Also, inheritance can facilitate code reuse, since you can
customize the behavior of built-in classes without having to modify
them.

On the other hand, inheritance can make programs difficult to read,
since it is sometimes not clear, when a method is invoked, where to
find the definition.  For example, one of the methods you can invoke
on a Frame is getBounds.  Can you find the documentation
for getBounds?  It turns out that getBounds is defined in
the parent of the parent of the parent of Frame.

Also, many of the things that can be done using inheritance can
be done almost as elegantly (or more so) without it.

Glossary

description

[object method:]  A method that is invoked on an object,
and that operates on that object, which is referred to by
the keyword this in Java or ``the current object'' in
English.  Object methods do not have the keyword static.

[class method:]  A method with the keyword static.
Class methods are not invoked on objects and they do not have
a current object.

[current object:]  The object on which an object method
is invoked.  Inside the method,
the current object is referred to by this.

[this:]  The keyword that refers to the current object.

[implicit:]  Anything that is left unsaid or implied.  Within
an object method, you can refer to the instance variables
implicitly (without naming the object).

[explicit:]  Anything that is spelled out completely.  Within
a class method, all references to the instance variables have to
be explicit.

 object method
 class method
 current object
 this
 implicit
 explicit

description


Linked lists
list

References in objects

In the last chapter we saw that the instance variables of an
object can be arrays, and I mentioned that they can be objects,
too.

One of the more interesting possibilities is that an object
can contain a reference to another object of the same type.
There is a common data structure, the list, that takes advantage
of this feature.

Lists are made up of nodes, where each node contains a
reference to the next node in the list.  In addition, each node
usually contains a unit of data called the cargo.  In our
first example, the cargo will be a single integer, but later we
will see how to write a generic list that can contain objects
of any type.

The Node class

As usual when we write a new class, we'll start with the instance
variables, one or two constructors and toString so that we
can test the basic mechanism of creating and displaying the new
type.

verbatim
public class Node 
    int cargo;
    Node next;

    public Node () 
        cargo = 0;
        next = null;
    

    public Node (int cargo, Node next) 
        this.cargo = cargo;
        this.next = next;
    

    public String toString () 
        return cargo + "";
    

verbatim

The declarations of the instance variables follow naturally
from the specification, and the rest follows mechanically from
the instance variables.  The expression cargo + "" is
an awkward but concise way to convert an integer to a String.

To test the implementation so far, we would put something like
this in main:

verbatim
    Node node = new Node (1, null);
    System.out.println (node);
verbatim

The result is simply

verbatim
1
verbatim

To make that interesting, we need a list with more than
one node!  First let's create three nodes:

verbatim
    Node node1 = new Node (1, null);
    Node node2 = new Node (2, null);
    Node node3 = new Node (3, null);
verbatim

This code creates three nodes, but we don't yet have a list
because the nodes are not linked.  The state diagram
looks like this:

figure=list1.eps

To link up the nodes, we have to make the first node refer to the
second and the second node refer to the third.

verbatim
    node1.next = node2;
    node2.next = node3;
    node3.next = null;
verbatim

The reference of the third node is null, which indicates that
it is the end of the list.  Now the state diagram looks like:

figure=list2.eps

Now we know how to create nodes and link them into lists.  What
might be less clear at this point is why.

Lists as collections

The thing that makes lists useful is that they are a way
of collecting information into a single object.  In the example,
the first node of the list serves as a reference to the entire
list.

If we want to pass the list as a parameter, all
we have to pass is a reference to the first node.  The method
printList takes a single node as an argument.  Starting
with the head of the list, it prints each node until it gets
to the end (indicated by the null reference).

verbatim
    public static void printList (Node list) 
        Node node = list;

        while (node != null) 
            System.out.print (node);
            node = node.next;
        
        System.out.println ();
    
verbatim

To invoke this method we just have to pass a reference to the
first node:

verbatim
        printList (node1);
verbatim

Inside printList we have a reference to the first node
of the list, but there is no variable that refers to the other
nodes.  We have to use the next value from each node
to get to the next node.

The diagram shows the value of list and the values that
node takes on:

figure=list3.eps

This way of moving through a list is called a traversal,
just like the similar pattern of moving through the elements of
an array.

The output of this method is

verbatim
123
verbatim

By convention, lists are usually printed in parentheses with commas
between the elements, as in (1, 2, 3).  As an exercise, modify
printList so that it generates output in this format.

As another exercise, rewrite printList using a for loop
instead of a while loop.

Lists and recursion

Recursion and lists go together like fava beans and a nice
Chianti.  For example, here is a recursive algorithm for printing
a list backwards:

enumerate

Separate the list into two pieces: the first node (called
the head) and the rest (called the tail).

Print the tail backwards.

Print the head.

enumerate

Of course, Step 2, the recursive call, assumes that we have a
way of printing a list backwards.  But if we assume that
the recursive call works, then it should be clear that this
algorithm will print the list backwards.

Now all we need is a base case, and a way of proving that for
any list we will eventually get to the base case.  A natural
choice for the base case is a list with a single element, but
an even better choice is the empty list, represented by null.

verbatim
    public static void printBackward (Node list) 
        if (list == null) return;

        Node head = list;
        Node tail = list.next;

        printBackward (tail);
        System.out.print (head);
        
verbatim

The first line handles the base case by doing nothing.  The
next two lines split the list into head and tail.
The last two lines print the list.

We invoke this method exactly as we invoked printList:

verbatim
        printBackward (node1);
verbatim

The result is a backwards list.

Can we prove that this method will always terminate?   In other
words, will it always reach the base case?  In fact, the answer
is no.  There are some lists that will make this method crash.


Infinite lists

There is nothing to prevent a node from referring back to
an earlier node in the list, including itself.  For example,
this figure shows a list with two nodes, one of which refers
to itself.

figure=list4.eps

If we invoke printList on this list, it will loop forever.
If we invoke printBackward it will recurse infinitely.
This sort of behavior makes infinite lists difficult to work
with.

Nevertheless, they are occasionally useful.  For example, we
might represent a number as a list of digits and use an infinite
list to represent a repeating fraction.

Regardless, it is problematic that we cannot prove that printList
and printBackward terminate.  The best we can do is the
hypothetical statement, ``If the list contains no loops, then these
methods will terminate.''  This sort of claim is called a 
precondition.  It imposes a constraint on one of the parameters and
describes the behavior of the method if the constraint is satisfied.
We will see more examples soon.


The fundamental ambiguity theorem

There is a part of printBackward that might have raised
an eyebrow:

verbatim
        Node head = list;
        Node tail = list.next;
verbatim

After the first assignment, head and list have the same
type and the same value.  So why did I create a new variable?

The reason is that the two variables play different roles.  We think
of head as a reference to a single node, and we think of
list as a reference to the first node of a list.  These
``roles'' are not part of the program; they are in the mind of the
programmer.

The second assignment creates a new reference to the second node
in the list, but in this case we think of it as a list.
So, even though head and tail have the same
type, they play different roles.

This ambiguity is useful, but it can make programs with lists
difficult to read.  I often use variable names like node
and list to document how I intend to use a variable, and
sometimes I create additional variables to disambiguate.

I could have written printBackward without head
and tail, but I think it makes it harder to understand:

verbatim
    public static void printBackward (Node list) 
        if (list == null) return;

        printBackward (list.next);
        System.out.print (list);
        

verbatim

Looking at the two function calls, we have to remember that
printBackward treats its argument as a list and print
treats its argument as a single object.

Object methods for nodes

You might have wondered why printList and printBackward
are class methods.  I have made the claim that anything that can
be done with class methods can also be done with object methods;
it's just a question of which form is cleaner.

In this case it turns out that there is a more compelling reason
to make these class methods.  It is perfectly legal to send null
as an argument to a class method, but it is not legal to invoke
an object method on a null object.

verbatim
	Node node = null;
	printList (node);       // legal
	node.printList ();      // NullPointerException
verbatim

This limitation makes it awkward to write list-manipulating
code in a clean, object-oriented style.  A little later we
will see a way to get around this, though.


Modifying lists

Obviously one way to modify a list is to change the cargo of
one on the nodes, but the more interesting operations are the
ones that add, remove, or reorder the nodes.

As an example, we'll write a method that removes the second
node in the list and returns a reference to the removed node.

verbatim
    public static Node removeSecond (Node list) 
        Node first = list;
        Node second = list.next;

        // make the first node refer to the third
        first.next = second.next;

        // separate the second node from the rest of the list
        second.next = null;
        return second;
    
verbatim

Again, I am using temporary variables to make the code more
readable.  Here is how to use this method.

verbatim
        printList (node1);
        Node removed = removeSecond (node1);
        printList (removed);
        printList (node1);
verbatim

The output is

verbatim
(1, 2, 3)           the original list
(2)                 the removed node
(1, 3)              the modified list
verbatim

Here is a state diagram showing the effect of this operation.

figure=list5.eps

What happens if we invoke this method and pass a list with only
one element (a singleton)?  What happens if we pass the empty list as an
argument?  Is there a precondition for this method?


Wrappers and helpers

For some list operations it is useful to divide the labor into
two methods.  For example, to print a list backwards in the
conventional list format, (3, 2, 1) we can use the
printBackwards method to print 3, 2, but we need
a separate method to print the parentheses and the first node.
We'll call it printBackwardNicely.

verbatim
    public static void printBackwardNicely (Node list) 
        System.out.print ("(");

        if (list != null) 
            Node head = list;
            Node tail = list.next;
            printBackward (tail);
            System.out.print (head);
        
        System.out.println (")");
    	
verbatim

Again, it is a good idea to check methods like this to see
if they work with special cases like an empty list or
a singleton.

Elsewhere in the program, when we use this method, we will
invoke printBackwardNicely directly and it will invoke
printBackward on our behalf.  In that sense, 
printBackwardNicely acts as a wrapper, and it uses
printBackward as a helper.

The List class

There are a number of subtle problems with the way we have been
implementing lists.  In an odd reversal of cause and effect, I will
propose an alternative implementation first and then explain what
problems it solves.

First, we will create a new class called List.  Its instance
variables are an integer that contains the length of the list and a
reference to the first node in the list.  List objects (with an upper
case L) serve as handles for manipulating lists of Node objects.

verbatim
public class List 
    int length;
    Node head;

    public List () 
        length = 0;
        head = null;
    

verbatim

One nice thing about the List class is that it gives
us a natural place to put wrapper functions like
printBackwardNicely, which we can make an object
method in the List class.

verbatim
    public void printBackward () 
	System.out.print ("(");

        if (head != null) 
            Node tail = head.next;
            Node.printBackward (tail);
            System.out.print (head);
        
        System.out.println (")");
    	
verbatim

Just to make things confusing, I renamed printBackwardNicely.
Now there are two methods named printBackward: one in the 
Node class (the helper) and one in the List class (the
wrapper).  In order for the wrapper to invoke the helper, it has to
identify the class explicitly (Node.printBackward).

What happens to printBackward if you reverse the order
of the two statements?

verbatim
            Node.printBackward (tail);
            System.out.print (head);
verbatim

Aside from providing a nice place to put wrapper functions,
the List class also makes it easier to add or remove
the first element of a list.  For example, addNewFirstNode
is an object method for Lists; it
takes a node as an argument and places it at the beginning of
the list.

verbatim
    public void addNewFirstNode (Node node) 
        node.next = head;
        head = node;
        length++;
    
verbatim

As always, to check code like this it is a good idea to think about
the special cases.  For example, what happens if the list is initially
empty?


Invariants

Some lists are ``well-formed;'' others are not.  For example, if
a list contains a loop, it will cause many of our methods to
crash, so we might want to require that lists contain no loops.
Another requirement is that the length value in the List
object should be equal to the actual number of nodes in the list.

Requirements like this are called invariants because, ideally,
they should be true of every object all the time.  Specifying invariants
for objects is a useful programming practice because it makes it
easier to prove the correctness of code, check the integrity of
data structures, and detect errors.

One thing that is sometimes confusing about invariants is that
there are some times when they are violated.  For example, in the
middle of addNewFirstNode, after we have added the node, but
before we have incremented length, the invariant is
violated.  This kind of violation is acceptable; in fact, it is
often impossible to modify an object without violating an
invariant for at least a little while.  Normally the requirement
is that every method that violates an invariant must restore
the invariant.

If there is any significant stretch of code in which the invariant
is violated, it is important for the comments to make that clear,
so that no operations are performed that depend on the invariant.


Glossary

description

[list:] A data structure that implements a collection using
a sequence of linked nodes.

[node:] An element of a list, usually implemented as an object
that contains a reference to another object of the same type.

[cargo:] An item of data contained in a node. 

[link:] An object reference embedded in an object.

[generic:] A kind of data structure that can contain data
of any type.

[precondition:] An assertion that must be true in order for a
method to work correctly.

[invariant:] An assertion that should be true of an object at
all times (except while the object is being modified).

[wrapper method:] A method that acts as a middle-man between a caller
and a helper method, often making the interface to the helper
method cleaner.

description


Stacks

Abstract data types

The data types we have looked at so far are all concrete, in the
sense that we have completely specified how they are implemented.
For example, the Card class represents a card using two
integers.  As I discussed at the time, that is not the only way
to represent a card; there are many alternative implementations.

An abstract data type, or ADT, specifies a set of operations (or
methods) and the semantics of the operations (what they do) but it
does not not specify the implementation of the operations.  That's
what makes it abstract.

Why is that useful?

itemize

It simplifies the task of specifying an algorithm if you
can perform operations on objects without having to think at the
same time about how the operations are performed.

Since there are usually many ways to implement an ADT,
it might be useful to write an algorithm that can be used with
any of the possible implementations.

Well-known ADTs, like the Stack ADT in this chapter,
are often implemented in standard libraries so they can be written
once and used by many programmers.

The operations on ADTs provide a common-high level language
for specifying and talking about algorithms.

itemize


The Stack ADT

In this chapter we will look at one common ADT, the stack.  A
stack is a collection, meaning that it is a data structure that
contains multiple elements.  Other collections we have seen include
arrays and lists.

As I said, an ADT is defined by the operations you can perform
on it.  Stacks can perform only the following operations:

description

[construction:] You can create a new, empty stack.

[push:] Add a new item to the stack.

[pop:] Remove and return an item from the stack.  The item
that is returned is always the last one that was added.

[check:] Check whether the stack is empty.

description

A stack is sometimes called a ``last in, first out,'' or LIFO
data structure, because the last item added is the first to
be removed.

The Java Stack Object

Java provides a built-in object type called Stack that
implements the Stack ADT.  You should make some effort to keep
these two things---the ADT and the Java implementation---straight.
Before using the Stack class, we have to import it from
java.util.

Then the syntax for constructing a new Stack is

verbatim
    Stack stack = new Stack ();
verbatim

Initially the stack is empty, as we can confirm with the
empty method, which returns a boolean:

verbatim
    System.out.println (stack.empty ());
verbatim

A stack is a generic data structure, which means that we can
add any type of item to it.  In the Java implementation, though,
we can only add object types.  For our first example, we'll
use Node objects, as defined in the previous chapter.
Let's start by creating and printing a short list.

verbatim
    List list = new List ();
    list.addNewFirstNode (3);
    list.addNewFirstNode (2);
    list.addNewFirstNode (1);
    list.print ();
verbatim

The output is (1, 2, 3).  To put a Node object onto
the stack, use the push method:

verbatim
	stack.push (list.head);
verbatim

The following loop traverses the list and pushes all the nodes
onto the stack:

verbatim
for (Node node = list.head; node != null; node = node.next) 
    stack.push (node);

verbatim

We can remove an element from the stack with the pop method.

verbatim
    Object obj = stack.pop ();
verbatim

The return type from pop is Object!  That's because the
stack implementation doesn't really know the type of the objects it
contains.  When we pushed the Node objects, they were automatically
converted to Objects.  When we get them back from the stack,
we have to cast them back to Nodes.

verbatim
Node node = (Node) obj;
System.out.println (node);
verbatim

Unfortunately, the burden falls on the programmer to keep track of the
objects in the stack and cast them back to the right type when they
are removed.  If you try to cast an object to the wrong type, you get
a ClassCastException.

The following loop is a common idiom for popping all the elements
from a stack, stopping when it is empty:

verbatim
while (!stack.empty ()) 
    Node node = (Node) stack.pop ();
    System.out.print (node + " ");


verbatim

The output is 3 2 1.  In other words, we just used a stack
to print the elements of a list backwards!  Granted, it's not the
standard format for printing a list, but using a stack it was
remarkably easy to do.

You should compare this code to the implementations of 
printBackward in the previous chapter.  There is a natural parallel
between the recursive version of printBackward and the stack
algorithm here.  The difference is that printBackward uses the
run-time stack to keep track of the nodes while it traverses the list,
and then prints them on the way back from the recursion.  The stack
algorithm does the same thing, just using a Stack object instead
of the run-time stack.


Wrapper classes

For every primitive type in Java, there is a built-in object type
called a wrapper class.  For example, the wrapper class for
int is called Integer; for double it is called
Double.

Wrapper classes are useful for several reasons:

itemize

Each wrapper class contains special values (like the
minimum and maximum values for the type) and methods that are useful
for converting between types.

You can instantiate wrapper classes and create objects
that contain primitive values.  In other words, you can wrap
a primitive value up in an object, which is useful if you want
to invoke a method that requires an object type.

itemize

Creating wrapper objects

The most straightforward way to create a wrapper object is
to use its constructor:

verbatim
    Integer i = new Integer (17);
    Double d = new Double (3.14159);
    Character c = new Character ('b');
verbatim

Technically String is not a wrapper class, because there
is no corresponding primitive type, but the syntax for creating
a String object is the same:

verbatim
    String s = new String ("fred");
verbatim

On the other hand, no one ever uses the constructor for
String objects, because you can get the same effect
with a simple String value:

verbatim
    String s = "fred";
verbatim


Creating more wrapper objects

Some of the wrapper classes have a second constructor that takes
a String as an argument and tries to convert
to the appropriate type.  For example:

verbatim
    Integer i = new Integer ("17");
    Double d = new Double ("3.14159");
verbatim

The type conversion process is not very robust.
For example, if the Strings are not in the right format,
they will cause a NumberFormatException.  Any non-numeric
character in the String, including a space, will cause
the conversion to fail.

verbatim
    Integer i = new Integer ("17.1");        // WRONG!!
    Double d = new Double ("3.1459 ");       // WRONG!!
verbatim

It is usually a good idea to check the format of the String
before you try to convert it.


Getting the values out

Java knows how to print wrapper objects, so the easiest
way to extract a value is just to print the object:

verbatim
    Integer i = new Integer (17);
    Double d = new Double (3.14159);
    System.out.println (i);
    System.out.println (d);
verbatim

Alternatively, you can use the toString method to
convert the contents of the wrapper object to a String

verbatim
    String istring = i.toString();
    String dstring = d.toString();
verbatim

Finally, if you just want to extract the primitive value
from the object, there is an object method in each wrapper
class that does the job:

verbatim
    int iprim = i.intValue ();
    double dprim = d.doubleValue ();
verbatim

There are also methods for converting wrapper objects into
different primitive types.


Useful methods in the wrapper classes

As I mentioned, the wrapper classes contain useful methods that
pertain to each type.  For example, Integer contains methods for
interpreting and printing integers in different bases.  If you have a
String that contains a number in base 6, you can convert to base
10 using parseInt.

verbatim
    String base6 = "12345";
    int base10 = Integer.parseInt (base6, 6);
    System.out.println (base10);
verbatim

Since parseInt is a class method, you invoke it by
naming the class and the method in dot notation.

Base 6 might not be all that useful, but hexadecimal
(base 16) and octal (base 8) are common for computer science
related things.

In the Character class, there are lots of methods
for converting characters to upper and lower case, and for
checking whether a character is a number, letter, or symbol.


Postfix expressions

In most programming languages, mathematical expressions are
written with the operator between the two operands, as in
1+2.  This format is called infix.  An alternate
format used by some calculators is called postfix.  In
postfix, the operator follows the operands, as in 1 2+.

The reason postfix is sometimes useful is that there is a
natural way to evaluate a postfix expression using a stack.

itemize

Starting at the beginning of the expression, get one
term (operator or operand) at a time.

	itemize

	If the term is an operand, push it on the stack.

	If the term is an operator, pop two operands off
	the stack, perform the operation on them, and push the
	result back on the stack.

	itemize

When we get to the end of the expression, there should
be exactly one operand left on the stack.  That operand is the
result.

itemize

As an exercise, apply this algorithm to the expression
1 2 + 3 *.

This example demonstrates one of the advantages of postfix:
there is no need to use parentheses to control the order of
operations.  To get the same result in infix, we would have to
write (1 + 2) * 3.  What is the postfix expression that
is equivalent to 1 + 2 * 3?


Parsing

In order to implement the algorithm from the previous section,
we need to be able to traverse a string and break it into operands
and operators.  This process is an example of parsing, and
the results---the individual chunks of the string---are called
tokens.

Java provides a built-in class called a StringTokenizer
that parses strings and breaks it into tokens.  To use it, you
have to import it from java.util.

In its simplest form, the StringTokenizer uses spaces
to mark the boundaries between tokens.  A character that marks
a boundary like this is called a delimiter.

We can create a StringTokenizer in the usual way, passing
as an argument the string we want to parse.

verbatim
    StringTokenizer st = new StringTokenizer ("Here are four tokens.");
verbatim

The following loop is a standard idiom for extracting the tokens
from a StringTokenizer.

verbatim
    while (st.hasMoreTokens ()) 
        System.out.println (st.nextToken());
    
verbatim

The output is

verbatim
Here
are
four
tokens.
verbatim

For parsing expressions, we have the option of specifying additional
characters that will be used as delimiters:

verbatim
    StringTokenizer st = new StringTokenizer ("11 22+33*", " +-*/");
verbatim

The second argument is a String that lists all the characters
that will be used as delimiters.  Now the output is:

verbatim
11
22
33
verbatim

This succeeds at extracting all the operands but we have lost the
operators.  Fortunately, there is one more option for 
StringTokenizers.

verbatim
    StringTokenizer st = new StringTokenizer ("11 22+33*", " +-*/", true);
verbatim

The third argument says, ``Yes, we would like to treat the delimiters
as tokens.''  Now the output is

verbatim
11
 
22
+
33
*
verbatim

This is just the stream of tokens we would like for evaluating
this expression.


Implementing ADTs

One of the fundamental goals of an ADT is to separate the
interests of the implementor, who writes the code that implements
the ADT, and the ``user'' or ``client'' who uses the ADT to
implement something else.  The implementor only has to worry
about whether the implementation is correct---in accord
with the specification of the ADT---and not how it will be used.

Conversely, the user assumes that the implementation of the
ADT is correct and doesn't worry about the details.  When you
are using one of Java's built-in classes, you have the luxury
of thinking exclusively as a user.

When you implement an ADT, on the other hand, you often have
to write client code to test it.  In that case, you sometimes
have to think carefully about which side of the line you are
on.

In this section we will switch gears and look at one way of
implementing the Stack ADT, using an array.  So put on your
implementor hat.


The array implementation of the Stack ADT
arraystack

The instance variables for this stack implementation are
an array of Objects, which will contain the items on
the stack, and an integer index which will keep track of
the next available space in the array.
Initially, the array is empty and the index is 0.

To add an element to the stack (push), we'll copy 
a reference to it onto the stack and increment the index.
To remove an element (pop) we have to decrement the
index first and then copy the element out.

Here's what that looks like in Java:

verbatim
public class Stack 
    Object[] array;
    int index;

    public Stack () 
        this.array = new Object[128];
        this.index = 0;
    

verbatim

As usual, once we have chosen the instance variables, it is
a mechanical process to write a constructor.
For now, the default size is 128 items.  Later we will consider
better ways of handling this.

Checking for an empty stack is trivial.

verbatim
    public boolean empty () 
        return index == 0;
    
verbatim

It it important to remember, though, that the number of elements in
the stack is not the same as the size of the array.  Initially the
size is 128, but the number of elements is 0.

The implementations of push and pop follow naturally from
the specification.

verbatim
    public void push (Object item) 
        array[index] = item;
        index++;
    

    public Object pop () 
        index--;
        return array[index];
    
verbatim

To test these methods, we can take advantage of the client code
we used to exercise the built-in Stack.  All we have to do is
comment out the line import java.util.Stack.  Then, instead
of using the stack implementation from java.util the
program will use the implementation we just wrote.

If everything goes according to plan, the program should
work without any additional changes.  Again, one of the strengths
of using an ADT is that you can change implementations without
changing client code.

Resizing arrays

A weakness of this implementation is that it chooses
an arbitrary size for the array when the Stack is created.  If
the user pushes more than 128 items onto the stack, it will cause
an ArrayIndexOutOfBounds exception.

An alternative is to let the client code specify the size of
the array.  This alleviates the problem, it requires the client
to know ahead of time how many items are needed, and that is not
always possible.

A better solution is to check whether the array is full and make
it bigger when necessary.  Since we have no idea how big the
array needs to be, it is a reasonable strategy to start with a
small size and double it each time it overflows.

Here's the improved version of push:

verbatim
    public void push (Object item) 
        if (full ()) resize ();

        // at this point we can prove that index < array.length

        array[index] = item;
        index++;
    
verbatim

Before putting the new item in the array, we check if the array
is full.  If so, we invoke resize.  After the if statement,
we know that either (1) there was room in the array, or (2) the
array has been resized and there is room.  If everything is
correct, then we can prove that index < array.length and 
therefore the next statement cannot cause an exception.

Now all we have to do is implement full and resize.

verbatim
    private boolean full () 
        return index == array.length;
    

    private void resize () 
        Object[] newArray = new Object[array.length * 2];

        // we assume that the old array is full
        for (int i=0; i<array.length; i++) 
            newArray[i] = array[i];
        
        array = newArray;
    
verbatim

Both methods are declared private, which means that they
cannot be invoked from another class definition, only from
within this one.  This is acceptable, since there is no reason
for client code to use these functions, and desirable, since
it enforces the boundary between the implementation and the
client.

The implementation of full is trivial; it just checks
whether the index has gone beyond the range of valid indices.

The implementation of resize is straightforward, with
the caveat that it assumes that the old array is full.  In other
words, that assumption is a precondition of this method.  It is
easy to see that this precondition is satisfied, since the only
way resize is invoked is if full returns true,
which can only happen if index == array.length.

At the end of resize, we replace the old array with
the new one (causing the old to be garbage collected).  The
new array.length is twice as big as the old, and 
index hasn't changed, so now it must be true that
index < array.length.  This assertion is a postcondition
of resize: something that must be true when the method
is complete (as long as its preconditions were satisfied).

Preconditions, postconditions, and invariants are useful tools
for analyzing programs and demonstrating their correctness.
In this example I have demonstrated a programming style that
facilitates program analysis and a style of documentation that
helps demonstrate correctness.


Glossary

description

[abstract data type (ADT):]  A data type (usually a collection
of objects) that is defined by a set of operations, but that can
be implemented in a variety of ways.

[wrapper class:]  One of the Java classes, like Double
and Integer that provide objects to contain primitive types,
and methods that operate on primitives.

[infix:]  A way of writing mathematical expressions with the
operators between the operands.

[postfix:]  A way of writing mathematical expressions with the
operators after the operands.

[parse:]  To read a string of characters or tokens and analyze
their grammatical structure.

[token:]  A set of characters that are treated as a unit for
purposes of parsing, like the words in a natural language.

[delimiter:]  A character that is used to separate tokens,
like the punctuation in a natural language.

[predicate:]  A mathematical statement that is either true or
false.

[postcondition:]  A predicate that must be true at the end of
a method (provided that the preconditions were true at the
beginning).


description


Queues and Priority Queues
queue

This chapter presents two new ADTs: Queues and Priority Queues.
In real life a queue is a line of customers waiting for service
of some kind.  In most cases, the first customer in line is the
next customer to be served.  There are exceptions, though.  For
example, at airports customers whose flight is leaving imminently
are sometimes taken from the middle of the queue.  Also, at
supermarkets a polite customer might let someone with only a
few items go first. 

The rule that determines who goes next is called a 
queueing discipline.  The simplest queueing discipline is
called FIFO, for ``first-in-first-out.''  The most general
queueing discipline is priority queueing, in which each customer
is assigned a priority, and the customer with the highest priority
goes first, regardless of the order of arrival.  The reason I
say this is the most general discipline is that the priority
can be based on anything: what time a flight leaves, how many
groceries the customer has, or how important the customer is.
Of course, not all queueing disciplines are ``fair,'' but
fairness is in the eye of the beholder.

As with most ADTs, there are a number of ways to implement a queue.
Since a queue is a collection of items, we can use any of the basic
mechanisms for storing collections: arrays, lists, or vectors.
Our choice among them will be based in part on their performance---
how long it takes to perform the operations we want to perform---
and partly on ease of implementation.


The queue ADT

The queue ADT is defined by the following operations:

description

[construction:] Create a new, empty queue.

[insert:] Add a new item to the queue.

[remove:] Remove and return an item from the queue.  The item
that is returned is the first one that was added.

[empty:] Check whether the queue is empty.

description

To demonstrate a queue implementation, I will take advantage of the
List class from Chapter list.  Also, I will assume that
we have a class named Customer that defines all the information
about each customer, and the operations we can perform on customers.

As far as our implementation goes, it does not matter what kind of
object is in the Queue, so we can make it generic.  Here is
what the implementation looks like.

verbatim
public class Queue 
    public List list;

    public Queue () 
        list = new List ();
    

    public boolean empty () 
        return list.empty ();
    

    public void insert (Object obj) 
        list.addNewLastNode (obj);
    

    public Object remove () 
        return list.removeFirstNode ();
    

verbatim

A queue object contains a single instance variable, which is
the list that implements it.  For each of the other methods,
all we have to do is invoke the corresponding method from the
List class.

Veneer

An implementation like this is called a veneer.  In
real life, veneer is a thin coating of good-quality wood used
in furniture making to hide lower-quality wood underneath.
Computer scientists use this metaphor to describe a small
piece of code that hides the details of an implementation and
provides a simpler, or more standard, interface.

This example demonstrates one of the nice things about a
veneer, which is that it is easy to implement, and one of
the dangers of using a veneer, which is the performance
hazard!

Normally when we invoke a method we are not concerned with the
details of its implementation.  But there is one ``detail''
we might want to know---the performance characteristics of the
method.  How long does it take, as a function of the number
of items in the list?

First let's look at removeFirstNode.

verbatim
    public Object removeFirstNode () 
        Object result = head;
        if (head != null) 
            head = head.next;
        
        return result;
    
verbatim

There are no loops or function calls here, so that is a pretty
good sign that this method is ``constant time,'' which is to
say that it takes the same amount of time no matter how many
items are in the list.  Actually, it might be slightly faster
when the list is empty, since we can skip the body of the conditional,
but that difference is insignificant.

The performance of addNewLastNode is very different.

verbatim
    public void addNewLastNode (Object obj) 
        // special case: empty list
        if (head == null) 
            head = new Node (obj, null);
            return;
        
        Node last;
        for (last = head; last.next != null; last = last.next) 
            // traverse the list to find the last node
        
        last.next = new Node (obj, null);
    
verbatim

The first conditional handles the special case of adding a new
node to an empty list.  In the general case, though, we have to
traverse the list to find the last element so we can make it
refer to the new node.

This traversal takes time proportional to the length of the
list.  Since the run time is a linear function of the length,
we would say that this method is ``linear time.''  Compared to
constant time, that's very bad. 

Linked Queue

We would like an implementation of the Queue ADT that can
perform all operations in constant time.  One way to
accomplish that is to implement a linked queue, which
is similar to a linked list in the sense that it is made up
of zero or more linked Node objects, except that it
maintains a reference to both the first and last nodes.

figure=queue1.eps,width=4in

Here's what the new Queue implementation looks like:

verbatim
public class Queue 
    public Node first, last;

    public Queue () 
        first = null;
        last = null;
    

    public boolean empty () 
        return first == null;
    

verbatim

So far it is straightforward.  In an empty queue, both first
and last are null.  To check whether a list is empty, we only
have to check one of them.

insert is a little more complicated because we have to
deal with several special cases.

verbatim
    public void insert (Object obj) 
        Node node = new Node (obj, null);
        if (last != null) 
            last.next = node;
        
        last = node;
        if (first == null) 
            first = last;
        
    
verbatim

The first condition checks to make sure that last refers
to a node; if it does then we have to make it refer to the new
node.

The second condition deals with the special case where the list
was initially empty.  In this case both first and last
refer to the new node.

remove also deals with several special cases.

verbatim
    public Object remove () 
        Node result = first;
        if (first != null) 
            first = first.next;
        
        if (first == null) 
            last = null;
        
        return result;
    
verbatim

The first condition checks whether there were any nodes in
the queue.  If so, we have to copy the next node into
first.  The second condition deals with the special
case that the list is now empty, in which case we have to
make last null.

As an exercise, draw diagrams showing both operations in
both the normal case and in the special cases, and convince
yourself that they are correct.

Clearly, this implementation is more complicated than the
veneer implementation, and it is more difficult to demonstrate
that it is correct.  The advantage is that we have achieved
the goal: both insert and remove are constant
time.

Circular buffer

Another common implementation of a queue is a circular
buffer.  ``Buffer'' is a general name for a temporary storage
location, although it often refers to an array, as it does in
this case.  What it means to say a buffer is ``circular'' should
become clear in a minute.

The implementation of a circular buffer is similar to the array
implementation of a Stack, as in Section arraystack.  The
queue items are stored in an array, and we use indices to
keep track of where we are in the array.  In the Stack implementation,
there was a single index that pointed to the next available space.
In the Queue implementation, there are two indices: first
points to the space in the array that contains the first customer
in line and next points to the next available space.

The following figure shows a queue with two items (represented
by dots).

figure=queue2.eps,width=4in

There are two ways to represent the variables first and
last.  Literally, they are integers, and their values are
shown in boxes on the right.  Abstractly, though, they are
indices of the array, so they can also be shown as arrows
pointing to locations in the array.  The representation is
convenient, but you should remember that the indices are not
references; they are just integers.

Here is an incomplete array implementation of a queue:

verbatim
public class Queue 
    public Object[] array;
    public int first, next;

    public Queue () 
        array = new Object[128];
        first = 0;
        next = 0;
    

    public boolean empty () 
        return first == next;
    
verbatim

The instance variables and the constructor are straightforward,
although again we have the problem that we have to choose an
arbitrary size for the array.  Later we will solve that problem,
as we did with the Stack, by resizing the array if it gets full.

The implementation of empty is a little surprising, though.
We might have thought that first == 0 would indicate an
empty queue, but that neglects the fact that the head of
the queue moves through the array.  Once we see the implementation
of insert and remove, that will make more sense.

verbatim
    public void insert (Object item) 
        array[next] = item;
        next++;
    

    public Object remove () 
        Object result = array[first];
        first++;
        return result;
    
verbatim

insert looks very much like push in Section arraystack;
it puts the new item in the next available space and then increments
the index.

remove is similar.  It takes the first item from the queue
and then increments the index that points to the head of the queue.
The following figure shows what the queue looks like after both
items have been removed.

figure=queue3.eps,width=4in

It is always true that next points to an available space.
If first catches up with next and points to the same
space, then first is referring to an ``empty'' location,
and the queue is empty.  I put ``empty'' in quotation marks because
it is possible that the location that first points to actually
contains a value (we do nothing to insure that empty locations contain
null); on the other hand, since we know the queue is empty, we
will never read this location, so we can think of it, abstractly,
as empty.

As an exercise, fix remove so that it returns null
if the queue is empty.

The next problem with this implementation is that eventually it
will run out of space.  When we add an item we increment next
and when we remove an item we increment first, but we never
decrement either.  What happens when we get to the end of the
array?

The following figure shows the queue after we add four more items:

figure=queue4.eps,width=4in

The array is now full.  There is no ``next available space,'' so there
is nowhere for next to point.  One possibility is that we could
resize the array, as we did with the Stack implementation.  But in that
case the array would keep getting bigger regardless of how many items
were actually in queue.  A better solution is to wrap around to the
beginning of the array and reuse the spaces there.  This ``wrap around''
is the reason this implementation is called a circular buffer.

One way to wrap the index around is to add a special case whenever
we increment an index:

verbatim
        next++;
        if (next == array.length) next = 0; 
verbatim

A fancy alternative is to use the modulus operator:

verbatim
        next = (next + 1) 
verbatim

Either way, we have one last problem to solve.  How do we know
if the queue is really full, meaning that we cannot insert
another item?  The following figure shows what the queue looks
like when it is ``full.''

figure=queue5.eps,width=4in

There is still one empty space in the array, but the queue is
full because if we insert another item, then we have to increment
next such that next == first, and in that case it
would appear that the queue was empty!

To avoid that, we sacrifice one space in the array.  So how
can we tell if the queue is full?

verbatim
        if ((next + 1) 
verbatim

And what should we do if the array is full?  Well, now resizing
the array is probably the only option.

As an exercise, put together all the code from this section and
write an implementation of a Queue using a circular buffer that
resizes itself when necessary.

Priority queue

A Priority Queue is an ADT with exactly the same interface as
a Queue, but different semantics.  The interface is:

description

[construction:] Create a new, empty queue.

[insert:] Add a new item to the queue.

[remove:] Remove and return an item from the queue.  The item
that is returned is the one with the highest priority.

[empty:] Check whether the queue is empty.

description

The semantic difference is that the item that is removed from
the queue is not necessarily the first one that was added.  Rather,
it is whatever item in the queue has the highest priority.
What the priorities are, and how they compare to each other, are not
specified by the Priority Queue implementation.  It depends on
what the items are that are in the queue.

For example, if the items in the queue have names, we might choose
them in alphabetical order.  If they are bowling scores, we might
choose from highest to lowest, but if they are golf scores, we would
go from lowest to highest.

So we face a new problem.  We would like an implementation of
Priority Queue that is generic---it should work with any kind
of object---but at the same time the code that implements Priority
Queue needs to have the ability to compare the objects it contains.

We have seen a way to implement generic data structures using
Objects, but that does not solve this problem, because
there is no way to compare Objects unless we know what type
they are.

The answer lies in a new Java feature called an abstract class.

Abstract class

An abstract class is a set of classes.  The abstract class definition
specifies the requirements a class must satisfy to be a member.

Often abstract classes have names that end in ``able'' to indicate
the fundamental capability the abstract class requires.  For
example, any class that provides a method named draw can
be a member of the abstract class named Drawable.  Any
class that contains a method named start can be a member
of the abstract class Runnable.

As of Java 2, Java provides a built-in abstract class that we
can use in an implementation of a Priority Queue.  It is called
Comparable, and it means what it says.  Any class that
belongs to the Comparable abstract class has to provide
a method named compareTo that compares two objects and
returns a value indicating whether one is larger or smaller than
the other, or whether they are the same.

Many of the built-in Java classes are members of the Comparable
abstract class, including the numeric wrapper classes like Integer
and Double.

In the next section I will show how to write an ADT that manipulates
an abstract class.  Then we will see how to write a new (concrete)
class that belongs to an existing abstract class.  Then we will see
how to write a new abstract class.

Array implementation of Priority Queue

In the implementation of the Priority Queue, every time we specify
the type of the items in the queue, we specify the abstract class
Comparable.  For example, the instance variables are an
array of Comparables and an integer:

verbatim
public class PriorityQueue 
    private Comparable[] array;
    private int index;

verbatim

As usual, index is the index of the next available location in the
array.  The instance variables are declared private so that
other classes cannot have direct access to them.

The constructor and empty are similar to what we have seen
before.  I chose the initial size for the array arbitrarily.

verbatim
    public PriorityQueue () 
        array = new Comparable [16];
        index = 0;
    

    public boolean empty () 
        return index == 0;
    
verbatim

insert is similar to push:

verbatim
    public void insert (Comparable item) 
        if (index == array.length) 
            resize ();
        
        array[index] = item;
        index++;
    
verbatim

I am omitting the implementation of resize to save space.
Finally, the only interesting method is remove, which has
to traverse the array to find and remove the largest item:

verbatim
    public Comparable remove () 
        if (index == 0) return null;

        int maxIndex = 0;

        // find the item with the highest priority
        for (int i=1; i<index; i++) 
            if (array[i].compareTo (array[maxIndex]) > 0) 
                maxIndex = i;
            
        
        Comparable result = array[maxIndex];

        // move the last item into the empty slot
        index--;
        array[maxIndex] = array[index];
        return result;
   
verbatim

As we traverse the array, maxIndex keeps track of the
index of the largest element we have seen so far.  What it
means to be the ``largest'' is determined by compareTo.
In this case the compareTo method is provided by the
Integer class, and it does what we expect---larger
(more positive) numbers win.

A Priority Queue client

The implementation of Priority Queue is written entirely
in terms of Comparable objects.  But there is no
such thing as a Comparable object!  Go ahead, try
to create one:

verbatim
    Comparable comp = new Comparable ();       // ERROR
verbatim

You'll get a compile-time message that says something like
``java.lang.Comparable is an interface.  It can't be instantiated.''
In Java, abstract classes are called interfaces.  I have
avoided this word so far because it also means several other
things, but now you have to know.

Why can't abstract classes be instantiated?  Because an abstract
class only specifies requirements (you must have a compareTo
method); it does not provide an implementation.

To create a Comparable object, you have to create one of
the objects that implements Comparable, like Integer.
Then you can use that object anywhere a Comparable is called
for.

verbatim
        PriorityQueue pq = new PriorityQueue ();
        Integer item = new Integer (17);
        pq.insert (item);
verbatim

This code creates a new, empty Priority Queue and a new Integer
object.  Then it inserts the Integer into the queue.  
insert is expecting a Comparable as a parameter, so it is
perfectly happy to take an Integer.  If we try to pass a 
Rectangle, which does not implement Comparable, we get a
compile-time message like, ``Incompatible type for method.  Explicit
cast needed to convert java.awt.Rectangle to java.lang.Comparable.''

That's the compiler telling us that if we want to make that conversion,
we have to do it explicitly.  We might try to do what it says:

verbatim
	Rectangle rect = new Rectangle ();
	pq.insert ((Comparable) rect);
verbatim

But in that case we get a run-time error, a ClassCastException.
When the Rectangle tries to pass as a Comparable, the
run-time system checks whether it satisfies the requirements, and
rejects it.  So that proves that you can't fit a square peg in a
round hole.

To get items out of the queue, we have to reverse the process:

verbatim
    while (!pq.empty ()) 
        item = (Integer) pq.remove ();
        System.out.println (item);
    
verbatim

This loop removes all the items from the queue and prints them.
We are assuming that the items in the queue are Integers.
If they were not, we would get a ClassCastException.

The Customer class

Finally, let's looks at how we can make a new class that implements
Comparable.  As an example of something with an unusual definition
of ``highest'' priority, we'll use golfers:

verbatim
public class Golfer implements Comparable 
    String name;
    int score;

    public Golfer (String name, int score) 
        this.name = name;
        this.score = score;
    

verbatim

The class definition and the constructor are pretty much the same as
always; the difference is that we have to declare that Golfer
implements Comparable.  If we try to compile Golfer.java at
this point, we get something like ``class Golfer must be declared
abstract. It does not define int compareTo(java.lang.Object) from
interface java.lang.Comparable.''  In other words, to be a Comparable,
Golfer has to provide a method named compareTo.  So
let's write one:

verbatim
    public int compareTo (Object obj) 
        Golfer that = (Golfer) obj;

        int a = this.score;
        int b = that.score;
	
        // for golfers, low is good!
        if (a<b) return 1;
        if (a>b) return -1;
        return 0;
    
verbatim

Two things here are a little surprising.  First, the parameter
is an Object.  That's because in general the caller doesn't
know what type the objects are that are being compared.  For
example, in PriorityQueue.java when we invoke compareTo,
we pass a Comparable as a parameter.  We don't have to
know whether it is an Integer or a Golfer or whatever.

Inside compareTo we have to convert the parameter from
an Object to a Golfer.  As usual, there is a risk
when we do this kind of cast: if we cast to the wrong type we
get an exception.

Finally, we can create some golfers:

verbatim
        Golfer tiger = new Golfer ("Tiger Woods", 61);
        Golfer phil = new Golfer ("Phil Mickelson", 72);
        Golfer hal = new Golfer ("Hal Sutton", 69);
verbatim

And put them in the queue:

verbatim
        pq.insert (tiger);
        pq.insert (phil);
        pq.insert (hal);
verbatim

When we pull them out:

verbatim
        while (!pq.empty ()) 
            golfer = (Golfer) pq.remove ();
            System.out.println (golfer);
        
verbatim

They appear in descending order (for golfers):

verbatim
        Tiger Woods     61
        Hal Sutton      69
        Phil Mickelson  72
verbatim

When we switched from Integers to Golfers, we didn't
have to make any changes in PriorityQueue.java at all.  So
we succeeded in maintaining a barrier between PriorityQueue
and the classes that use it, allowing us to reuse the code without
modification.  Furthermore, we were able to give the client code
control over the definition of compareTo, making this
implementation of PriorityQueue more versatile.


Glossary

description

[queue:]  An ordered set of objects waiting for a service of
some kind.

[queueing discipline:]  The rules that determine which member
of a queue is removed next.

[FIFO:]  ``first in, first out,'' a queueing discipline in which
the first member to arrive is the first to be removed.

[priority queue:]  a queueing discipline in which
the each member has a priority determined by external factors.
The member with the highest priority is the first to be removed.

[Priority Queue:]  An ADT that defines the operations one
might perform on a priority queue.

[veneer:]  A class definition that implements an ADT with
method definitions that are invocations of other methods, sometimes
with simple transformations.  The veneer do no significant work,
but it improves or standardizes the interface seen by the client.

[performance hazard:]  A danger associated with a veneer that
some of the methods might be implemented inefficiently in a way
that is not apparent to the client.

[linked queue:]  An implementation of a queue using a linked
list and references to the first and last nodes.

[circular buffer:]  An implementation of a queue using an
array and indices of the first element and the next available space.

[abstract class:]  A set of classes.  The abstract class specification
lists the requirements a class must satisfy to be included in the set.

[interface:]  The Java word for an abstract class.  Not to be
confused with the more broad meaning of the word interface.


description


Trees

This chapter presents a new data structure called a tree, some of its
uses and two ways to implement it.

A possible source of confusion is the distinction
between an ADT, a data structure, and an implementation of an ADT or
data structure.  There is no universal answer, because something
that is an ADT at one level might in turn be the implementation of
another ADT.

To help keep some of this straight, it is sometimes useful to draw
a diagram showing the relationship between an ADT and its possible
implementations.  This figure shows that there are two implementation
of a tree:

figure=tree_adt.eps

In the next chapter we will see a more complicated figure that
depicts the heap implementation of a Priority Queue.

A tree node

Like lists, trees are made up of nodes.  A common kind of tree is
a binary tree, in which each node contains a reference to two
other nodes (possibly null).  The class definition looks like this:

verbatim
public class Tree 
    Object cargo;
    Tree left, right;

verbatim

Like list nodes, tree nodes contain cargo: in this case a generic 
Object.  The other instance variables are called left and 
right, in accordance with a standard way to represent trees
graphically:

figure=tree1.eps

The top of the tree (the node referred to by tree) is
called the root.  In keeping with the tree
metaphor, the other nodes are called branches and the nodes
at the tips with null references are called leaves.  It
may seem odd that we draw the picture with the root at the top
and the leaves at the bottom, but that is not the strangest thing.

To make things worse, computer scientists mix in yet another
metaphor: the family tree.  The top node is sometimes called
a parent and the nodes it refers to are its children.
Nodes with the same parent are called siblings, and so on.

Finally, there is also a geometric vocabulary for taking
about trees.  I already mentioned left and right, but there is
also ``up'' (toward the parent/root) and down (toward the
children/leaves).  Also, all the nodes that are the same
distance from the root comprise a level of the tree.

I don't know why we need three metaphors for talking about trees,
but there it is.

Building trees

The process of assembling tree nodes is very similar
to the process of assembling lists.

We have a constructor for tree nodes that initializes the instance
variables.

verbatim
    public Tree (Object cargo, Tree left, Tree right) 
        this.cargo = cargo;
        this.left = left;
        this.right = right;
    
verbatim

We allocate the child nodes first:

verbatim
    Tree left = new Tree (new Integer(2), null, null);
    Tree right = new Tree (new Integer(3), null, null);
verbatim

We can create the parent node and link it to the children
at the same time:

verbatim
    Tree tree = new Tree (new Integer(1), left, right);
verbatim

This code produces the state shown in the previous figure.

By now, any time you see a new compound data type, your first
question should be, ``How can I traverse it?''  The most natural
way to traverse a tree is recursively.  For example, to
add up all the integers in a tree, we could write this class
method:

verbatim
    public static int total (Tree tree) 
        if (tree == null) return 0;
        Integer cargo = (Integer) tree.cargo;
        return cargo.intValue() + total (tree.left) + total (tree.right);
    
verbatim

This is a class method because we would like to use null to
represent the empty tree, and make the empty tree the base case of the
recursion.  If the tree is empty, the method returns 0.
Otherwise it makes two recursive calls to find the total value of its
two children.  Finally, it adds in its own cargo and returns the
total.

Although this method works, there is some difficulty fitting it into
an object-oriented design.
It should not appear in the Tree class because it
requires the cargo to be Integer objects.  If we make that
assumption in Tree.java then we lose the advantages of a generic
data structure.

On the other hand, this code accesses the instance variables
of the Tree nodes, so it ``knows'' more than it should about
the implementation of the tree.  If we changed that implementation
later (and we will) this code would break.

Later in this chapter we will develop ways to solve this problem,
allowing client code to traverse trees containing any kinds of
objects without breaking the abstraction barrier between the client
code and the implementation.

Before we get there, though, let's look at an application of trees.

Expression trees

A tree is a natural way to represent the structure of an expression.
For example, the infix expression 1 + 2 * 3 and the postfix
expression 1 2 3 * + represent the same computation, but in both
cases we have to know semantic rules in order to evaluate the expression.
In the first case we have to know
the order of operations; in the second case we have to know the stack
model of evaluation.  Without these rules, the expressions are
ambiguous.  A tree is an unambiguous representation of a computation.

The following figure represents the same computation:

figure=tree2.eps

The nodes can be operands like 1 and 2 or operators
like + and *.  Operands are leaf nodes; operators 
contain references to their operands (all of these operators
are binary, meaning they have exactly two operands).

Looking at this figure, there is no question what the order of
operations is: the multiplication happens first in order to compute
the first operand of the addition.

Expression trees like this have many uses.  The example we are
going to look at is translation from one format (postfix) to
another (infix).  Similar trees are used inside compilers to parse,
optimize and translate programs.

Traversal

I already pointed out that recursion provides a natural way to
traverse a tree.  We can print the contents of an expression tree
like this:

verbatim
    public static void print (Tree tree) 
        if (tree == null) return;
        System.out.print (tree + " ");
        print (tree.left);
        print (tree.right);
    
verbatim

In other words, to print a tree, first print the contents of
the root, then print the entire left subtree, then print the
entire right subtree.  This way of traversing a tree is called
a preorder, because the contents of the root appear before
(pre-) the contents of the children.

For the example expression the output is + 1 * 2 3.  This
is different from both postfix and infix; it is a new format called
prefix, in which the operators appear before their operands.

You might begin to suspect that if we traverse the tree in a
different order we might get the expression back in a different
format.  For example, if we print the subtrees first, and then
the root node:

verbatim
    public static void printPostorder (Tree tree) 
        if (tree == null) return;
        printPostorder (tree.left);
        printPostorder (tree.right);
        System.out.print (tree + " ");
    
verbatim

We get the expression in postfix (1 2 3 * +)!  As the
name of the previous method implies, this order of traversal
is called postorder.  Finally, to traverse a tree inorder,
we print the left tree, then the root, then the right tree:

verbatim
    public static void printInorder (Tree tree) 
        if (tree == null) return;
        printInorder (tree.left);
        System.out.print (tree + " ");
        printInorder (tree.right);
    
verbatim

The result is 1 + 2 * 3, which is the expression in infix.

Now, to be fair, I have to point out that I have omitted an
important complication.  Sometimes when we write an expression
in infix we have to use parentheses to preserve the order of
operations.  So an inorder traversal is not quite sufficient to
generate an infix expression.

Nevertheless, with a few improvements, the expression tree 
and the three recursive traversals provide 
a general way to translate expressions from one format to
another.

Encapsulation

There is still one problem with the way we have been traversing
trees: it breaks down the barrier between the client code (the
application that builds the tree) and the Tree implementation.
Ideally, tree code should be general; it shouldn't know anything
about expression trees.
And the code that generates and traverses the expression tree shouldn't
know about the implementation of the trees.  This design criterion
is called object encapsulation.

In the current version, the Tree code knows too much about
the application.  Instead, the Tree class should provide
the general capability of traversing a tree in various ways.  As
it traverses, it should perform operations on each node that are
specified by the application.

To facilitate this separation of interests, we will
create a new abstract class, called Visitable.  The items
stored in a tree will be required to be visitable, which means
that they define a method named visit that does whatever
it is the application wants done to each node.  That way the
Tree can perform the traversal and the application can perform
the node operations.

Here are the steps we have to perform to wedge an abstract class
between an application and a library:

enumerate

Define an abstract class that specifies the operations we
need the library to be able to perform.

Rewrite the library to use the new abstract class instead
of generic Objects.

Define a concrete class that implements the abstract class
in a way that is appropriate for the application.

Rewrite the application to use the new concrete class.

enumerate

The next few sections demonstrate these steps.

Defining an abstract class

An abstract class definition looks a lot like a concrete class
definition, except that it only specifies the interface of each
method and not an implementation.  The definition of Visitable
is

verbatim
public interface Visitable 
    public void visit ();

verbatim

That's it!  The word interface is Java's keyword for an
abstract class.  The definition of visit looks like any other
method definition, except that it has no body.  This definition
specifies that any class that implements Visitable has to have
a method named visit that takes no parameters and that returns
void.

Like other class definitions, abstract class definitions go in a file
with the same name as the class (in this case Visitable.java).

Implementing an abstract class

For the current application, ``visiting'' a tree node means printing
its contents.  Since the contents of an expression tree are tokens,
we'll create a new concrete class called Token that implements
Visitable

verbatim
public class Token implements Visitable 
    String str;

    public Token (String str) 
        this.str = str;
    

    public void visit () 
        System.out.print (str + " ");
    

verbatim

When we compile this class definition (which is in a file named 
Token.java), the compiler checks whether the methods provided satisfy
the requirements specified by the abstract class.  If not, it will
produce an error message.  For example, if we misspell the name of the
method that is supposed to be visit is not correct, we might get
something like, ``class Token must be declared abstract. It does not
define void visit() from interface Visitable.''

The next step is to modify the parser to put Token objects
into the tree instead of Strings.  Here is a small example:

verbatim
    String expr = "1 2 3 * +";
    StringTokenizer st = new StringTokenizer (expr, " +-*/", true);
    String token = st.nextToken();
    Tree tree = new Tree (new Token (token), null, null));
verbatim


Array implementation of trees

What does it mean to ``implement'' a tree?  So far we have been
talking about trees as data structures.  To say what a tree is,
we would explain how to build one and what basic operations we
can perform (like traversal).

But the Tree type we have been working with is not the
only kind of thing we would like to recognize as a tree.  There is
a set of operations trees should be able to perform, and anything
that can perform them should be recognized as a tree.

In other words, a tree is an ADT.
It is defined by the following operations.

description

[construction:] Build an empty tree.

[empty:] Is this tree the empty tree?

[left:] Return the left child of this node, or an
empty tree if there is none.

[right:] Return the left child of this node, or an
empty tree if there is none.

[parent:] Return the parent of this node, or an empty
tree if this node is the root.

description

In the implementation we have seen, the empty tree is represented
by the special value null.  left and right are
performed by accessing the instance variables of the node.  We
have not implemented parent yet (you might think about how
to do it).

There is another implementation of trees that uses arrays and
indices instead of objects and references.  To see how it works,
we will start by looking at a hybrid implementation that uses
both arrays and objects.

This figure shows a tree like the ones we have been looking at,
although it is laid out sideways, with the root at the left and
the leaves on the right.  At the bottom there is an array of
references that refer to the objects in the trees.

figure=tree3.eps

In this particular case, the cargo of each node is the same as
the array index of the node, but of course that is not true in
general.  You might notice that array index 1 refers to the
root node and array index 0 is empty.  The reason for
that will become clear soon.

So now we have a tree where each node has a unique index.
Furthermore, the indices have been assigned to the nodes according
to a deliberate pattern, in order to achieve the following
results:

enumerate

The left child of the node with index  has index .

The right child of the node with index  has index .

The parent of the node with index  has index  (rounded down).

enumerate

Using these formulas, we can implement left, right
and parent just by doing arithmetic; we don't have to
use the references at all!

Since we don't use the references, we can get rid of them,
which means that what used to be a tree node is now just cargo
and nothing else.  That means we can implement the tree as an
array of cargo objects; we don't need tree nodes at all.

Here's what one implementation looks like:

verbatim
public class Tree 
    Object[] array;
    static String empty = "empty node";

    public Tree () 
        array = new Object [128];
        for (int i=0; i<array.length; i++) 
            array[i] = empty;
        
    
verbatim

Most of this is no surprise.  The instance variable
is an array of Objects.  The constructor initializes this array
with an arbitrary initial size (we can always resize it later).

There are a few unexpected things, too.  The variable empty
looks like an instance variable, but the keyword static
makes it a class variable.  Like instance variables, class
variables are declared outside of any method, and they are accessible
from within any method (unless shadowed by a local variable or
parameter).  But unlike instance variables, there is only one copy
of the class variable for the entire class, not one for every
object.

In this case, the class variable is useful because it provides a
special value that will be used to identify ``empty'' spaces in
the array (items that have not been set yet).  Thus, the implementation
of empty is

verbatim
    public boolean empty (int i) 
        return (array[i] == empty);
    
verbatim

The only this that is odd about this method is the way we identify
the node.  Instead of a reference, we are using an integer index.
The array the index refers to, of course, is the instance variable
of this, the current object.

The implementation of left, right
and parent is just arithmetic:

verbatim
    public int left (int i)   return 2*i;  
    public int right (int i)   return 2*i + 1;  
    public int parent (int i)   return i/2;  
verbatim

Again, all references to the nodes of the tree are integer indices.
We could use these indices directly to add and remove objects from
the array.  To help avoid errors, it is often a good idea to provide
accessor methods to perform those operations:

verbatim
    public Object getNode (int i) 
        return array[i];
    

    public void setNode (int i, Object obj) 
        array[i] = obj;
    
verbatim

Finally we are ready to build a tree.  In another class (the client),
we would write

verbatim
    Tree tree = new Tree ();
    int root = 1;
    tree.setNode (root, "one");
verbatim

The constructor builds an empty tree.  In this case we assume that
the client knows that the index of the root is 1 although it
would be preferable for the tree implementation to provide that
information.  Anyway, the setNode invocation puts the
string "one" into the root node.

To add children to the root node:

verbatim
    tree.setNode (tree.left (root), "two");
    tree.setNode (tree.right (root), "three");
verbatim

In the tree class we would like to provide a method that prints
the contents of the tree in preorder.

verbatim
    public void print (int i) 
        if (empty (i)) return;
        System.out.println (array[i]);
        print (left (i));
        print (right (i));
    
verbatim

We invoke this method from the client by passing the root as
a parameter.

verbatim
    tree.print (root);
verbatim

The output is

verbatim
one
two
three
verbatim

Also, we can check that root has a left child

verbatim
    System.out.println (tree.empty (tree.left (root)));
verbatim

and that the child doesn't.

verbatim
    System.out.println (tree.empty (tree.left (tree.left (root))));
verbatim

This implementation provides the basic operations required to
be a tree, but it leaves a lot to be desired.  As I pointed out,
we expect the client to have a lot of information about the
implementation, and the interface the client sees, with indices
and all, is not very pretty.

Also, we have the usual problem with array implementations, which
is that the initial size of the array is arbitrary and it might have
to be resized.  This last problem can be solved by replacing the
array with a Vector.

The Vector class
vector

The Vector is a built-in Java class in the java.util
package.  It is an implementation of an array of Objects,
with the added feature that it can resize itself automatically,
so we don't have to.

The Vector class provides methods named get and
set that are equivalent to the getNode and setNode
methods we wrote for the Tree class.
You should review the other Vector operations by consulting
the online documentation.

Each Vector has a
capacity, which is the amount of space that has been allocated
to store values, and a size, which is the number of values that
are actually in the vector.

In general, it is the responsibility of the client code to make
sure that the vector has sufficient size before invoking
set or get.  One way to make that guarantee is to
use the add and insert methods (and variations thereof),
which increase the size of the vector if necessary.  Another
is to use the setSize method to resize the vector before
setting a new element.

Most of the time the client doesn't have to worry about
capacity.  Whenever add, insert and setSize
are invoked, the capacity of the vector is increased automatically.
For performance reasons, some applications might want to take
control of this function, which is why there are additional methods
for increasing and decreasing capacity.

Because the client code has no access to the implementation of
a vector, it is not clear how we should traverse one.  Of course,
one possibility is to use a loop variable as an index into the
vector:

verbatim
        for (int i=0; i<v.size(); i++) 
            System.out.println (v.get(i));
        
verbatim

There's nothing wrong with that, but there is another way that
serves to demonstrate the Iterator class.  Vectors provide
a method named iterator that returns an Iterator object
that makes it possible to traverse the vector.

The Iterator class
iterator

Iterator is an abstract class in the java.util
package.  It specifies three methods:

description

[hasNext:] Does this iteration have more elements?

[next:] Return the next element, or throw an exception if
there is none.

[remove:] Remove from the original collection the last
element that was returned.

description

The following example uses an iterator to traverse and print the
elements of a vector.

verbatim
        Iterator iterator = vector.iterator ();
        while (iterator.hasNext ()) 
            System.out.println (iterator.next ());
        
verbatim

In a previous section we used the Visitable abstract class to
allow a client to traverse a data structure without knowing the
details of its implementation.  Iterators provide another way to do
the same thing.  In the first case, the library performs the iteration
and invokes client code to ``visit'' each element.  In the second
case the library gives the client an object that it can use to
select elements one at a time (albeit in an order controlled by
the library).

As an exercise, write a concrete class named PreIterator that
implements the Iterator interface, and write a method named 
preorder for the Tree class that returns a PreIterator
that selects the elements of the Tree in preorder.


Glossary

description

[binary tree:]  A tree in which each node refers to 0, 1, or
2 dependent nodes.

[root:]  The top-most node in a tree, to which no other nodes
refer.

[leaf:]  A bottom-most node in a tree, which refers to no other
nodes.

[parent:]  The node that refers to a given node.

[child:]  One of the nodes referred to by a node.

[level:]  The set of nodes equidistant from the root.

[prefix notation:]  A way of writing a mathematical expression
with each operator appearing before its operands.

[preorder:]  A way to traverse a tree, visiting each node
before its children.

[postorder:]  A way to traverse a tree, visiting the children
of each node before the node itself.

[inorder:]  A way to traverse a tree, visiting the left subtree,
then the root, then the right subtree.

[class variable:]  A static variable declared outside of any
method.  It is accessible from any method.

[binary operator:]  An operator that takes two operands.

[object encapsulation:]  The design goal of keeping
as separate as possible the implementations of two objects.  Neither
class should have to know the details of the implementation of
the other.

description


Heap

The Heap

A heap is a special kind of tree that happens to be an efficient
implementation of a priority queue.  This figure shows the relationships
among the data structures in this chapter.

figure=heap_adt.eps

Ordinarily we try to maintain as much distance as possible
between an ADT and its implementation, but in the case of the 
Heap, this barrier breaks down a little.  The reason is that
we are interested in the performance of the operations we implement.
For each implementation
there are some operations that are easy to implement and efficient
and others that are clumsy and slow.

It turns out that the array implementation of a tree works
particularly well as an implementation of a Heap.  The 
operations the array performs well are
exactly the operations we need to implement a Heap.

To understand this relationship, we will proceed in a few steps.
First, we need to develop ways of comparing the performance of
various implementations.  Next, we will look at the operations
Heaps perform.  Finally, we will compare the Heap implementation
of a Priority Queue to the others (arrays and lists) and see
why the Heap is considered particularly efficient.

Performance evaluation

When we compare algorithms, we would like to have a way to tell
when one is faster than another, or takes less space, or uses less
of some other resource.  It is hard to answer those questions in
detail, because the time and space used by an algorithm depend on the
implementation of the algorithm, the particular problem being
solved, and the hardware the program runs on.

The objective of this section is to develop a way of talking about
performance that is independent of all of those things, and only
depends on the algorithm itself.  To start, we will focus on run
time; later we will talk about other resources.

Our decisions are guided by a series of constraints:

enumerate

First, the performance of an algorithm depends on the
hardware it runs on, so we usually don't talk about run time
in absolute terms like seconds.  Instead, we usually
count the number of abstract operations the algorithm performs.

Second, performance often depends on the particular
problem we are trying to solve -- some problems are easier than
others.  To compare algorithms, we usually focus on either the
worst-case scenario or an average (or common) case.

Third, performance depends on the size of the problem
(usually, but not always, the number of elements in a collection).
We address this dependence explicitly by
expressing run time as a function of problem size.

Finally, performance depends on
details of the implementation like object allocation overhead
and method invocation overhead.  We usually ignore these details
because they don't affect the rate at which the
number of abstract operations increases with problem size.

enumerate

To make this process more concrete, consider two algorithms we
have already seen for sorting an array of integers.  The
first is selection sort, which we saw in Section sorting.
Here is the pseudocode we used there.

verbatim
    selectionsort (array) 
        for (int i=0; i<array.length; i++) 
          // find the lowest item at or to the right of i
          // swap the ith item and the lowest item
        
    
verbatim

This is called a selection sort because each time through the loop it
selects the lowest card remaining in the deck.  To perform the
operations specified in the pseudocode, we wrote helper methods named
findLowest and swap.  In pseudocode,
findLowest looks like this

verbatim
    // find the index of the lowest item between
    // i and the end of the array

    findLowest (array, i) 
        // lowest contains the index of the lowest item so far
        lowest = i;
        for (int j=i+1; j<array.length; j++) 
          // compare the jth item to the lowest item so far
          // if the jth item is lower, replace lowest with j
        
        return lowest;
    
verbatim

And swap looks like this:

verbatim
    swap (i, j) 
        // store a reference to the ith card in temp
        // make the ith element of the array refer to the jth card
        // make the jth element of the array refer to temp
    
verbatim

To analyze the performance of this algorithm, 
the first step is to decide what operations to count.  Obviously,
the program does a lot of things: it increments i, compares
it to the length of the deck, it searches for the largest element
of the array, etc.  It is not obvious what the ``right'' thing is
to count.

It turns out that a good choice is the number of
times we compare two items.  Many other choices would yield
the same result in the end, but this is easy to do and we will
find that it allows us to compare most easily with other sort
algorithms.

The next step it to decide whether we are interested in the
best case performance, the worst case, the average case, or
something else.  For this algorithm, it turns out not to matter,
but for mergesort it is easiest to do worst case analysis, so
we'll look at the worse case for both.

The next step is to define the ``problem size.''  In this case
it is natural to choose the size of the array, which we call
.

Finally, we would like to derive an expression that tells us how
many abstract operations (specifically, comparisons) we have to
do, as a function of .

We start by analyzing the helper functions.  swap copies
several references, but it doesn't perform any comparisons, so
we ignore the time spent performing swaps.
findLowest starts at i and traverses the array,
comparing each item to lowest.  The number of
items we look at is , so the total number of
comparisons is .

Next we consider how many times findLowest
gets invoked and what the value of  is each time.  The last
time it is invoked,  is  so the number of
comparisons is 1.  The previous iteration performs 2 comparisons,
and so on.  During the first iteration,  is  and the
number of comparisons is .

So the total number of comparisons is .
This sum is equal to .  To describe this algorithm,
we would typically ignore the lower order term () and say
that the total amount of work is proportional to .  Since
the leading order term is quadratic, we might also say that this
algorithm is quadratic.

Analysis of mergesort

In Section mergesort I claimed that mergesort takes time
that is proportional to , but I didn't explain how
or why.  Now I will.

Again, we start by looking at pseudocode for the algorithm.
For mergesort, it's

verbatim
  mergeSort (array) 
    // find the midpoint of the array
    // divide the array into two halves
    // sort the halves recursively
    // merge the two halves and return the result
  
verbatim

At each level of the recursion, we split the array in half,
make two recursive calls, and then merge the halves.  Graphically,
the process looks like this:

figure=mergetree.eps,width=5in

Each line in the diagram is a level of the recursion.  At the
top, a single array divides into two halves.  At the bottom, 
arrays (with one element each) are merged into  arrays (with
2 elements each).

The first two columns of the table show the number of arrays at each
level and the number of items in each array.  The third column shows
the number of merges that take place at each level of recursion.  The
next column is the one that takes the most thought: it shows the
number of comparisons each merge performs.

If you look at the pseudocode (or your implementation) of
merge, you should convince yourself that in the worst case it
takes  comparisons, where  is the total number items
being merged.

The next step is to multiply the number of merges at each level
by the amount of work (comparisons) per merge.  The result is the
total work at each level.  At this point we take advantage of a small
trick.  We know that in the end we are only interested in the
leading-order term in the result, so we can go ahead
and ignore the  term in the comparisons per merge.  If we do
that, then the total work at each level is simply .

Next we need to know is the number of levels as a function
of .  Well, we start with  items and divide it in half until
we get to 1.  That's the same as starting at 1 and multiplying by
2 until we get to .  In other words, we want to know how many times
we have to multiply 2 by itself before we get to .   The answer is
that the number of levels, , is the logarithm, base 2, of .

Finally, we multiply the amount of work per level, , by the
number of levels,  to get , as promised.

It might not be obvious at first that  is better than
, but for large values of , it is.
As an exercise, write a program that prints  and
 for a range of values of .  For what value of  are
they equal?

Overhead

Performance analysis takes a lot of handwaving.  First we ignored most
of the operations the program performs and counted only comparison.
Then we decided to consider only worst case performance.  During the
analysis we took the liberty of rounding a few things off, and when we
finished, we casually discarded the lower-order terms.

When we interpret the results of this analysis, we have to keep
all this hand-waving in mind.  Because mergesort is ,
we consider it a better algorithm than selection sort, but that
doesn't mean that mergesort is always faster.  It just means
that eventually, if we sort bigger and bigger arrays, mergesort
will win.

How long that takes depends on the details of the implementation,
including the additional work, besides the comparisons we counted,
that each algorithm performs.  This extra work is sometimes called
overhead.  It doesn't affect the performance analysis, but
it does affect the run time of the algorithm.

For example, our implementation of mergesort actually allocates
subarrays before making the recursive calls and then lets them
get garbage collected after they are merged.  Looking again at
the diagram of mergesort, we can see that the total amount of
space that gets allocated is , and the total number of
objects that get allocated is about .  All that allocating takes
time.

Even so, it is most often true that a bad implementation of a good
algorithm is better than a good implementation of a bad algorithm.
The reason is that for large values of  the good algorithm is
better and for small values of  it doesn't matter because both
algorithms are good enough.

Priority Queue implementations

In Chapter queue we looked at an implementation of a Priority
Queue based on an array.  The items the in array are unsorted, so it
is easy to add a new item (at the end), but harder to remove an
item because we have to search for the item with the highest
priority.

An alternative is an implementation based on a sorted list.
In this case when we insert a new item we traverse the list and
put the new item in the right spot.  This implementation takes
advantage of a property of lists, which is that it is easy to
insert a new node into the middle.  Removing an item from
such a list is easy, assuming we keep the item with the highest
priority at the beginning.

Performance analysis of these operations is straightforward.
Adding an item to the end of an array or
removing a node from the beginning of a list takes the same amount
of time regardless of the number of items.  These operations
are said to be constant time.

Any time we traverse an array or list, performing a constant-time
operation on each element, the run time is
proportional to the number of items.  Thus, removing something
from the array and adding something to the list are both
linear operations; the run time is a linear
function of the number of items.

So how long does it take to insert and then remove  items
from a Priority Queue?  For the array implementation, 
insertions takes time proportional to , but the removals
take longer.  The first removal has to traverse all  items;
the second has to traverse , and so on, until the last
removal, which only has to look at 1 item.  Thus, the total
time is .  We have already seen that the
leading term of this sum is .

The analysis of the list implementation is similar.  The
first insertion doesn't require any traversal, but after that
we have to traverse at least part of the list each time we
insert a new item.  In general we don't know how much of the
list we will have to traverse, since it depends on the data
and what order they are inserted, but we can assume that on
average we have to traverse half of the list.  Unfortunately,
even traversing half of the list is still a linear operation.

So, once again, to insert and remove  items takes time
proportional to .  Thus, based on this analysis we cannot
say which implementation is better; both the array and the
list are quadratic.

If we implement a Priority Queue using a heap, then we can
perform both insertions and removals in time proportional
to .  Thus the total time for  items is ,
which is better than .  That's why, at the beginning of
the chapter, I said that a heap is a particularly efficient
implementation of a Priority Queue.

Definition of a Heap

A heap is a special kind of tree.  It has two properties
that are not generally true for other trees:

description

[completeness:] The tree is complete, which means that
nodes are added from top to bottom, left to right, without
leaving any spaces.

[heapness:] The item in the tree with the highest priority
is at the top of the tree, and the same is true for every subtree.

description

Both of these properties bear a little explaining.
This figure shows a number of trees that are considered
complete or not complete:

figure=tree4.eps,width=4in

A leaf node is considered
a complete tree, as is a node with two children.  The empty
tree is also considered complete.
If a node
has one child, then it must be on the left and it must be
a leaf node.  Otherwise, the tree is not complete.

We can handle the more complicated cases by applying these
rules recursively.  In other words, a tree is complete if
the root obeys the rules and all the subtrees are also complete.

It is natural to write these rules as a recursive method:

verbatim
    public boolean isComplete (Tree tree) 
        if (isLeaf (tree)) return true;

        if (left == null && right != null) return false;

        if (left != null && right == null)
            return isLeaf (left);

        if (left == null && right != null)
            return isComplete (left) && isComplete (right);
    
verbatim

For this example I used the linked implementation of a tree.  As
an exercise, write the same method for the array implementation.

The heap property is similarly recursive.  In order for a
tree to be a heap, the largest value in the tree has to be at
the root, and the same has to be true for each subtree.
As another exercise, write a method that checks whether a tree
has the heap property.

Heap remove

It might seem odd that we are going to remove things from the
heap before we insert any, but it turns out that removal is easier
to explain.

At first glance, we might think that removing an item from the
heap would be a constant-time operation, since the item with
the highest priority is always at the root.  The problem is that
once we remove the root node, we are left with something that
is no longer a heap.  Before we can return the result, we have
to restore the heap property.  We will call this operation
reheapify.

The situation is shown in the following figure:

figure=tree5.eps,height=2in

The root node has priority r and two subtrees, A and B.
The value at the root of Subtree A is a and the value at
the root of Subtree B is b.

We assume that before we remove r from the tree, the
tree is a heap.  That implies that r is the largest value
in the heap and that a and b are the largest values
in their respective subtrees.

Once we remove r, we have to make the resulting tree
a heap again.  In other words we need to make sure it has
the properties of completeness and heapness.

The best way to insure completeness is to remove the bottom-most,
right-most node, and put its value at the root.  In a general tree
implementation, we would have to traverse the tree to find this node,
but in the array implementation, we can find it in constant time
because it is always the last (non-null) element of the array.

Of course, the chances are that the last value is not the highest,
so putting it at the root breaks the heapness property.  Fortunately
it is easy to restore.  We know that the largest value in the
heap is either a or b.  Therefore we can select whichever
is larger and swap it with the value at the root.

Arbitrarily, let's say that b is larger.  Now the
situation looks like this:

figure=tree6.eps,height=2in

The value c represents the value we copied from the
last entry in the array.  At this point we know that the highest
value is at the root.  Also, since we haven't changed Subtree A
at all, we know that it is still a heap.  The only problem is
that we don't know if Subtree B is a heap, since we just stuck
a (probably low) value at its root.

Wouldn't it be nice if we had a method that could reheapify
Subtree B?  Wait... we do!

Heap insert

Inserting a new item in a heap is a similar operation, except that
instead of trickling a value down from the top, we trickle it
up from the bottom.

Again, to guarantee completeness, we add the new element at the
bottom-most, rightmost position in the tree, which is the next
available space in the array.

Then to restore the heap property, we compare the new value with
its neighbors.  The situation looks like this:

figure=tree7.eps,height=2in

The new value is c.  We can restore the heap property
of this subtree by comparing c to a.  
If c is smaller, then the heap property is satisfied.
If c is larger, then we swap c and a.  The
swap satisfies the heap property because we know that c
must also be bigger than b, because c > a and
a > b.

Now that this subtree is reheapified, we can work our way up
the tree until we reach the root.

Performance of heaps

For both insert and remove, we perform a constant-time operation
to do the actual insertion and removal, but then we have to
reheapify the tree.  In one case we start at the root and work
our way down, comparing items and then recursively reheapifying
one of the subtrees.  In the other case we start at a leaf and
work our way up, again comparing elements at each level of
the tree.

As usual, there are several operations we might want to count,
like comparisons and swaps.  Either choice would work, the
real issue is the number of levels of the tree we examine
and how much work we do at each level.  In both cases we
keep examining levels of the tree until we restore the heap
property, which means we might only visit one, or in the worst
case we might
have to visit them all.  Let's consider the worst case.

At each level, we perform only constant-time operations
like comparisons and swaps.  So the total amount of work is
proportional to the number of levels in the tree, also called
the height of the tree.

So we might say that these operations are linear with respect to
the height of the tree, but the ``problem size'' we are interested
in is not height, it's the number of items in the heap.

As a function of , the height of the tree is .
This is not true for all trees, but it is true for complete
trees.  To see why, think of the number of nodes on each level
of the tree.  The first level contains 1, the second contains 2,
the third contains 4, and so on.  The th level contains
 nodes, and the total number in all levels up to  is
.  In other words,  which means that
.

Thus, both insertion and removal take logarithmic time.
To insert and remove  items takes time proportional to
.

Heapsort

The result of the previous section suggests yet another algorithm
for sorting.  Given  items, we insert them into a Heap and
then remove them.  Because of the Heap semantics, they come
out in order.  We have already shown that this algorithm, which
is called heapsort, takes time proportional to ,
which is better than selection sort and the same as mergesort.

As the value of  gets large, we expect heapsort to be faster
than selection sort, but performance analysis gives us no way
to know whether it will be faster than mergesort.  We would say
that the two algorithms have the same order of growth because
they grow with the same functional form.  Another way to
say the same thing is that they belong to the same complexity
class.

Complexity classes are sometimes written in ``big-O notation''.
For example, , pronounced ``oh of
en squared'' is the set of all functions that grow no faster
than  for large values of .  To say that an algorithm
is  is the same as saying that it
is quadratic.  The other complexity classes we have seen,
in decreasing order of performance, are:

0.2in
tabularll
          &  constant time  
     &  logarithmic  
          &  linear  
   &  ``en log en''  
        &  quadratic  
        &  exponential
tabular
0.2in

So far none of the algorithms we have looked at are exponential.
The reason is that these algorithms take so long (for any significant
value of ) that they are impractical.


Glossary

description

[selection sort:] The simple sorting algorithm in Section sorting.

[mergesort:]  A better sorting algorithm from Section mergesort.

[heapsort:]  Yet another sorting algorithm.

[complexity class:]  A set of algorithms whose performance
(usually run time) has the same order of growth.

[order of growth:]  A set of functions with the same leading-order
term, and therefore the same qualitative behavior for large values
of .

[overhead:]  Additional time or resources consumed by a programming
performing operations other than the abstract operations considered
in performance analysis.

[height:]  The height of a tree is the number of levels.

description


Table

Arrays, Vectors and Tables

Arrays are a generally useful data structure, but they suffer
from two important limitations:

itemize

The size of the array does not depend on the number of
items in it.  If the array is too big, it wastes space.  If
it is too small it might cause an error, or we might have to
write code to resize it.

Although the array can contain any type of item, the
indices of the array have to be integers.  We cannot, for
example, use a String to specify an element of an array.

itemize

In Section vector we saw how the built-in Vector
class solves the first problem.  As the user adds items it
expands automatically.  It is also possible to shrink a Vector
so that the capacity is the same as the current size.

But Vectors don't help with the second problem.  The
indices are still integers.

That's where the Table ADT comes in.  The Table
is a generalization of a Vector that can use any type
as an index.  These generalized indices are called keys.

Just as you would use an index to access a value in an array,
you use a key to access a value in a Table.  So each
key is associated with a value, which is why Tables are sometimes
called associative arrays.

The canonical example of a table is a dictionary, which is a table
that associates words (the keys) with their definitions (the
values).  Because of this example Tables are also sometimes called
Dictionaries.  Also, the association of a particular key with
a particular value is called an entry.


The Table ADT

Like the other ADTs we have looked at, Tables are defined
by the set of operations they support:

description

[constructor:] Make a new, empty table. 

[put:]  Create an entry that associates a value with a key.

[get:]  For a given key, find the corresponding value.

[containsKey:]  Return true if there
is an entry in the Table with the given Key.

description


The built-in Hashtable

Java provides an implementation of the Table ADT called
Hashtable.  It is in the java.util package.
Later in the chapter we'll see why it is called Hashtable.

To demonstrate the use of the Hashtable we'll write
a short program that traverses a String and counts the number
of times each word appears.

We'll create a new class called WordCount that will
build the Table and then print its contents.  Naturally, each
WordCount object will contain a Hashtable:

verbatim
public class WordCount 
    Hashtable ht;

    public WordCount () 
        ht = new Hashtable ();
    

verbatim

The only public methods for WordCount are processLine,
which takes a String and adds its words to the Table, and print,
which prints the results at the end.

processLine breaks the String into words using a 
StringTokenizer and passes each word to processWord.

verbatim
    public void processLine (String s) 
        StringTokenizer st = new StringTokenizer (s, " ,.", false);
        while (st.hasMoreTokens()) 
            String word = st.nextToken();
            processWord (word.toLowerCase ());
        
    
verbatim

The interesting work is in processWord.

verbatim
    public void processWord (String word) 
        if (ht.containsKey (word)) 
            Integer i = (Integer) ht.get (word);
            Integer j = new Integer (i.intValue() + 1);
            ht.put (word, j);
         else 
            ht.put (word, new Integer (1));
        
    
verbatim

If the word is already in the table, we get its counter,
increment it, and put the new value.  Otherwise, we just
put a new entry in the table with the counter set to 1.

To print the entries in the table, we need to be able to enumerate
the keys in the table.  Fortunately, the Hashtable implementation
provides a method, keys, that returns an Enumeration
object we can use.  Enumerations are very similar to the
Iterators we saw in Section iterator.  Both are
abstract classes in the java.util package; you should review the
documentation of both.  Here's how keys works:

verbatim
    public void print () 
        Enumeration enum = ht.keys ();
        while (enum.hasMoreElements ()) 
            String key = (String) enum.nextElement ();
            Integer value = (Integer) ht.get (key);
            System.out.println (" " + key + ", " + value + " "); 
        
    
verbatim

Each of the elements of the Enumeration is an Object,
but since we know they are keys, we typecast them to be Strings.
When we get the values from the Table, they are also Objects,
but we know they are counters, so we typecast them to be Integers.

Finally, to count the words in a string:

verbatim
        WordCount wc = new WordCount ();
        wc.processLine ("da doo ron ron ron, da doo ron ron");
        wc.print ();
verbatim

The output is

verbatim
 ron, 5 
 doo, 2 
 da, 2 
verbatim

In general the Enumeration is not in any particular order.
The only guarantee is that all the keys in the table will appear.

A Vector implementation

One easy way to implement the Table ADT is to use a Vector
of entries, where each entry is an object that contains a key
and a value.  These objects are called key-value pairs.

A class definition for a KeyValuePair might look like
this:

verbatim
class KeyValuePair 
    Object key, value;

    public KeyValuePair (Object key, Object value) 
        this.key = key;
        this.value = value;
    

    public String toString () 
        return " " + key + ", " + value + " ";
    

verbatim

Then the implementation of Table looks like this:

verbatim
public class Table 
    Vector v;

    public Table () 
        v = new Vector ();
    

verbatim

To put a new entry in the table, we just add a new
KeyValuePair to the Vector:

verbatim
    public void put (Object key, Object value) 
        KeyValuePair pair = new KeyValuePair (key, value);
        v.add (pair);
    
verbatim

Then to look up a key in the Table we have to traverse the
Vector and find a KeyValuePair with a matching
key:

verbatim
    public Object get (Object key) 
        Iterator iterator = v.iterator ();
        while (iterator.hasNext ()) 
            KeyValuePair pair = (KeyValuePair) iterator.next ();
            if (key.equals (pair.key)) 
                return pair.value;
            
        
        return null;
    
verbatim

The idiom to traverse a Vector is the one we saw in
Section iterator.  When we compare keys, we use deep
equality (the equals method) rather than shallow
equality (the == operator).  This allows the key class
to specify the definition of equality.  In our example, the
keys are Strings, so it will use the built-in equals
method in the String class.

For most of the built-in classes, the equals method
implements deep equality.  For some classes, though, it is
not easy to define what that means.  For example, see the
documentation of equals for Doubles.

The containsKey method is almost identical to get
except that it returns true or false instead of
an object reference or null.

Actually, the implementation of put is not complete.
If there is already an entry in the table with the given
key, put should update it (give it a new value), not
add another entry with the same key.  So a more correct version
is:

verbatim
    public void put (Object key, Object value) 
        if (containsKey (key)) 
            update (key, value);
         else 
            KeyValuePair pair = new KeyValuePair (key, value);
            v.add (pair);
        
    
verbatim

The update method is not part of the Table ADT, so
it is declared private:

verbatim
    private void update (Object key, Object value) 
        Iterator iterator = v.iterator ();
        while (iterator.hasNext ()) 
            KeyValuePair pair = (KeyValuePair) iterator.next ();
            if (key.equals (pair.key)) 
                pair.value = value;
                break;
            
        
verbatim

The only method we haven't implemented is keys.  As an
exercise, write this method by building a Vector of
keys and returning the elements of the vector.


The List abstract class

The java.util package defines an abstract class called
List that specifies the set of operations a class has to
implement in order to be considered (very abstractly) a list.
This does not mean, of course, that every class that implements
List has to be a linked list.

As a matter of fact, the built-in LinkedList class does
implement List, but so does Vector

Some of the methods in the List definition are add,
get and iterator.  Notice that all the methods from
the Vector class that we used to implement Table
are defined in the List abstract class.

That means that instead of a Vector, we could have used
any List class.  In Table.java we can replace
Vector with LinkedList, and the program still works!

This kind of type generality can be very useful for tuning
the performance of a program.  You can write the program in
terms of an abstract class like List and then test the
program with several different implementations to see which
yields the best performance.


Hash table implementation

The reason that the built-in implementation of the Table ADT
is called Hashtable is that it uses a particularly efficient
implementation of a Table called a hashtable.

Of course, the whole point of defining an ADT is that it allows
us to use an implementation without knowing the details.  So it
is probably a bad thing that the people who wrote the Java
library named this class according to its implementation rather
than its ADT, but I suppose of all the bad things they did, this
one is pretty small.

Anyhoo, you might be wondering what a hashtable is, and why I
say it is particularly efficient.  We'll start by analyzing
the performance of the List implementation we just did.

Looking at the implementation of put, we see that there are
two cases.  If the key is not already in the table, then we
only have to create a new key-value pair and add it to
the List.  Both of these are constant-time operations.

In the other case, we have to traverse the List to find
the existing key-value pair.  This operation is linear (depending
on the length of the list).
For the same reason, get and containsKey are also
linear.

Although linear operations are often good enough, we can do
better.  It turns out that there is a way to implement the
Table ADT so that both put and get are constant-time
operations!

The key is to realize that traversing a list takes time
proportional to the length of the list.  If we can put an upper
bound on the length of the list, then we can put an upper
bound on the traverse time, and anything with a fixed upper
bound is considered constant time.

But how can we limit the length of the lists without limiting
the number of items in the table?  By increasing the number
of lists:  instead of one long list, we'll keep many short
lists.

As long as we know which list to search, we can put a bound
on the amount of searching.

Hash Functions

And that's where hash functions come in.  We need some way to
look at a key and know, without searching, which list it will
be in.  We'll assume that the lists
are in an array (or Vector) so we can refer to them by index.

The solution is to come up with some mapping---almost any
mapping---between the key values and the indices of the lists.
For every possible key there has to be a single index, but
there might be many keys that map to the same index.

For example, imagine an array of 8 lists and a table made
up of keys that are Integers and values that are
Strings.  It might be tempting to use the
intValue of the Integers as indices, since they
are the right type, but there are a whole lot of integers
that do not fall between 0 and 7, which are the only legal
indices.

The modulus operator provides a simple
(in terms of code) and efficient (in terms of run time) way
to map all the integers into the range .
The expression

verbatim
    key.intValue() 
verbatim

is always in the range.

For other types, we can play similar games.  For example,
to convert a Character to an integer, we can use
the built-in method Character.getNumericValue and
for Doubles there is intValue.

For Strings we could get the numeric value of each character and
add them up, or instead we might use a shifted sum.  To
calculate a shifted sum, alternate between adding new values to the
accumulator and shifting the accumulator to the left.
By ``shift to the left'' I mean ``multiply
by a constant.''

To see how this works, take the list of numbers  1, 2, 3, 4, 5,
6  and compute their shifted sum as follows.  First,
initialize the accumulator to 0.  Then,

enumerate

Add the next element of the list to the accumulator.

Multiply the accumulator by 10.

Repeat until the list is finished.

enumerate

As an exercise, write a method that calculates the shifted sum
of the numeric values of the characters in a String using
a multiplier of 32.

So, for each type, we have a function that takes values of that
type and generates a corresponding integer value.  These
functions are called hash functions, because they often
involve making a hash of the components of the objects.  The
integer value for each object is called its hash code.

There is one other way we might generate a hash code for
Java objects.  Every Java object provides
a method called hashCode that returns an integer
that corresponds to that object.  For the built-in types,
the hashCode method is implemented so that if two objects
contain the same data, they will have the same hash code
(as in deep equality).  The documentation of these methods
explains what the hash function is.  You should check them
out.

For user-defined types, it is up to the implementor to provide
an appropriate hash function.  The default hash function, provided
in the Object class, often uses the location of the
object to generate a hash code, so its notion of ``sameness''
is shallow equality.  Most often when we are searching a
hash table for a key, shallow equality is not what we want.

Regardless of how the hash code is generated, the last step
is to use the modulus operator to map the hash code
into the range of legal indices.

Resizing a hash table

Let's review.  A Hash table consists of an array (or Vector)
of Lists, where each List contains a small number
of key-value pairs.  To add a new entry to a table, we calculate
the hash code of the new key and add the entry to the
corresponding List.

To look up a key, we hash it again (getting the same value
every time) and search the corresponding list.  If the lengths
of the lists are bounded then the search time is bounded.

So how do we keep the lists short?  Well, one goal is to keep
them as balanced as possible, so that there are no very long
lists at the same time that others are empty.
This is not easy to do perfectly---it depends on how well we
chose the hash function---but we can usually do a pretty good
job.

Even with perfect balance, the average list length
grows linearly with the number of entries, and we have to put
a stop to that.

The solution is to keep track of the average number of entries
per list, which is called the load factor;
if the load factor gets to high, we have to resize the table.

To resize, we create a new table, usually twice as big as the
original, take all the entries out of the old one, hash them again,
and put them in the new table.  Usually we can get away with using the
same hash function; we just use a different value for the modulus
operator.

Performance of resizing

How long does it take to resize the table?  Clearly it is linear
with the number of entries.  That means that most of the
time put takes constant time, but every once in a while
it takes linear time.

At first glance, that sounds bad.  Doesn't that undermine my
claim that we can perform put in constant time?  Well,
frankly, yes.  But with a little wheedling, I can fix it.

Since some put operations take longer than others, let's
figure out the average time for a put operation.  The
average is going to be , the constant time for a simple
put, plus an additional term of , the percentage of
the time I have to resize, times , the cost
of resizing.

equation
t(n) = c + p kn
equation

I don't know what  and  are, but we can figure out what
 is.  Imagine that we have just resized the hash table by
doubling its size.  If there are  entries, then we can add an
addition  entries before we have to resize again.  So the
percentage of the time we have to resize is .

Plugging into the equation, we get

equation
t(n) = c + 1/n kn = c + k
equation

In other words,  is constant time!


Glossary

description

[table:] An ADT that defines operations on a collection of
entries.

[entry:] An element in a table that contains a key-value pair.

[key:] An index, of any type, used to look up values in a table.

[value:] An element, of any type, stored in a table.

[dictionary:] Another name for a table.

[associative array:] Another name for a dictionary.

[hashtable:] A particularly efficient implementation of a table.

[hash function:] A function that maps values of a certain type
onto integers.

[hash code:] The integer value that corresponds to a given value.

[shifted sum:] A simple hash function often used for compounds
objects like Strings.

[load factor:] The number of entries in a hashtable divided
by the number of lists in the hashtable; i.e. the average number
of entries per list.


description