Circle-ellipse problem
Encyclopedia
The circle-ellipse problem in software development
(sometimes known as the square-rectangle problem) illustrates a number of pitfalls which can arise when using subtype polymorphism
in object modelling. The issues are most commonly encountered when using object-oriented programming
.
The problem concerns which subtyping or inheritance
relationship should exist between classes which represent circle
s and ellipse
s (or, similarly, square
s and rectangle
s). More generally, the problem illustrates the difficulties which can occur when a base class contains method
s which mutate an object in a manner which might invalidate a (stronger) invariant found in a derived class, causing the Liskov substitution principle
to be violated.
The existence of the circle-ellipse problem is sometimes used to criticize object-oriented programming. It may also imply that hierarchical taxonomies are difficult to make universal, implying that situational classification systems may be more practical.
that subtype polymorphism
, which is implemented in most OO languages via inheritance
, should be used to model object types which are subsets of each other; this is commonly referred to as the is-a relationship. In the present example, the set of circles is a subset of the set of ellipses; circles can be defined as ellipses whose major and minor axes are the same length. Thus, code written in an OOPL which models shapes will frequently choose to make
If the classes are immutable
, there is no problem with this state of affairs. Circles satisfy all invariants of ellipses; and an (immutable) circle can be used in any context where an immutable ellipse is expected. The relationship satisfies the Liskov substitution principle
.
But some OO designs encourage the use of mutator
s, methods which modify instances of the class. A sub-class has to provide support for all behaviour supported by the super-class; subclasses must implement any mutators defined in a base class. In the present case, the method
A related problem with this inheritance arises when we consider the implementation. An ellipse requires more state to describe than a circle does, as the former needs attributes to specify the length and rotation of the major and minor axes; whereas a circle needs only a radius. It may be possible to avoid this if the language (such as Eiffel
) makes constant values of a class, functions without arguments and data members interchangeable.
Some authors have suggested reversing the relationship between circle and ellipse, on the grounds that an ellipse is a circle with additional capabilities. Unfortunately, ellipses fail to satisfy many of the invariants of circles; if
The problem is sometimes expressed in statements such as "a
Alternately,
, this can be done via the
). This is exactly the implementation that is used in purely functional programming.
In this case methods such as
This means that it is no longer a problem to define
A disadvantage is that changing the value of an instance then requires an assignment
, which is inconvenient and prone to programming errors, e.g.
A second disadvantage is that such an assignment conceptually involves a temporary value,
which could reduce performance and be difficult to optimise.
The
This has a disadvantage of introducing an extra class where all we really want to do is specify that
The disadvantage is that one can no longer use circles where ellipses are expected.
However, an acceptable workaround may be to add a method
Alternatively, one may provide a method
A circle can already be represented by an ellipse. There is no reason to have class Circle unless it needs some circle-specific methods that can't be applied to an ellipse, or unless the programmer wishes to benefit from conceptual and/or performance advantages of the circle's simpler model.
Many object systems in popular use are based on a design which takes it for granted that an object carries the same type over its entire lifetime, from construction to finalization. This is not a limitation of OOP, but rather of particular implementations only.
The following example uses the Common Lisp
Object System (CLOS
) in which objects can change class without losing their identity. All variables or other storage locations which hold a reference to an object continue to hold a reference to that same object after it changes class.
The circle and ellipse models are deliberately simplified to avoid distracting details which are not relevant to the Circle-Ellipse Problem. An ellipse has two semi-axes called
This code can be demonstrated with an interactive session, using the CLISP implementation of Common Lisp.
with arguments (#), no method is applicable.
The following restarts are available:
RETRY :R1 try calling RADIUS again
RETURN :R2 specify return values
ABORT :R3 Abort main loop
Break 1 [6]> :a
[7]> (setf (v-axis obj) 4)
4
[8]> (radius obj)
4
[9]> (class-of obj)
[10]> (setf (radius obj) 9)
9
[11]> (v-axis obj)
9
[12]> (h-axis obj)
9
[13]> (setf (h-axis obj) 8)
8
[14]> (class-of obj)
[15]> (radius obj)
with arguments (#), no method is applicable.
The following restarts are available:
RETRY :R1 try calling RADIUS again
RETURN :R2 specify return values
ABORT :R3 Abort main loop
Break 1 [16]> :a
[17]>
Now, a prisoner is obviously a person. So we could logically create a sub-class:
Just as obviously, this leads us into trouble, since a prisoner is not free to move an arbitrary distance in any direction, yet the contract of the
So, in fact, the class
By analogy, then, a Circle, is not an Ellipse, because it lacks the same degrees of freedom as an Ellipse.
This strongly suggests that inheritance should never be used when the sub-class restricts the freedom implicit in the base class, but should only be used when the sub-class adds extra detail to the concept represented by the base class is in 'Monkey' is-a 'Animal'.
Software development
Software development is the development of a software product...
(sometimes known as the square-rectangle problem) illustrates a number of pitfalls which can arise when using subtype polymorphism
Subtype polymorphism
In programming language theory, subtyping or subtype polymorphism is a form of type polymorphism in which a subtype is a datatype that is related to another datatype by some notion of substitutability, meaning that program constructs, typically subroutines or functions, written to operate on...
in object modelling. The issues are most commonly encountered when using object-oriented programming
Object-oriented programming
Object-oriented programming is a programming paradigm using "objects" – data structures consisting of data fields and methods together with their interactions – to design applications and computer programs. Programming techniques may include features such as data abstraction,...
.
The problem concerns which subtyping or inheritance
Inheritance (computer science)
In object-oriented programming , inheritance is a way to reuse code of existing objects, establish a subtype from an existing object, or both, depending upon programming language support...
relationship should exist between classes which represent circle
Circle
A circle is a simple shape of Euclidean geometry consisting of those points in a plane that are a given distance from a given point, the centre. The distance between any of the points and the centre is called the radius....
s and ellipse
Ellipse
In geometry, an ellipse is a plane curve that results from the intersection of a cone by a plane in a way that produces a closed curve. Circles are special cases of ellipses, obtained when the cutting plane is orthogonal to the cone's axis...
s (or, similarly, square
Square (geometry)
In geometry, a square is a regular quadrilateral. This means that it has four equal sides and four equal angles...
s and rectangle
Rectangle
In Euclidean plane geometry, a rectangle is any quadrilateral with four right angles. The term "oblong" is occasionally used to refer to a non-square rectangle...
s). More generally, the problem illustrates the difficulties which can occur when a base class contains method
Method (computer science)
In object-oriented programming, a method is a subroutine associated with a class. Methods define the behavior to be exhibited by instances of the associated class at program run time...
s which mutate an object in a manner which might invalidate a (stronger) invariant found in a derived class, causing the Liskov substitution principle
Liskov substitution principle
Substitutability is a principle in object-oriented programming. It states that, in a computer program, if S is a subtype of T, then objects of type T may be replaced with objects of type S without altering any of the desirable properties of that program...
to be violated.
The existence of the circle-ellipse problem is sometimes used to criticize object-oriented programming. It may also imply that hierarchical taxonomies are difficult to make universal, implying that situational classification systems may be more practical.
The problem
It is a central tenet of object-oriented analysis and designObject-oriented analysis and design
Object-oriented analysis and design is a software engineering approach that models a system as a group of interacting objects. Each object represents some entity of interest in the system being modeled, and is characterised by its class, its state , and its behavior...
that subtype polymorphism
Subtype polymorphism
In programming language theory, subtyping or subtype polymorphism is a form of type polymorphism in which a subtype is a datatype that is related to another datatype by some notion of substitutability, meaning that program constructs, typically subroutines or functions, written to operate on...
, which is implemented in most OO languages via inheritance
Inheritance (computer science)
In object-oriented programming , inheritance is a way to reuse code of existing objects, establish a subtype from an existing object, or both, depending upon programming language support...
, should be used to model object types which are subsets of each other; this is commonly referred to as the is-a relationship. In the present example, the set of circles is a subset of the set of ellipses; circles can be defined as ellipses whose major and minor axes are the same length. Thus, code written in an OOPL which models shapes will frequently choose to make
class Circle
as a sub-class of class Ellipse
(i.e., inheriting from it).If the classes are immutable
Immutable object
In object-oriented and functional programming, an immutable object is an object whose state cannot be modified after it is created. This is in contrast to a mutable object, which can be modified after it is created...
, there is no problem with this state of affairs. Circles satisfy all invariants of ellipses; and an (immutable) circle can be used in any context where an immutable ellipse is expected. The relationship satisfies the Liskov substitution principle
Liskov substitution principle
Substitutability is a principle in object-oriented programming. It states that, in a computer program, if S is a subtype of T, then objects of type T may be replaced with objects of type S without altering any of the desirable properties of that program...
.
Ellipse
could have a method stretchX (integer Factor)
which alters the length of one of its axes, but not the other, and returns the result in a new Ellipse
object. This would cause no problem for Circle.stretchX
, as it could return a new Ellipse
just as the Ellipse.stretchX
does (whilst remaining unaltered itself).But some OO designs encourage the use of mutator
Mutator method
In computer science, a mutator method is a method used to control changes to a variable.The mutator method, sometimes called a "setter", is most often used in object-oriented programming, in keeping with the principle of encapsulation...
s, methods which modify instances of the class. A sub-class has to provide support for all behaviour supported by the super-class; subclasses must implement any mutators defined in a base class. In the present case, the method
Ellipse.stretchX
alters the length of one of its axes in place. If Circle
inherits from Ellipse
, it must also have a method stretchX
, but the result of this method would be to change a circle into something which is no longer a circle. The Circle class can not simultaneously satisfy its own invariant and the behavioural requirements of the Ellipse.stretchX
method.A related problem with this inheritance arises when we consider the implementation. An ellipse requires more state to describe than a circle does, as the former needs attributes to specify the length and rotation of the major and minor axes; whereas a circle needs only a radius. It may be possible to avoid this if the language (such as Eiffel
Eiffel (programming language)
Eiffel is an ISO-standardized, object-oriented programming language designed by Bertrand Meyer and Eiffel Software. The design of the language is closely connected with the Eiffel programming method...
) makes constant values of a class, functions without arguments and data members interchangeable.
Some authors have suggested reversing the relationship between circle and ellipse, on the grounds that an ellipse is a circle with additional capabilities. Unfortunately, ellipses fail to satisfy many of the invariants of circles; if
Circle
has a method radius
, Ellipse
will now have to provide it as well.The problem is sometimes expressed in statements such as "a
Circle
is not a sort of Ellipse
". This looks confusingly like the absurd "a circle is not a sort of ellipse", and sounds identical, so it is unhelpful. What is actually meant is "an OO-model of a circle should not be a sort of OO-model of an ellipse"Possible solutions
One may solve the problem by changing one's model, or perhaps using a different language, which could be a (not yet implemented) extension of an existing language, or by using a different paradigm. Exactly which option is appropriate will depend on who wroteCircle
and who wrote Ellipse
. If the same author is designing them both from scratch, then the author will be able to define the interface to handle this situation. If the Ellipse
object was already written, and cannot be changed, then the options are more limited.Return success or failure value
Allow the objects to return a "success" or "failure" value for each modifier. This is usually done in the case of file I/O, but can also be helpful here. Now,Ellipse.stretchX
works, and returns 'true', while Circle.stretchX
simply returns 'false'. This is in general good practice, but may require that the original author of Ellipse
anticipated such a problem, and defined the mutators as returning a value.Alternately,
Circle.stretchX
could throw an exception (but depending on the language, this may also require that the original author of Ellipse
declare that it may throw an exception).Return the new value of X
This is a similar solution to the above, but is slightly more powerful.Ellipse.stretchX
now returns the new value of its X dimension. Now, Circle.stretchX
can simply return its current radius. All modifications must be done through Circle.stretch
, which preserves the circle invariant.Allow for a weaker contract on Ellipse
If the interface contract forEllipse
states only that "stretchX modifies the X axis", and does not state "and nothing else will change", then Circle
could simply force the X and Y dimensions to be the same. Circle.stretchX
and Circle.stretchY
both modify both the X and Y size.Circle::stretchX(x) {xSize = ySize = x;}
Circle::stretchY(y) {xSize = ySize = y;}
Convert the Circle into an Ellipse
IfCircle.stretchX
is called, then Circle
changes itself into an Ellipse
. For example, in Common LispCommon Lisp
Common Lisp, commonly abbreviated CL, is a dialect of the Lisp programming language, published in ANSI standard document ANSI INCITS 226-1994 , . From the ANSI Common Lisp standard the Common Lisp HyperSpec has been derived for use with web browsers...
, this can be done via the
CHANGE-CLASS
method. This may be dangerous, however, if some other function is expecting it to be a Circle
. Some languages don't allow this type of change at all, and others impose restrictions on the Ellipse
class to be an acceptable replacement for Circle
.Make all instances constant
One can change the model so that instances of the classes represent constant values (i.e. they are immutableImmutable object
In object-oriented and functional programming, an immutable object is an object whose state cannot be modified after it is created. This is in contrast to a mutable object, which can be modified after it is created...
). This is exactly the implementation that is used in purely functional programming.
In this case methods such as
stretchX
must be changed to yield a new instance, rather than modifying the instance they act on.This means that it is no longer a problem to define
Circle.stretchX
, and the inheritance reflects the mathematical relationship between circles and ellipses.A disadvantage is that changing the value of an instance then requires an assignment
Assignment (computer science)
In computer programming, an assignment statement sets or re-sets the value stored in the storage location denoted by a variable name. In most imperative computer programming languages, assignment statements are one of the basic statements...
, which is inconvenient and prone to programming errors, e.g.
Orbit(planet[i]) := Orbit(planet[i]).stretchX
.A second disadvantage is that such an assignment conceptually involves a temporary value,
which could reduce performance and be difficult to optimise.
Factor out modifiers
One can define a new classMutableEllipse
, and put the modifiers from Ellipse
in it.The
Circle
only inherits queries from Ellipse
.This has a disadvantage of introducing an extra class where all we really want to do is specify that
Circle
does not inherit modifiers from Ellipse
.Impose preconditions on modifiers
One can specify thatEllipse.stretchX
is only allowed on instances satisfying Ellipse.stretchable
, and will otherwise throw an exception. This requires anticipation of the problem when Ellipse is defined.Factor out common functionality into an abstract base class
Create an abstract base class calledRoundObject
and put methods that work with both Circle
s and Ellipse
s in this class. Functions that can deal with either type of object will expect a RoundObject
, and functions that call Ellipse
or Circle
-specific requirements will use the descendant classes.Drop all inheritance relationships
This solves the problem at a stroke, and may sometimes be a reasonable approach.The disadvantage is that one can no longer use circles where ellipses are expected.
However, an acceptable workaround may be to add a method
Circle.toEllipse
which would instantiate and return a mutable ellipse object using the circle's current state. The caveat is that mutations performed on this new ellipse would not affect the original circle, and vice versa.Alternatively, one may provide a method
Circle.asEllipse
, which returns a mutable Ellipse "view" of the Circle
object: an Ellipse instance initialized with the circle's current state, that "redirects" all messages to the Circle
object it was obtained from. The "view" is an interface which a client that requires an ellipse may use to work with a circle, even if there is no actual subtyping relationship between the two classes. For example, the ellipse 's stretchXAxis
method would be implemented to invoke stretchRadius
on the Circle
.Combine class Circle into class Ellipse
Then, wherever a circle was used before, use an ellipse.A circle can already be represented by an ellipse. There is no reason to have class Circle unless it needs some circle-specific methods that can't be applied to an ellipse, or unless the programmer wishes to benefit from conceptual and/or performance advantages of the circle's simpler model.
Inverse inheritance
Majorinc proposed a model that divides methods on modifiers, selectors and general methods. Only selectors can be automatically inherited from superclass, while modifiers should be inherited from subclass to superclass. In general case, the methods must be explicitly inherited. The model can be emulated in languages with multiple inheritance, using abstract classes.Change the Programming Language
This problem has straightforward solutions in a sufficiently powerful OO programming system. Essentially, the Circle-Ellipse problem is one of synchronizing two representations of type: the de-facto type based on the properties of the object, and the formal type associated with the object by the object system. If these two pieces of information, which are ultimately just bits in the machine, are kept synchronized so that they say the same thing, everything is fine. It is clear that a circle cannot satisfy the invariants required of it while its base ellipse methods allow mutation of parameters. However, the possibility exists that when a circle cannot meet the circle invariants, its type can be updated so that it becomes an ellipse. If a circle which has become a de facto ellipse doesn't change type, then its type is a piece of information which is now out of date, reflecting the history of the object (how it was once constructed) and not its present reality (what it has since mutated into).Many object systems in popular use are based on a design which takes it for granted that an object carries the same type over its entire lifetime, from construction to finalization. This is not a limitation of OOP, but rather of particular implementations only.
The following example uses the Common Lisp
Common Lisp
Common Lisp, commonly abbreviated CL, is a dialect of the Lisp programming language, published in ANSI standard document ANSI INCITS 226-1994 , . From the ANSI Common Lisp standard the Common Lisp HyperSpec has been derived for use with web browsers...
Object System (CLOS
CLOS
The Common Lisp Object System is the facility for object-oriented programming which is part of ANSI Common Lisp. CLOS is a powerful dynamic object system which differs radically from the OOP facilities found in more static languages such as C++ or Java. CLOS was inspired by earlier Lisp object...
) in which objects can change class without losing their identity. All variables or other storage locations which hold a reference to an object continue to hold a reference to that same object after it changes class.
The circle and ellipse models are deliberately simplified to avoid distracting details which are not relevant to the Circle-Ellipse Problem. An ellipse has two semi-axes called
h-axis
and v-axis
in the code. Being an ellipse, a circle inherits these, and also has a radius
property, whose value is equal to that of the axes (which must, of course, be equal).This code can be demonstrated with an interactive session, using the CLISP implementation of Common Lisp.
$ clisp -q -i circle-ellipse.lisp
[1]> (make-instance 'ellipse :v-axis 3 :h-axis 3)
[2]> (make-instance 'ellipse :v-axis 3 :h-axis 4)
[3]> (defvar obj (make-instance 'ellipse :v-axis 3 :h-axis 4))
OBJ
[4]> (class-of obj)
[5]> (radius obj)
-
-
- - NO-APPLICABLE-METHOD: When calling #
- - NO-APPLICABLE-METHOD: When calling #
-
with arguments (#
The following restarts are available:
RETRY :R1 try calling RADIUS again
RETURN :R2 specify return values
ABORT :R3 Abort main loop
Break 1 [6]> :a
[7]> (setf (v-axis obj) 4)
4
[8]> (radius obj)
4
[9]> (class-of obj)
[10]> (setf (radius obj) 9)
9
[11]> (v-axis obj)
9
[12]> (h-axis obj)
9
[13]> (setf (h-axis obj) 8)
8
[14]> (class-of obj)
[15]> (radius obj)
-
-
- - NO-APPLICABLE-METHOD: When calling #
- - NO-APPLICABLE-METHOD: When calling #
-
with arguments (#
The following restarts are available:
RETRY :R1 try calling RADIUS again
RETURN :R2 specify return values
ABORT :R3 Abort main loop
Break 1 [16]> :a
[17]>
Challenge the Premise of the Problem
While at first glance it may seem obvious that a Circle is-an Ellipse, consider the following alternate representation of essentially the same problem, stated in terms of Java code.
class Person
{
void walkNorth( int meters ) {...} // No failure or exception allowed
void walkEast( int meters ) {...} // No failure or exception allowed
}
Now, a prisoner is obviously a person. So we could logically create a sub-class:
class Prisoner extends Person
{
void walkNorth( int meters ) {...} // No failure or exception allowed
void walkEast( int meters ) {...} // No failure or exception allowed
}
Just as obviously, this leads us into trouble, since a prisoner is not free to move an arbitrary distance in any direction, yet the contract of the
Person
class states that a Person can.So, in fact, the class
Person
could better be named FreePerson
. If that were the case, then the idea that class Prisoner extends FreePerson
is clearly wrong.By analogy, then, a Circle, is not an Ellipse, because it lacks the same degrees of freedom as an Ellipse.
This strongly suggests that inheritance should never be used when the sub-class restricts the freedom implicit in the base class, but should only be used when the sub-class adds extra detail to the concept represented by the base class is in 'Monkey' is-a 'Animal'.
External links
- http://www.parashift.com/c++-faq-lite/proper-inheritance.html#faq-21.6 A popular C++ FAQ site by Marshall Cline. States and explains the problem.
- Constructive Deconstruction of Subtyping by Alistair CockburnAlistair CockburnAlistair Cockburn is one of the initiators of the agile movement in software development, helping write theManifesto for Agile Software Development in 2001 and the agile PM Declaration of Interdependence in 2005...
on his own web-site. Technical/mathematical discussion of typing and subtyping, with applications to this problem. - http://www.dbdebunk.com/page/page/1409199.htm More on Cure for Madness by C.J. Date from Database Debunkings 24 September 2004 Some implications of the problem for SQLSQLSQL is a programming language designed for managing data in relational database management systems ....
. - http://orafaq.com/usenet/comp.databases.theory/2001/10/01/0001.htm Beginning of a long thread (follow the Maybe reply: links) on Oracle FAQ discussing the issue. Refers to writings of C.J. Date. Some bias towards SmalltalkSmalltalkSmalltalk is an object-oriented, dynamically typed, reflective programming language. Smalltalk was created as the language to underpin the "new world" of computing exemplified by "human–computer symbiosis." It was designed and created in part for educational use, more so for constructionist...
. - LiskovSubstitutionPrinciple at WikiWikiWebWikiWikiWebWikiWikiWeb is a term that has been used to refer to four things: the first wiki, or user-editable website, launched on 25 March 1995 by Ward Cunningham as part of the Portland Pattern Repository ; the Perl-based application that was used to run it, also developed by Cunningham, which was the first...
- Subtyping, Subclassing, and Trouble with OOP, an essay discussing a related problem, whether sets should inherit from bags.
- Subtyping by Constraints in Object-Oriented Databases, an essay dicussing an extended version of the circle-elipse problem in the environment of object-oriented databases.