In this and the next few sections, we’ll learn about a relationship called inheritance that can exist between two classes. We will focus on one particular way of using inheritance in our code design, and through that, will learn how inheritance works in Python.
Suppose we are designing an application to keep track of a company’s employees, including some who are paid based on an annual salary and others who are paid based on an hourly wage. We might choose to define a separate class for these two types of employee so that they could have different attributes. For instance, only a salaried employee has a salary to be stored, and only an hourly-paid employee has an hourly wage to be recorded. The classes could also have different methods for the different ways that their pay is computed.
This design overlooks something important: employees of both types have many things in common. For instance, they all have data like a name, address, and employee id. And even though their pay is computed differently, they all get paid. If we had two classes for the two kinds of employees, all the code for these common elements would be duplicated. This is not only redundant but error prone: if you find a bug or make another kind of improvement in one class, you may forget to make the same changes in the other class. Things get even worse if the company decides to add other kinds of employees, such as those working on commission.
A better design would “factor out” the things that are common to all employees and write them once. This is what inheritance allows us to do, and here is how:
We define a subclass for each specific type of employee. In each subclass, we declare that it is a kind of employee, which will cause it to “inherit” those common elements from the base class without having to define them itself.
Terminology note: if class B
is a subclass of class A
, we also say that A
is a superclass of B
.
In our running example of a company with different kinds of employees, we could define a base class Employee
and two subclasses as follows (for now, we will leave the class bodies empty):
class Employee:
pass
# We use the "(Employee)" part to mark SalariedEmployee as a subclass of Employee.
class SalariedEmployee(Employee):
pass
# We use the "(Employee)" part to mark HourlyEmployee as a subclass of Employee.
class HourlyEmployee(Employee):
pass
It’s useful to know that there are three ways to talk about classes that are in an inheritance relationship:
Now let’s fill in these classes, starting with the methods we want. Once we have that, we can figure out the data that will be needed to implement the methods. To keep our example simple, let’s say that we only need a method for paying an employee, and that it will just print out a statement saying when and how much the employee was paid.
Here’s the outline for the Employee
class with a pay
method.
class Employee:
"""An employee of a company.
"""
def pay(self, pay_date: date) -> None:
"""Pay this Employee on the given date and record the payment.
(Assume this is called once per month.)
"""
pass
If we try to write the body of the pay
method, we run into a problem: it must compute the appropriate pay amount for printing, and that must be done differently for each type of employee. Our solution is to pull that part out into a helper method:
class Employee:
"""An employee of a company.
"""
def get_monthly_payment(self) -> float:
"""Return the amount that this Employee should be paid in one month.
Round the amount to the nearest cent.
"""
pass # We still have to figure this out.
def pay(self, pay_date: date) -> None:
"""Pay this Employee on the given date and record the payment.
(Assume this is called once per month.)
"""
payment = self.get_monthly_payment()
print(f'An employee was paid {payment} on {date}.')
Now method pay
is complete, but we have the same problem with get_monthly_payment
: it has to be different for each type of employee. Clearly, the subclasses must define this method, each in their own way. But we are going to leave the incomplete get_monthly_payment
method in the Employee
class, because it defines part of the interface that every type of Employee
object needs to have. Subclasses will inherit this incomplete method, which they can redefine as appropriate. We’ll see how to do that shortly.
Notice that we did as much as we could in the base class, to avoid repeating code in the subclasses.
Because the Employee
class has a method with no body, client code should not make instances of this incomplete class directly. We do two things to make this clear:
NotImplementedError
. We call such a method an abstract method, and we call a class which has at least one abstract method an abstract class.AbstractEmployee
.Here is the complete definition of our class:
class Employee:
"""An employee of a company.
This is an abstract class. Only subclasses should be instantiated.
"""
def get_monthly_payment(self) -> float:
"""Return the amount that this Employee should be paid in one month.
Round the amount to the nearest cent.
"""
raise NotImplementedError
def pay(self, pay_date: date) -> None:
"""Pay this Employee on the given date and record the payment.
(Assume this is called once per month.)
"""
payment = self.get_monthly_payment()
print(f'An employee was paid {payment} on {pay_date}.')
It is possible for client code to ignore the warning and instantiate this class—Python does not prevent it. But look at what happens when we try to call one of the unimplemented methods on the object:
>>> a = Employee()
>>> # This method is itself abstract:
>>> a.get_monthly_payment()
Traceback...
NotImplementedError
>>> # This method calls a helper method that is abstract:
>>> a.pay(date(2018, 9, 30))
Traceback...
NotImplementedError
Now let’s fill in class SalariedEmployee
, which is a subclass of Employee
. Very importantly, all instances of SalariedEmployee
are also instances of Employee
. We can verify this using the built-in function isinstance
:
>>> # Here we see what isinstance does with an object of a simple built-in type.
>>> isinstance(5, int)
True
>>> isinstance(5, str)
False
>>> # Now let's check how it works with objects of a type that we define.
>>> fred = SalariedEmployee()
>>> # fred's type is as we constructed it: SalariedEmployee.
>>> # More precisely, the object that fred refers to has type SalariedEmployee.
>>> type(fred)
<class 'employee.SalariedEmployee'>
>>> # In other words, the object is an instance of SalariedEmployee.
>>> isinstance(fred, SalariedEmployee)
True
>>> # Here's the important part: it is also an instance of Employee.
>>> isinstance(fred, Employee)
True
Because Python “knows” that fred
is an instance of Employee
, this object will have access to all methods of Employee
! We say that fred
inherits all of the Employee
methods. So even if SalariedEmployee
remains an empty class, its instances can still call the methods get_monthly_payment
and pay
, because they are inherited.
class SalariedEmployee(Employee):
pass
>>> fred = SalariedEmployee()
>>> # fred inherits Employee.get_monthly_payment, and so can call it.
>>> # Of course, it raises an error when called, but it indeed is accessed.
>>> fred.get_monthly_payment()
Traceback...
NotImplementedError
Our SalariedEmployee
and HourlyEmployee
subclasses each inherit two methods: pay
and get_monthly_payment
. The method pay
is complete as it is, and is appropriate for all types of employees, so we needn’t do anything with it. However, get_monthly_payment
needs a new definition that does not raise an error and that defines the behaviour appropriately for the particular kind of employee. We accomplish this simply by defining the method again in the subclass. For now, we will hard-code values in the method, but will generalize this later. We say that this new method definition overrides the inherited definition:
class SalariedEmployee(Employee):
def get_monthly_payment(self) -> float:
# Assuming an annual salary of 60,000
return round(60000.0 / 12.0, 2)
class HourlyEmployee(Employee):
def get_monthly_payment(self) -> float:
# Assuming a 160-hour work month and a $20/hour wage.
return round(160.0 * 20.0, 2)
>>> fred = SalariedEmployee()
>>> fred.get_monthly_payment()
5000.0
>>> jerry = HourlyEmployee()
>>> jerry.get_monthly_payment()
3200.0
We now have a working version of all three classes, albeit a very limited one. Download and run the code that we’ve written so far. You can experiment with it as you continue reading.
The interaction above includes the call fred.get_monthly_payment()
. Since the name get_monthly_payment
could refer to several possible methods (one in class Employee
, one in class SalariedEmployee
, and one in class HourlyEmployee
), we say that the method name must be “resolved”. To understand inheritance, we need to know how Python handles method resolution in general.
This is how it works: whenever code calls a.myMethod()
, Python determines what type(a)
is and looks in that class for myMethod
. If myMethod
is found in this class, that method is called; otherwise, Python next searches for myMethod
in the superclass of the type of a
, and then the superclass of the superclass, etc., until it either finds a definition of myMethod
or it has exhausted all possibilities, in which case it raises an AttributeError
.
In the case of the call fred.get_monthly_payment()
, type(fred)
is SalariedEmployee
, and SalariedEmployee
contains a get_monthly_payment
method. So that is the one called.
This method call is more interesting: fred.pay(date(2018, 9, 30))
. The value of type(fred)
is SalariedEmployee
, but class SalariedEmployee
does not contain a pay
method. So Python next checks in the superclass Employee
, which does contain a pay
method, so then that is called. Straightforward. But then inside Employee.pay
, we have the call self.get_monthly_payment()
. Which get_monthly_payment
is called? We’re already executing a method (pay
) inside the Employee
class, but that doesn’t mean we call Employee.get_monthly_payment
. Good thing, because it raises an error! Remember the rule: type(self)
determines which class Python first looks in for the method. At this point, self
is fred
, whose type is SalariedEmployee
, and that class contains a get_monthly_payment
method. So in this case, when Employee.pay
calls self.get_monthly_payment()
, it gets SalariedEmployee.get_monthly_payment
.