Chapter 4. Functions, the Building Blocks of Code
 | "To create architecture is to put in order. Put what in order? Function and objects." |  |
 | --Le Corbusier |
In this chapter, we're going to explore functions. We already said that everything is an object in Python, and functions are no exception to this. But, what exactly is a function? A function is a sequence of instructions that perform a task, bundled as a unit. This unit can then be imported and used wherever it's needed. There are many advantages to using functions in your code, as we'll see shortly.
I believe the saying, a picture is worth one thousand words, is particularly true when explaining functions to someone who is new to this concept, so please take a look at the following image:
As you can see, a function is a block of instructions, packaged as a whole, like a box. Functions can accept input arguments and produce output values. Both of these are optional, as we'll see in the examples in this chapter.
A function in Python is defined by using the def
keyword, after which the name of the function follows, terminated by a pair of braces (which may or may not contain input parameters) and, finally, a colon (:
) signals the end of the function definition line. Immediately afterwards, indented by four spaces, we find the body of the function, which is the set of instructions that the function will execute when called.
Note
Note that the indentation by four spaces is not mandatory, but it is the amount of spaces suggested by PEP8, and, in practice, it is the most widely used spacing measure.
A function may or may not return output. If a function wants to return output, it does so by using the return
keyword, followed by the desired output. If you have an eagle eye, you may have noticed the little * after Optional in the output section of the preceding picture. This is because a function always returns something in Python, even if you don't explicitly use the return
clause. If the function has no return
statement in its body, it's return value is None
. The reasons behind this design choice are out of the scope of an introductory chapter, so all you need to know is that this behavior will make your life easier, as always, thank you Python.
Why use functions?
Functions are among the most important concepts and constructs of any language, so let me give you a few reasons why we need them:
- They reduce code duplication in a program. By having a specific task taken care of by a nice block of packaged code that we can import and call whenever we want, we don't need to duplicate its implementation.
- They help in splitting a complex task or procedure into smaller blocks, each of which becomes a function.
- They hide the implementation details from their users.
- They improve traceability.
- They improve readability.
Let's look at a few examples to get a better understanding of each point.
Reduce code duplication
Imagine that you are writing a piece of scientific software, and you need to calculate primes up to a limit, as we did in the previous chapter. You write several algorithms and prime numbers, being the basis of many different types of calculations, keep creeping into your code. Well, you have a nice algorithm to calculate them, so you copy and paste it to wherever you need. One day, though, your friend Mister Smarty gives you a better algorithm to calculate prime numbers, and this will save you a lot of time. At this point, you need to go over your whole codebase and replace the old code with the new code.
This is actually a very bad way to go about it. It's error-prone, you never know what lines you are chopping out or leaving there by mistake when you cut and paste code in other code, and you may also risk missing one of the places where prime calculation was done, leaving your software with different versions. Can you imagine if you discovered that the old way was buggy? You would have an undetected bug in your code, and bugs like this are quite hard to spot, especially in big codebases.
So, what should you do? Simple! You write a function, get_prime_numbers(upto)
, and use it anywhere you need a list of primes. When Mister Smarty comes to you and gives you the new code, all you have to do is replace the body of that function with the new implementation, and you're done! The rest of the software will automatically adapt, since it's just calling the function.
Your code will be shorter, it will not suffer from inconsistencies between old and new ways of performing a task, or undetected bugs due to copy and paste failures or oversights. Use functions, and you'll only gain from it, I promise.
Splitting a complex task
Functions are very useful also to split a long or complex task into smaller pieces. The end result is that the code benefits from it in several ways, for example, readability, testability, and reuse. To give you a simple example, imagine that you're preparing a report. Your code needs to fetch data from a data source, parse it, filter it, polish it, and then a whole series of algorithms needs to be run against it, in order to produce the results which will feed the Report
class. It's not uncommon to read procedures like this that are just one big function do_report(data_source)
. There are tens or hundreds of lines of code which end with return report
.
Situations like this are common in code produced by scientists. They have brilliant minds and they care about the correctness of the end result but, unfortunately, sometimes they have no training in programming theory. It is not their fault, one cannot know everything. Now, picture in your head something like a few hundred lines of code. It's very hard to follow through, to find the places where things are changing context (like finishing one task and starting the next one). Do you have the picture in your mind? Good. Don't do it! Instead, look at this code:
data.science.example.py
def do_report(data_source): # fetch and prepare data data = fetch_data(data_source) parsed_data = parse_data(data) filtered_data = filter_data(parsed_data) polished_data = polish_data(filtered_data) # run algorithms on data final_data = analyse(polished_data) # create and return report report = Report(final_data) return report
The previous example is fictitious, of course, but can you see how easy it would be to go through the code? If the end result looks wrong, it would be very easy to debug each of the single data outputs in the do_report
function. Moreover, it's even easier to exclude part of the process temporarily from the whole procedure (you just need to comment out the parts you need to suspend). Code like this is easier to deal with.
Hide implementation details
Let's stay with the preceding example to talk about this point as well. You can see that, by going through the code of the do_report
function, you can get a pretty good understanding without reading one single line of implementation. This is because functions hide the implementation details. This feature means that, if you don't need to delve into details, you are not forced to, in the way you would if do_report
was just one big fat function. In order to understand what was going on, you would have to read the implementation details. You don't need to with functions. This reduces the time you spend reading the code and since, in a professional environment, reading code takes much more time than actually writing it, it's very important to reduce it as much as we can.
Improve readability
Coders sometimes don't see the point in writing a function with a body of one or two lines of code, so let's look at an example that shows you why you should do it.
Imagine that you need to multiply two matrices:
Would you prefer to have to read this code:
matrix.multiplication.nofunc.py
a = [[1, 2], [3, 4]] b = [[5, 1], [2, 1]] c = [[sum(i * j for i, j in zip(r, c)) for c in zip(*b)] for r in a]
Or would you prefer this one:
matrix.multiplication.func.py
# this function could also be defined in another module
def matrix_mul(a, b):
return [[sum(i * j for i, j in zip(r, c)) for c in zip(*b)]
for r in a]
a = [[1, 2], [3, 4]]
b = [[5, 1], [2, 1]]
c = matrix_mul(a, b)
It's much easier to understand that c
is the result of the multiplication between a
and b
in the second example. It's much easier to read through the code and, if you don't need to modify that part, you don't even need to go into the implementation details.
Therefore, readability is improved here while, in the first snippet, you would have to spend time trying to understand what that complicated list comprehension was doing.
Note
Don't worry if you don't understand list comprehensions, we'll study them in the next chapter.
Improve traceability
Imagine that you have written an e-commerce website. You have displayed the product prices all over the pages. Imagine that the prices in your database are stored with no VAT, but you want to display them on the website with VAT at 20%. Here's a few ways of calculating the VAT-inclusive price from the VAT-exclusive price.
vat.py
price = 100 # GBP, no VAT final_price1 = price * 1.2 final_price2 = price + price / 5.0 final_price3 = price * (100 + 20) / 100.0 final_price4 = price + price * 0.2
All these four different ways of calculating a VAT-inclusive price are perfectly acceptable, and I promise you I have found them all in my colleagues' code, over the years. Now, imagine that you have started selling your products in different countries and some of them have different VAT rates so you need to refactor your code (throughout the website) in order to make that VAT calculation dynamic.
How do you trace all the places in which you are performing a VAT calculation? Coding today is a collaborative task and you cannot be sure the VAT has been calculated using only one of those forms. It's going to be hell, believe me.
So, let's write a function that takes the input values, vat
and price
(VAT-exclusive), and returns a VAT-inclusive price.
vat.function.py
def calculate_price_with_vat(price, vat): return price * (100 + vat) / 100
Now you can import that function and apply it in any place of your website where you need to calculate a VAT-inclusive price and when you need to trace those calls, you can search for calculate_price_with_vat
.
Note
Note that, in the preceding example, price
is assumed to be VAT-exclusive, and vat
has a percentage value (for example, 19, 20, 23, and so on).
Scopes and name resolution
Do you remember when we talked about scopes and namespaces in the first chapter? We're going to expand on that concept now. Finally, we can talk about functions and this will make everything easier to understand. Let's start with a very simple example.
scoping.level.1.py
def my_function(): test = 1 # this is defined in the local scope of the function print('my_function:', test) test = 0 # this is defined in the global scope my_function() print('global:', test)
I have defined the name test
in two different places in the previous example. It is actually in two different scopes. One is the global scope (test = 0
), and the other is the local scope of the function my_function
(test = 1
). If you execute the code, you'll see this:
$ python scoping.level.1.py my_function: 1 global: 0
It's clear that test = 1
shadows the assignment test = 0
in my_function
. In the global context, test
is still 0
, as you can see from the output of the program but we define the name test
again in the function body, and we set it to point to an integer of value 1
. Both the two test
names therefore exist, one in the global scope, pointing to an int
object with value 0, the other in the my_function
scope, pointing to an int
object with value 1. Let's comment out the line with test = 1
. Python goes and searches for the name test
in the next enclosing namespace (recall the LEGB rule: Local, Enclosing, Global, Built-in described in Chapter 1, Introduction and First Steps – Take a Deep Breath) and, in this case, we will see the value 0
printed twice. Try it in your code.
Now, let's raise the stakes here and level up:
scoping.level.2.py
def outer(): test = 1 # outer scope def inner(): test = 2 # inner scope print('inner:', test) inner() print('outer:', test) test = 0 # global scope outer() print('global:', test)
In the preceding code, we have two levels of shadowing. One level is in the function outer
, and the other one is in the function inner
. It is far from rocket science, but it can be tricky. If we run the code, we get:
$ python scoping.level.2.py inner: 2 outer: 1 global: 0
Try commenting out the line test = 1
. What do you think the result will be? Well, when reaching the line print('outer:', test)
, Python will have to look for test
in the next enclosing scope, therefore it will find and print 0
, instead of 1
. Make sure you comment out test = 2
as well, to see if you understand what happens, and if the LEGB rule is clear, before proceeding.
Another thing to note is that Python gives you the ability to define a function in another function. The inner function's name is defined within the namespace of the outer function, exactly as would happen with any other name.
The global and nonlocal statements
Going back to the preceding example, we can alter what happens to the shadowing of the test name by using one of these two special statements: global
and nonlocal
. As you can see from the previous example, when we define test = 2
in the function inner
, we overwrite test
neither in the function outer
, nor in the global scope. We can get read access to those names if we use them in a nested scope that doesn't define them, but we cannot modify them because, when we write an assignment instruction, we're actually defining a new name in the current scope.
How do we change this behavior? Well, we can use the nonlocal
statement. According to the official documentation:
"The
nonlocal
statement causes the listed identifiers to refer to previously bound variables in the nearest enclosing scope excluding globals."
Let's introduce it in the function inner
, and see what happens:
scoping.level.2.nonlocal.py
def outer(): test = 1 # outer scope def inner(): nonlocal test test = 2 # nearest enclosing scope print('inner:', test) inner() print('outer:', test) test = 0 # global scope outer() print('global:', test)
Notice how in the body of the function inner
I have declared the test
name to be nonlocal
. Running this code produces the following result:
$ python scoping.level.2.nonlocal.py inner: 2 outer: 2 global: 0
Wow, look at that result! It means that, by declaring test
to be nonlocal
in the function inner
, we actually get to bind the name test
to that declared in the function outer
. If we removed the nonlocal
test
line from the function inner
and tried the same trick in the function outer
, we would get a SyntaxError
, because the nonlocal
statement works on enclosing scopes excluding the global one.
Is there a way to get to that test = 0
in the global namespace then? Of course, we just need to use the global
statement. Let's try it.
scoping.level.2.global.py
def outer(): test = 1 # outer scope def inner(): global test test = 2 # global scope print('inner:', test) inner() print('outer:', test) test = 0 # global scope outer() print('global:', test)
Note that we have now declared the name test
to be global
, which will basically bind it to the one we defined in the global namespace (test = 0
). Run the code and you should get the following:
$ python scoping.level.2.global.py inner: 2 outer: 1 global: 2
This shows that the name affected by the assignment test = 2
is now the global
one. This trick would also work in the outer
function because, in this case, we're referring to the global scope. Try it for yourself and see what changes, get comfortable with scopes and name resolution, it's very important.
The global and nonlocal statements
Going back to the preceding example, we can alter what happens to the shadowing of the test name by using one of these two special statements: global
and nonlocal
. As you can see from the previous example, when we define test = 2
in the function inner
, we overwrite test
neither in the function outer
, nor in the global scope. We can get read access to those names if we use them in a nested scope that doesn't define them, but we cannot modify them because, when we write an assignment instruction, we're actually defining a new name in the current scope.
How do we change this behavior? Well, we can use the nonlocal
statement. According to the official documentation:
"The
nonlocal
statement causes the listed identifiers to refer to previously bound variables in the nearest enclosing scope excluding globals."
Let's introduce it in the function inner
, and see what happens:
scoping.level.2.nonlocal.py
def outer(): test = 1 # outer scope def inner(): nonlocal test test = 2 # nearest enclosing scope print('inner:', test) inner() print('outer:', test) test = 0 # global scope outer() print('global:', test)
Notice how in the body of the function inner
I have declared the test
name to be nonlocal
. Running this code produces the following result:
$ python scoping.level.2.nonlocal.py inner: 2 outer: 2 global: 0
Wow, look at that result! It means that, by declaring test
to be nonlocal
in the function inner
, we actually get to bind the name test
to that declared in the function outer
. If we removed the nonlocal
test
line from the function inner
and tried the same trick in the function outer
, we would get a SyntaxError
, because the nonlocal
statement works on enclosing scopes excluding the global one.
Is there a way to get to that test = 0
in the global namespace then? Of course, we just need to use the global
statement. Let's try it.
scoping.level.2.global.py
def outer(): test = 1 # outer scope def inner(): global test test = 2 # global scope print('inner:', test) inner() print('outer:', test) test = 0 # global scope outer() print('global:', test)
Note that we have now declared the name test
to be global
, which will basically bind it to the one we defined in the global namespace (test = 0
). Run the code and you should get the following:
$ python scoping.level.2.global.py inner: 2 outer: 1 global: 2
This shows that the name affected by the assignment test = 2
is now the global
one. This trick would also work in the outer
function because, in this case, we're referring to the global scope. Try it for yourself and see what changes, get comfortable with scopes and name resolution, it's very important.
Input parameters
At the beginning of this chapter, we saw that a function can take input parameters. Before we delve into all possible type of parameters, let's make sure you have a clear understanding of what passing a parameter to a function means. There are three key points to keep in mind:
- Argument passing is nothing more than assigning an object to a local variable name
- Assigning an object to an argument name inside a function doesn't affect the caller
- Changing a mutable object argument in a function affects the caller
Let's look at an example for each of these points.
Argument passing
Take a look at the following code. We declare a name x
in the global scope, then we declare a function func(y)
and we call it, passing x
. I highlighted the call in the code.
key.points.argument.passing.py
x = 3
def func(y):
print(y)
func(x) # prints: 3
When func
is called with x
, what happens is that within its local scope, a name y
is created, and it's pointed to the same object x
is pointing to. This is better clarified by the following picture:
The right part of the preceding picture depicts the state of the program when execution has reached the end, after func
has returned (None
). Take a look at the Frames column, and note that we have two names, x and func, in the global namespace (Global frame), pointing to an int (with a value of three) and to a function object, respectively. Right below it, in the rectangle titled func, we can see the function's local namespace, in which only one name has been defined: y. Because we have called func with x (line 5 in the left part of the picture), y is pointing to the same object that x is pointing to. This is what happens under the hood when an argument is passed to a function. If we had used the name x instead of y in the function definition, things would have been exactly the same (only maybe a bit confusing at first), there would be a local x in the function, and a global x outside, as we saw in the Scopes and name resolution section.
So, in a nutshell, what really happens is that the function creates in its local scope the names defined as arguments and, when we call it, we basically tell Python which objects those names must be pointed towards.
Assignment to argument names don't affect the caller
This is something that can be tricky to understand at first, so let's look at an example.
key.points.assignment.py
x = 3
def func(x):
x = 7 # defining a local x, not changing the global one
func(x)
print(x) # prints: 3
In the preceding code, when the line x = 7
is executed, what happens is that within the local scope of the function func
, the name x
is pointed to an integer with value 7, leaving the global x
unaltered.
Changing a mutable affects the caller
This is the final point, and it's very important because Python apparently behaves differently with mutables (just apparently though). Let's look at an example:
key.points.mutable.py
x = [1, 2, 3]
def func(x):
x[1] = 42 # this affects the caller!
func(x)
print(x) # prints: [1, 42, 3]
Wow, we actually changed the original object! If you think about it, there is nothing weird in this behavior. The name x
in the function is set to point to the caller object by the function call and within the body of the function, we're not changing x
, in that we're not changing its reference, or, in other words, we are not changing the object x
is pointing to. What we're doing is accessing that object's element at position 1, and changing its value.
Remember point #2: "Assigning an object to an argument name within a function doesn't affect the caller". If that is clear to you, the following code should not be surprising.
key.points.mutable.assignment.py
x = [1, 2, 3] def func(x): x[1] = 42 # this changes the caller! x = 'something else' # this points x to a new string object func(x) print(x) # still prints: [1, 42, 3]
Take a look at the two lines I have highlighted. At first, we just access the caller object again, at position 1, and change its value to number 42. Then, we reassign x
to point to the string 'something else'
. This leaves the caller unaltered, according to point #2, and, in fact, the output is the same as that of the previous snippet.
Take your time to play around with this concept and experiment with prints and calls to the id
function until everything is clear in your mind. This is one of the key aspects of Python and it must be very clear, otherwise you risk introducing subtle bugs into your code.
Now that we have a good understanding of input parameters and how they behave, let's see how we can specify them.
How to specify input parameters
There are five different ways of specifying input parameters. Let's look at them one by one.
Positional arguments
Positional arguments are read from left to right and they are the most common type of arguments.
arguments.positional.py
def func(a, b, c): print(a, b, c) func(1, 2, 3) # prints: 1 2 3
There is not much else to say. They can be as numerous as you want and they are assigned by position. In the function call, 1
comes first, 2
comes second and 3
comes third, therefore they are assigned to a
, b
and c
respectively.
Keyword arguments and default values
Keyword arguments are assigned by keyword using the name=value
syntax.
arguments.keyword.py
def func(a, b, c):
print(a, b, c)
func(a=1, c=2, b=3) # prints: 1 3 2
Keyword arguments act when calling the function instead of respecting the left-to-right positional assignment, k. Keyword arguments are matched by name, even when they don't respect the definition's original position (we'll see that there is a limitation to this behavior later, when we mix and match different types of arguments).
The counterpart of keyword arguments, on the definition side, is default values. The syntax is the same, name=value
, and allows us to not have to provide an argument if we are happy with the given default.
arguments.default.py
def func(a, b=4, c=88):
print(a, b, c)
func(1) # prints: 1 4 88
func(b=5, a=7, c=9) # prints: 7 5 9
func(42, c=9) # prints: 42 4 9
The are two things to notice, which are very important. First of all, you cannot specify a default argument on the left of a positional one. Second, note how in the examples, when an argument is passed without using the argument_name=value
syntax, it must be the first one in the list,, and it is always assigned to a
. Try and scramble those arguments and see what happens. Python error messages are very good at telling you what's wrong. So, for example, if you tried something like this:
func(b=1, c=2, 42) # positional argument after keyword one
You would get the following error:
SyntaxError: non-keyword arg after keyword arg
This informs you that you've called the function incorrectly.
Variable positional arguments
Sometimes you may want to pass a variable number of positional arguments to a function and Python provides you with the ability to do it. Let's look at a very common use case, the minimum
function. This is a function that calculates the minimum of its input values.
arguments.variable.positional.py
def minimum(*n):
# print(n) # n is a tuple
if n: # explained after the code
mn = n[0]
for value in n[1:]:
if value < mn:
mn = value
print(mn)
minimum(1, 3, -7, 9) # n = (1, 3, -7, 9) - prints: -7
minimum() # n = () - prints: nothing
As you can see, when we specify a parameter prepending a *
to its name, we are telling Python that that parameter will be collecting a variable number of positional arguments, according to how the function is called. Within the function, n
is a tuple. Uncomment the print(n)
to see for yourself and play around with it for a bit.
Note
Have you noticed how we checked if n
wasn't empty with a simple if n:
? This is due to the fact that collection objects evaluate to True
when non-empty, and otherwise False
in Python. This is true for tuples, sets, lists, dictionaries, and so on.
One other thing to note is that we may want to throw an error when we call the function with no arguments, instead of silently doing nothing. In this context, we're not concerned about making this function robust, but in understanding variable positional arguments.
Let's make another example to show you two things that, in my experience, are confusing to those who are new to this.
arguments.variable.positional.unpacking.py
def func(*args): print(args) values = (1, 3, -7, 9) func(values) # equivalent to: func((1, 3, -7, 9)) func(*values) # equivalent to: func(1, 3, -7, 9)
Take a good look at the last two lines of the preceding example. In the first one, we call func
with one argument, a four elements tuple. In the second example, by using the *
syntax, we're doing something called unpacking, which means that the four elements tuple is unpacked, and the function is called with four arguments: 1, 3, -7, 9
.
This behavior is part of the magic Python does to allow you to do amazing things when calling functions dynamically.
Variable keyword arguments
Variable keyword arguments are very similar to variable positional arguments. The only difference is the syntax (**
instead of *
) and that they are collected in a dictionary. Collection and unpacking work in the same way, so let's look at an example:
arguments.variable.keyword.py
def func(**kwargs): print(kwargs) # All calls equivalent. They print: {'a': 1, 'b': 42} func(a=1, b=42) func(**{'a': 1, 'b': 42}) func(**dict(a=1, b=42))
All the calls are equivalent in the preceding example. You can see that adding a **
in front of the parameter name in the function definition tells Python to use that name to collect a variable number of keyword parameters. On the other hand, when we call the function, we can either pass name=value
arguments explicitly, or unpack a dictionary using the same **
syntax.
The reason why being able to pass a variable number of keyword parameters is so important may not be evident at the moment, so, how about a more realistic example? Let's define a function that connects to a database. We want to connect to a default database by simply calling this function with no parameters. We also want to connect to any other database by passing the function the appropriate arguments. Before you read on, spend a couple of minutes figuring out a solution by yourself.
arguments.variable.db.py
def connect(**options): conn_params = { 'host': options.get('host', '127.0.0.1'), 'port': options.get('port', 5432), 'user': options.get('user', ''), 'pwd': options.get('pwd', ''), } print(conn_params) # we then connect to the db (commented out) # db.connect(**conn_params) connect() connect(host='127.0.0.42', port=5433) connect(port=5431, user='fab', pwd='gandalf')
Note in the function we can prepare a dictionary of connection parameters (conn_params
) in the function using default values as fallback, allowing them to be overwritten if they are provided in the function call. There are better ways to do this with fewer lines of code but we're not concerned with that now. Running the preceding code yields the following result:
$ python arguments.variable.db.py {'host': '127.0.0.1', 'pwd': '', 'user': '', 'port': 5432} {'host': '127.0.0.42', 'pwd': '', 'user': '', 'port': 5433} {'host': '127.0.0.1', 'pwd': 'gandalf', 'user': 'fab', 'port': 5431}
Note the correspondence between the function calls and the output. Note how default values are either there or overridden, according to what was passed to the function.
Keyword-only arguments
Python 3 allows for a new type of parameter: the keyword-only parameter. We are going to study them only briefly as their use cases are not that frequent. There are two ways of specifying them, either after the variable positional arguments, or after a bare *. Let's see an example of both.
arguments.keyword.only.py
def kwo(*a, c): print(a, c) kwo(1, 2, 3, c=7) # prints: (1, 2, 3) 7 kwo(c=4) # prints: () 4 # kwo(1, 2) # breaks, invalid syntax, with the following error # TypeError: kwo() missing 1 required keyword-only argument: 'c' def kwo2(a, b=42, *, c): print(a, b, c) kwo2(3, b=7, c=99) # prints: 3 7 99 kwo2(3, c=13) # prints: 3 42 13 # kwo2(3, 23) # breaks, invalid syntax, with the following error # TypeError: kwo2() missing 1 required keyword-only argument: 'c'
As anticipated, the function, kwo
, takes a variable number of positional arguments (a
) and a keyword-only function, c
. The results of the calls are straightforward and you can uncomment the third call to see what error Python returns.
The same applies to the function, kwo2
, which differs from kwo
in that it takes a positional argument a
, a keyword argument b
, and then a keyword-only argument, c
. You can uncomment the third call to see the error.
Now that you know how to specify different types of input parameters, let's see how you can combine them in function definitions.
Combining input parameters
You can combine input parameters, as long as you follow these ordering rules:
- When defining a function, normal positional arguments come first (
name
), then any default arguments (name=value
), then the variable positional arguments (*name
, or simply*
), then any keyword-only arguments (eithername
orname=value
form is good), then any variable keyword arguments (**name
). - On the other hand, when calling a function, arguments must be given in the following order: positional arguments first (
value
), then any combination of keyword arguments (name=value
), variable positional arguments (*name
), then variable keyword arguments (**name
).
Since this can be a bit tricky when left hanging in the theoretical world, let's look at a couple of quick examples.
arguments.all.py
def func(a, b, c=7, *args, **kwargs): print('a, b, c:', a, b, c) print('args:', args) print('kwargs:', kwargs) func(1, 2, 3, *(5, 7, 9), **{'A': 'a', 'B': 'b'}) func(1, 2, 3, 5, 7, 9, A='a', B='b') # same as previous one
Note the order of the parameters in the function definition, and that the two calls are equivalent. In the first one, we're using the unpacking operators for iterables and dictionaries, while in the second one we're using a more explicit syntax. The execution of this yields (I printed only the result of one call):
$ python arguments.all.py a, b, c: 1 2 3 args: (5, 7, 9) kwargs: {'A': 'a', 'B': 'b'}
Let's now look at an example with keyword-only arguments.
arguments.all.kwonly.py
def func_with_kwonly(a, b=42, *args, c, d=256, **kwargs):
print('a, b:', a, b)
print('c, d:', c, d)
print('args:', args)
print('kwargs:', kwargs)
# both calls equivalent
func_with_kwonly(3, 42, c=0, d=1, *(7, 9, 11), e='E', f='F')
func_with_kwonly(3, 42, *(7, 9, 11), c=0, d=1, e='E', f='F')
Note that I have highlighted the keyword-only arguments in the function declaration. They come after the variable positional argument *args
, and it would be the same if they came right after a single *
(in which case there wouldn't be a variable positional argument). The execution of this yields (I printed only the result of one call):
$ python arguments.all.kwonly.py a, b: 3 42 c, d: 0 1 args: (7, 9, 11) kwargs: {'f': 'F', 'e': 'E'}
One other thing to note are the names I gave to the variable positional and keyword arguments. You're free to choose differently, but be aware that args
and kwargs
are the conventional names given to these parameters, at least generically. Now that you know how to define a function in all possible flavors, let me show you something tricky: mutable defaults.
Avoid the trap! Mutable defaults
One thing to be very aware of with Python is that default values are created at def
time, therefore, subsequent calls to the same function will possibly behave differently according to the mutability of their default values. Let's look at an example:
arguments.defaults.mutable.py
def func(a=[], b={}):
print(a)
print(b)
print('#' * 12)
a.append(len(a)) # this will affect a's default value
b[len(a)] = len(a) # and this will affect b's one
func()
func()
func()
The parameters both have mutable default values. This means that, if you affect those objects, any modification will stick around in subsequent function calls. See if you can understand the output of those calls:
$ python arguments.defaults.mutable.py [] {} ############ [0] {1: 1} ############ [0, 1] {1: 1, 2: 2} ############
It's interesting, isn't it? While this behavior may seem very weird at first, it actually makes sense, and it's very handy, for example, when using memoization techniques (Google an example of that, if you're interested).
Even more interesting is what happens when, between the calls, we introduce one that doesn't use defaults, like this:
arguments.defaults.mutable.intermediate.call.py
func() func(a=[1, 2, 3], b={'B': 1}) func()
When we run this code, this is the output:
$ python arguments.defaults.mutable.intermediate.call.py [] {} ############ [1, 2, 3] {'B': 1} ############ [0] {1: 1} ############
This output shows us that the defaults are retained even if we call the function with other values. One question that comes to mind is, how do I get a fresh empty value every time? Well, the convention is the following:
arguments.defaults.mutable.no.trap.py
def func(a=None): if a is None: a = [] # do whatever you want with `a` ...
Note that, by using the preceding technique, if a
isn't passed when calling the function, you always get a brand new empty list.
Okay, enough with the input, let's look at the other side of the coin, the output.
Argument passing
Take a look at the following code. We declare a name x
in the global scope, then we declare a function func(y)
and we call it, passing x
. I highlighted the call in the code.
key.points.argument.passing.py
x = 3
def func(y):
print(y)
func(x) # prints: 3
When func
is called with x
, what happens is that within its local scope, a name y
is created, and it's pointed to the same object x
is pointing to. This is better clarified by the following picture:
The right part of the preceding picture depicts the state of the program when execution has reached the end, after func
has returned (None
). Take a look at the Frames column, and note that we have two names, x and func, in the global namespace (Global frame), pointing to an int (with a value of three) and to a function object, respectively. Right below it, in the rectangle titled func, we can see the function's local namespace, in which only one name has been defined: y. Because we have called func with x (line 5 in the left part of the picture), y is pointing to the same object that x is pointing to. This is what happens under the hood when an argument is passed to a function. If we had used the name x instead of y in the function definition, things would have been exactly the same (only maybe a bit confusing at first), there would be a local x in the function, and a global x outside, as we saw in the Scopes and name resolution section.
So, in a nutshell, what really happens is that the function creates in its local scope the names defined as arguments and, when we call it, we basically tell Python which objects those names must be pointed towards.
Assignment to argument names don't affect the caller
This is something that can be tricky to understand at first, so let's look at an example.
key.points.assignment.py
x = 3
def func(x):
x = 7 # defining a local x, not changing the global one
func(x)
print(x) # prints: 3
In the preceding code, when the line x = 7
is executed, what happens is that within the local scope of the function func
, the name x
is pointed to an integer with value 7, leaving the global x
unaltered.
Changing a mutable affects the caller
This is the final point, and it's very important because Python apparently behaves differently with mutables (just apparently though). Let's look at an example:
key.points.mutable.py
x = [1, 2, 3]
def func(x):
x[1] = 42 # this affects the caller!
func(x)
print(x) # prints: [1, 42, 3]
Wow, we actually changed the original object! If you think about it, there is nothing weird in this behavior. The name x
in the function is set to point to the caller object by the function call and within the body of the function, we're not changing x
, in that we're not changing its reference, or, in other words, we are not changing the object x
is pointing to. What we're doing is accessing that object's element at position 1, and changing its value.
Remember point #2: "Assigning an object to an argument name within a function doesn't affect the caller". If that is clear to you, the following code should not be surprising.
key.points.mutable.assignment.py
x = [1, 2, 3] def func(x): x[1] = 42 # this changes the caller! x = 'something else' # this points x to a new string object func(x) print(x) # still prints: [1, 42, 3]
Take a look at the two lines I have highlighted. At first, we just access the caller object again, at position 1, and change its value to number 42. Then, we reassign x
to point to the string 'something else'
. This leaves the caller unaltered, according to point #2, and, in fact, the output is the same as that of the previous snippet.
Take your time to play around with this concept and experiment with prints and calls to the id
function until everything is clear in your mind. This is one of the key aspects of Python and it must be very clear, otherwise you risk introducing subtle bugs into your code.
Now that we have a good understanding of input parameters and how they behave, let's see how we can specify them.
How to specify input parameters
There are five different ways of specifying input parameters. Let's look at them one by one.
Positional arguments
Positional arguments are read from left to right and they are the most common type of arguments.
arguments.positional.py
def func(a, b, c): print(a, b, c) func(1, 2, 3) # prints: 1 2 3
There is not much else to say. They can be as numerous as you want and they are assigned by position. In the function call, 1
comes first, 2
comes second and 3
comes third, therefore they are assigned to a
, b
and c
respectively.
Keyword arguments and default values
Keyword arguments are assigned by keyword using the name=value
syntax.
arguments.keyword.py
def func(a, b, c):
print(a, b, c)
func(a=1, c=2, b=3) # prints: 1 3 2
Keyword arguments act when calling the function instead of respecting the left-to-right positional assignment, k. Keyword arguments are matched by name, even when they don't respect the definition's original position (we'll see that there is a limitation to this behavior later, when we mix and match different types of arguments).
The counterpart of keyword arguments, on the definition side, is default values. The syntax is the same, name=value
, and allows us to not have to provide an argument if we are happy with the given default.
arguments.default.py
def func(a, b=4, c=88):
print(a, b, c)
func(1) # prints: 1 4 88
func(b=5, a=7, c=9) # prints: 7 5 9
func(42, c=9) # prints: 42 4 9
The are two things to notice, which are very important. First of all, you cannot specify a default argument on the left of a positional one. Second, note how in the examples, when an argument is passed without using the argument_name=value
syntax, it must be the first one in the list,, and it is always assigned to a
. Try and scramble those arguments and see what happens. Python error messages are very good at telling you what's wrong. So, for example, if you tried something like this:
func(b=1, c=2, 42) # positional argument after keyword one
You would get the following error:
SyntaxError: non-keyword arg after keyword arg
This informs you that you've called the function incorrectly.
Variable positional arguments
Sometimes you may want to pass a variable number of positional arguments to a function and Python provides you with the ability to do it. Let's look at a very common use case, the minimum
function. This is a function that calculates the minimum of its input values.
arguments.variable.positional.py
def minimum(*n):
# print(n) # n is a tuple
if n: # explained after the code
mn = n[0]
for value in n[1:]:
if value < mn:
mn = value
print(mn)
minimum(1, 3, -7, 9) # n = (1, 3, -7, 9) - prints: -7
minimum() # n = () - prints: nothing
As you can see, when we specify a parameter prepending a *
to its name, we are telling Python that that parameter will be collecting a variable number of positional arguments, according to how the function is called. Within the function, n
is a tuple. Uncomment the print(n)
to see for yourself and play around with it for a bit.
Note
Have you noticed how we checked if n
wasn't empty with a simple if n:
? This is due to the fact that collection objects evaluate to True
when non-empty, and otherwise False
in Python. This is true for tuples, sets, lists, dictionaries, and so on.
One other thing to note is that we may want to throw an error when we call the function with no arguments, instead of silently doing nothing. In this context, we're not concerned about making this function robust, but in understanding variable positional arguments.
Let's make another example to show you two things that, in my experience, are confusing to those who are new to this.
arguments.variable.positional.unpacking.py
def func(*args): print(args) values = (1, 3, -7, 9) func(values) # equivalent to: func((1, 3, -7, 9)) func(*values) # equivalent to: func(1, 3, -7, 9)
Take a good look at the last two lines of the preceding example. In the first one, we call func
with one argument, a four elements tuple. In the second example, by using the *
syntax, we're doing something called unpacking, which means that the four elements tuple is unpacked, and the function is called with four arguments: 1, 3, -7, 9
.
This behavior is part of the magic Python does to allow you to do amazing things when calling functions dynamically.
Variable keyword arguments
Variable keyword arguments are very similar to variable positional arguments. The only difference is the syntax (**
instead of *
) and that they are collected in a dictionary. Collection and unpacking work in the same way, so let's look at an example:
arguments.variable.keyword.py
def func(**kwargs): print(kwargs) # All calls equivalent. They print: {'a': 1, 'b': 42} func(a=1, b=42) func(**{'a': 1, 'b': 42}) func(**dict(a=1, b=42))
All the calls are equivalent in the preceding example. You can see that adding a **
in front of the parameter name in the function definition tells Python to use that name to collect a variable number of keyword parameters. On the other hand, when we call the function, we can either pass name=value
arguments explicitly, or unpack a dictionary using the same **
syntax.
The reason why being able to pass a variable number of keyword parameters is so important may not be evident at the moment, so, how about a more realistic example? Let's define a function that connects to a database. We want to connect to a default database by simply calling this function with no parameters. We also want to connect to any other database by passing the function the appropriate arguments. Before you read on, spend a couple of minutes figuring out a solution by yourself.
arguments.variable.db.py
def connect(**options): conn_params = { 'host': options.get('host', '127.0.0.1'), 'port': options.get('port', 5432), 'user': options.get('user', ''), 'pwd': options.get('pwd', ''), } print(conn_params) # we then connect to the db (commented out) # db.connect(**conn_params) connect() connect(host='127.0.0.42', port=5433) connect(port=5431, user='fab', pwd='gandalf')
Note in the function we can prepare a dictionary of connection parameters (conn_params
) in the function using default values as fallback, allowing them to be overwritten if they are provided in the function call. There are better ways to do this with fewer lines of code but we're not concerned with that now. Running the preceding code yields the following result:
$ python arguments.variable.db.py {'host': '127.0.0.1', 'pwd': '', 'user': '', 'port': 5432} {'host': '127.0.0.42', 'pwd': '', 'user': '', 'port': 5433} {'host': '127.0.0.1', 'pwd': 'gandalf', 'user': 'fab', 'port': 5431}
Note the correspondence between the function calls and the output. Note how default values are either there or overridden, according to what was passed to the function.
Keyword-only arguments
Python 3 allows for a new type of parameter: the keyword-only parameter. We are going to study them only briefly as their use cases are not that frequent. There are two ways of specifying them, either after the variable positional arguments, or after a bare *. Let's see an example of both.
arguments.keyword.only.py
def kwo(*a, c): print(a, c) kwo(1, 2, 3, c=7) # prints: (1, 2, 3) 7 kwo(c=4) # prints: () 4 # kwo(1, 2) # breaks, invalid syntax, with the following error # TypeError: kwo() missing 1 required keyword-only argument: 'c' def kwo2(a, b=42, *, c): print(a, b, c) kwo2(3, b=7, c=99) # prints: 3 7 99 kwo2(3, c=13) # prints: 3 42 13 # kwo2(3, 23) # breaks, invalid syntax, with the following error # TypeError: kwo2() missing 1 required keyword-only argument: 'c'
As anticipated, the function, kwo
, takes a variable number of positional arguments (a
) and a keyword-only function, c
. The results of the calls are straightforward and you can uncomment the third call to see what error Python returns.
The same applies to the function, kwo2
, which differs from kwo
in that it takes a positional argument a
, a keyword argument b
, and then a keyword-only argument, c
. You can uncomment the third call to see the error.
Now that you know how to specify different types of input parameters, let's see how you can combine them in function definitions.
Combining input parameters
You can combine input parameters, as long as you follow these ordering rules:
- When defining a function, normal positional arguments come first (
name
), then any default arguments (name=value
), then the variable positional arguments (*name
, or simply*
), then any keyword-only arguments (eithername
orname=value
form is good), then any variable keyword arguments (**name
). - On the other hand, when calling a function, arguments must be given in the following order: positional arguments first (
value
), then any combination of keyword arguments (name=value
), variable positional arguments (*name
), then variable keyword arguments (**name
).
Since this can be a bit tricky when left hanging in the theoretical world, let's look at a couple of quick examples.
arguments.all.py
def func(a, b, c=7, *args, **kwargs): print('a, b, c:', a, b, c) print('args:', args) print('kwargs:', kwargs) func(1, 2, 3, *(5, 7, 9), **{'A': 'a', 'B': 'b'}) func(1, 2, 3, 5, 7, 9, A='a', B='b') # same as previous one
Note the order of the parameters in the function definition, and that the two calls are equivalent. In the first one, we're using the unpacking operators for iterables and dictionaries, while in the second one we're using a more explicit syntax. The execution of this yields (I printed only the result of one call):
$ python arguments.all.py a, b, c: 1 2 3 args: (5, 7, 9) kwargs: {'A': 'a', 'B': 'b'}
Let's now look at an example with keyword-only arguments.
arguments.all.kwonly.py
def func_with_kwonly(a, b=42, *args, c, d=256, **kwargs):
print('a, b:', a, b)
print('c, d:', c, d)
print('args:', args)
print('kwargs:', kwargs)
# both calls equivalent
func_with_kwonly(3, 42, c=0, d=1, *(7, 9, 11), e='E', f='F')
func_with_kwonly(3, 42, *(7, 9, 11), c=0, d=1, e='E', f='F')
Note that I have highlighted the keyword-only arguments in the function declaration. They come after the variable positional argument *args
, and it would be the same if they came right after a single *
(in which case there wouldn't be a variable positional argument). The execution of this yields (I printed only the result of one call):
$ python arguments.all.kwonly.py a, b: 3 42 c, d: 0 1 args: (7, 9, 11) kwargs: {'f': 'F', 'e': 'E'}
One other thing to note are the names I gave to the variable positional and keyword arguments. You're free to choose differently, but be aware that args
and kwargs
are the conventional names given to these parameters, at least generically. Now that you know how to define a function in all possible flavors, let me show you something tricky: mutable defaults.
Avoid the trap! Mutable defaults
One thing to be very aware of with Python is that default values are created at def
time, therefore, subsequent calls to the same function will possibly behave differently according to the mutability of their default values. Let's look at an example:
arguments.defaults.mutable.py
def func(a=[], b={}):
print(a)
print(b)
print('#' * 12)
a.append(len(a)) # this will affect a's default value
b[len(a)] = len(a) # and this will affect b's one
func()
func()
func()
The parameters both have mutable default values. This means that, if you affect those objects, any modification will stick around in subsequent function calls. See if you can understand the output of those calls:
$ python arguments.defaults.mutable.py [] {} ############ [0] {1: 1} ############ [0, 1] {1: 1, 2: 2} ############
It's interesting, isn't it? While this behavior may seem very weird at first, it actually makes sense, and it's very handy, for example, when using memoization techniques (Google an example of that, if you're interested).
Even more interesting is what happens when, between the calls, we introduce one that doesn't use defaults, like this:
arguments.defaults.mutable.intermediate.call.py
func() func(a=[1, 2, 3], b={'B': 1}) func()
When we run this code, this is the output:
$ python arguments.defaults.mutable.intermediate.call.py [] {} ############ [1, 2, 3] {'B': 1} ############ [0] {1: 1} ############
This output shows us that the defaults are retained even if we call the function with other values. One question that comes to mind is, how do I get a fresh empty value every time? Well, the convention is the following:
arguments.defaults.mutable.no.trap.py
def func(a=None): if a is None: a = [] # do whatever you want with `a` ...
Note that, by using the preceding technique, if a
isn't passed when calling the function, you always get a brand new empty list.
Okay, enough with the input, let's look at the other side of the coin, the output.
Assignment to argument names don't affect the caller
This is something that can be tricky to understand at first, so let's look at an example.
key.points.assignment.py
x = 3
def func(x):
x = 7 # defining a local x, not changing the global one
func(x)
print(x) # prints: 3
In the preceding code, when the line x = 7
is executed, what happens is that within the local scope of the function func
, the name x
is pointed to an integer with value 7, leaving the global x
unaltered.
Changing a mutable affects the caller
This is the final point, and it's very important because Python apparently behaves differently with mutables (just apparently though). Let's look at an example:
key.points.mutable.py
x = [1, 2, 3]
def func(x):
x[1] = 42 # this affects the caller!
func(x)
print(x) # prints: [1, 42, 3]
Wow, we actually changed the original object! If you think about it, there is nothing weird in this behavior. The name x
in the function is set to point to the caller object by the function call and within the body of the function, we're not changing x
, in that we're not changing its reference, or, in other words, we are not changing the object x
is pointing to. What we're doing is accessing that object's element at position 1, and changing its value.
Remember point #2: "Assigning an object to an argument name within a function doesn't affect the caller". If that is clear to you, the following code should not be surprising.
key.points.mutable.assignment.py
x = [1, 2, 3] def func(x): x[1] = 42 # this changes the caller! x = 'something else' # this points x to a new string object func(x) print(x) # still prints: [1, 42, 3]
Take a look at the two lines I have highlighted. At first, we just access the caller object again, at position 1, and change its value to number 42. Then, we reassign x
to point to the string 'something else'
. This leaves the caller unaltered, according to point #2, and, in fact, the output is the same as that of the previous snippet.
Take your time to play around with this concept and experiment with prints and calls to the id
function until everything is clear in your mind. This is one of the key aspects of Python and it must be very clear, otherwise you risk introducing subtle bugs into your code.
Now that we have a good understanding of input parameters and how they behave, let's see how we can specify them.
How to specify input parameters
There are five different ways of specifying input parameters. Let's look at them one by one.
Positional arguments
Positional arguments are read from left to right and they are the most common type of arguments.
arguments.positional.py
def func(a, b, c): print(a, b, c) func(1, 2, 3) # prints: 1 2 3
There is not much else to say. They can be as numerous as you want and they are assigned by position. In the function call, 1
comes first, 2
comes second and 3
comes third, therefore they are assigned to a
, b
and c
respectively.
Keyword arguments and default values
Keyword arguments are assigned by keyword using the name=value
syntax.
arguments.keyword.py
def func(a, b, c):
print(a, b, c)
func(a=1, c=2, b=3) # prints: 1 3 2
Keyword arguments act when calling the function instead of respecting the left-to-right positional assignment, k. Keyword arguments are matched by name, even when they don't respect the definition's original position (we'll see that there is a limitation to this behavior later, when we mix and match different types of arguments).
The counterpart of keyword arguments, on the definition side, is default values. The syntax is the same, name=value
, and allows us to not have to provide an argument if we are happy with the given default.
arguments.default.py
def func(a, b=4, c=88):
print(a, b, c)
func(1) # prints: 1 4 88
func(b=5, a=7, c=9) # prints: 7 5 9
func(42, c=9) # prints: 42 4 9
The are two things to notice, which are very important. First of all, you cannot specify a default argument on the left of a positional one. Second, note how in the examples, when an argument is passed without using the argument_name=value
syntax, it must be the first one in the list,, and it is always assigned to a
. Try and scramble those arguments and see what happens. Python error messages are very good at telling you what's wrong. So, for example, if you tried something like this:
func(b=1, c=2, 42) # positional argument after keyword one
You would get the following error:
SyntaxError: non-keyword arg after keyword arg
This informs you that you've called the function incorrectly.
Variable positional arguments
Sometimes you may want to pass a variable number of positional arguments to a function and Python provides you with the ability to do it. Let's look at a very common use case, the minimum
function. This is a function that calculates the minimum of its input values.
arguments.variable.positional.py
def minimum(*n):
# print(n) # n is a tuple
if n: # explained after the code
mn = n[0]
for value in n[1:]:
if value < mn:
mn = value
print(mn)
minimum(1, 3, -7, 9) # n = (1, 3, -7, 9) - prints: -7
minimum() # n = () - prints: nothing
As you can see, when we specify a parameter prepending a *
to its name, we are telling Python that that parameter will be collecting a variable number of positional arguments, according to how the function is called. Within the function, n
is a tuple. Uncomment the print(n)
to see for yourself and play around with it for a bit.
Note
Have you noticed how we checked if n
wasn't empty with a simple if n:
? This is due to the fact that collection objects evaluate to True
when non-empty, and otherwise False
in Python. This is true for tuples, sets, lists, dictionaries, and so on.
One other thing to note is that we may want to throw an error when we call the function with no arguments, instead of silently doing nothing. In this context, we're not concerned about making this function robust, but in understanding variable positional arguments.
Let's make another example to show you two things that, in my experience, are confusing to those who are new to this.
arguments.variable.positional.unpacking.py
def func(*args): print(args) values = (1, 3, -7, 9) func(values) # equivalent to: func((1, 3, -7, 9)) func(*values) # equivalent to: func(1, 3, -7, 9)
Take a good look at the last two lines of the preceding example. In the first one, we call func
with one argument, a four elements tuple. In the second example, by using the *
syntax, we're doing something called unpacking, which means that the four elements tuple is unpacked, and the function is called with four arguments: 1, 3, -7, 9
.
This behavior is part of the magic Python does to allow you to do amazing things when calling functions dynamically.
Variable keyword arguments
Variable keyword arguments are very similar to variable positional arguments. The only difference is the syntax (**
instead of *
) and that they are collected in a dictionary. Collection and unpacking work in the same way, so let's look at an example:
arguments.variable.keyword.py
def func(**kwargs): print(kwargs) # All calls equivalent. They print: {'a': 1, 'b': 42} func(a=1, b=42) func(**{'a': 1, 'b': 42}) func(**dict(a=1, b=42))
All the calls are equivalent in the preceding example. You can see that adding a **
in front of the parameter name in the function definition tells Python to use that name to collect a variable number of keyword parameters. On the other hand, when we call the function, we can either pass name=value
arguments explicitly, or unpack a dictionary using the same **
syntax.
The reason why being able to pass a variable number of keyword parameters is so important may not be evident at the moment, so, how about a more realistic example? Let's define a function that connects to a database. We want to connect to a default database by simply calling this function with no parameters. We also want to connect to any other database by passing the function the appropriate arguments. Before you read on, spend a couple of minutes figuring out a solution by yourself.
arguments.variable.db.py
def connect(**options): conn_params = { 'host': options.get('host', '127.0.0.1'), 'port': options.get('port', 5432), 'user': options.get('user', ''), 'pwd': options.get('pwd', ''), } print(conn_params) # we then connect to the db (commented out) # db.connect(**conn_params) connect() connect(host='127.0.0.42', port=5433) connect(port=5431, user='fab', pwd='gandalf')
Note in the function we can prepare a dictionary of connection parameters (conn_params
) in the function using default values as fallback, allowing them to be overwritten if they are provided in the function call. There are better ways to do this with fewer lines of code but we're not concerned with that now. Running the preceding code yields the following result:
$ python arguments.variable.db.py {'host': '127.0.0.1', 'pwd': '', 'user': '', 'port': 5432} {'host': '127.0.0.42', 'pwd': '', 'user': '', 'port': 5433} {'host': '127.0.0.1', 'pwd': 'gandalf', 'user': 'fab', 'port': 5431}
Note the correspondence between the function calls and the output. Note how default values are either there or overridden, according to what was passed to the function.
Keyword-only arguments
Python 3 allows for a new type of parameter: the keyword-only parameter. We are going to study them only briefly as their use cases are not that frequent. There are two ways of specifying them, either after the variable positional arguments, or after a bare *. Let's see an example of both.
arguments.keyword.only.py
def kwo(*a, c): print(a, c) kwo(1, 2, 3, c=7) # prints: (1, 2, 3) 7 kwo(c=4) # prints: () 4 # kwo(1, 2) # breaks, invalid syntax, with the following error # TypeError: kwo() missing 1 required keyword-only argument: 'c' def kwo2(a, b=42, *, c): print(a, b, c) kwo2(3, b=7, c=99) # prints: 3 7 99 kwo2(3, c=13) # prints: 3 42 13 # kwo2(3, 23) # breaks, invalid syntax, with the following error # TypeError: kwo2() missing 1 required keyword-only argument: 'c'
As anticipated, the function, kwo
, takes a variable number of positional arguments (a
) and a keyword-only function, c
. The results of the calls are straightforward and you can uncomment the third call to see what error Python returns.
The same applies to the function, kwo2
, which differs from kwo
in that it takes a positional argument a
, a keyword argument b
, and then a keyword-only argument, c
. You can uncomment the third call to see the error.
Now that you know how to specify different types of input parameters, let's see how you can combine them in function definitions.
Combining input parameters
You can combine input parameters, as long as you follow these ordering rules:
- When defining a function, normal positional arguments come first (
name
), then any default arguments (name=value
), then the variable positional arguments (*name
, or simply*
), then any keyword-only arguments (eithername
orname=value
form is good), then any variable keyword arguments (**name
). - On the other hand, when calling a function, arguments must be given in the following order: positional arguments first (
value
), then any combination of keyword arguments (name=value
), variable positional arguments (*name
), then variable keyword arguments (**name
).
Since this can be a bit tricky when left hanging in the theoretical world, let's look at a couple of quick examples.
arguments.all.py
def func(a, b, c=7, *args, **kwargs): print('a, b, c:', a, b, c) print('args:', args) print('kwargs:', kwargs) func(1, 2, 3, *(5, 7, 9), **{'A': 'a', 'B': 'b'}) func(1, 2, 3, 5, 7, 9, A='a', B='b') # same as previous one
Note the order of the parameters in the function definition, and that the two calls are equivalent. In the first one, we're using the unpacking operators for iterables and dictionaries, while in the second one we're using a more explicit syntax. The execution of this yields (I printed only the result of one call):
$ python arguments.all.py a, b, c: 1 2 3 args: (5, 7, 9) kwargs: {'A': 'a', 'B': 'b'}
Let's now look at an example with keyword-only arguments.
arguments.all.kwonly.py
def func_with_kwonly(a, b=42, *args, c, d=256, **kwargs):
print('a, b:', a, b)
print('c, d:', c, d)
print('args:', args)
print('kwargs:', kwargs)
# both calls equivalent
func_with_kwonly(3, 42, c=0, d=1, *(7, 9, 11), e='E', f='F')
func_with_kwonly(3, 42, *(7, 9, 11), c=0, d=1, e='E', f='F')
Note that I have highlighted the keyword-only arguments in the function declaration. They come after the variable positional argument *args
, and it would be the same if they came right after a single *
(in which case there wouldn't be a variable positional argument). The execution of this yields (I printed only the result of one call):
$ python arguments.all.kwonly.py a, b: 3 42 c, d: 0 1 args: (7, 9, 11) kwargs: {'f': 'F', 'e': 'E'}
One other thing to note are the names I gave to the variable positional and keyword arguments. You're free to choose differently, but be aware that args
and kwargs
are the conventional names given to these parameters, at least generically. Now that you know how to define a function in all possible flavors, let me show you something tricky: mutable defaults.
Avoid the trap! Mutable defaults
One thing to be very aware of with Python is that default values are created at def
time, therefore, subsequent calls to the same function will possibly behave differently according to the mutability of their default values. Let's look at an example:
arguments.defaults.mutable.py
def func(a=[], b={}):
print(a)
print(b)
print('#' * 12)
a.append(len(a)) # this will affect a's default value
b[len(a)] = len(a) # and this will affect b's one
func()
func()
func()
The parameters both have mutable default values. This means that, if you affect those objects, any modification will stick around in subsequent function calls. See if you can understand the output of those calls:
$ python arguments.defaults.mutable.py [] {} ############ [0] {1: 1} ############ [0, 1] {1: 1, 2: 2} ############
It's interesting, isn't it? While this behavior may seem very weird at first, it actually makes sense, and it's very handy, for example, when using memoization techniques (Google an example of that, if you're interested).
Even more interesting is what happens when, between the calls, we introduce one that doesn't use defaults, like this:
arguments.defaults.mutable.intermediate.call.py
func() func(a=[1, 2, 3], b={'B': 1}) func()
When we run this code, this is the output:
$ python arguments.defaults.mutable.intermediate.call.py [] {} ############ [1, 2, 3] {'B': 1} ############ [0] {1: 1} ############
This output shows us that the defaults are retained even if we call the function with other values. One question that comes to mind is, how do I get a fresh empty value every time? Well, the convention is the following:
arguments.defaults.mutable.no.trap.py
def func(a=None): if a is None: a = [] # do whatever you want with `a` ...
Note that, by using the preceding technique, if a
isn't passed when calling the function, you always get a brand new empty list.
Okay, enough with the input, let's look at the other side of the coin, the output.
Changing a mutable affects the caller
This is the final point, and it's very important because Python apparently behaves differently with mutables (just apparently though). Let's look at an example:
key.points.mutable.py
x = [1, 2, 3]
def func(x):
x[1] = 42 # this affects the caller!
func(x)
print(x) # prints: [1, 42, 3]
Wow, we actually changed the original object! If you think about it, there is nothing weird in this behavior. The name x
in the function is set to point to the caller object by the function call and within the body of the function, we're not changing x
, in that we're not changing its reference, or, in other words, we are not changing the object x
is pointing to. What we're doing is accessing that object's element at position 1, and changing its value.
Remember point #2: "Assigning an object to an argument name within a function doesn't affect the caller". If that is clear to you, the following code should not be surprising.
key.points.mutable.assignment.py
x = [1, 2, 3] def func(x): x[1] = 42 # this changes the caller! x = 'something else' # this points x to a new string object func(x) print(x) # still prints: [1, 42, 3]
Take a look at the two lines I have highlighted. At first, we just access the caller object again, at position 1, and change its value to number 42. Then, we reassign x
to point to the string 'something else'
. This leaves the caller unaltered, according to point #2, and, in fact, the output is the same as that of the previous snippet.
Take your time to play around with this concept and experiment with prints and calls to the id
function until everything is clear in your mind. This is one of the key aspects of Python and it must be very clear, otherwise you risk introducing subtle bugs into your code.
Now that we have a good understanding of input parameters and how they behave, let's see how we can specify them.
How to specify input parameters
There are five different ways of specifying input parameters. Let's look at them one by one.
Positional arguments
Positional arguments are read from left to right and they are the most common type of arguments.
arguments.positional.py
def func(a, b, c): print(a, b, c) func(1, 2, 3) # prints: 1 2 3
There is not much else to say. They can be as numerous as you want and they are assigned by position. In the function call, 1
comes first, 2
comes second and 3
comes third, therefore they are assigned to a
, b
and c
respectively.
Keyword arguments and default values
Keyword arguments are assigned by keyword using the name=value
syntax.
arguments.keyword.py
def func(a, b, c):
print(a, b, c)
func(a=1, c=2, b=3) # prints: 1 3 2
Keyword arguments act when calling the function instead of respecting the left-to-right positional assignment, k. Keyword arguments are matched by name, even when they don't respect the definition's original position (we'll see that there is a limitation to this behavior later, when we mix and match different types of arguments).
The counterpart of keyword arguments, on the definition side, is default values. The syntax is the same, name=value
, and allows us to not have to provide an argument if we are happy with the given default.
arguments.default.py
def func(a, b=4, c=88):
print(a, b, c)
func(1) # prints: 1 4 88
func(b=5, a=7, c=9) # prints: 7 5 9
func(42, c=9) # prints: 42 4 9
The are two things to notice, which are very important. First of all, you cannot specify a default argument on the left of a positional one. Second, note how in the examples, when an argument is passed without using the argument_name=value
syntax, it must be the first one in the list,, and it is always assigned to a
. Try and scramble those arguments and see what happens. Python error messages are very good at telling you what's wrong. So, for example, if you tried something like this:
func(b=1, c=2, 42) # positional argument after keyword one
You would get the following error:
SyntaxError: non-keyword arg after keyword arg
This informs you that you've called the function incorrectly.
Variable positional arguments
Sometimes you may want to pass a variable number of positional arguments to a function and Python provides you with the ability to do it. Let's look at a very common use case, the minimum
function. This is a function that calculates the minimum of its input values.
arguments.variable.positional.py
def minimum(*n):
# print(n) # n is a tuple
if n: # explained after the code
mn = n[0]
for value in n[1:]:
if value < mn:
mn = value
print(mn)
minimum(1, 3, -7, 9) # n = (1, 3, -7, 9) - prints: -7
minimum() # n = () - prints: nothing
As you can see, when we specify a parameter prepending a *
to its name, we are telling Python that that parameter will be collecting a variable number of positional arguments, according to how the function is called. Within the function, n
is a tuple. Uncomment the print(n)
to see for yourself and play around with it for a bit.
Note
Have you noticed how we checked if n
wasn't empty with a simple if n:
? This is due to the fact that collection objects evaluate to True
when non-empty, and otherwise False
in Python. This is true for tuples, sets, lists, dictionaries, and so on.
One other thing to note is that we may want to throw an error when we call the function with no arguments, instead of silently doing nothing. In this context, we're not concerned about making this function robust, but in understanding variable positional arguments.
Let's make another example to show you two things that, in my experience, are confusing to those who are new to this.
arguments.variable.positional.unpacking.py
def func(*args): print(args) values = (1, 3, -7, 9) func(values) # equivalent to: func((1, 3, -7, 9)) func(*values) # equivalent to: func(1, 3, -7, 9)
Take a good look at the last two lines of the preceding example. In the first one, we call func
with one argument, a four elements tuple. In the second example, by using the *
syntax, we're doing something called unpacking, which means that the four elements tuple is unpacked, and the function is called with four arguments: 1, 3, -7, 9
.
This behavior is part of the magic Python does to allow you to do amazing things when calling functions dynamically.
Variable keyword arguments
Variable keyword arguments are very similar to variable positional arguments. The only difference is the syntax (**
instead of *
) and that they are collected in a dictionary. Collection and unpacking work in the same way, so let's look at an example:
arguments.variable.keyword.py
def func(**kwargs): print(kwargs) # All calls equivalent. They print: {'a': 1, 'b': 42} func(a=1, b=42) func(**{'a': 1, 'b': 42}) func(**dict(a=1, b=42))
All the calls are equivalent in the preceding example. You can see that adding a **
in front of the parameter name in the function definition tells Python to use that name to collect a variable number of keyword parameters. On the other hand, when we call the function, we can either pass name=value
arguments explicitly, or unpack a dictionary using the same **
syntax.
The reason why being able to pass a variable number of keyword parameters is so important may not be evident at the moment, so, how about a more realistic example? Let's define a function that connects to a database. We want to connect to a default database by simply calling this function with no parameters. We also want to connect to any other database by passing the function the appropriate arguments. Before you read on, spend a couple of minutes figuring out a solution by yourself.
arguments.variable.db.py
def connect(**options): conn_params = { 'host': options.get('host', '127.0.0.1'), 'port': options.get('port', 5432), 'user': options.get('user', ''), 'pwd': options.get('pwd', ''), } print(conn_params) # we then connect to the db (commented out) # db.connect(**conn_params) connect() connect(host='127.0.0.42', port=5433) connect(port=5431, user='fab', pwd='gandalf')
Note in the function we can prepare a dictionary of connection parameters (conn_params
) in the function using default values as fallback, allowing them to be overwritten if they are provided in the function call. There are better ways to do this with fewer lines of code but we're not concerned with that now. Running the preceding code yields the following result:
$ python arguments.variable.db.py {'host': '127.0.0.1', 'pwd': '', 'user': '', 'port': 5432} {'host': '127.0.0.42', 'pwd': '', 'user': '', 'port': 5433} {'host': '127.0.0.1', 'pwd': 'gandalf', 'user': 'fab', 'port': 5431}
Note the correspondence between the function calls and the output. Note how default values are either there or overridden, according to what was passed to the function.
Keyword-only arguments
Python 3 allows for a new type of parameter: the keyword-only parameter. We are going to study them only briefly as their use cases are not that frequent. There are two ways of specifying them, either after the variable positional arguments, or after a bare *. Let's see an example of both.
arguments.keyword.only.py
def kwo(*a, c): print(a, c) kwo(1, 2, 3, c=7) # prints: (1, 2, 3) 7 kwo(c=4) # prints: () 4 # kwo(1, 2) # breaks, invalid syntax, with the following error # TypeError: kwo() missing 1 required keyword-only argument: 'c' def kwo2(a, b=42, *, c): print(a, b, c) kwo2(3, b=7, c=99) # prints: 3 7 99 kwo2(3, c=13) # prints: 3 42 13 # kwo2(3, 23) # breaks, invalid syntax, with the following error # TypeError: kwo2() missing 1 required keyword-only argument: 'c'
As anticipated, the function, kwo
, takes a variable number of positional arguments (a
) and a keyword-only function, c
. The results of the calls are straightforward and you can uncomment the third call to see what error Python returns.
The same applies to the function, kwo2
, which differs from kwo
in that it takes a positional argument a
, a keyword argument b
, and then a keyword-only argument, c
. You can uncomment the third call to see the error.
Now that you know how to specify different types of input parameters, let's see how you can combine them in function definitions.
Combining input parameters
You can combine input parameters, as long as you follow these ordering rules:
- When defining a function, normal positional arguments come first (
name
), then any default arguments (name=value
), then the variable positional arguments (*name
, or simply*
), then any keyword-only arguments (eithername
orname=value
form is good), then any variable keyword arguments (**name
). - On the other hand, when calling a function, arguments must be given in the following order: positional arguments first (
value
), then any combination of keyword arguments (name=value
), variable positional arguments (*name
), then variable keyword arguments (**name
).
Since this can be a bit tricky when left hanging in the theoretical world, let's look at a couple of quick examples.
arguments.all.py
def func(a, b, c=7, *args, **kwargs): print('a, b, c:', a, b, c) print('args:', args) print('kwargs:', kwargs) func(1, 2, 3, *(5, 7, 9), **{'A': 'a', 'B': 'b'}) func(1, 2, 3, 5, 7, 9, A='a', B='b') # same as previous one
Note the order of the parameters in the function definition, and that the two calls are equivalent. In the first one, we're using the unpacking operators for iterables and dictionaries, while in the second one we're using a more explicit syntax. The execution of this yields (I printed only the result of one call):
$ python arguments.all.py a, b, c: 1 2 3 args: (5, 7, 9) kwargs: {'A': 'a', 'B': 'b'}
Let's now look at an example with keyword-only arguments.
arguments.all.kwonly.py
def func_with_kwonly(a, b=42, *args, c, d=256, **kwargs):
print('a, b:', a, b)
print('c, d:', c, d)
print('args:', args)
print('kwargs:', kwargs)
# both calls equivalent
func_with_kwonly(3, 42, c=0, d=1, *(7, 9, 11), e='E', f='F')
func_with_kwonly(3, 42, *(7, 9, 11), c=0, d=1, e='E', f='F')
Note that I have highlighted the keyword-only arguments in the function declaration. They come after the variable positional argument *args
, and it would be the same if they came right after a single *
(in which case there wouldn't be a variable positional argument). The execution of this yields (I printed only the result of one call):
$ python arguments.all.kwonly.py a, b: 3 42 c, d: 0 1 args: (7, 9, 11) kwargs: {'f': 'F', 'e': 'E'}
One other thing to note are the names I gave to the variable positional and keyword arguments. You're free to choose differently, but be aware that args
and kwargs
are the conventional names given to these parameters, at least generically. Now that you know how to define a function in all possible flavors, let me show you something tricky: mutable defaults.
Avoid the trap! Mutable defaults
One thing to be very aware of with Python is that default values are created at def
time, therefore, subsequent calls to the same function will possibly behave differently according to the mutability of their default values. Let's look at an example:
arguments.defaults.mutable.py
def func(a=[], b={}):
print(a)
print(b)
print('#' * 12)
a.append(len(a)) # this will affect a's default value
b[len(a)] = len(a) # and this will affect b's one
func()
func()
func()
The parameters both have mutable default values. This means that, if you affect those objects, any modification will stick around in subsequent function calls. See if you can understand the output of those calls:
$ python arguments.defaults.mutable.py [] {} ############ [0] {1: 1} ############ [0, 1] {1: 1, 2: 2} ############
It's interesting, isn't it? While this behavior may seem very weird at first, it actually makes sense, and it's very handy, for example, when using memoization techniques (Google an example of that, if you're interested).
Even more interesting is what happens when, between the calls, we introduce one that doesn't use defaults, like this:
arguments.defaults.mutable.intermediate.call.py
func() func(a=[1, 2, 3], b={'B': 1}) func()
When we run this code, this is the output:
$ python arguments.defaults.mutable.intermediate.call.py [] {} ############ [1, 2, 3] {'B': 1} ############ [0] {1: 1} ############
This output shows us that the defaults are retained even if we call the function with other values. One question that comes to mind is, how do I get a fresh empty value every time? Well, the convention is the following:
arguments.defaults.mutable.no.trap.py
def func(a=None): if a is None: a = [] # do whatever you want with `a` ...
Note that, by using the preceding technique, if a
isn't passed when calling the function, you always get a brand new empty list.
Okay, enough with the input, let's look at the other side of the coin, the output.
How to specify input parameters
There are five different ways of specifying input parameters. Let's look at them one by one.
Positional arguments
Positional arguments are read from left to right and they are the most common type of arguments.
arguments.positional.py
def func(a, b, c): print(a, b, c) func(1, 2, 3) # prints: 1 2 3
There is not much else to say. They can be as numerous as you want and they are assigned by position. In the function call, 1
comes first, 2
comes second and 3
comes third, therefore they are assigned to a
, b
and c
respectively.
Keyword arguments and default values
Keyword arguments are assigned by keyword using the name=value
syntax.
arguments.keyword.py
def func(a, b, c):
print(a, b, c)
func(a=1, c=2, b=3) # prints: 1 3 2
Keyword arguments act when calling the function instead of respecting the left-to-right positional assignment, k. Keyword arguments are matched by name, even when they don't respect the definition's original position (we'll see that there is a limitation to this behavior later, when we mix and match different types of arguments).
The counterpart of keyword arguments, on the definition side, is default values. The syntax is the same, name=value
, and allows us to not have to provide an argument if we are happy with the given default.
arguments.default.py
def func(a, b=4, c=88):
print(a, b, c)
func(1) # prints: 1 4 88
func(b=5, a=7, c=9) # prints: 7 5 9
func(42, c=9) # prints: 42 4 9
The are two things to notice, which are very important. First of all, you cannot specify a default argument on the left of a positional one. Second, note how in the examples, when an argument is passed without using the argument_name=value
syntax, it must be the first one in the list,, and it is always assigned to a
. Try and scramble those arguments and see what happens. Python error messages are very good at telling you what's wrong. So, for example, if you tried something like this:
func(b=1, c=2, 42) # positional argument after keyword one
You would get the following error:
SyntaxError: non-keyword arg after keyword arg
This informs you that you've called the function incorrectly.
Variable positional arguments
Sometimes you may want to pass a variable number of positional arguments to a function and Python provides you with the ability to do it. Let's look at a very common use case, the minimum
function. This is a function that calculates the minimum of its input values.
arguments.variable.positional.py
def minimum(*n):
# print(n) # n is a tuple
if n: # explained after the code
mn = n[0]
for value in n[1:]:
if value < mn:
mn = value
print(mn)
minimum(1, 3, -7, 9) # n = (1, 3, -7, 9) - prints: -7
minimum() # n = () - prints: nothing
As you can see, when we specify a parameter prepending a *
to its name, we are telling Python that that parameter will be collecting a variable number of positional arguments, according to how the function is called. Within the function, n
is a tuple. Uncomment the print(n)
to see for yourself and play around with it for a bit.
Note
Have you noticed how we checked if n
wasn't empty with a simple if n:
? This is due to the fact that collection objects evaluate to True
when non-empty, and otherwise False
in Python. This is true for tuples, sets, lists, dictionaries, and so on.
One other thing to note is that we may want to throw an error when we call the function with no arguments, instead of silently doing nothing. In this context, we're not concerned about making this function robust, but in understanding variable positional arguments.
Let's make another example to show you two things that, in my experience, are confusing to those who are new to this.
arguments.variable.positional.unpacking.py
def func(*args): print(args) values = (1, 3, -7, 9) func(values) # equivalent to: func((1, 3, -7, 9)) func(*values) # equivalent to: func(1, 3, -7, 9)
Take a good look at the last two lines of the preceding example. In the first one, we call func
with one argument, a four elements tuple. In the second example, by using the *
syntax, we're doing something called unpacking, which means that the four elements tuple is unpacked, and the function is called with four arguments: 1, 3, -7, 9
.
This behavior is part of the magic Python does to allow you to do amazing things when calling functions dynamically.
Variable keyword arguments
Variable keyword arguments are very similar to variable positional arguments. The only difference is the syntax (**
instead of *
) and that they are collected in a dictionary. Collection and unpacking work in the same way, so let's look at an example:
arguments.variable.keyword.py
def func(**kwargs): print(kwargs) # All calls equivalent. They print: {'a': 1, 'b': 42} func(a=1, b=42) func(**{'a': 1, 'b': 42}) func(**dict(a=1, b=42))
All the calls are equivalent in the preceding example. You can see that adding a **
in front of the parameter name in the function definition tells Python to use that name to collect a variable number of keyword parameters. On the other hand, when we call the function, we can either pass name=value
arguments explicitly, or unpack a dictionary using the same **
syntax.
The reason why being able to pass a variable number of keyword parameters is so important may not be evident at the moment, so, how about a more realistic example? Let's define a function that connects to a database. We want to connect to a default database by simply calling this function with no parameters. We also want to connect to any other database by passing the function the appropriate arguments. Before you read on, spend a couple of minutes figuring out a solution by yourself.
arguments.variable.db.py
def connect(**options): conn_params = { 'host': options.get('host', '127.0.0.1'), 'port': options.get('port', 5432), 'user': options.get('user', ''), 'pwd': options.get('pwd', ''), } print(conn_params) # we then connect to the db (commented out) # db.connect(**conn_params) connect() connect(host='127.0.0.42', port=5433) connect(port=5431, user='fab', pwd='gandalf')
Note in the function we can prepare a dictionary of connection parameters (conn_params
) in the function using default values as fallback, allowing them to be overwritten if they are provided in the function call. There are better ways to do this with fewer lines of code but we're not concerned with that now. Running the preceding code yields the following result:
$ python arguments.variable.db.py {'host': '127.0.0.1', 'pwd': '', 'user': '', 'port': 5432} {'host': '127.0.0.42', 'pwd': '', 'user': '', 'port': 5433} {'host': '127.0.0.1', 'pwd': 'gandalf', 'user': 'fab', 'port': 5431}
Note the correspondence between the function calls and the output. Note how default values are either there or overridden, according to what was passed to the function.
Keyword-only arguments
Python 3 allows for a new type of parameter: the keyword-only parameter. We are going to study them only briefly as their use cases are not that frequent. There are two ways of specifying them, either after the variable positional arguments, or after a bare *. Let's see an example of both.
arguments.keyword.only.py
def kwo(*a, c): print(a, c) kwo(1, 2, 3, c=7) # prints: (1, 2, 3) 7 kwo(c=4) # prints: () 4 # kwo(1, 2) # breaks, invalid syntax, with the following error # TypeError: kwo() missing 1 required keyword-only argument: 'c' def kwo2(a, b=42, *, c): print(a, b, c) kwo2(3, b=7, c=99) # prints: 3 7 99 kwo2(3, c=13) # prints: 3 42 13 # kwo2(3, 23) # breaks, invalid syntax, with the following error # TypeError: kwo2() missing 1 required keyword-only argument: 'c'
As anticipated, the function, kwo
, takes a variable number of positional arguments (a
) and a keyword-only function, c
. The results of the calls are straightforward and you can uncomment the third call to see what error Python returns.
The same applies to the function, kwo2
, which differs from kwo
in that it takes a positional argument a
, a keyword argument b
, and then a keyword-only argument, c
. You can uncomment the third call to see the error.
Now that you know how to specify different types of input parameters, let's see how you can combine them in function definitions.
Combining input parameters
You can combine input parameters, as long as you follow these ordering rules:
- When defining a function, normal positional arguments come first (
name
), then any default arguments (name=value
), then the variable positional arguments (*name
, or simply*
), then any keyword-only arguments (eithername
orname=value
form is good), then any variable keyword arguments (**name
). - On the other hand, when calling a function, arguments must be given in the following order: positional arguments first (
value
), then any combination of keyword arguments (name=value
), variable positional arguments (*name
), then variable keyword arguments (**name
).
Since this can be a bit tricky when left hanging in the theoretical world, let's look at a couple of quick examples.
arguments.all.py
def func(a, b, c=7, *args, **kwargs): print('a, b, c:', a, b, c) print('args:', args) print('kwargs:', kwargs) func(1, 2, 3, *(5, 7, 9), **{'A': 'a', 'B': 'b'}) func(1, 2, 3, 5, 7, 9, A='a', B='b') # same as previous one
Note the order of the parameters in the function definition, and that the two calls are equivalent. In the first one, we're using the unpacking operators for iterables and dictionaries, while in the second one we're using a more explicit syntax. The execution of this yields (I printed only the result of one call):
$ python arguments.all.py a, b, c: 1 2 3 args: (5, 7, 9) kwargs: {'A': 'a', 'B': 'b'}
Let's now look at an example with keyword-only arguments.
arguments.all.kwonly.py
def func_with_kwonly(a, b=42, *args, c, d=256, **kwargs):
print('a, b:', a, b)
print('c, d:', c, d)
print('args:', args)
print('kwargs:', kwargs)
# both calls equivalent
func_with_kwonly(3, 42, c=0, d=1, *(7, 9, 11), e='E', f='F')
func_with_kwonly(3, 42, *(7, 9, 11), c=0, d=1, e='E', f='F')
Note that I have highlighted the keyword-only arguments in the function declaration. They come after the variable positional argument *args
, and it would be the same if they came right after a single *
(in which case there wouldn't be a variable positional argument). The execution of this yields (I printed only the result of one call):
$ python arguments.all.kwonly.py a, b: 3 42 c, d: 0 1 args: (7, 9, 11) kwargs: {'f': 'F', 'e': 'E'}
One other thing to note are the names I gave to the variable positional and keyword arguments. You're free to choose differently, but be aware that args
and kwargs
are the conventional names given to these parameters, at least generically. Now that you know how to define a function in all possible flavors, let me show you something tricky: mutable defaults.
Avoid the trap! Mutable defaults
One thing to be very aware of with Python is that default values are created at def
time, therefore, subsequent calls to the same function will possibly behave differently according to the mutability of their default values. Let's look at an example:
arguments.defaults.mutable.py
def func(a=[], b={}):
print(a)
print(b)
print('#' * 12)
a.append(len(a)) # this will affect a's default value
b[len(a)] = len(a) # and this will affect b's one
func()
func()
func()
The parameters both have mutable default values. This means that, if you affect those objects, any modification will stick around in subsequent function calls. See if you can understand the output of those calls:
$ python arguments.defaults.mutable.py [] {} ############ [0] {1: 1} ############ [0, 1] {1: 1, 2: 2} ############
It's interesting, isn't it? While this behavior may seem very weird at first, it actually makes sense, and it's very handy, for example, when using memoization techniques (Google an example of that, if you're interested).
Even more interesting is what happens when, between the calls, we introduce one that doesn't use defaults, like this:
arguments.defaults.mutable.intermediate.call.py
func() func(a=[1, 2, 3], b={'B': 1}) func()
When we run this code, this is the output:
$ python arguments.defaults.mutable.intermediate.call.py [] {} ############ [1, 2, 3] {'B': 1} ############ [0] {1: 1} ############
This output shows us that the defaults are retained even if we call the function with other values. One question that comes to mind is, how do I get a fresh empty value every time? Well, the convention is the following:
arguments.defaults.mutable.no.trap.py
def func(a=None): if a is None: a = [] # do whatever you want with `a` ...
Note that, by using the preceding technique, if a
isn't passed when calling the function, you always get a brand new empty list.
Okay, enough with the input, let's look at the other side of the coin, the output.
Positional arguments
Positional arguments are read from left to right and they are the most common type of arguments.
arguments.positional.py
def func(a, b, c): print(a, b, c) func(1, 2, 3) # prints: 1 2 3
There is not much else to say. They can be as numerous as you want and they are assigned by position. In the function call, 1
comes first, 2
comes second and 3
comes third, therefore they are assigned to a
, b
and c
respectively.
Keyword arguments and default values
Keyword arguments are assigned by keyword using the name=value
syntax.
arguments.keyword.py
def func(a, b, c):
print(a, b, c)
func(a=1, c=2, b=3) # prints: 1 3 2
Keyword arguments act when calling the function instead of respecting the left-to-right positional assignment, k. Keyword arguments are matched by name, even when they don't respect the definition's original position (we'll see that there is a limitation to this behavior later, when we mix and match different types of arguments).
The counterpart of keyword arguments, on the definition side, is default values. The syntax is the same, name=value
, and allows us to not have to provide an argument if we are happy with the given default.
arguments.default.py
def func(a, b=4, c=88):
print(a, b, c)
func(1) # prints: 1 4 88
func(b=5, a=7, c=9) # prints: 7 5 9
func(42, c=9) # prints: 42 4 9
The are two things to notice, which are very important. First of all, you cannot specify a default argument on the left of a positional one. Second, note how in the examples, when an argument is passed without using the argument_name=value
syntax, it must be the first one in the list,, and it is always assigned to a
. Try and scramble those arguments and see what happens. Python error messages are very good at telling you what's wrong. So, for example, if you tried something like this:
func(b=1, c=2, 42) # positional argument after keyword one
You would get the following error:
SyntaxError: non-keyword arg after keyword arg
This informs you that you've called the function incorrectly.
Variable positional arguments
Sometimes you may want to pass a variable number of positional arguments to a function and Python provides you with the ability to do it. Let's look at a very common use case, the minimum
function. This is a function that calculates the minimum of its input values.
arguments.variable.positional.py
def minimum(*n):
# print(n) # n is a tuple
if n: # explained after the code
mn = n[0]
for value in n[1:]:
if value < mn:
mn = value
print(mn)
minimum(1, 3, -7, 9) # n = (1, 3, -7, 9) - prints: -7
minimum() # n = () - prints: nothing
As you can see, when we specify a parameter prepending a *
to its name, we are telling Python that that parameter will be collecting a variable number of positional arguments, according to how the function is called. Within the function, n
is a tuple. Uncomment the print(n)
to see for yourself and play around with it for a bit.
Note
Have you noticed how we checked if n
wasn't empty with a simple if n:
? This is due to the fact that collection objects evaluate to True
when non-empty, and otherwise False
in Python. This is true for tuples, sets, lists, dictionaries, and so on.
One other thing to note is that we may want to throw an error when we call the function with no arguments, instead of silently doing nothing. In this context, we're not concerned about making this function robust, but in understanding variable positional arguments.
Let's make another example to show you two things that, in my experience, are confusing to those who are new to this.
arguments.variable.positional.unpacking.py
def func(*args): print(args) values = (1, 3, -7, 9) func(values) # equivalent to: func((1, 3, -7, 9)) func(*values) # equivalent to: func(1, 3, -7, 9)
Take a good look at the last two lines of the preceding example. In the first one, we call func
with one argument, a four elements tuple. In the second example, by using the *
syntax, we're doing something called unpacking, which means that the four elements tuple is unpacked, and the function is called with four arguments: 1, 3, -7, 9
.
This behavior is part of the magic Python does to allow you to do amazing things when calling functions dynamically.
Variable keyword arguments
Variable keyword arguments are very similar to variable positional arguments. The only difference is the syntax (**
instead of *
) and that they are collected in a dictionary. Collection and unpacking work in the same way, so let's look at an example:
arguments.variable.keyword.py
def func(**kwargs): print(kwargs) # All calls equivalent. They print: {'a': 1, 'b': 42} func(a=1, b=42) func(**{'a': 1, 'b': 42}) func(**dict(a=1, b=42))
All the calls are equivalent in the preceding example. You can see that adding a **
in front of the parameter name in the function definition tells Python to use that name to collect a variable number of keyword parameters. On the other hand, when we call the function, we can either pass name=value
arguments explicitly, or unpack a dictionary using the same **
syntax.
The reason why being able to pass a variable number of keyword parameters is so important may not be evident at the moment, so, how about a more realistic example? Let's define a function that connects to a database. We want to connect to a default database by simply calling this function with no parameters. We also want to connect to any other database by passing the function the appropriate arguments. Before you read on, spend a couple of minutes figuring out a solution by yourself.
arguments.variable.db.py
def connect(**options): conn_params = { 'host': options.get('host', '127.0.0.1'), 'port': options.get('port', 5432), 'user': options.get('user', ''), 'pwd': options.get('pwd', ''), } print(conn_params) # we then connect to the db (commented out) # db.connect(**conn_params) connect() connect(host='127.0.0.42', port=5433) connect(port=5431, user='fab', pwd='gandalf')
Note in the function we can prepare a dictionary of connection parameters (conn_params
) in the function using default values as fallback, allowing them to be overwritten if they are provided in the function call. There are better ways to do this with fewer lines of code but we're not concerned with that now. Running the preceding code yields the following result:
$ python arguments.variable.db.py {'host': '127.0.0.1', 'pwd': '', 'user': '', 'port': 5432} {'host': '127.0.0.42', 'pwd': '', 'user': '', 'port': 5433} {'host': '127.0.0.1', 'pwd': 'gandalf', 'user': 'fab', 'port': 5431}
Note the correspondence between the function calls and the output. Note how default values are either there or overridden, according to what was passed to the function.
Keyword-only arguments
Python 3 allows for a new type of parameter: the keyword-only parameter. We are going to study them only briefly as their use cases are not that frequent. There are two ways of specifying them, either after the variable positional arguments, or after a bare *. Let's see an example of both.
arguments.keyword.only.py
def kwo(*a, c): print(a, c) kwo(1, 2, 3, c=7) # prints: (1, 2, 3) 7 kwo(c=4) # prints: () 4 # kwo(1, 2) # breaks, invalid syntax, with the following error # TypeError: kwo() missing 1 required keyword-only argument: 'c' def kwo2(a, b=42, *, c): print(a, b, c) kwo2(3, b=7, c=99) # prints: 3 7 99 kwo2(3, c=13) # prints: 3 42 13 # kwo2(3, 23) # breaks, invalid syntax, with the following error # TypeError: kwo2() missing 1 required keyword-only argument: 'c'
As anticipated, the function, kwo
, takes a variable number of positional arguments (a
) and a keyword-only function, c
. The results of the calls are straightforward and you can uncomment the third call to see what error Python returns.
The same applies to the function, kwo2
, which differs from kwo
in that it takes a positional argument a
, a keyword argument b
, and then a keyword-only argument, c
. You can uncomment the third call to see the error.
Now that you know how to specify different types of input parameters, let's see how you can combine them in function definitions.
Combining input parameters
You can combine input parameters, as long as you follow these ordering rules:
- When defining a function, normal positional arguments come first (
name
), then any default arguments (name=value
), then the variable positional arguments (*name
, or simply*
), then any keyword-only arguments (eithername
orname=value
form is good), then any variable keyword arguments (**name
). - On the other hand, when calling a function, arguments must be given in the following order: positional arguments first (
value
), then any combination of keyword arguments (name=value
), variable positional arguments (*name
), then variable keyword arguments (**name
).
Since this can be a bit tricky when left hanging in the theoretical world, let's look at a couple of quick examples.
arguments.all.py
def func(a, b, c=7, *args, **kwargs): print('a, b, c:', a, b, c) print('args:', args) print('kwargs:', kwargs) func(1, 2, 3, *(5, 7, 9), **{'A': 'a', 'B': 'b'}) func(1, 2, 3, 5, 7, 9, A='a', B='b') # same as previous one
Note the order of the parameters in the function definition, and that the two calls are equivalent. In the first one, we're using the unpacking operators for iterables and dictionaries, while in the second one we're using a more explicit syntax. The execution of this yields (I printed only the result of one call):
$ python arguments.all.py a, b, c: 1 2 3 args: (5, 7, 9) kwargs: {'A': 'a', 'B': 'b'}
Let's now look at an example with keyword-only arguments.
arguments.all.kwonly.py
def func_with_kwonly(a, b=42, *args, c, d=256, **kwargs):
print('a, b:', a, b)
print('c, d:', c, d)
print('args:', args)
print('kwargs:', kwargs)
# both calls equivalent
func_with_kwonly(3, 42, c=0, d=1, *(7, 9, 11), e='E', f='F')
func_with_kwonly(3, 42, *(7, 9, 11), c=0, d=1, e='E', f='F')
Note that I have highlighted the keyword-only arguments in the function declaration. They come after the variable positional argument *args
, and it would be the same if they came right after a single *
(in which case there wouldn't be a variable positional argument). The execution of this yields (I printed only the result of one call):
$ python arguments.all.kwonly.py a, b: 3 42 c, d: 0 1 args: (7, 9, 11) kwargs: {'f': 'F', 'e': 'E'}
One other thing to note are the names I gave to the variable positional and keyword arguments. You're free to choose differently, but be aware that args
and kwargs
are the conventional names given to these parameters, at least generically. Now that you know how to define a function in all possible flavors, let me show you something tricky: mutable defaults.
Avoid the trap! Mutable defaults
One thing to be very aware of with Python is that default values are created at def
time, therefore, subsequent calls to the same function will possibly behave differently according to the mutability of their default values. Let's look at an example:
arguments.defaults.mutable.py
def func(a=[], b={}):
print(a)
print(b)
print('#' * 12)
a.append(len(a)) # this will affect a's default value
b[len(a)] = len(a) # and this will affect b's one
func()
func()
func()
The parameters both have mutable default values. This means that, if you affect those objects, any modification will stick around in subsequent function calls. See if you can understand the output of those calls:
$ python arguments.defaults.mutable.py [] {} ############ [0] {1: 1} ############ [0, 1] {1: 1, 2: 2} ############
It's interesting, isn't it? While this behavior may seem very weird at first, it actually makes sense, and it's very handy, for example, when using memoization techniques (Google an example of that, if you're interested).
Even more interesting is what happens when, between the calls, we introduce one that doesn't use defaults, like this:
arguments.defaults.mutable.intermediate.call.py
func() func(a=[1, 2, 3], b={'B': 1}) func()
When we run this code, this is the output:
$ python arguments.defaults.mutable.intermediate.call.py [] {} ############ [1, 2, 3] {'B': 1} ############ [0] {1: 1} ############
This output shows us that the defaults are retained even if we call the function with other values. One question that comes to mind is, how do I get a fresh empty value every time? Well, the convention is the following:
arguments.defaults.mutable.no.trap.py
def func(a=None): if a is None: a = [] # do whatever you want with `a` ...
Note that, by using the preceding technique, if a
isn't passed when calling the function, you always get a brand new empty list.
Okay, enough with the input, let's look at the other side of the coin, the output.
Keyword arguments and default values
Keyword arguments are assigned by keyword using the name=value
syntax.
arguments.keyword.py
def func(a, b, c):
print(a, b, c)
func(a=1, c=2, b=3) # prints: 1 3 2
Keyword arguments act when calling the function instead of respecting the left-to-right positional assignment, k. Keyword arguments are matched by name, even when they don't respect the definition's original position (we'll see that there is a limitation to this behavior later, when we mix and match different types of arguments).
The counterpart of keyword arguments, on the definition side, is default values. The syntax is the same, name=value
, and allows us to not have to provide an argument if we are happy with the given default.
arguments.default.py
def func(a, b=4, c=88):
print(a, b, c)
func(1) # prints: 1 4 88
func(b=5, a=7, c=9) # prints: 7 5 9
func(42, c=9) # prints: 42 4 9
The are two things to notice, which are very important. First of all, you cannot specify a default argument on the left of a positional one. Second, note how in the examples, when an argument is passed without using the argument_name=value
syntax, it must be the first one in the list,, and it is always assigned to a
. Try and scramble those arguments and see what happens. Python error messages are very good at telling you what's wrong. So, for example, if you tried something like this:
func(b=1, c=2, 42) # positional argument after keyword one
You would get the following error:
SyntaxError: non-keyword arg after keyword arg
This informs you that you've called the function incorrectly.
Variable positional arguments
Sometimes you may want to pass a variable number of positional arguments to a function and Python provides you with the ability to do it. Let's look at a very common use case, the minimum
function. This is a function that calculates the minimum of its input values.
arguments.variable.positional.py
def minimum(*n):
# print(n) # n is a tuple
if n: # explained after the code
mn = n[0]
for value in n[1:]:
if value < mn:
mn = value
print(mn)
minimum(1, 3, -7, 9) # n = (1, 3, -7, 9) - prints: -7
minimum() # n = () - prints: nothing
As you can see, when we specify a parameter prepending a *
to its name, we are telling Python that that parameter will be collecting a variable number of positional arguments, according to how the function is called. Within the function, n
is a tuple. Uncomment the print(n)
to see for yourself and play around with it for a bit.
Note
Have you noticed how we checked if n
wasn't empty with a simple if n:
? This is due to the fact that collection objects evaluate to True
when non-empty, and otherwise False
in Python. This is true for tuples, sets, lists, dictionaries, and so on.
One other thing to note is that we may want to throw an error when we call the function with no arguments, instead of silently doing nothing. In this context, we're not concerned about making this function robust, but in understanding variable positional arguments.
Let's make another example to show you two things that, in my experience, are confusing to those who are new to this.
arguments.variable.positional.unpacking.py
def func(*args): print(args) values = (1, 3, -7, 9) func(values) # equivalent to: func((1, 3, -7, 9)) func(*values) # equivalent to: func(1, 3, -7, 9)
Take a good look at the last two lines of the preceding example. In the first one, we call func
with one argument, a four elements tuple. In the second example, by using the *
syntax, we're doing something called unpacking, which means that the four elements tuple is unpacked, and the function is called with four arguments: 1, 3, -7, 9
.
This behavior is part of the magic Python does to allow you to do amazing things when calling functions dynamically.
Variable keyword arguments
Variable keyword arguments are very similar to variable positional arguments. The only difference is the syntax (**
instead of *
) and that they are collected in a dictionary. Collection and unpacking work in the same way, so let's look at an example:
arguments.variable.keyword.py
def func(**kwargs): print(kwargs) # All calls equivalent. They print: {'a': 1, 'b': 42} func(a=1, b=42) func(**{'a': 1, 'b': 42}) func(**dict(a=1, b=42))
All the calls are equivalent in the preceding example. You can see that adding a **
in front of the parameter name in the function definition tells Python to use that name to collect a variable number of keyword parameters. On the other hand, when we call the function, we can either pass name=value
arguments explicitly, or unpack a dictionary using the same **
syntax.
The reason why being able to pass a variable number of keyword parameters is so important may not be evident at the moment, so, how about a more realistic example? Let's define a function that connects to a database. We want to connect to a default database by simply calling this function with no parameters. We also want to connect to any other database by passing the function the appropriate arguments. Before you read on, spend a couple of minutes figuring out a solution by yourself.
arguments.variable.db.py
def connect(**options): conn_params = { 'host': options.get('host', '127.0.0.1'), 'port': options.get('port', 5432), 'user': options.get('user', ''), 'pwd': options.get('pwd', ''), } print(conn_params) # we then connect to the db (commented out) # db.connect(**conn_params) connect() connect(host='127.0.0.42', port=5433) connect(port=5431, user='fab', pwd='gandalf')
Note in the function we can prepare a dictionary of connection parameters (conn_params
) in the function using default values as fallback, allowing them to be overwritten if they are provided in the function call. There are better ways to do this with fewer lines of code but we're not concerned with that now. Running the preceding code yields the following result:
$ python arguments.variable.db.py {'host': '127.0.0.1', 'pwd': '', 'user': '', 'port': 5432} {'host': '127.0.0.42', 'pwd': '', 'user': '', 'port': 5433} {'host': '127.0.0.1', 'pwd': 'gandalf', 'user': 'fab', 'port': 5431}
Note the correspondence between the function calls and the output. Note how default values are either there or overridden, according to what was passed to the function.
Keyword-only arguments
Python 3 allows for a new type of parameter: the keyword-only parameter. We are going to study them only briefly as their use cases are not that frequent. There are two ways of specifying them, either after the variable positional arguments, or after a bare *. Let's see an example of both.
arguments.keyword.only.py
def kwo(*a, c): print(a, c) kwo(1, 2, 3, c=7) # prints: (1, 2, 3) 7 kwo(c=4) # prints: () 4 # kwo(1, 2) # breaks, invalid syntax, with the following error # TypeError: kwo() missing 1 required keyword-only argument: 'c' def kwo2(a, b=42, *, c): print(a, b, c) kwo2(3, b=7, c=99) # prints: 3 7 99 kwo2(3, c=13) # prints: 3 42 13 # kwo2(3, 23) # breaks, invalid syntax, with the following error # TypeError: kwo2() missing 1 required keyword-only argument: 'c'
As anticipated, the function, kwo
, takes a variable number of positional arguments (a
) and a keyword-only function, c
. The results of the calls are straightforward and you can uncomment the third call to see what error Python returns.
The same applies to the function, kwo2
, which differs from kwo
in that it takes a positional argument a
, a keyword argument b
, and then a keyword-only argument, c
. You can uncomment the third call to see the error.
Now that you know how to specify different types of input parameters, let's see how you can combine them in function definitions.
Combining input parameters
You can combine input parameters, as long as you follow these ordering rules:
- When defining a function, normal positional arguments come first (
name
), then any default arguments (name=value
), then the variable positional arguments (*name
, or simply*
), then any keyword-only arguments (eithername
orname=value
form is good), then any variable keyword arguments (**name
). - On the other hand, when calling a function, arguments must be given in the following order: positional arguments first (
value
), then any combination of keyword arguments (name=value
), variable positional arguments (*name
), then variable keyword arguments (**name
).
Since this can be a bit tricky when left hanging in the theoretical world, let's look at a couple of quick examples.
arguments.all.py
def func(a, b, c=7, *args, **kwargs): print('a, b, c:', a, b, c) print('args:', args) print('kwargs:', kwargs) func(1, 2, 3, *(5, 7, 9), **{'A': 'a', 'B': 'b'}) func(1, 2, 3, 5, 7, 9, A='a', B='b') # same as previous one
Note the order of the parameters in the function definition, and that the two calls are equivalent. In the first one, we're using the unpacking operators for iterables and dictionaries, while in the second one we're using a more explicit syntax. The execution of this yields (I printed only the result of one call):
$ python arguments.all.py a, b, c: 1 2 3 args: (5, 7, 9) kwargs: {'A': 'a', 'B': 'b'}
Let's now look at an example with keyword-only arguments.
arguments.all.kwonly.py
def func_with_kwonly(a, b=42, *args, c, d=256, **kwargs):
print('a, b:', a, b)
print('c, d:', c, d)
print('args:', args)
print('kwargs:', kwargs)
# both calls equivalent
func_with_kwonly(3, 42, c=0, d=1, *(7, 9, 11), e='E', f='F')
func_with_kwonly(3, 42, *(7, 9, 11), c=0, d=1, e='E', f='F')
Note that I have highlighted the keyword-only arguments in the function declaration. They come after the variable positional argument *args
, and it would be the same if they came right after a single *
(in which case there wouldn't be a variable positional argument). The execution of this yields (I printed only the result of one call):
$ python arguments.all.kwonly.py a, b: 3 42 c, d: 0 1 args: (7, 9, 11) kwargs: {'f': 'F', 'e': 'E'}
One other thing to note are the names I gave to the variable positional and keyword arguments. You're free to choose differently, but be aware that args
and kwargs
are the conventional names given to these parameters, at least generically. Now that you know how to define a function in all possible flavors, let me show you something tricky: mutable defaults.
Avoid the trap! Mutable defaults
One thing to be very aware of with Python is that default values are created at def
time, therefore, subsequent calls to the same function will possibly behave differently according to the mutability of their default values. Let's look at an example:
arguments.defaults.mutable.py
def func(a=[], b={}):
print(a)
print(b)
print('#' * 12)
a.append(len(a)) # this will affect a's default value
b[len(a)] = len(a) # and this will affect b's one
func()
func()
func()
The parameters both have mutable default values. This means that, if you affect those objects, any modification will stick around in subsequent function calls. See if you can understand the output of those calls:
$ python arguments.defaults.mutable.py [] {} ############ [0] {1: 1} ############ [0, 1] {1: 1, 2: 2} ############
It's interesting, isn't it? While this behavior may seem very weird at first, it actually makes sense, and it's very handy, for example, when using memoization techniques (Google an example of that, if you're interested).
Even more interesting is what happens when, between the calls, we introduce one that doesn't use defaults, like this:
arguments.defaults.mutable.intermediate.call.py
func() func(a=[1, 2, 3], b={'B': 1}) func()
When we run this code, this is the output:
$ python arguments.defaults.mutable.intermediate.call.py [] {} ############ [1, 2, 3] {'B': 1} ############ [0] {1: 1} ############
This output shows us that the defaults are retained even if we call the function with other values. One question that comes to mind is, how do I get a fresh empty value every time? Well, the convention is the following:
arguments.defaults.mutable.no.trap.py
def func(a=None): if a is None: a = [] # do whatever you want with `a` ...
Note that, by using the preceding technique, if a
isn't passed when calling the function, you always get a brand new empty list.
Okay, enough with the input, let's look at the other side of the coin, the output.
Variable positional arguments
Sometimes you may want to pass a variable number of positional arguments to a function and Python provides you with the ability to do it. Let's look at a very common use case, the minimum
function. This is a function that calculates the minimum of its input values.
arguments.variable.positional.py
def minimum(*n):
# print(n) # n is a tuple
if n: # explained after the code
mn = n[0]
for value in n[1:]:
if value < mn:
mn = value
print(mn)
minimum(1, 3, -7, 9) # n = (1, 3, -7, 9) - prints: -7
minimum() # n = () - prints: nothing
As you can see, when we specify a parameter prepending a *
to its name, we are telling Python that that parameter will be collecting a variable number of positional arguments, according to how the function is called. Within the function, n
is a tuple. Uncomment the print(n)
to see for yourself and play around with it for a bit.
Note
Have you noticed how we checked if n
wasn't empty with a simple if n:
? This is due to the fact that collection objects evaluate to True
when non-empty, and otherwise False
in Python. This is true for tuples, sets, lists, dictionaries, and so on.
One other thing to note is that we may want to throw an error when we call the function with no arguments, instead of silently doing nothing. In this context, we're not concerned about making this function robust, but in understanding variable positional arguments.
Let's make another example to show you two things that, in my experience, are confusing to those who are new to this.
arguments.variable.positional.unpacking.py
def func(*args): print(args) values = (1, 3, -7, 9) func(values) # equivalent to: func((1, 3, -7, 9)) func(*values) # equivalent to: func(1, 3, -7, 9)
Take a good look at the last two lines of the preceding example. In the first one, we call func
with one argument, a four elements tuple. In the second example, by using the *
syntax, we're doing something called unpacking, which means that the four elements tuple is unpacked, and the function is called with four arguments: 1, 3, -7, 9
.
This behavior is part of the magic Python does to allow you to do amazing things when calling functions dynamically.
Variable keyword arguments
Variable keyword arguments are very similar to variable positional arguments. The only difference is the syntax (**
instead of *
) and that they are collected in a dictionary. Collection and unpacking work in the same way, so let's look at an example:
arguments.variable.keyword.py
def func(**kwargs): print(kwargs) # All calls equivalent. They print: {'a': 1, 'b': 42} func(a=1, b=42) func(**{'a': 1, 'b': 42}) func(**dict(a=1, b=42))
All the calls are equivalent in the preceding example. You can see that adding a **
in front of the parameter name in the function definition tells Python to use that name to collect a variable number of keyword parameters. On the other hand, when we call the function, we can either pass name=value
arguments explicitly, or unpack a dictionary using the same **
syntax.
The reason why being able to pass a variable number of keyword parameters is so important may not be evident at the moment, so, how about a more realistic example? Let's define a function that connects to a database. We want to connect to a default database by simply calling this function with no parameters. We also want to connect to any other database by passing the function the appropriate arguments. Before you read on, spend a couple of minutes figuring out a solution by yourself.
arguments.variable.db.py
def connect(**options): conn_params = { 'host': options.get('host', '127.0.0.1'), 'port': options.get('port', 5432), 'user': options.get('user', ''), 'pwd': options.get('pwd', ''), } print(conn_params) # we then connect to the db (commented out) # db.connect(**conn_params) connect() connect(host='127.0.0.42', port=5433) connect(port=5431, user='fab', pwd='gandalf')
Note in the function we can prepare a dictionary of connection parameters (conn_params
) in the function using default values as fallback, allowing them to be overwritten if they are provided in the function call. There are better ways to do this with fewer lines of code but we're not concerned with that now. Running the preceding code yields the following result:
$ python arguments.variable.db.py {'host': '127.0.0.1', 'pwd': '', 'user': '', 'port': 5432} {'host': '127.0.0.42', 'pwd': '', 'user': '', 'port': 5433} {'host': '127.0.0.1', 'pwd': 'gandalf', 'user': 'fab', 'port': 5431}
Note the correspondence between the function calls and the output. Note how default values are either there or overridden, according to what was passed to the function.
Keyword-only arguments
Python 3 allows for a new type of parameter: the keyword-only parameter. We are going to study them only briefly as their use cases are not that frequent. There are two ways of specifying them, either after the variable positional arguments, or after a bare *. Let's see an example of both.
arguments.keyword.only.py
def kwo(*a, c): print(a, c) kwo(1, 2, 3, c=7) # prints: (1, 2, 3) 7 kwo(c=4) # prints: () 4 # kwo(1, 2) # breaks, invalid syntax, with the following error # TypeError: kwo() missing 1 required keyword-only argument: 'c' def kwo2(a, b=42, *, c): print(a, b, c) kwo2(3, b=7, c=99) # prints: 3 7 99 kwo2(3, c=13) # prints: 3 42 13 # kwo2(3, 23) # breaks, invalid syntax, with the following error # TypeError: kwo2() missing 1 required keyword-only argument: 'c'
As anticipated, the function, kwo
, takes a variable number of positional arguments (a
) and a keyword-only function, c
. The results of the calls are straightforward and you can uncomment the third call to see what error Python returns.
The same applies to the function, kwo2
, which differs from kwo
in that it takes a positional argument a
, a keyword argument b
, and then a keyword-only argument, c
. You can uncomment the third call to see the error.
Now that you know how to specify different types of input parameters, let's see how you can combine them in function definitions.
Combining input parameters
You can combine input parameters, as long as you follow these ordering rules:
- When defining a function, normal positional arguments come first (
name
), then any default arguments (name=value
), then the variable positional arguments (*name
, or simply*
), then any keyword-only arguments (eithername
orname=value
form is good), then any variable keyword arguments (**name
). - On the other hand, when calling a function, arguments must be given in the following order: positional arguments first (
value
), then any combination of keyword arguments (name=value
), variable positional arguments (*name
), then variable keyword arguments (**name
).
Since this can be a bit tricky when left hanging in the theoretical world, let's look at a couple of quick examples.
arguments.all.py
def func(a, b, c=7, *args, **kwargs): print('a, b, c:', a, b, c) print('args:', args) print('kwargs:', kwargs) func(1, 2, 3, *(5, 7, 9), **{'A': 'a', 'B': 'b'}) func(1, 2, 3, 5, 7, 9, A='a', B='b') # same as previous one
Note the order of the parameters in the function definition, and that the two calls are equivalent. In the first one, we're using the unpacking operators for iterables and dictionaries, while in the second one we're using a more explicit syntax. The execution of this yields (I printed only the result of one call):
$ python arguments.all.py a, b, c: 1 2 3 args: (5, 7, 9) kwargs: {'A': 'a', 'B': 'b'}
Let's now look at an example with keyword-only arguments.
arguments.all.kwonly.py
def func_with_kwonly(a, b=42, *args, c, d=256, **kwargs):
print('a, b:', a, b)
print('c, d:', c, d)
print('args:', args)
print('kwargs:', kwargs)
# both calls equivalent
func_with_kwonly(3, 42, c=0, d=1, *(7, 9, 11), e='E', f='F')
func_with_kwonly(3, 42, *(7, 9, 11), c=0, d=1, e='E', f='F')
Note that I have highlighted the keyword-only arguments in the function declaration. They come after the variable positional argument *args
, and it would be the same if they came right after a single *
(in which case there wouldn't be a variable positional argument). The execution of this yields (I printed only the result of one call):
$ python arguments.all.kwonly.py a, b: 3 42 c, d: 0 1 args: (7, 9, 11) kwargs: {'f': 'F', 'e': 'E'}
One other thing to note are the names I gave to the variable positional and keyword arguments. You're free to choose differently, but be aware that args
and kwargs
are the conventional names given to these parameters, at least generically. Now that you know how to define a function in all possible flavors, let me show you something tricky: mutable defaults.
Avoid the trap! Mutable defaults
One thing to be very aware of with Python is that default values are created at def
time, therefore, subsequent calls to the same function will possibly behave differently according to the mutability of their default values. Let's look at an example:
arguments.defaults.mutable.py
def func(a=[], b={}):
print(a)
print(b)
print('#' * 12)
a.append(len(a)) # this will affect a's default value
b[len(a)] = len(a) # and this will affect b's one
func()
func()
func()
The parameters both have mutable default values. This means that, if you affect those objects, any modification will stick around in subsequent function calls. See if you can understand the output of those calls:
$ python arguments.defaults.mutable.py [] {} ############ [0] {1: 1} ############ [0, 1] {1: 1, 2: 2} ############
It's interesting, isn't it? While this behavior may seem very weird at first, it actually makes sense, and it's very handy, for example, when using memoization techniques (Google an example of that, if you're interested).
Even more interesting is what happens when, between the calls, we introduce one that doesn't use defaults, like this:
arguments.defaults.mutable.intermediate.call.py
func() func(a=[1, 2, 3], b={'B': 1}) func()
When we run this code, this is the output:
$ python arguments.defaults.mutable.intermediate.call.py [] {} ############ [1, 2, 3] {'B': 1} ############ [0] {1: 1} ############
This output shows us that the defaults are retained even if we call the function with other values. One question that comes to mind is, how do I get a fresh empty value every time? Well, the convention is the following:
arguments.defaults.mutable.no.trap.py
def func(a=None): if a is None: a = [] # do whatever you want with `a` ...
Note that, by using the preceding technique, if a
isn't passed when calling the function, you always get a brand new empty list.
Okay, enough with the input, let's look at the other side of the coin, the output.
Variable keyword arguments
Variable keyword arguments are very similar to variable positional arguments. The only difference is the syntax (**
instead of *
) and that they are collected in a dictionary. Collection and unpacking work in the same way, so let's look at an example:
arguments.variable.keyword.py
def func(**kwargs): print(kwargs) # All calls equivalent. They print: {'a': 1, 'b': 42} func(a=1, b=42) func(**{'a': 1, 'b': 42}) func(**dict(a=1, b=42))
All the calls are equivalent in the preceding example. You can see that adding a **
in front of the parameter name in the function definition tells Python to use that name to collect a variable number of keyword parameters. On the other hand, when we call the function, we can either pass name=value
arguments explicitly, or unpack a dictionary using the same **
syntax.
The reason why being able to pass a variable number of keyword parameters is so important may not be evident at the moment, so, how about a more realistic example? Let's define a function that connects to a database. We want to connect to a default database by simply calling this function with no parameters. We also want to connect to any other database by passing the function the appropriate arguments. Before you read on, spend a couple of minutes figuring out a solution by yourself.
arguments.variable.db.py
def connect(**options): conn_params = { 'host': options.get('host', '127.0.0.1'), 'port': options.get('port', 5432), 'user': options.get('user', ''), 'pwd': options.get('pwd', ''), } print(conn_params) # we then connect to the db (commented out) # db.connect(**conn_params) connect() connect(host='127.0.0.42', port=5433) connect(port=5431, user='fab', pwd='gandalf')
Note in the function we can prepare a dictionary of connection parameters (conn_params
) in the function using default values as fallback, allowing them to be overwritten if they are provided in the function call. There are better ways to do this with fewer lines of code but we're not concerned with that now. Running the preceding code yields the following result:
$ python arguments.variable.db.py {'host': '127.0.0.1', 'pwd': '', 'user': '', 'port': 5432} {'host': '127.0.0.42', 'pwd': '', 'user': '', 'port': 5433} {'host': '127.0.0.1', 'pwd': 'gandalf', 'user': 'fab', 'port': 5431}
Note the correspondence between the function calls and the output. Note how default values are either there or overridden, according to what was passed to the function.
Keyword-only arguments
Python 3 allows for a new type of parameter: the keyword-only parameter. We are going to study them only briefly as their use cases are not that frequent. There are two ways of specifying them, either after the variable positional arguments, or after a bare *. Let's see an example of both.
arguments.keyword.only.py
def kwo(*a, c): print(a, c) kwo(1, 2, 3, c=7) # prints: (1, 2, 3) 7 kwo(c=4) # prints: () 4 # kwo(1, 2) # breaks, invalid syntax, with the following error # TypeError: kwo() missing 1 required keyword-only argument: 'c' def kwo2(a, b=42, *, c): print(a, b, c) kwo2(3, b=7, c=99) # prints: 3 7 99 kwo2(3, c=13) # prints: 3 42 13 # kwo2(3, 23) # breaks, invalid syntax, with the following error # TypeError: kwo2() missing 1 required keyword-only argument: 'c'
As anticipated, the function, kwo
, takes a variable number of positional arguments (a
) and a keyword-only function, c
. The results of the calls are straightforward and you can uncomment the third call to see what error Python returns.
The same applies to the function, kwo2
, which differs from kwo
in that it takes a positional argument a
, a keyword argument b
, and then a keyword-only argument, c
. You can uncomment the third call to see the error.
Now that you know how to specify different types of input parameters, let's see how you can combine them in function definitions.
Combining input parameters
You can combine input parameters, as long as you follow these ordering rules:
- When defining a function, normal positional arguments come first (
name
), then any default arguments (name=value
), then the variable positional arguments (*name
, or simply*
), then any keyword-only arguments (eithername
orname=value
form is good), then any variable keyword arguments (**name
). - On the other hand, when calling a function, arguments must be given in the following order: positional arguments first (
value
), then any combination of keyword arguments (name=value
), variable positional arguments (*name
), then variable keyword arguments (**name
).
Since this can be a bit tricky when left hanging in the theoretical world, let's look at a couple of quick examples.
arguments.all.py
def func(a, b, c=7, *args, **kwargs): print('a, b, c:', a, b, c) print('args:', args) print('kwargs:', kwargs) func(1, 2, 3, *(5, 7, 9), **{'A': 'a', 'B': 'b'}) func(1, 2, 3, 5, 7, 9, A='a', B='b') # same as previous one
Note the order of the parameters in the function definition, and that the two calls are equivalent. In the first one, we're using the unpacking operators for iterables and dictionaries, while in the second one we're using a more explicit syntax. The execution of this yields (I printed only the result of one call):
$ python arguments.all.py a, b, c: 1 2 3 args: (5, 7, 9) kwargs: {'A': 'a', 'B': 'b'}
Let's now look at an example with keyword-only arguments.
arguments.all.kwonly.py
def func_with_kwonly(a, b=42, *args, c, d=256, **kwargs):
print('a, b:', a, b)
print('c, d:', c, d)
print('args:', args)
print('kwargs:', kwargs)
# both calls equivalent
func_with_kwonly(3, 42, c=0, d=1, *(7, 9, 11), e='E', f='F')
func_with_kwonly(3, 42, *(7, 9, 11), c=0, d=1, e='E', f='F')
Note that I have highlighted the keyword-only arguments in the function declaration. They come after the variable positional argument *args
, and it would be the same if they came right after a single *
(in which case there wouldn't be a variable positional argument). The execution of this yields (I printed only the result of one call):
$ python arguments.all.kwonly.py a, b: 3 42 c, d: 0 1 args: (7, 9, 11) kwargs: {'f': 'F', 'e': 'E'}
One other thing to note are the names I gave to the variable positional and keyword arguments. You're free to choose differently, but be aware that args
and kwargs
are the conventional names given to these parameters, at least generically. Now that you know how to define a function in all possible flavors, let me show you something tricky: mutable defaults.
Avoid the trap! Mutable defaults
One thing to be very aware of with Python is that default values are created at def
time, therefore, subsequent calls to the same function will possibly behave differently according to the mutability of their default values. Let's look at an example:
arguments.defaults.mutable.py
def func(a=[], b={}):
print(a)
print(b)
print('#' * 12)
a.append(len(a)) # this will affect a's default value
b[len(a)] = len(a) # and this will affect b's one
func()
func()
func()
The parameters both have mutable default values. This means that, if you affect those objects, any modification will stick around in subsequent function calls. See if you can understand the output of those calls:
$ python arguments.defaults.mutable.py [] {} ############ [0] {1: 1} ############ [0, 1] {1: 1, 2: 2} ############
It's interesting, isn't it? While this behavior may seem very weird at first, it actually makes sense, and it's very handy, for example, when using memoization techniques (Google an example of that, if you're interested).
Even more interesting is what happens when, between the calls, we introduce one that doesn't use defaults, like this:
arguments.defaults.mutable.intermediate.call.py
func() func(a=[1, 2, 3], b={'B': 1}) func()
When we run this code, this is the output:
$ python arguments.defaults.mutable.intermediate.call.py [] {} ############ [1, 2, 3] {'B': 1} ############ [0] {1: 1} ############
This output shows us that the defaults are retained even if we call the function with other values. One question that comes to mind is, how do I get a fresh empty value every time? Well, the convention is the following:
arguments.defaults.mutable.no.trap.py
def func(a=None): if a is None: a = [] # do whatever you want with `a` ...
Note that, by using the preceding technique, if a
isn't passed when calling the function, you always get a brand new empty list.
Okay, enough with the input, let's look at the other side of the coin, the output.
Keyword-only arguments
Python 3 allows for a new type of parameter: the keyword-only parameter. We are going to study them only briefly as their use cases are not that frequent. There are two ways of specifying them, either after the variable positional arguments, or after a bare *. Let's see an example of both.
arguments.keyword.only.py
def kwo(*a, c): print(a, c) kwo(1, 2, 3, c=7) # prints: (1, 2, 3) 7 kwo(c=4) # prints: () 4 # kwo(1, 2) # breaks, invalid syntax, with the following error # TypeError: kwo() missing 1 required keyword-only argument: 'c' def kwo2(a, b=42, *, c): print(a, b, c) kwo2(3, b=7, c=99) # prints: 3 7 99 kwo2(3, c=13) # prints: 3 42 13 # kwo2(3, 23) # breaks, invalid syntax, with the following error # TypeError: kwo2() missing 1 required keyword-only argument: 'c'
As anticipated, the function, kwo
, takes a variable number of positional arguments (a
) and a keyword-only function, c
. The results of the calls are straightforward and you can uncomment the third call to see what error Python returns.
The same applies to the function, kwo2
, which differs from kwo
in that it takes a positional argument a
, a keyword argument b
, and then a keyword-only argument, c
. You can uncomment the third call to see the error.
Now that you know how to specify different types of input parameters, let's see how you can combine them in function definitions.
Combining input parameters
You can combine input parameters, as long as you follow these ordering rules:
- When defining a function, normal positional arguments come first (
name
), then any default arguments (name=value
), then the variable positional arguments (*name
, or simply*
), then any keyword-only arguments (eithername
orname=value
form is good), then any variable keyword arguments (**name
). - On the other hand, when calling a function, arguments must be given in the following order: positional arguments first (
value
), then any combination of keyword arguments (name=value
), variable positional arguments (*name
), then variable keyword arguments (**name
).
Since this can be a bit tricky when left hanging in the theoretical world, let's look at a couple of quick examples.
arguments.all.py
def func(a, b, c=7, *args, **kwargs): print('a, b, c:', a, b, c) print('args:', args) print('kwargs:', kwargs) func(1, 2, 3, *(5, 7, 9), **{'A': 'a', 'B': 'b'}) func(1, 2, 3, 5, 7, 9, A='a', B='b') # same as previous one
Note the order of the parameters in the function definition, and that the two calls are equivalent. In the first one, we're using the unpacking operators for iterables and dictionaries, while in the second one we're using a more explicit syntax. The execution of this yields (I printed only the result of one call):
$ python arguments.all.py a, b, c: 1 2 3 args: (5, 7, 9) kwargs: {'A': 'a', 'B': 'b'}
Let's now look at an example with keyword-only arguments.
arguments.all.kwonly.py
def func_with_kwonly(a, b=42, *args, c, d=256, **kwargs):
print('a, b:', a, b)
print('c, d:', c, d)
print('args:', args)
print('kwargs:', kwargs)
# both calls equivalent
func_with_kwonly(3, 42, c=0, d=1, *(7, 9, 11), e='E', f='F')
func_with_kwonly(3, 42, *(7, 9, 11), c=0, d=1, e='E', f='F')
Note that I have highlighted the keyword-only arguments in the function declaration. They come after the variable positional argument *args
, and it would be the same if they came right after a single *
(in which case there wouldn't be a variable positional argument). The execution of this yields (I printed only the result of one call):
$ python arguments.all.kwonly.py a, b: 3 42 c, d: 0 1 args: (7, 9, 11) kwargs: {'f': 'F', 'e': 'E'}
One other thing to note are the names I gave to the variable positional and keyword arguments. You're free to choose differently, but be aware that args
and kwargs
are the conventional names given to these parameters, at least generically. Now that you know how to define a function in all possible flavors, let me show you something tricky: mutable defaults.
Avoid the trap! Mutable defaults
One thing to be very aware of with Python is that default values are created at def
time, therefore, subsequent calls to the same function will possibly behave differently according to the mutability of their default values. Let's look at an example:
arguments.defaults.mutable.py
def func(a=[], b={}):
print(a)
print(b)
print('#' * 12)
a.append(len(a)) # this will affect a's default value
b[len(a)] = len(a) # and this will affect b's one
func()
func()
func()
The parameters both have mutable default values. This means that, if you affect those objects, any modification will stick around in subsequent function calls. See if you can understand the output of those calls:
$ python arguments.defaults.mutable.py [] {} ############ [0] {1: 1} ############ [0, 1] {1: 1, 2: 2} ############
It's interesting, isn't it? While this behavior may seem very weird at first, it actually makes sense, and it's very handy, for example, when using memoization techniques (Google an example of that, if you're interested).
Even more interesting is what happens when, between the calls, we introduce one that doesn't use defaults, like this:
arguments.defaults.mutable.intermediate.call.py
func() func(a=[1, 2, 3], b={'B': 1}) func()
When we run this code, this is the output:
$ python arguments.defaults.mutable.intermediate.call.py [] {} ############ [1, 2, 3] {'B': 1} ############ [0] {1: 1} ############
This output shows us that the defaults are retained even if we call the function with other values. One question that comes to mind is, how do I get a fresh empty value every time? Well, the convention is the following:
arguments.defaults.mutable.no.trap.py
def func(a=None): if a is None: a = [] # do whatever you want with `a` ...
Note that, by using the preceding technique, if a
isn't passed when calling the function, you always get a brand new empty list.
Okay, enough with the input, let's look at the other side of the coin, the output.
Combining input parameters
You can combine input parameters, as long as you follow these ordering rules:
- When defining a function, normal positional arguments come first (
name
), then any default arguments (name=value
), then the variable positional arguments (*name
, or simply*
), then any keyword-only arguments (eithername
orname=value
form is good), then any variable keyword arguments (**name
). - On the other hand, when calling a function, arguments must be given in the following order: positional arguments first (
value
), then any combination of keyword arguments (name=value
), variable positional arguments (*name
), then variable keyword arguments (**name
).
Since this can be a bit tricky when left hanging in the theoretical world, let's look at a couple of quick examples.
arguments.all.py
def func(a, b, c=7, *args, **kwargs): print('a, b, c:', a, b, c) print('args:', args) print('kwargs:', kwargs) func(1, 2, 3, *(5, 7, 9), **{'A': 'a', 'B': 'b'}) func(1, 2, 3, 5, 7, 9, A='a', B='b') # same as previous one
Note the order of the parameters in the function definition, and that the two calls are equivalent. In the first one, we're using the unpacking operators for iterables and dictionaries, while in the second one we're using a more explicit syntax. The execution of this yields (I printed only the result of one call):
$ python arguments.all.py a, b, c: 1 2 3 args: (5, 7, 9) kwargs: {'A': 'a', 'B': 'b'}
Let's now look at an example with keyword-only arguments.
arguments.all.kwonly.py
def func_with_kwonly(a, b=42, *args, c, d=256, **kwargs):
print('a, b:', a, b)
print('c, d:', c, d)
print('args:', args)
print('kwargs:', kwargs)
# both calls equivalent
func_with_kwonly(3, 42, c=0, d=1, *(7, 9, 11), e='E', f='F')
func_with_kwonly(3, 42, *(7, 9, 11), c=0, d=1, e='E', f='F')
Note that I have highlighted the keyword-only arguments in the function declaration. They come after the variable positional argument *args
, and it would be the same if they came right after a single *
(in which case there wouldn't be a variable positional argument). The execution of this yields (I printed only the result of one call):
$ python arguments.all.kwonly.py a, b: 3 42 c, d: 0 1 args: (7, 9, 11) kwargs: {'f': 'F', 'e': 'E'}
One other thing to note are the names I gave to the variable positional and keyword arguments. You're free to choose differently, but be aware that args
and kwargs
are the conventional names given to these parameters, at least generically. Now that you know how to define a function in all possible flavors, let me show you something tricky: mutable defaults.
Avoid the trap! Mutable defaults
One thing to be very aware of with Python is that default values are created at def
time, therefore, subsequent calls to the same function will possibly behave differently according to the mutability of their default values. Let's look at an example:
arguments.defaults.mutable.py
def func(a=[], b={}):
print(a)
print(b)
print('#' * 12)
a.append(len(a)) # this will affect a's default value
b[len(a)] = len(a) # and this will affect b's one
func()
func()
func()
The parameters both have mutable default values. This means that, if you affect those objects, any modification will stick around in subsequent function calls. See if you can understand the output of those calls:
$ python arguments.defaults.mutable.py [] {} ############ [0] {1: 1} ############ [0, 1] {1: 1, 2: 2} ############
It's interesting, isn't it? While this behavior may seem very weird at first, it actually makes sense, and it's very handy, for example, when using memoization techniques (Google an example of that, if you're interested).
Even more interesting is what happens when, between the calls, we introduce one that doesn't use defaults, like this:
arguments.defaults.mutable.intermediate.call.py
func() func(a=[1, 2, 3], b={'B': 1}) func()
When we run this code, this is the output:
$ python arguments.defaults.mutable.intermediate.call.py [] {} ############ [1, 2, 3] {'B': 1} ############ [0] {1: 1} ############
This output shows us that the defaults are retained even if we call the function with other values. One question that comes to mind is, how do I get a fresh empty value every time? Well, the convention is the following:
arguments.defaults.mutable.no.trap.py
def func(a=None): if a is None: a = [] # do whatever you want with `a` ...
Note that, by using the preceding technique, if a
isn't passed when calling the function, you always get a brand new empty list.
Okay, enough with the input, let's look at the other side of the coin, the output.
Avoid the trap! Mutable defaults
One thing to be very aware of with Python is that default values are created at def
time, therefore, subsequent calls to the same function will possibly behave differently according to the mutability of their default values. Let's look at an example:
arguments.defaults.mutable.py
def func(a=[], b={}):
print(a)
print(b)
print('#' * 12)
a.append(len(a)) # this will affect a's default value
b[len(a)] = len(a) # and this will affect b's one
func()
func()
func()
The parameters both have mutable default values. This means that, if you affect those objects, any modification will stick around in subsequent function calls. See if you can understand the output of those calls:
$ python arguments.defaults.mutable.py [] {} ############ [0] {1: 1} ############ [0, 1] {1: 1, 2: 2} ############
It's interesting, isn't it? While this behavior may seem very weird at first, it actually makes sense, and it's very handy, for example, when using memoization techniques (Google an example of that, if you're interested).
Even more interesting is what happens when, between the calls, we introduce one that doesn't use defaults, like this:
arguments.defaults.mutable.intermediate.call.py
func() func(a=[1, 2, 3], b={'B': 1}) func()
When we run this code, this is the output:
$ python arguments.defaults.mutable.intermediate.call.py [] {} ############ [1, 2, 3] {'B': 1} ############ [0] {1: 1} ############
This output shows us that the defaults are retained even if we call the function with other values. One question that comes to mind is, how do I get a fresh empty value every time? Well, the convention is the following:
arguments.defaults.mutable.no.trap.py
def func(a=None): if a is None: a = [] # do whatever you want with `a` ...
Note that, by using the preceding technique, if a
isn't passed when calling the function, you always get a brand new empty list.
Okay, enough with the input, let's look at the other side of the coin, the output.
Return values
Return values of functions are one of those things where Python is light years ahead of most other languages. Functions are usually allowed to return one object (one value) but, in Python, you can return a tuple, and this implies that you can return whatever you want. This feature allows a coder to write software that would be much harder to write in any other language, or certainly more tedious. We've already said that to return something from a function we need to use the return
statement, followed by what we want to return. There can be as many return statements as needed in the body of a function.
On the other hand, if within the body of a function we don't return anything, the function will return None
. This behavior is harmless and, even though I don't have the room here to go into detail explaining why Python was designed like this, let me just tell you that this feature allows for several interesting patterns, and confirms Python as a very consistent language.
I say it's harmless because you are never forced to collect the result of a function call. I'll show you what I mean with an example:
return.none.py
def func(): pass func() # the return of this call won't be collected. It's lost. a = func() # the return of this one instead is collected into `a` print(a) # prints: None
Note that the whole body of the function is comprised only of the pass
statement. As the official documentation tells us, pass
is a null operation. When it is executed, nothing happens. It is useful as a placeholder when a statement is required syntactically, but no code needs to be executed. In other languages, we would probably just indicate that with a pair of curly braces ({}
), which define an empty scope but in Python a scope is defined by indenting code, therefore a statement such as pass
is necessary.
Notice also that the first call of the function func
returns a value (None
) which we don't collect. As I said before, collecting the return value of a function call is not mandatory.
Now, that's good but not very interesting so, how about we write an interesting function? Remember that in Chapter 1, Introduction and First Steps – Take a Deep Breath, we talked about the factorial of a function. Let's write our own here (for simplicity, I will assume the function is always called correctly with appropriate values so I won't sanity-check on the input argument):
return.single.value.py
def factorial(n): if n in (0, 1): return 1 result = n for k in range(2, n): result *= k return result f5 = factorial(5) # f5 = 120
Note that we have two points of return. If n
is either 0
or 1
(in Python it's common to use the in
type of check as I did instead of the more verbose if n ==0 or n == 1:
), we return 1
. Otherwise, we perform the required calculation, and we return result
. Can we write this function a little bit more Pythonically? Yes, but I'll let you figure out that for yourself, as an exercise.
return.single.value.2.py
from functools import reduce
from operator import mul
def factorial(n):
return reduce(mul, range(1, n + 1), 1)
f5 = factorial(5) # f5 = 120
I know what you're thinking, one line? Python is elegant, and concise! I think this function is readable even if you have never seen reduce
or mul
, but if you can't read it or understand it, set aside a few minutes and do some research on the Python documentation until its behavior is clear to you. Being able to look up functions in the documentation and understand code written by someone else is a task every developer needs to be able to perform, so think of this as a good exercise, and good luck!
Tip
To this end, make sure you look up the help
function, which comes in very handy exploring with the console.
Returning multiple values
Unlike in most other languages, in Python it's very easy to return multiple objects from a function. This feature opens up a whole world of possibilities and allows you to code in a style that is hard to reproduce with other languages. Our thinking is limited by the tools we use, therefore when Python gives you more freedom than other languages, it is actually boosting your own creativity as well. To return multiple values is very easy, you just use tuples (either explicitly or implicitly). Let's look at a simple example that mimics the divmod
built-in function:
return.multiple.py
def moddiv(a, b):
return a // b, a % b
print(moddiv(20, 7)) # prints (2, 6)
I could have wrapped the highlighted part in the preceding code in braces, making it an explicit tuple, but there's no need for that. The preceding function returns both the result and the remainder of the division, at the same time.
Returning multiple values
Unlike in most other languages, in Python it's very easy to return multiple objects from a function. This feature opens up a whole world of possibilities and allows you to code in a style that is hard to reproduce with other languages. Our thinking is limited by the tools we use, therefore when Python gives you more freedom than other languages, it is actually boosting your own creativity as well. To return multiple values is very easy, you just use tuples (either explicitly or implicitly). Let's look at a simple example that mimics the divmod
built-in function:
return.multiple.py
def moddiv(a, b):
return a // b, a % b
print(moddiv(20, 7)) # prints (2, 6)
I could have wrapped the highlighted part in the preceding code in braces, making it an explicit tuple, but there's no need for that. The preceding function returns both the result and the remainder of the division, at the same time.
A few useful tips
When writing functions, it's very useful to follow guidelines so that you write them well. I'll quickly point some of them out here:
- Functions should do one thing: Functions that do one thing are easy to describe in one short sentence. Functions which do multiple things can be split into smaller functions which do one thing. These smaller functions are usually easier to read and understand. Remember the data science example we saw a few pages ago.
- Functions should be small: The smaller they are, the easier it is to test them and to write them so that they do one thing.
- The fewer input parameters, the better: Functions which take a lot of arguments quickly become harder to manage (among other issues).
- Functions should be consistent in their return values: Returning
False
orNone
is not the same thing, even if within a Boolean context they both evaluate toFalse
.False
means that we have information (False
), whileNone
means that there is no information. Try writing functions which return in a consistent way, no matter what happens in their body. - Functions shouldn't have side effects: In other words, functions should not affect the values you call them with. This is probably the hardest statement to understand at this point, so I'll give you an example using lists. In the following code, note how
numbers
is not sorted by thesorted
function, which actually returns a sorted copy ofnumbers
. Conversely, thelist.sort()
method is acting on thenumbers
object itself, and that is fine because it is a method (a function that belongs to an object and therefore has the rights to modify it):>>> numbers = [4, 1, 7, 5] >>> sorted(numbers) # won't sort the original `numbers` list [1, 4, 5, 7] >>> numbers # let's verify [4, 1, 7, 5] # good, untouched >>> numbers.sort() # this will act on the list >>> numbers [1, 4, 5, 7]
Follow these guidelines and you'll write better functions, which will serve you well.
Note
Chapter 3, Functions in Clean Code by Robert C. Martin, Prentice Hall is dedicated to functions and it's probably the best set of guidelines I've ever read on the subject.
Recursive functions
When a function calls itself to produce a result, it is said to be recursive. Sometimes recursive functions are very useful in that they make it easier to write code. Some algorithms are very easy to write using the recursive paradigm, while others are not. There is no recursive function that cannot be rewritten in an iterative fashion, so it's usually up to the programmer to choose the best approach for the case at hand.
A recursive function usually has a set of base cases for which the return value doesn't depend on a subsequent call to the function itself and a set of recursive cases, for which the return value is calculated with one or more calls to the function itself.
As an example, we can consider the (hopefully familiar by now) factorial
function N!. The base case is when N is either 0 or 1. The function returns 1 with no need for further calculation. On the other hand, in the general case, N! returns the product 1 * 2 * ... * (N-1) * N. If you think about it, N! can be rewritten like this: N! = (N-1)! * N. As a practical example, consider 5! = 1 * 2 * 3 * 4 * 5 = (1 * 2 * 3 * 4) * 5 = 4! * 5.
Let's write this down in code:
recursive.factorial.py
def factorial(n): if n in (0, 1): # base case return 1 return factorial(n - 1) * n # recursive case
Note
When writing recursive functions, always consider how many nested calls you make, there is a limit. For further information on this, check out sys.getrecursionlimit()
and sys.setrecursionlimit()
.
Recursive functions are used a lot when writing algorithms and they can be really fun to write. As a good exercise, try to solve a couple of simple problems using both a recursive and an iterative approach.
Anonymous functions
One last type of functions that I want to talk about are anonymous functions. These functions, which are called lambdas in Python, are usually used when a fully-fledged function with its own name would be overkill, and all we want is a quick, simple one-liner that does the job.
Imagine that you want a list of all the numbers up to N which are multiples of five. Imagine that you want to filter those out using the filter
function, which takes a function and an iterable and constructs a filter object which you can iterate on, from those elements of iterable for which the function returns True
. Without using an anonymous function, you would do something like this:
filter.regular.py
def is_multiple_of_five(n):
return not n % 5
def get_multiples_of_five(n):
return list(filter(is_multiple_of_five, range(n)))
print(get_multiples_of_five(50))
I have highlighted the main logic of get_multiples_of_five
. Note how the filter uses is_multiple_of_five
to filter the first n natural numbers. This seems a bit excessive, the task is simple and we don't need to keep the is_multiple_of_five
function around for anything else. Let's rewrite it using a lambda function:
filter.lambda.py
def get_multiples_of_five(n):
return list(filter(lambda k: not k % 5, range(n)))
print(get_multiples_of_five(50))
The logic is exactly the same but the filtering function is now a lambda. Defining a lambda is very easy and follows this form: func_name = lambda [parameter_list]: expression
. A function object is returned, which is equivalent to this: def func_name([parameter_list]): return expression
.
Note
Note that optional parameters are indicated following the common syntax of wrapping them in square brackets.
Let's look at another couple of examples of equivalent functions defined in the two forms:
lambda.explained.py
# example 1: adder def adder(a, b): return a + b # is equivalent to: adder_lambda = lambda a, b: a + b # example 2: to uppercase def to_upper(s): return s.upper() # is equivalent to: to_upper_lambda = lambda s: s.upper()
The preceding examples are very simple. The first one adds two numbers, and the second one produces the uppercase version of a string. Note that I assigned what is returned by the lambda
expressions to a name (adder_lambda
, to_upper_lambda
), but there is no need for that when you use lambdas in the way we did in the filter
example before.
Function attributes
Every function is a fully-fledged object and, as such, they have many attributes. Some of them are special and can be used in an introspective way to inspect the function object at runtime. The following script is an example that shows all of them and how to display their value for an example function:
func.attributes.py
def multiplication(a, b=1):
"""Return a multiplied by b. """
return a * b
special_attributes = [
"__doc__", "__name__", "__qualname__", "__module__",
"__defaults__", "__code__", "__globals__", "__dict__",
"__closure__", "__annotations__", "__kwdefaults__",
]
for attribute in special_attributes:
print(attribute, '->', getattr(multiplication, attribute))
I used the built-in getattr
function to get the value of those attributes. getattr(obj, attribute)
is equivalent to obj.attribute
and comes in handy when we need to get an attribute at runtime using its string name. Running this script yields:
$ python func.attributes.py __doc__ -> Return a multiplied by b. __name__ -> multiplication __qualname__ -> multiplication __module__ -> __main__ __defaults__ -> (1,) __code__ -> <code object multiplication at 0x7ff529e79300, file "ch4/func.attributes.py", line 1> __globals__ -> {... omitted ...} __dict__ -> {} __closure__ -> None __annotations__ -> {} __kwdefaults__ -> None
I have omitted the value of the __globals__
attribute, it was too big. An explanation of the meaning of this attribute can be found in the types section of the Python Data Model documentation page.
Built-in functions
Python comes with a lot of built-in functions. They are available anywhere and you can get a list of them by inspecting the builtins
module with dir(__builtins__)
, or by going to the official Python documentation. Unfortunately, I don't have the room to go through all of them here. Some of them we've already seen, such as any
, bin
, bool
, divmod
, filter
, float
, getattr
, id
, int
, len
, list
, min
, print
, set
, tuple
, type
, and zip
, but there are many more, which you should read at least once.
Get familiar with them, experiment, write a small piece of code for each of them, make sure you have them at the tip of your fingers so that you can use them when you need them.
One final example
Before we finish off this chapter, how about a final example? I was thinking we could write a function to generate a list of prime numbers up to a limit. We've already seen the code for this so let's make it a function and, to keep it interesting, let's optimize it a bit.
It turns out that you don't need to divide it by all numbers from 2 to N-1 to decide if a number N is prime. You can stop at . Moreover, you don't need to test the division for all numbers from 2 to , you can just use the primes in that range. I'll leave it to you to figure out why this works, if you're interested. Let's see how the code changes:
primes.py
from math import sqrt, ceil def get_primes(n): """Calculate a list of primes up to n (included). """ primelist = [] for candidate in range(2, n + 1): is_prime = True root = int(ceil(sqrt(candidate))) # division limit for prime in primelist: # we try only the primes if prime > root: # no need to check any further break if candidate % prime == 0: is_prime = False break if is_prime: primelist.append(candidate) return primelist
The code is the same as in the previous chapter. We have changed the division algorithm so that we only test divisibility using the previously calculated primes and we stopped once the testing divisor was greater than the root of the candidate. We used the result list primelist
to get the primes for the division. We calculated the root value using a fancy formula, the integer value of the ceiling of the root of the candidate. While a simple int(k ** 0.5) + 1
would have served our purpose as well, the formula I chose is cleaner and requires me to use a couple of imports, which I wanted to show you. Check out the functions in the math
module, they are very interesting!
Documenting your code
I'm a big fan of code that doesn't need documentation. When you program correctly, choose the right names and take care of the details, your code should come out as self-explanatory and documentation should not be needed. Sometimes a comment is very useful though, and so is some documentation. You can find the guidelines for documenting Python in PEP257 – Docstring conventions, but I'll show you the basics here.
Python is documented with strings, which are aptly called docstrings. Any object can be documented, and you can use either one-line or multi-line docstrings. One-liners are very simple. They should not provide another signature for the function, but clearly state its purpose.
docstrings.py
def square(n): """Return the square of a number n. """ return n ** 2 def get_username(userid): """Return the username of a user given their id. """ return db.get(user_id=userid).username
Using triple double-quoted strings allows you to expand easily later on. Use sentences that end in a period, and don't leave blank lines before or after.
Multi-line comments are structured in a similar way. There should be a one-liner that briefly gives you the gist of what the object is about, and then a more verbose description. As an example, I have documented a fictitious connect
function, using the Sphinx notation, in the following example.
Note
Sphinx is probably the most widely used tool for creating Python documentation. In fact, the official Python documentation was written with it. It's definitely worth spending some time checking it out.
docstrings.py
def connect(host, port, user, password): """Connect to a database. Connect to a PostgreSQL database directly, using the given parameters. :param host: The host IP. :param port: The desired port. :param user: The connection username. :param password: The connection password. :return: The connection object. """ # body of the function here... return connection
Importing objects
Now that you know a lot about functions, let's see how to use them. The whole point of writing functions is to be able to later reuse them, and this in Python translates to importing them into the namespace in which you need them. There are many different ways to import objects into a namespace, but the most common ones are just two: import module_name
and from module_name import function_name
. Of course, these are quite simplistic examples, but bear with me for the time being.
The form import module_name
finds the module module_name
and defines a name for it in the local namespace where the import
statement is executed.
The form from module_name import identifier
is a little bit more complicated than that, but basically does the same thing. It finds module_name
and searches for an attribute (or a submodule) and stores a reference to identifier
in the local namespace.
Both forms have the option to change the name of the imported object using the as
clause, like this:
from mymodule import myfunc as better_named_func
Just to give you a flavor of what importing looks like, here's an example from a test module of a number theory library I wrote some years ago (it's available on Bitbucket):
karma/test_nt.py
import unittest # imports the unittest module from math import sqrt # imports one function from math from random import randint, sample # two imports at once from mock import patch from nose.tools import ( # multiline import assert_equal, assert_list_equal, assert_not_in, ) from karma import nt, utils
I commented some of them and I hope it's easy to follow. When you have a structure of files starting in the root of your project, you can use the dot notation to get to the object you want to import into your current namespace, be it a package, a module, a class, a function, or anything else. The from module import
syntax also allows a catch-all clause from module import *
, which is sometimes used to get all the names from a module into the current namespace at once, but it's frowned upon for several reasons: performances, the risk of silently shadowing other names, and so on. You can read all that there is to know about imports in the official Python documentation but, before we leave the subject, let me give you a better example.
Imagine that you have defined a couple of functions: square(n)
and cube(n)
in a module, funcdef.py
, which is in the lib
folder. You want to use them in a couple of modules which are at the same level of the lib
folder, called func_import.py
, and func_from.py
. Showing the tree structure of that project produces something like this:
├── func_from.py ├── func_import.py ├── lib ├── funcdef.py └── __init__.py
Before I show you the code of each module, please remember that in order to tell Python that it is actually a package, we need to put a __init__.py
module in it.
Note
There are two things to note about the __init__.py
file. First of all, it is a fully fledged Python module so you can put code into it as you would with any other module. Second, as of Python 3.3, its presence is no longer required to make a folder be interpreted as a Python package.
The code is as follows:
funcdef.py
def square(n): return n ** 2 def cube(n): return n ** 3
func_import.py
import lib.funcdef
print(lib.funcdef.square(10))
print(lib.funcdef.cube(10))
func_from.py
from lib.funcdef import square, cube
print(square(10))
print(cube(10))
Both these files, when executed, print 100
and 1000
. You can see how differently we then access the square
and cube
functions, according to how and what we imported in the current scope.
Relative imports
The imports we've seen until now are called absolute, that is to say they define the whole path of the module that we want to import, or from which we want to import an object. There is another way of importing objects into Python, which is called relative import. It's helpful in situations in which we want to rearrange the structure of large packages without having to edit sub-packages, or when we want to make a module inside a package able to import itself. Relative imports are done by adding as many leading dots in front of the module as the number of folders we need to backtrack, in order to find what we're searching for. Simply put, it is something like this:
from .mymodule import myfunc
For a complete explanation of relative imports, refer to PEP328 (https://www.python.org/dev/peps/pep-0328).
In later chapters, we'll create projects using different libraries and we'll use several different types of imports, including relative ones, so make sure you take a bit of time to read up about it in the official Python documentation.
Relative imports
The imports we've seen until now are called absolute, that is to say they define the whole path of the module that we want to import, or from which we want to import an object. There is another way of importing objects into Python, which is called relative import. It's helpful in situations in which we want to rearrange the structure of large packages without having to edit sub-packages, or when we want to make a module inside a package able to import itself. Relative imports are done by adding as many leading dots in front of the module as the number of folders we need to backtrack, in order to find what we're searching for. Simply put, it is something like this:
from .mymodule import myfunc
For a complete explanation of relative imports, refer to PEP328 (https://www.python.org/dev/peps/pep-0328).
In later chapters, we'll create projects using different libraries and we'll use several different types of imports, including relative ones, so make sure you take a bit of time to read up about it in the official Python documentation.
Summary
In this chapter, finally we explored the world of functions. They are extremely important and, from now on, we'll use them basically everywhere. We talked about the main reasons for using them, the most important of which are code reuse and implementation hiding.
We saw that a function object is like a box that takes optional input and produces output. We can feed input values to a function in many different ways, using positional and keyword arguments, and using variable syntax for both types.
Now you should know how to write a function, how to document it, import it into your code, and call it.
The next chapter will force me to push my foot down on the throttle even more so I suggest you take any opportunity you get to consolidate and enrich the knowledge you've gathered until now by putting your nose into the Python official documentation.
Ready for the cool stuff? Let's go!