This tutorial will provide you with the additional tools you need to use Python in our second year courses. We will explore pylab (also called matplotlib), which uses numpy and the plotting packages of matplotlib.

Outline:

Getting your work done with Pylab: plotting, formatted output, reading in data from a file.

Working with numpy.

1. Getting your work done with Pylab

In your second year courses, you will be using Pylab to get work done in a variety of ways. We tested Pylab in the First Steps, Part 2 tutorial.

Plotting in Pylab

Let's start with a look at the basics of plotting. Look at the following code:

#import numpy functions and typesfrom pylab import *
plot([1,2,3,4],[1,4,9,16])
show()

This code will simply plot the elements of the list [1, 2, 3, 4] against the elements ofthe list [1, 4, 9, 16]. The final command in all pylab plotting should always be "show()", which activates the plotting window.

Activity 1: Copy and save the excerpt above to a new script file, and run it in IDLE using F5. The result should be a window like this

Notice the row of icons at the bottom of the window. These icons allow you to explore your plots by panning the plots and changing its settings. For more information on these buttons see the interactive navigation documentation at http://matplotlib.sourceforge.net/users/navigation_toolbar.htm.

To learn more about the very powerful plot command, in the Python shell type from pylab import * help(plot)

The following activity will help you explore several useful capabilties of matplotlib. Activity 2: Do the following:

Copy and paste the following script into a new IDLE script window and run it. Resize the plotting windows to view them properly, if necessary.

Close the plotting windows, then quickly read over the code, trying to get an overall understanding of the different commands.

For guidance on the commands, type from pylab import * in the Python shell and use help() on the commands. This is a quick way to learn about the capabilities of pylab.

#import numpy functions and typesfrom pylab import *
x = arange(1,10,0.5)
xsquare = x**2
xcube = x**3
xsquareroot = x**0.5#open figure 1
figure(1)#basic plot
plot(x,xsquare)#add a label to the x axis
xlabel('abscissa')#add a label to the y axis
ylabel('ordinate')#add a title
title('A practice plot.')#save the figure to the current diretory as a png file
savefig('plot_1.png')#open a second figure
figure(2)#do two plots:#one with red circles with no line joining them#and another with green plus signs joined by a dashed curve
plot(x, xsquare,'ro', x, xcube,'g+--')#x and y labels, title
xlabel('abscissa')
ylabel('ordinate')
title('More practice.')#add a legend
legend(('squared','cubed'))#save the figure
savefig('plot_2.png')
figure(3)#use the subplot function to generate multiple panels within the same plotting window
subplot(3,1,1)#plot black stars with dotted lines
plot(x, xsquareroot,'k*:')
title('square roots')
subplot(3,1,2)#plot red right-pointing triangles with dashed lines
plot(x, xsquare,'r>-')
title('squares')
subplot(3,1,3)#plot magenta hexagons with a solid line
plot(x, xcube,'mh-')
title('cubes')
savefig('plot_3.png')#show() should be the last line
show()

The result will produce a session that looks like this, after rearranging and resizing the windows:

This code demonstrates a lot. It shows examples of:

Plotting with numpy arrays, defined based on the arange() command.

How to open multiple plotting windows with figure()

How to add plotting symbols and change the lines joining the points with strings like 'ro' (red circles) and 'g+--' (green plus signs with dashes). For more information on plotting, type help(plot) and see the pyplot tutorial at http://matplotlib.sourceforge.net/users/pyplot_tutorial.html

How to save your plot to a graphics file with savefig(). To save to a pdf, replace the .png suffix with .pdf.

How to label and title your figures with xlabel, ylabel, title.

How to use the subplot command to create multiple window figures. The basic syntax for subplot is subplot(rows, cols, num), where rows is the number of rows in the plot, cols is the number of columns in the plot, and num is the plot number. See help(subplot) and . The example shows three plots in a single column.

That you should use show() on the last line of code.

Now that we have a basic idea of plotting, let's try plotting Lissajous figures. Suppose we have two independent sinusoidal time signals x(t) and y(t). A Lissajous figure is a parametric plot in the form (x,y) = (x(t), y(t)). Such figures are useful for plotting the motion of particles under the influence of forces whose x and y components are independent.

Activity 3: Do the following

Write a program that will create two time series x(t) and y(t) over the interval 0≤t≤2π, with

Try running your code for various values of n and the phase constant. Use subplot to generate multiple windows, and save the output to a plot file.

Now we will look at a different type of plot: the histogram, which has its own command hist. Histograms are useful to show, for example, how often various outcomes occur in an experiment.

Activity 4: Play dice
Suppose you rolled a die 20 times and recorded the results in an array called roll. Copy and paste the following script into a new IDLE script window and run it to make a histogram:

from pylab import *
#results of my die rolls
roll =array([2,3,5,5,4,2,6,1,2,1,1,6,2,1,4,1,3,4,2,2])#histogram
figure(1)
hist(roll)
show()

It is showing you: five times you rolled a 1, six times you rolled a 2, twice you rolled a 3, etc.

But there is a problem with this histogram: the binning, which python chooses automatically, is lousy. Inside the hist command, you can specify the number of bins immediately after the name of the array, and further change the binning using options such as range= , align= , bins= . You can then use the xticks command to control the labels on the bins.
You can also assign different weights to each bin, using the weights= option inside the hist command. For example, this allows you to "normalize" the histogram, to show you the probability of each outcome.
Try the following script to make a nicely-binned, normalized histogram:

from pylab import *
#results of my die rolls
roll =array([2,3,5,5,4,2,6,1,2,1,1,6,2,1,4,1,3,4,2,2])#for normalization
N =len(roll)
weights_array = ones(N) / N
#histogram
figure(1)
hist(roll,6, weights=weights_array,range=[0.5,6.5], align='mid')
xticks(range(1,7))
xlabel('Roll')
ylabel('Probability of Roll')
title('Nicely binned normalized histogram')
show()

Formatted output & tuples

In the tutorial "First Steps, Part 1" we calculated the height reached by an object projected up from the surface. We worked with these lines of code:

v=12.5
g=9.8
h=v**2/(2*g)
h

You can execute these lines of code again to remind you of what they did. The last line gives 7.9719387755102034
This number is hard to read and has far too many digits after the decimal place. To control this behaviour, we use formatted strings, which in Python requires tuples. Tuples are another type of sequence, like lists, but have a different structure and purpose. Here are some lines of code that define and explore the properties of a tuple.

#define a tuple
my_tuple =(5,4,3,2,1)#print out some elements of my_tuple
my_tuple[0]
my_tuple[3]#get the length of my_tuplelen(my_tuple)

A tuple is enclosed by parentheses () instead of brackets []. The elements can be accessed in a similar way to lists. But tuples are more hardwired than lists: you cannot change the elements of a tuple. For example, the assignment statement "my_tuple[3]=-15" will give an error (try it and see).

Now that we know a bit about tuples, let's see an example of a formatted string in Python

v=12.5
g=9.8
h=v**2/(2*g)
h_cm = h*100
h_int =int(h)str="The height is %f metres or %f centimetres."%(h, h_cm)#See? There's a tuple at the end of this line!printstr

Activity 5: Run this code.
This produces The height is 7.971939 metres or 797.193878 centimetres.

In the line beginning str, the %f are indicators that a floating-point number should be printed. The tuple (h, h_cm) after the % sign indicate the values to be selected. Now if we want to control the number of significant figures, we can use the following line to replace the str = line above:

str="The height is %4.1f metres or %4.1f centimetres."%(h, h_cm)printstr

Activity 6: Replace the string and run the new code.
This produces The height is 8.0 metres or 797.2 centimetres.

The %4.1f strings indicate that each number should be at least four characters long and that at most one figure should be included after the decimal place. Other string-formatting codes, besides %f, are %s for strings, %d for integers, %g for floating point numbers in scientific notation exponential format. More information on string format codes can be found in our reference guide.

Activity 7: We'll repeat a prior example, produce cleaned up output, and plot the results:

Recall Activity 12 of "First Steps, Part 2", in which we found the following expressions for a particle falling under the influence gravity and air resistance:

We wrote the following to print out this solution:

#import numpy functions and typesfrom numpy import *
#define a time array
t = arange(0,20.5,0.5)#terminal speed
s_t =20.0#time constant
tau =5.0#define a speed array
v = -s_t*(1-exp(-t/tau))#define a position array
x = -s_t*(t+tau*(1-exp(-t/tau)))#define an acceleration array
a = -s_t*exp(-t/tau)/tau
for i inrange(len(t)):
print"t, x(t), v(t), a(t)", t[i], x[i], v[i], a[i]

Now, modify this program to print out the program in columns with answers formatted in columns and plot the results in a manner you choose.

You will often need to output results to a file and read other data from an input file. We will show you a couple of tricks to get this done (ASCII output only) without dwelling on the details.

Activity 8: Writing output to a file:
Let's keep working with the falling particle example, xva.py. The following code outputs the x(t), v(t), and a(t) to a file called xva_file.txt. Because we will use the data in xva_file.txt as input into another code in the next example, we will output the data with up to 3 digits after the decimal place (see the %8.3f format strings below).

#import pylab functions and typesfrom pylab import *
#This line opens the file xva_file.txt for output. The 'w' says that we are permitted to write to the file.
outfile =open('xva_file.txt','w')#define a time array
t = arange(0,20.5,0.5)#terminal speed
s_t =20.0#time constant
tau =5.0#define a speed array
v = -s_t*(1-exp(-t/tau))#define a position array
x = -s_t*(t+tau*(1-exp(-t/tau)))#define an acceleration array
a = -s_t*exp(-t/tau)/tau
str="%8s %8s %8s %8s"%("t","x(t)","v(t)","a(t)")#print the string to the screen. A return is automatically generated.printstr#print the string to the file, followed by a return
outfile.write(str+'\n')for i inrange(len(t)):
str="%8.3f %8.3f %8.3f %8.3f"%(t[i], x[i], v[i], a[i])#print the string to screenprintstr#print the string to the file
outfile.write(str+'\n')#for loop is done. Close the file
outfile.close()

Now do the following:

Run the code above and examine the output on the screen and in the output file. A sample output file can be found here:

Activity 9: Reading input from a file:
If you carried out Activity 7, you will have a file called xva_file.txt in your run directory. How can we read the data from that file? The file starts with a header line of text followed by rows of numbers. All entries are separated by white space. For files in this format (tab, whitespace, or comma delimited), pylab provides the very handy loadtxt routine. Now do the following:

Locate the path to the file xva_file.txt. For example, it might reside under your C:\ directory on windows.

Start a new Python shell, and at the command line, type the following: from pylab import * xva_array = loadtxt('<path to xva_file.txt>', skiprows=1)
where in the previous line, <path to xva_file.txt> is the directory path to the file you want, and skiprows=1 tells the loadtxt command to skip the first row, which contains text.

If this step worked, type the following: xva_array

This will output a lot of numbers. If you look carefully, you will see that the numbers are consistent with the output from the xva script.

What is xva_array? It is actually a two-dimensional numpy array of floating point numbers --- a matrix --- with 4 columns and 41 rows. To see this, type shape(xva_array)
This will result in (41, 4)

To understand what to do with xva_array, we will need to further explore the properties of numpy in the next section. We will return to this example in Activity 11.

2. Working with numpy

The numpy module, which is a contraction of "Numerical Python", is a very important part of doing scientific computing with Python. Until we did Activity 8 in this tutorial, we were limited to one dimensional arrays, but numpy is capable of handling multiple dimesion arrays. Numpy one dimensional arrays are lists, numpy two dimensional arrays are lists of lists, numpy 3-d arrays are lists of lists of lists, and so on. To see how this works, do the following:

Activity 10: Creating numpy arrays:

Reproduce the following interactive session, and include some additional commands of your own.

The numpy module allows you to initialize arrays full of zeros or ones.

The shape is determined by a sequence like (3,4,2) --- you can also use [3,4,2] --- and this shape can be accessed using the shape function.

The numpy multidimensional arrays are implemented as nested lists.

Activity 11: Working with numpy arrays:
The numpy module has very useful array indexing features and ways to manipulate the shapes of arrays. Try to reproduce the following session, including the error we made!

>>> array_1 #the original arrayarray([15.,26.,33.,34.,32.,30.])>>> array_1.reshape((2,3))# a reshaped version of the array into a 2 by 3 matrixarray([[15.,26.,33.],[34.,32.,30.]])>>> array_4 = array_1.reshape((2,3))>>> array_4
array([[15.,26.,33.],[34.,32.,30.]])>>> shape(array_4)#what is the new shape of array_4?(2,3)>>> array_4[0,:]#the first row of array_4array([15.,26.,33.])>>> array_4[:,1]#the second column of array_4array([26.,32.])>>> array_4[2,1]#the element at the third row and the second column of array_4. Oops!
Traceback (most recent call last):
File "<pyshell#84>", line 1,in<module>
array_4[2,1]IndexError: index (2) out of range(0<=index<2)in dimension 0>>> array_4[1,2]#the element at the third column and the second row of array_430.0

Note that for a two-dimensional array a, a[:,j] is row j+1 of the array, and a[k,:] is column k+1 of the array.

Given what we know about numpy arrays, we can now better handle pylab's loadtxt command. Here is the procedure to read in a comma or blank-separated ASCII file of values.

Look at the file and identify its overall structure.

Run the loadtxt command, skipping any header rows that contain text.

Check the shape of the resulting array, which will tell you the number of rows and columns.

Extract rows and columns as appropriate.

Here's this procedure in action Activity 12 Loading a blank-separated ascii file, part 2.
Do the following:

Write a script to read in and plot the data from xva_file.txt.

Use Activity 8 to guide you. The xva_array, whose shape is 41 by 4, has columns that correspond to t, x(t), v(t), a(t).

Now plot the data, including the following lines of code in your script: t = xva_array[:, 0] x = xva_array[:, 1] v = xva_array[:, 2] a = xva_array[:, 3]

See if you can reproduce the plots of the position, velocity and acceleration from before.

. Note that the path to the xva_array.txt file will be different for you. If the file you want to load is in the same folder as your script, then identifying the full path is unnecessary (i.e. just 'xva_array.txt' will work).

Here are some questions to practice graphing with Pylab:

Given the velocity, the position and acceleration are obtained by

These equations represent the exact solution to the problem. But we also want to calculate these quantities numerically. To do this, we need to find numerical approximations to derivatives and integrals. We will use the simplest formulas to proceed.

First, to find an approximate solution for the position, we recall from calculus that the velocity v(t) is defined by

This means that

with the approximation becoming more accurate as

gets smaller. So to find the position as a function of time numerically we start at the initial position x(0)=0 and update to the next time according to the approximate formula

and update to subsequent times according to

This is equivalent to finding the integral as a (right) Riemann sum.

Second, to find an approximate solution for the acceleration, we recall that the acceleration a(t) is defined by

This means that

So to find the acceleration as a function of time numerically we use the approximate formula

## Learning Physics with Pylab

This tutorial will provide you with the additional tools you need to use Python in our second year courses. We will explore pylab (also called matplotlib), which uses numpy and the plotting packages of matplotlib.Outline:

## 1. Getting your work done with Pylab

In your second year courses, you will be using Pylab to get work done in a variety of ways. We tested Pylab in the First Steps, Part 2 tutorial.## Plotting in Pylab

Let's start with a look at the basics of plotting. Look at the following code:This code will simply plot the elements of the list

[1, 2, 3, 4]against the elements ofthe list[1, 4, 9, 16]. The final command in all pylab plotting should always be "show()", which activates the plotting window.Activity 1: Copy and save the excerpt above to a new script file, and run it in IDLE using

F5. The result should be a window like thisNotice the row of icons at the bottom of the window. These icons allow you to explore your plots by panning the plots and changing its settings. For more information on these buttons see the interactive navigation documentation at http://matplotlib.sourceforge.net/users/navigation_toolbar.htm.

To learn more about the very powerful plot command, in the Python shell type

from pylab import*help(plot)The following activity will help you explore several useful capabilties of matplotlib.

Activity 2: Do the following:

- Copy and paste the following script into a new IDLE script window and run it. Resize the plotting windows to view them properly, if necessary.
- Close the plotting windows, then quickly read over the code, trying to get an overall understanding of the different commands.
- For guidance on the commands, type

The result will produce a session that looks like this, after rearranging and resizing the windows:from pylab import* in the Python shell and usehelp()on the commands. This is a quick way to learn about the capabilities of pylab.This code demonstrates a lot. It shows examples of:

numpyarrays, defined based on thearange()command.figure()'ro' (red circles) and'g+--'(green plus signs with dashes). For more information on plotting, typehelp(plot)and see the pyplot tutorial at http://matplotlib.sourceforge.net/users/pyplot_tutorial.htmlsavefig(). To save to a pdf, replace the.pngsuffix with.pdf.xlabel,ylabel,title.subplot(rows, cols, num), whererowsis the number of rows in the plot,colsis the number of columns in the plot, andnumis the plot number. Seehelp(subplot)and . The example shows three plots in a single column.show()on the last line of code.Now that we have a basic idea of plotting, let's try plotting

Lissajous figures. Suppose we have two independent sinusoidal time signals x(t) and y(t). A Lissajous figure is a parametric plot in the form (x,y) = (x(t), y(t)). Such figures are useful for plotting the motion of particles under the influence of forces whose x and y components are independent.Activity 3: Do the following

- Write a program that will create two time series x(t) and y(t) over the interval 0≤t≤2π, with

- Our example solution can be found here:
lissajous.py

- Details
- Download
- 1 KB

. The output from this script looks like this:

Now we will look at a different type of plot: the histogram, which has its own commandTry running your code for various values of n and the phase constant. Use subplot to generate multiple windows, and save the output to a plot file.

hist. Histograms are useful to show, for example, how often various outcomes occur in an experiment.Activity 4: Play dice

Suppose you rolled a die 20 times and recorded the results in an array called

roll. Copy and paste the following script into a new IDLE script window and run it to make a histogram:It is showing you: five times you rolled a 1, six times you rolled a 2, twice you rolled a 3, etc.

But there is a problem with this histogram: the binning, which python chooses automatically, is lousy. Inside the

histcommand, you can specify the number of bins immediately after the name of the array, and further change the binning using options such asrange=,align=,bins=. You can then use thextickscommand to control the labels on the bins.You can also assign different weights to each bin, using the

weights=option inside thehistcommand. For example, this allows you to "normalize" the histogram, to show you the probability of each outcome.Try the following script to make a nicely-binned, normalized histogram:

## Formatted output & tuples

In the tutorial "First Steps, Part 1" we calculated the height reached by an object projected up from the surface. We worked with these lines of code:You can execute these lines of code again to remind you of what they did. The last line gives

7.9719387755102034This number is hard to read and has far too many digits after the decimal place. To control this behaviour, we use formatted strings, which in Python requires

tuples. Tuples are another type of sequence, like lists, but have a different structure and purpose. Here are some lines of code that define and explore the properties of a tuple.A tuple is enclosed by parentheses

()instead of brackets[]. The elements can be accessed in a similar way to lists. But tuples are more hardwired than lists: you cannot change the elements of a tuple. For example, the assignment statement "my_tuple[3]=-15" will give an error (try it and see).Now that we know a bit about tuples, let's see an example of a formatted string in Python

Activity 5: Run this code.

This produces

The height is 7.971939 metres or 797.193878 centimetres.In the line beginning

str, the%fare indicators that a floating-point number should be printed. The tuple(h, h_cm)after the%sign indicate the values to be selected. Now if we want to control the number of significant figures, we can use the following line to replace thestr =line above:Activity 6: Replace the string and run the new code.

This produces

The height is 8.0 metres or 797.2 centimetres.The

%4.1fstrings indicate that each number should be at least four characters long and that at most one figure should be included after the decimal place. Other string-formatting codes, besides %f, are %s for strings, %d for integers, %g for floating point numbers in scientific notation exponential format. More information on string format codes can be found in our reference guide.Activity 7: We'll repeat a prior example, produce cleaned up output, and plot the results:

We wrote the following to print out this solution:

You will often need to output results to a file and read other data from an input file. We will show you a couple of tricks to get this done (ASCII output only) without dwelling on the details.

Activity 8: Writing output to a file:

Let's keep working with the falling particle example,

xva.py. The following code outputs the x(t), v(t), and a(t) to a file calledxva_file.txt. Because we will use the data in xva_file.txt as input into another code in the next example, we will output the data with up to 3 digits after the decimal place (see the %8.3f format strings below).Now do the following:

Activity 9: Reading input from a file:

If you carried out Activity 7, you will have a file called xva_file.txt in your run directory. How can we read the data from that file? The file starts with a header line of text followed by rows of numbers. All entries are separated by white space. For files in this format (tab, whitespace, or comma delimited), pylab provides the very handy

loadtxtroutine. Now do the following:pathto the file xva_file.txt. For example, it might reside under your C:\ directory on windows.from pylab import*xva_array = loadtxt('<path to xva_file.txt>', skiprows=1)where in the previous line,

<path to xva_file.txt>is the directory path to the file you want, andskiprows=1tells theloadtxtcommand to skip the first row, which contains text.xva_arrayxvascript.xva_array? It is actually a two-dimensionalnumpyarray of floating point numbers --- a matrix --- with 4 columns and 41 rows. To see this, typeshape(xva_array)This will result in

(41, 4)## 2. Working with numpy

Thenumpymodule, which is a contraction of "Numerical Python", is a very important part of doing scientific computing with Python. Until we did Activity 8 in this tutorial, we were limited to one dimensional arrays, but numpy is capable of handling multiple dimesion arrays. Numpy one dimensional arrays are lists, numpy two dimensional arrays are lists of lists, numpy 3-d arrays are lists of lists of lists, and so on. To see how this works, do the following:Activity 10: Creating numpy arrays:

- Reproduce the following interactive session, and include some additional commands of your own.

Notice the following:Activity 11: Working with numpy arrays:

The numpy module has very useful array indexing features and ways to manipulate the shapes of arrays. Try to reproduce the following session, including the error we made!

a,a[:,j]is row j+1 of the array, anda[k,:]is column k+1 of the array.Given what we know about numpy arrays, we can now better handle pylab's

loadtxtcommand. Here is the procedure to read in a comma or blank-separated ASCII file of values.loadtxtcommand, skipping any header rows that contain text.Here's this procedure in action

Activity 12 Loading a blank-separated ascii file, part 2.

Do the following:

xva_file.txt.xva_array, whose shape is 41 by 4, has columns that correspond tot,x(t), v(t), a(t).t = xva_array[:, 0]x = xva_array[:, 1]v = xva_array[:, 2]a = xva_array[:, 3]xva_array.txtfile will be different for you. If the file you want to load is in the same folder as your script, then identifying the full path is unnecessary (i.e. just 'xva_array.txt' will work).Here are some questions to practice graphing with Pylab:

OTHER MATERIAL

*

*

These equations represent the exact solution to the problem. But we also want to calculate these quantities numerically. To do this, we need to find numerical approximations to derivatives and integrals. We will use the simplest formulas to proceed.

v(t)is defined bywith the approximation becoming more accurate as

gets smaller. So to find the position as a function of time numerically we start at the initial position

x(0)=0and update to the next time according to the approximate formulaand update to subsequent times according to

This is equivalent to finding the integral as a (right) Riemann sum.

a(t)is defined bySo to find the acceleration as a function of time numerically we use the approximate formula