Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Laboratory 6: Using the GLSolve Science to the TeraGrid | 22S 166, Lab Reports of Statistics

Material Type: Lab; Professor: Cowles; Class: 22S - Computing in Statistics; Subject: Statistics and Actuarial Science; University: University of Iowa; Term: Fall 2007;

Typology: Lab Reports

Pre 2010

Uploaded on 03/10/2009

koofers-user-fi4
koofers-user-fi4 🇺🇸

10 documents

1 / 1

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Computing in Statistics, 22S:166
Fall 2007, Lab 6
Using the GISolve Science Gateway to the TeraGrid
Dec. 4 and 5, 2007
1Downloading example data and configuration files
Under Handouts on the course web page, there are three different sets of files for use with
GISolve. We will have different groups of students run GISolve on different files. Please
listen for which set your group should download.
You will need to use Firefox (not Internet Explorer) for running GISolve during this lab, so
you might as well use Firefox for these beginning steps, too.
2Checking the resource reservation status
The following web site shows which TeraGrid resources are available to GISolve users at
what times.
https://www.cigi.uiuc.edu/doku.php/projects/gisolve/tg-resources
Note which sites are available and how many CPUs at each. With multiple groups about to
run jobs, we want to avoid starting jobs that require the same CPUs as this slows everything
down tremendously.
3Logging into GISolve
Go to www.gisolve.org.
Log in using your email address and password. Select the Bayesian Geostatistical Modeling
tab. Examine th e screen, but don’t enter anything yet.
We will refer to the plot of speed ups (slide 37 on the TeraGrid talk handout) to determine
how many CPUs should be used for each of the three dataset sizes. We will then decide
how to allocate the available resources among the groups.
number of CPUs is total number to be divided among all the chains you are running
at the site
make number of CPUs per chain a perfect square to use PLAPACK efficiently
how big a perfect square determined by size of dataset (see graph of speedups in
next slide)
run more than one chain on a site if enough CPUs are available
helps in assessing convergence
1
generates more samples per unit time if CPUs are available
samples from different chains are independent
Please fill in below what is agreed on for your group.
Resource _____________ No of CPUs _______________ No of chains _____________
Resource _____________ No of CPUs _______________ No of chains _____________
4Uploading data files and configuration files
Edit your configuration file to make sure that it has the right number of lines at the end for
the number of chains that you are going to run. If you are using more than one resource
and want a different number of chains on different resources, you need to prepare more than
one configuration file.
Upload your data file and configuration file, using the ”Browse” capability provided.
5Running the job
specify maximum wall clock time
must be long enough for the number of requested iterations to finish
must not run past the end of the reserved time on resource
submit job
click “Visualize output” periodically to view plots of accumulating samples
download zip files of plots and numeric output
2

Partial preview of the text

Download Laboratory 6: Using the GLSolve Science to the TeraGrid | 22S 166 and more Lab Reports Statistics in PDF only on Docsity!

Computing in Statistics

, 22S:

Fall 2007, Lab 6

Using the GISolve Science Gateway to the TeraGrid

Dec. 4 and 5, 2007

1 Downloading example data and configuration files Under Handouts on the course web page, there are three different sets of files for use withGISolve.

We will have different groups of students run GISolve on different files.

Please

listen for which set your group should download.You will need to use Firefox (not Internet Explorer) for running GISolve during this lab, soyou might as well use Firefox for these beginning steps, too. 2 Checking the resource reservation status The following web site shows which TeraGrid resources are available to GISolve users atwhat times. https://www.cigi.uiuc.edu/doku.php/projects/gisolve/tg-resources Note which sites are available and how many CPUs at each. With multiple groups about torun jobs, we want to avoid starting jobs that require the same CPUs as this slows everythingdown tremendously. 3 Logging into GISolve Go to

www.gisolve.org

Log in using your email address and password. Select the Bayesian Geostatistical Modelingtab. Examine the screen, but don’t enter anything yet.We will refer to the plot of speed ups (slide 37 on the TeraGrid talk handout) to determinehow many CPUs should be used for each of the three dataset sizes.

We will then decide

how to allocate the available resources among the groups.^ •^

number of CPUs is total number to be divided among all the chains you are runningat the site • make number of CPUs

per chain

a perfect square to use PLAPACK efficiently

-^ how big a perfect square determined by size of dataset (see graph of speedups innext slide) • run more than one chain on a site if enough CPUs are available –^ helps in assessing convergence

-^ generates more samples per unit time if CPUs are available –^ samples from different chains are independent Please fill in below what is agreed on for your group. Resource _____________

No of CPUs _______________

No of chains _____________

Resource _____________

No of CPUs _______________

No of chains _____________

4 Uploading data files and configuration files Edit your configuration file to make sure that it has the right number of lines at the end forthe number of chains that you are going to run. If you are using more than one resourceand want a different number of chains on different resources, you need to prepare more thanone configuration file.Upload your data file and configuration file, using the ”Browse” capability provided. 5 Running the job^ •^

specify maximum wall clock time^ –^ must be long enough for the number of requested iterations to finish^ –^ must not run past the end of the reserved time on resource • submit job • click “Visualize output” periodically to view plots of accumulating samples • download zip files of plots and numeric output