ONL NPR FAQ

This FAQ is a compilation of the most frequently asked questions. It is NOT a tutorial. You should still use the tutorial pages find explanations of features and concepts. You may want to use your browsers Edit => Find to search for help or use our index below. The index has been divided into these sections to make it easier to find help:

Tunnels and Connectivity	Problems with the RLI tunnel, connectivity, and ssh.
The Remote Laboratory Interface	Problems with using the basic RLI features but excluding filters, queues, and plugins.
Filters, Queues and Bandwidth	Problems with filters, and queues.
Router Plugins	Problems with using and writing router plugins.
Unix Commands	Problems with Unix commands such as source, ping, netstat, and iperf.

Selecting a link from this table will take you to the Questions section. If you find a potentially helpful question, select the Q-label and that link will take you to the question/answer in the Questions and Answer section.

Index

Tunnels and Connectivity

Q1: I got a dialogue box that contains the error message Unable to connect: couldn't get I/O for 127.0.0.1.

Q2: I have recived this warning when I added the ssh tunnel:

Warning: Permanently added the RSA host key for IP address '10.0.1.3'
to the list of known hosts.

what does it mean and whats wrong?

Q3: I can't ssh into any hosts
Q4: I was trying to make my first trial but i got IO Exception errors. I created the ssh tunnel and then launched the RLI, but it didn´t work. My computer is an iMac with OS X 10.4.
Q5: My wireless connection to ONL seems to go dead periodically. What is going on?

The Remote Laboratory Interface

Q1: Do we have to have a reservation before we define our network topology?
Q2: The RLI complains about my RLI.jar file being out of date. What does that mean?
Q3: I got an "allocation failure" message when I hit commit. What can I do about that?
Q4: I was working on defining my network topology, and I got a message about my reservation about to be canceled? Why did I get that message when I was obviously using the RLI?
Q5: All NPRs were reserved when my NPR failed to commit. I tried committing again, but then I got a message about insufficient resources. What can I do about that?

Filters, Queues and Bandwidth

Q2: Why does the Queue Table for Port 3 show 9.984 Mbps when I set the port rate to 10 Mbps?
Q4: When I send back-to-back 1500-byte packets out of a 10 Mbps port, I noticed that the interarrival times between the first two packets looked like it was for a 1 Gbps link and for a 10 Mbps link for the remaining packets. What did I do wrong?

Router Plugins

Q1: After changing delay.c, I get all of these compile time errors even though when I compile the code on a non-ONL host using g++ I don't. What am I doing wrong?
Q2: If I recompile a plugin, do I have to unload the plugin first?
Q3: "double x;" doesn't seem to work. Why?
Q4: When I use the math functions pow(), log() and ceil(), I get undefined symbols. What is wrong?
Q5: For my plugin I need to use the random() function, but when I include stdlib.h there's a conflicting type compilation error because libkern.h also declares random and malloc.h declares malloc, free and realloc.
Q9: Is there a function to compute the UDP checksum?

Unix Commands

Q1: When I do the following
```
onlusr> source /users/onl/.topology.csh
	
```
I get an unexpected end of file.
Q2: I couldn't get the touchme script to run unless I enterred ./touchme.
Q3: 'ping' hangs

Questions and Answers (Tunnels and Connectivity)

Q1: I got a dialogue box that contains the error message Unable to connect: couldn't get I/O for 127.0.0.1.
A1: It looks like you did not build your RLI tunnel. See the "Getting Started" link in the sidebar of the ONL web page. The least troublesome way to build the RLI tunnel is if you can run the ssh command from the command line:
```
ssh -L 7070:onlsrv:7070 onl.arl.wustl.edu
```
If you are using a graphical tool like PuTTY or SSH client, you will have to follow precisely the steps given in the Getting Started sidebar. The precise steps for building the RLI tunnel are given at the RLI SSH Tunneling link on that page. If you are taking a course that is using ONL, someone should be assigned to help you with this if you have problems.
Q2: I have received this warning when I added the ssh tunnel:
```
Warning: Permanently added the RSA host key for IP address '10.0.1.3'
to the list of known hosts.
```
what does it mean and whats wrong?
A2: The short answer is that there is nothing wrong.
10.0.1.3 is the IP address of the eth0 interface to the host onlusr; i.e., the ONL user host. You can see this by enterring:
```
onlusr> /sbin/ifconfig
```
while logged into onlusr and note the inet addr field in the eth0 entry. You didn't say so, but my guess is that you got this message when you tried to build an SSH tunnel from one ONL host to another. More specifically, you must have enterred the ssh command FROM onlusr to some other onl host.
Whenever you SSH to a remote host X from host Y, the IP address of host Y (the FROM host) is looked up in the file ~/.ssh/known_hosts (a plaintext file with RSA keys) at the remote host X. If it is there, then you are connected to that host. If not, then SSH will add the hostname to the file after authentication. In your case, I noticed that your ~mndd/.ssh/known_hosts contains an entry for 10.0.1.3 as its first entry ... which makes sense. This is why ...
Your onl home directory is NFS mounted on every onl host which also means that the file ~mndd/.ssh/known_hosts is accessible on every onl host. Suppose that you are on onlusr, and you enter something like:
```
onlusr> ssh onl031
```
All of your ONL hosts (given to you through File => Commit) are setup to accept the ssh connection without asking for a password. But the ssh server (daemon) running on onl031 will still do some authentication. One thing it does is look at the file ~/.ssh/known_hosts on onl031 (in your home directory) to see if the IP address of onlusr (10.0.1.3) is a host that you have allowed to login to onl031 before. The first time you do this, there is nothing in the known_hosts file. Since you are allowed to login to your onl hosts from other onl hosts, ssh adds the host to the known_hosts file.
Enter the command "man ssh" and scroll down to the section "Server authentication" for more details.
Q3: I can't ssh into any ONL hosts from my laptop.

A3:
+ You can not log into an ONL host unless you have an ONL account. This means that you must have either registered for an account through the ONL Web page or you received a predefined login name as part of a course/tutorial (and an email).
+ You can only ssh into onl.arl.wustl.edu from outside of the testbed. Once the ssh succeeds, you will end up on the host acting as the user host (currently onlusr).
+ You can only ssh into other hosts after they have been commited to you; i.e., wait for the experiment commit to finish first.
Q4: I was trying to make my first trial but I got IO Exception errors. I created the ssh tunnel and then launched the RLI, but it didn´t work. My computer is an iMac with OS X 10.4.
A4: All requests from the RLI to the testbed go through the ONL Proxy Daemon. This type of error usually means that the connection between the RLI and that Proxy Daemon either was lost or never established. Here are some possibilities:
- The Daemon died.
- Your SSH tunnel was incorrectly created.
- The SSH tunnel is OK but something at your end is causing the problem.
Q5: My wireless connection to ONL seems to go dead periodically. What is going on?

A5: The RLI has a timeout mechanism that does not work well with a wireless network. You should always use a wired network when doing ONL experiments.

Questions and Answers (The Remote Laboratory Interface)

Q1: Do we have to have a reservation before we define our network topology?

A1: No. The reservation should be for those parts where you actually need to commit (bind) actual resources. You can either do that through advanced reservations (see sidebar) or the RLI will pop up a dialogue box that allows you to do it when you commit. But if the testbed is very busy, it is best to make an advanced reservation.
Q2: The RLI complains about my RLI.jar file being out of date. What does that mean?
A2: Yes, the RLI changes every once in a while. And it does complain if the version is old enough. We usually announce new versions to those using ONL as part of a course. The procedure for getting the RLI.jar file is the same as it has always been.
- Use HTTP: Click the "Get RLI" link in the NPR section in the sidebar of the ONL Web page. [[ If the resulting file is not the one above, then perhaps you need to flush your browser cache ... this should not be necessary unless you have a long lived www connection ]]
Q3: I got an "allocation failure" message when I hit commit. What can I do about that?

A3: Normally, this should not happen. But occassionally, an NPR or host can fail to properly initialize. If the NPR initialization fails, then close the experiment (File => Close) and try again. In rare cases when there are catastrophic hardware problems, all NPRs can end up in the repair state leaving no available NPRs. This situation can not be resolved until the staff fixes the underlying problem. If a single host or link fails, you can continue to use the NPR if you don't need that particular part of the setup. An email about the failure is sent to our staff, but the NPR is not placed in the repair state.
Q4: I was working on defining my network topology, and I got a message about my reservation about to be canceled? Why did I get that message when I was obviously using the RLI?

A4: The reservation is not considered to be in use until you commit. Do not ignore the message because indeed your reservation will be canceled because all reservations left unused for the first 30 minutes of the reservation period will be canceled. Some advice:
1) Make the beginning time of the reservation for when you think you will commit; and
2) Do a File => Commit even if you are not done with the network topology.
After the first commit, we assume that you have arrived for your reservation and we will not bother you anymore until near the end of the reservation period when you will get a warning message. But the RLI will pop up a dialogue box that asks if you want to extend your reservation period. If it is possible, the reservation will be extended. Even if the reservation is not extended, you can continue to work as long as no one else makes a reservation that will require your NPR.
Q5: All NPRs were reserved when my NPR failed to commit. I tried committing again, but then I got a message about insufficient resources. What can I do about that?

A5: Nothing. Email is automatically sent to our staff, and someone will look into the problem. But since reservations are now overbooked, we have to look at the NPR, fix the problem and put it back into service before there are sufficient resources. Sometimes the problem can be quickly resolved, but it depends on the nature of the problem.

Questions and Answers (Filters, Queues and Bandwidth)

Q1: Why does the Queue Table for Port 3 show 9.984 Mbps when I set the port rate to 10 Mbps?

A1: Output port rates are controlled by a token bucket regulator that has a granularity of around 64 Kbps; i.e., all port rates are integer multiples of 64 Kbps.
Q2: When I send back-to-back 1500-byte packets through a 10 Mbps egress link, I noticed that the interarrival times between the first two packets looked like it was for a 1 Gbps link and for a 10 Mbps link for the remaining packets. What did I do wrong?

A2: The output port rate is controlled by a token bucket regulator. The current implementation has this behavior. See NPR Tutorial => Filters, Queues and Bandwidth => Setting Port Rate.

Questions and Answers (Router Plugins)

Q1: After changing delay.c, I get all of these compile time errors even though when I compile the code on a non-ONL host using g++ I don't. What am I doing wrong?

A1: The plugins have to be written in Microengine C for the IXP, not C++. All variable declarations in C have to be at the beginning of a block. They can't appear randomly through out the code as they can be in C++.
Q2: If I recompile a plugin, do I have to unload the plugin first?
A2: You should:
- Delete the plugin from the Plugin Table
- Recompile (e.g., "make clean; make ")
- Add the plugin to the Plugin Table
Q3: "double x;" doesn't seem to work. Why?

A3: The microengines don't have floating point. You will have to do it in integer and perhaps use approximations. For example, 0.01 is 1/100. It is a pain when going to smaller fractions. That's why if you look at something like Van Jacobson's RTT estimation calculation it involves powers of 2 so that it can be done using the shift operator ... i.e., x/8 is x>>3.
Q4: When I use the math functions pow(), log() and ceil(), I get undefined symbols. What is wrong?

A4: The IXP library doesn't have these functions. You will have to code them yourself. But pow() and ceil() are trivial. log() is not trivial. But I suggest you approximate log(x). Your application is probably using a limited range of x. So, use small table of log values and use linear interpolation. Or, use the first few terms of a Taylor series scaled to be integer. What a pain. I would just do a very crude approximation using linear interpolation. You can't be expected to write a real kernel version of log(x) for a 2 week project.
Q5: For my plugin I need to use the random() function, but when I include stdlib.h there's a conflicting type compilation error because libkern.h also declares random and malloc.h declares malloc, free and realloc.
A5: There is no such thing as a full stdlib available.
I describe a workaround. Yes, it uses rand() which doesn't generate very good random numbers, but who cares right now.
Here is what you do:
- Copy rand.c from the directory ~onl/stdPlugins/dropdelay-610/ to yours:
```
cp ~onl/stdPlugins/dropdelay-610/rand.c .
		
```
- Insert the code into your plugin source code.
- Compile as before.
Q6: Is there a function to compute the UDP checksum?

A6: Use onl_api_udp_cksum(). See NPR Tutorial => Summary Information => Plugin Functions and the stringSub plugin source code. But remember that all of the remaining fields in the IP and UDP headers must already have their final values; i.e., you don't want to compute the checksum and then decide to change one of the header fields.

Questions and Answers (Unix Commands)

Q1: When I do the following
```
onlusr> source /users/onl/.topology.csh
```
I get an unexpected end of file.
A1: This looks like you are trying to source a c-shell script when you are actually running the bash shell. Yes, when I enter for the user mndd:
```
ls -al ~mndd
```
I see files in your home directory like .bashrc. And when I enter:
```
ypcat passwd | grep mndd
```
I see:
```
sec:x:5261:5005:max nobody:/users/mndd:/bin/bash
```
which indicates (last field) that your shell is bash and not csh. So, you need to do this:
```
onlusr> source /users/onl/.topology
```
i.e., source the file .topology, NOT .topology.csh.
Q2: I couldn't get the touchme script to run unless I enterred ./touchme.

A2: Right. It looks like the default command search PATH for most users does not contain the current directory ("."). That means that if touchme is in the current directory, you will need to enter ./touchme in order for your shell to find the script. Also, since it is a script, check that it has execute permissions.
Q3: 'ping' hangs

A3:
+ Are you pinging from the correct host; i.e., usually not onlusr?
+ Are you pinging to the correct host?
+ Have you installed routes in both the forward and reverse directions? (The brute force method: 'Topology => Generate default routes' will generate default routes on all ports) (Note: This should not be necessary if you are using a predefined configuration file unless your instructor says that you need to define routes.)

Revised: Wed, Jan 21, 2009