Dynamic Probe Class Library Programming Guide

Chapter 1. What is DPCL?

DPCL (Dynamic Probe Class Library) is a C++ class library whose application programming interface (API) enables a program to dynamically insert instrumentation code patches, or "probes", into an executing program. The program that uses DPCL calls to insert probes is called the "analysis tool", while the program that accepts and runs the probes is called the "target application".

The DPCL product is an asynchronous software system designed to serve as a foundation for a variety of analysis tools that need to dynamically instrument (insert probes into and remove probes from) target applications. In addition to its API, the DPCL system consists of daemon processes that attach themselves to the target application process(es) to perform most of the actual work, and an asynchronous communication and callback facility that connects the class library to the daemon processes.

This overview describes the various parts of the DPCL system and how they work together to enable analysis tools to instrument target application processes. Before you can understand the individual parts of the DPCL system, however, you need to have a general understanding of the system as a whole. Here is a very high-level description of how an analysis tool uses the DPCL system to instrument a serial or parallel target application. Say you have an executable DPCL program that has been compiled with the DPCL header files and linked with the DPCL library. When you start execution of this program, its code does the following to instrument a target application:

The analysis tool initializes itself to use the DPCL system. To do this, the analysis tool calls a DPCL initialization function. This enables the analysis tool to respond asynchronously to data sent, via a DPCL daemon, from probes installed in the target application.
The analysis tool connects to, or creates, one or more target application processes.
- To connect to one or more target application processes that are already running, the analysis tool calls a function designed for this purpose. When called, this DPCL function starts a DPCL daemon process on the host machine(s) running the target application process(es). The daemon attaches itself to the process(es) and will now perform much of the actual work requested by the analysis tool through the DPCL function calls.
- To create one or more target application processes running, the analysis tool calls a function designed for this purpose. When called, this DPCL function creates the new process(es) on the particular host as indicated. As with connecting, a DPCL daemon process attaches itself to the process(es) and will now perform much of the actual work requested by the analysis tool through DPCL function calls.
The analysis tool creates snippets of executable instrumentation code (the "probes") to be inserted into the target application process(es). To do this, the analysis tool uses a DPCL class that has overloaded common operators so that expressions written within the context of the class do not execute locally but instead create internal data structures (called "probe expressions") that represent the operation. In addition to these overloaded operators (which implicitly call functions to create probe expressions representing operations), there are additional functions that your code can explicitly call to create probe expressions representing conditional logic, a sequence of other probe expressions, or a function call.
A probe expression is a specific type of data structure called an "abstract syntax tree". Abstract syntax trees (a term used mainly in compiler technology) are data structures that represent instructions removed from a specific syntactic representation; they are a sort of intermediary stage between source code and executable instructions. When the analysis tool calls DPCL functions to insert a probe expression into one or more target application processes, the DPCL system will manipulate these abstract syntax trees into executable instructions that will run as part of the target application process(es).
While an analysis tool can create probe expressions that perform some simple logic, the programmatic capabilities of probe expressions are rather limited. For this reason, a probe expression may optionally call functions. Specifically, a probe expression can call:
- a DPCL system-defined function for sending collected data back to the analysis tool.
- UNIX functions (like getrusage, times, and vtimes).
- functions contained in the target application.
- A C function compiled into an object file called a "probe module". Certain DPCL functions enable you to load such modules into one or more target application processes. Once a probe module is loaded into a target application process, the analysis tool can direct the target process to call any of its functions. Note, however, that probe module functions must be written in C; other languages (such as C++) are not supported.
The analysis tool inserts the probes into the target application for execution. To do this, the analysis tool calls certain DPCL functions that, like many DPCL functions, send service request messages to a DPCL daemon. This DPCL daemon then translates the messages into the desired action.
There are three general ways an analysis tool can execute probes within a target application process. These three general approaches correspond to the three types of probes -- point probes, phase probes, and one-shot probes. The analysis tool can:
- place probes at a specific locations in the target application code. Such probes are called "point probes", and, when activated by the analysis tool, will run whenever execution reaches their installed location in the code.
- install probes that are executed within the target application process(es) upon the expiration of a timer regardless of what the target application is doing. Such probes are called "phase probes". The invocation of phase probes is governed by data structures called phases. Each phase defines that phase probe(s) to be invoked and the time interval between successive invocations of the phase probe(s).
- explicitly execute a probe within the target application process(es) regardless of what the target application is doing. Such probes are called "one-shot probes".
Enters the DPCL main event loop. This enables the analysis tool to interface asynchronously with the DPCL system. To do this, the analysis tool calls a DPCL function that puts execution in a processing loop. This enables the analysis tool to respond asynchronously to, for example, messages from DPCL daemons.
The analysis tool responds to data sent, via a DPCL daemon, from its installed probes. To do this, the analysis tool must have created a DPCL callback routine (also called simply a "DPCL callback") designed to respond to the probe data. The probe can send data in a message to the DPCL daemon. The daemon then passes this message back to the analysis tool along with information indicating which callback should be used to respond to this probe's data. The analysis tool, meanwhile, has been sitting in its main event loop waiting for just such an event; it responds to the message by calling the appropriate callback and supplying it with the collected probe data.

That is a very high-level view of how the analysis tool interacts with the DPCL system in order to instrument a target application. The rest of this overview will provide a more-detailed description of this process and the DPCL system. First, it describes dynamic instrumentation in general by answering the following questions:

What is dynamic instrumentation?
What are the advantages of dynamic instrumentation?

Next, this overview describes the basic architecture of the DPCL system by answering the questions:

What is a DPCL target application?
What is a DPCL analysis tool? In describing what a DPCL analysis tool is, this overview also answers the questions:
- What is the DPCL API?
- What are DPCL callbacks?
What are the DPCL daemons?
What is a probe? In describing what a probe is, this overview also answers the questions:
- What is a probe expression?
- What is a probe module?
- What are the three types of probes?

Once these questions have been answered, you will have a clearer idea of the various parts of the DPCL system, and how they are organized. This overview then concludes by explaining why it is advantageous to build analysis tools on the DPCL system.

What is dynamic instrumentation?

Dynamic instrumentation refers to a specific type of software instrumentation. Software instrumentation refers to code that is inserted into a program to gather information regarding the program's run. As the instrumented application executes, the instrumented code then generates the desired information, which could include performance, trace, test coverage, diagnostic, or other data. Traditionally, software instrumentation has been inserted:

manually by a programmer editing a program's source code
automatically by a compiler designed to generate the instrumentation
by linking in an instrumented version of a library
directly into the executable by an application designed for this purpose

Dynamic instrumentation is distinct from these more traditional methods of software instrumentation because it can be added and removed while the application is running. The application does not need to be terminated and restarted from the beginning in order to add and remove instrumentation.

What are the advantages of dynamic instrumentation?

We chose to base the DPCL product on dynamic instrumentation technology, because dynamic instrumentation offers several key benefits that cannot be realized by traditional software instrumentation approaches. Specifically, a DPCL analysis tool's ability to add and remove instrumentation probes while the target application is running means that the analysis tool can perform run-time analysis. In other words, it can examine the target application's behavior without waiting for its execution to complete. This is especially useful for:

examining programs, such as database servers, that do not normally terminate. Since analysis tools built on the DPCL system can insert instrumentation probes long after the target application has begun executing, instrumenting such programs is not a problem.
examining long-running numerical programs, especially when the program is repetitive and the general execution structure can be obtained from a few early iterations.
visualizing complex or long-running programs with a minimum of secondary storage consumption. Because it can visualize a target application's behavior as it actually runs, an analysis tool built on the DPCL system can avoid storing large volumes of trace or other collected data.
enabling the user of the analysis tool to interactively tailor the type of data collected to match his or her needs. A user might want to do this, for example, in order to examine different hypotheses regarding the target application's performance. Since the user could alter the type of data collected without having to stop and restart the target application, this reduces the need to run the target application multiple times in order to collect the required data.

Additional advantages of dynamic instrumentation in general, as well as specific advantages of the DPCL system, are described in Why is it advantageous to build analysis tools on the DPCL system?. Before you can fully appreciate the advantages of building analysis tools on the DPCL system, you need to understand more about the DPCL system (as described next in What is the DPCL system?).

What is the DPCL system?

The DPCL system is an asynchronous software system whose client/server architecture enables analysis tools to connect to, and insert instrumentation probes into, one or more target application processes. What's more, the DPCL system encapsulates a parallel infrastructure, making it ideally suited for analyzing parallel programs. The DPCL system consists of several conceptual parts including:

A class library whose Application Programming Interface (API) enables you to build analysis tools on the DPCL system.
Daemon processes that connect to the target application process(es) to perform work requested by the analysis tool through the DPCL API function calls.
Instrumentation probes that the analysis tool defines and inserts (via DPCL API function calls and daemons) into the target application to collect data.
An asynchronous communication callback facility that connects the class library with the daemons. It is this callback facility that enables an analysis tool to respond to data collected and sent by its probes.

The following figure illustrates how the parts of the DPCL system work together to enable an analysis tool to instrument a serial target application.

Figure 1. Instrumenting a serial target application

View figure.

The preceding figure illustrates how the parts of the DPCL system work together to enable an analysis tool to instrument a serial target application. For additional explanation of this figure, refer to the following key:

(a): The analysis tool calls DPCL functions which request services from the DPCL communication daemon.
(b): The DPCL superdaemon coordinates the creation and removal of the DPCL communication daemon. It establishes the socket connection to the analysis tool, and then spawns (and transfers the socket connection to) the DPCL communication daemon.
(c): The DPCL communication daemon translates API function requests into the desired action (for example, the installation or removal of probes in the target application).
(d): Probes within the target application process send data to the DPCL communication daemon.
(e): The DPCL communication daemon forwards data collected by the probe(s) back to the analysis tool, triggering the appropriate callback routine in the analysis tool.

This next figure is similar to the preceding one, except that it shows how the parts of the DPCL system work together to enable an analysis tool to instrument a parallel target application. The preceding figure's key applies to this next figure as well; refer to it for additional information. Note in this figure that only a single DPCL communication daemon runs, per user, on each server machine. If the analysis tool connects to multiple target application processes running on the same server machine, then that server machine's DPCL communication daemon will coordinate the communication with all of the processes.

Figure 2. Instrumenting a parallel target application

View figure.

The remainder of this section describes each part of the DPCL system (the target application, the analysis tool, the class library, the callbacks, the daemons, and the probes) in greater detail.

What is a DPCL target application?

A target application is an executable program into which the analysis tool inserts probes. A target application could be a serial or a parallel program. Furthermore, if the target application is a parallel program, it could follow either the Single Program Multiple Data (SPMD) or the Multiple Program Multiple Data (MPMD) model, and may be designed for either a message-passing or a shared-memory system.

What is a DPCL analysis tool?

An analysis tool (in the context of the DPCL product) is a C++ application that links in the DPCL library and uses the DPCL API calls to instrument (create probes and insert them into) one or more target application processes. In addition to containing code not related to accessing DPCL system functionality (such as defining the user interface), the analysis tool will also contain DPCL callback routines designed to respond to data sent back from the probes it has installed in the target application. Typically, an analysis tool is designed to measure program efficiency, confirm program correctness, or monitor program execution.

The DPCL system is designed to provide you, the creator of analysis tools, with a scalable general-purpose infrastructure for instrumenting target applications. In other words, the DPCL system concentrates on enabling your analysis tool to connect to the target application process(es), and then dynamically insert and remove probes as needed. You concentrate on creating the actual probes, and leverage the DPCL system's ability to insert them into one or more target application processes. What's more, the DPCL system encapsulates a parallel infrastructure, making it ideally suited for analyzing parallel programs. This design affords you a large degree of flexibility. For example, an analysis tool could be a complex and general-purpose tool like a debugger, or it might be a simple and specialized tool designed for only one particular program, user, or situation.

What is the DPCL API?

DPCL's Application Programming Interface (API) is the key means by which the analysis tool interacts with the DPCL system to effectively instrument a target application. Along with the DPCL callbacks (which enable an analysis tool to respond asynchronously to data sent from installed probes), the DPCL API is what enables the analysis tool to leverage the capabilities of the DPCL system. The DPCL API contains:

three classes that the analysis tool can use to represent, and act upon, the target application. These are the Process class (used to represent a single UNIX process), the Application class (used to represent a group of related UNIX processes), and the PoeAppl class (a class derived from the Application class and used specifically to represent POE applications). Member functions of these classes enable an analysis tool to:
- connect to target application processes
- create new target application processes in a stopped state
- start execution of target application processes
- allocate and free memory for probes within target application processes
- execute probes within target application processes
- disconnect from target application processes
- suspend and resume execution of target application processes.
- terminate target application processes.
a ProbeExp class that the analysis tool can use to build data structures called "probe expressions". These probe expressions represent code that the analysis tool can execute within one or more target application processes. The ProbeExp class overloads common operators so that expressions do not execute locally, but instead call member functions that create the probe expressions. Additional member functions of the ProbeExp class enable the analysis tool to create probe expressions representing conditional logic, a sequence of probe expressions, or a function call. The analysis tool can then insert these probe expressions into the target application process(es) for execution.
Two classes that enable an analysis tool to navigate the source code structure of the target application, and identify locations where the analysis tool can safely install instrumentation probes. These classes are the:
- SourceObj class. Objects of this class represent part of the source code structure, and a group of such objects provide the hierarchical representation of the target application's source code which the analysis tool can navigate.
- InstPoint class. Objects of this class represent locations in the target application code where the analysis tool can install probes.

Be aware that the above is just a quick summary of some of the main classes and functional capabilities of the DPCL API. For more information about the DPCL classes, refer to Chapter 2, What are the DPCL classes?. For complete reference information on the DPCL API, refer to the DPCL Class Reference.

What are blocking and nonblocking API calls?

There are two types of DPCL API calls -- blocking calls (also referred to as pseudo-synchronous or semi-synchronous service requests) and nonblocking calls (also referred to as asynchronous service requests). Much of the functionality of the DPCL system is available in both blocking and nonblocking versions. For example, to connect to a target application, an analysis tool could call either a blocking function (bconnect) or a nonblocking function (connect). As this example implies, the naming convention for a blocking function is to prefix the letter "b" to the name of the nonblocking function.

To understand why the DPCL API includes both blocking and nonblocking versions of the same functionality, it is important to recall that the DPCL system is an asynchronous system that, by definition, acts upon events that, if they occur, will do so at an undetermined time and in an undetermined order. The nonblocking functions are designed to take advantage of the asynchronous nature of the DPCL system; calls to such functions return immediately without waiting for a response from the DPCL system. When an analysis tool calls an asynchronous, nonblocking function, it can specify the name of a callback routine that will respond to status returned by the DPCL system. The callback not only enables the analysis tool to perform status error checking, but also enables it to be structured in a more event-driven manner. When analyzing parallel programs, certain performance benefits can usually be realized by leveraging the event-driven nature of the asynchronous functions. The Application class, for example, is a grouping of related Process class objects that enables the analysis tool code to manipulate a set of related UNIX processes as a single unit. For the asynchronous Application class functions, the callback routine that responds to the successful or unsuccessful completion of the operation will execute for each process individually. The callback routine, in addition to performing error checking to ensure that the operation was successful for the particular process, could contain code for the next action to be performed on the process. In other words, the analysis tool could continue work on one process without waiting for the operation to complete on the other processes.

The blocking functions, on the other hand, are designed to hide the complexity of the asynchronous system; calls to such functions do not return control to the analysis tool until they either succeed or fail in carrying out the requested service. This means that callbacks are not needed; instead, the code to act upon the function return value can be placed right after to call to the function. Keep in mind, however, that these so-called "blocking" functions are not truly synchronous; instead they merely mimic a synchronous system while allowing the DPCL system to continue processing events not related to the blocking request. For example, data being sent from probes could still be processed by other callback routines while execution is supposedly "blocked" and waiting for the function to return. That is why the blocking functions are referred to as "pseudo-synchronous" or "semi-synchronous" service requests. We designed the blocking functions to be pseudo-synchronous in order to provide a simple blocking interface that helped avoid program deadlock by allowing the DPCL system to continue processing other requests while "blocked".

Figure 3. Blocking function calls (pseudo-synchronous service requests). Calls to blocking functions do not return control to the analysis tool until they either succeed or fail in carrying out the requested service. The blocking functions provide a simpler interface, and so are preferable for applications that don't need to take advantage of the finer level of control available in an asynchronous system. In particular, if the analysis tool is instrumenting a serial application, the blocking functions are probably preferable.

View figure.

Figure 4. Nonblocking function calls (asynchronous service requests). The nonblocking functions are designed to take advantage of the asynchronous nature of the DPCL system. Calls to nonblocking functions return immediately upon issuing the service request, without waiting for a response to the request from the DPCL system. Instead, a callback routine responds to the return value from the system. The nonblocking functions can be harder to program, but enable the analysis tool to leverage the finer level of programmatic control.

View figure.

What are DPCL callbacks?

DPCL callbacks are routines called by the DPCL system when certain messages arrive from a DPCL daemon. When an analysis tool initializes itself to use the DPCL system, one of the things it does is enter the DPCL main event loop so that it can interface asynchronously with the DPCL system. The DPCL main event loop listens to file descriptors and sockets for input; there will be one socket for each remote node to which the analysis tool is connected. Remember that the communication to and from each target application process is handled by a DPCL daemon. A DPCL daemon may send two types of messages to the analysis tool. A DPCL daemon may send a message:

indicating whether a particular service requested by a DPCL function succeeded or failed.
containing data from probes that the analysis tool has installed in the target application.

When the DPCL main event loop detects input on a file descriptor that is connected to a DPCL daemon, it calls a dispatch routine for the file descriptor or socket. If the input is on a file descriptor representing a socket connection to a DPCL daemon, the message is examined and the appropriate callback for the message type is executed. Since there are two types of messages that can be sent from a DPCL daemon, there are two types of callbacks -- acknowledgment callbacks and data callbacks. Acknowledgment callbacks are callbacks that respond to the success or failure of an asynchronous, nonblocking, function call. Data callbacks are callbacks that respond to probe data forwarded by the DPCL daemon.

All callback routines have the same parameters. These parameters include:

a data structure with values indicating:
- the socket or file descriptor over which the message was received
- the message type
- the size of the message sent
a tag value supplied by the analysis tool when it installed the probe or called the asynchronous function.
a pointer to the object that registered the callback routine. This would be the instance of the DPCL class object that installed the probe or called the asynchronous function.
a raw byte stream containing the actual message -- the probe data or the function return status.

What are the DPCL daemons?

There are two types of DPCL daemons -- DPCL communication daemons and DPCL superdaemons.

DPCL superdaemons are created the first time an analysis tool calls an API routine to connect to one or more target application processes on a given node. These superdaemons create the DPCL communication daemons, and are responsible for ensuring that only one such daemon exists on each remote host. They also perform user authentication on the remote host.
DPCL communication daemons handle the communication between the analysis tool and target application processes. This is the daemon attached to the target application process that will perform much of the actual work requested, via DPCL function calls, by the analysis tool. This daemon also relays the data collected by instrumentation probes within the target application back to the analysis tool.

In order for the DPCL daemon processes to be as unobtrusive as possible, we have designed the DPCL system so that only one DPCL communication daemon per user and only one DPCL superdaemon will be running on each host machine at a time. Furthermore, we have ensured that these daemons are not persistent; they will terminate when the analysis tool issues a service request to disconnect from the target application process.

To better understand the purpose and life cycle of the two types of DPCL daemons, it is worthwhile to understand how they are created, used, and destroyed. First, an analysis tool will need to connect to the target application processes. To do this:

The analysis tool calls a DPCL function to connect to one or more target application processes.
The DPCL function creates a socket connection from the analysis tool to the inetd daemon running on the host where the target application process is running.
The inetd daemon spawns a new daemon process -- a DPCL superdaemon process. This DPCL superdaemon inherits the analysis tool socket connection.
The DPCL superdaemon checks to see if there is already a DPCL superdaemon running on the host machine. If so, it transfers the analysis tool socket connection to the existing DPCL superdaemon and exits. If there is no other existing DPCL superdaemon on the host machine, then the DPCL superdaemon does not exit; it becomes the DPCL superdaemon for the host machine.
The DPCL superdaemon performs user authentication to ensure that the analysis tool user is authorized to run on the remote host.
The DPCL superdaemon checks to see if there is a DPCL communication daemon running for this user on the host machine.
- If there is a DPCL communication daemon for this user already running, the DPCL superdaemon passes the analysis tool socket connection over to the DPCL communication daemon.
- If there is no DPCL communication daemon for this user running on the host, the DPCL superdaemon spawns one. This DPCL communication daemon inherits the analysis tool socket connection and will handle communication between the analysis tool and the installed probes.

If the target application is a parallel application, similar connections and daemons need to be created for each host on which the target application processes are running. Once the analysis tool is connected via the DPCL communication daemon(s) to the target application process(es):

The analysis tool may issue API function calls to, for example, install or remove probes. The DPCL functions send messages to the DPCL communication daemon(s), which translate the messages into the desired action.
Probes installed within the target application process(es) may send collected data to the DPCL communication daemon(s), which will relay these messages back to the analysis tool.

Finally, when the analysis tool is done collecting data, it will need to disconnect from the target application processes. To do this:

The analysis tool calls a DPCL function to disconnect from one or more target application processes.
The DPCL function sends a disconnect message to the DPCL communication daemon. Provided the DPCL communication daemon is not connected to other analysis tools, it asks the DPCL superdaemon if it can exit.
If there are no analysis tools attempting to connect to one or more target application processes on this host, the DPCL superdaemon tells the DPCL communication daemon that it can exit.
The DPCL communication daemon exits.
If the DPCL communication daemon that exited was the last DPCL communication daemon running on this host, the DPCL superdaemon exits.

Since we have designed the DPCL system so that only one DPCL communication daemon per user will be running on a given host, this means that a single DPCL communication daemon may be coordinating the communication between multiple target application processes and/or multiple analysis tools. The following figure illustrates the possible connection variations that can exist for a single user on a single host machine.

Figure 5. DPCL communication daemon. Each host machine has only one DPCL communication daemon per user. It may coordinate the communication between multiple target application processes and/or multiple analysis tools. Keep in mind that this figure illustrates a single user and a single host machine. Each user connected to a DPCL target application process on the host will have a separate DPCL communication daemon running. Likewise, each user will have a separate DPCL communication daemon for each host that is running target application process(es) to which he or she is connected.

View figure.

What is a probe?

The term probe refers to the software instrumentation code patch that your analysis tool can insert into the target application. Probes are created by the analysis tool, and therefore are able to perform any work required by the tool. For example, depending on the needs of the analysis tool, probes could be inserted into the target application to collect and report performance information (such as execution time), keep track of pass counts for test coverage tools, or report or modify the contents of variables for debuggers.

Probes are created by the analysis tool using a combination of probe expressions and probe modules (described next in What is a probe expression? and What is a probe module?). For the purposes of this book, a probe is defined as "a probe expression that may optionally call functions".

What is a probe expression?

A probe expression is a simple instruction or sequence of instructions that represents the executable code to be inserted into the target application. Probe expressions are abstract syntax trees -- data structures that represent the logic to be performed by the probe within the target application process(es).

The term "abstract syntax tree" is one we have borrowed from compiler technology. These data structures are called "abstract" because they are removed from the syntactic representation of the code. For example, an abstract syntax tree for the expression a + (b x c) is identical to the abstract syntax tree for the expression a + b x c (where only precedence rules force the multiplication operation to be performed first).

Figure 6. Abstract syntax tree. This abstract syntax tree formed from either the expression a + (b * c) or the expression a + b * c.

View figure.

Compilers need to create abstract syntax trees from a program's source code as an intermediary stage before manipulating and converting the data structure into executable instructions. Since the DPCL system also needs to create executable instructions (for insertion into one or more target application processes), it also needs to create these abstract syntax trees. When the analysis tool inserts a probe expression into one of more target application processes, the DPCL system uses compilation techniques to manipulate these abstract syntax trees into executable instructions that will run as part of the target application process(es).

From the DPCL programmer's point of view, the procedure for creating a probe expression can be a "building block" task in which smaller probe expressions are eventually combined and sequenced into the full probe expression.

For example, the analysis tool can create probe expressions representing constant or variable values, and then combine these into more complex probe expressions representing simple operations on the values, or function calls that pass the values as parameters to the function. The analysis tool could then take two of these more complex probe expressions and combine them into a single probe expression that represents a sequence of the two existing expressions. Then the analysis tool could join two such sequences into a longer sequence or combine them into a conditional statement. This process of combining and sequencing smaller probe expressions into larger ones would continue, depending on the complexity of the probe logic, until the analysis tool has a single probe expression representing the full probe logic.

The class for creating probe expressions is the ProbeExp class. Constructors of this class enable your code to create probe expressions that represent temporary data variables. Functions of other DPCL classes enable your code to create probe expressions that represent persistent data variables. To create probe expressions to represent operations, the ProbeExp class has overloaded common operators so that expressions written within the context of the class do not execute locally, but instead call member functions designed to create a probe expression that represents the particular operation. Probe expressions to represent arithmetic, bitwise, logical, relational, assignment, and pointer operations can all be created in this way. These probe expressions can then in turn be used as subexpressions in forming other probe expressions -- ones representing more complex operations. Other functions of the ProbeExp class (ones that must be called explicitly) enable your code to create a probe expression to represent a sequence of two existing probe expressions, a conditional statement, or a function call.

Although an analysis tool can create probe expressions to perform conditional control flow, integer arithmetic, and bitwise operations, the programmatic capabilities of probe expressions are rather limited. When more complicated probe logic is needed (such as iteration, recursion, and complex data structure manipulation), a probe expression can direct the target application to call a function in a probe module.

What is a probe module?

A probe module is a compiled object file containing one or more functions written in C. Once an analysis tool loads a particular probe module into a target application, a probe is able to call any of the functions contained in the module.

What are the three types of probes?

As already stated, a probe is a probe expression that may optionally call functions. There are three types of probes; they are differentiated by the manner in which their execution is triggered. The three types of probes are:

point probes (which are installed at particular locations in the target application code and, when in an activated state, are triggered whenever execution reaches that location in the code).
phase probes (which are triggered by expiration of a timer and executed regardless of what code the target application is executing).
one-shot probes (which are executed once and immediately, regardless of what code the target application is executing).

Each probe type has different intended uses, and together are designed to enable an analysis tool to efficiently instrument a target application. By "efficiently instrument", we mean "to collect the necessary data and display it in a timely manner while minimizing the instrumentation's intrusion cost to the target application".

What is a point probe?

Point probes are probes that the analysis tool places at particular locations within one or more target application processes. When placed in an activated state by the analysis tool, a point probe will run as part of a target application process whenever execution reaches its installed location in the code. The fact that point probes are associated with particular locations within the target application code makes them markedly different from the other two types of probes (which are executed at a particular time regardless of what code the target application is executing).

To install a point probe within one or more target application processes, the analysis tool must navigate the source code structure of the target application to identify locations where it can safely install point probes. The analysis tool navigates the source code structure by means of source objects (represented by instances of the SourceObj class); the locations where point probes can be installed are called instrumentation points (represented by instances of the InstPoint class).

What are source objects?

Source objects provide a coarse, source-code-level, view of a target application process, and enable an analysis tool to display or navigate a hierarchical representation of a particular target application process. After connecting to a process, the analysis tool can get the top-level source object (called the "program object") for the process; the analysis tool does this by calling the member function Process::get_program_object. This function returns the top-level source object (an instance of the DPCL class SourceObj). Since applications can be quite large, the initial source object provides only a very coarse view of the source structure; essentially, it is just a list of the modules (compilation units) contained in the target application process. Each of these modules is itself a source object and is considered a child of the program source object.

To navigate down into the source structure of a module, an analysis tool gets a reference to one of these module source objects (using the member function SourceObj::child) and expands it (using the SourceObj::expand or its blocking equivalent SourceObj::bexpand). Expanding a module source object returns the additional structure of the module -- including data, functions, and instrumentation points.

It is important to keep in mind that the program object and all its child module objects reflect the source hierarchy associated with a particular process only. This means that, in some cases, the analysis tool will need to navigate multiple source hierarchies (as described in the following table).

If the target application is: Then:
A serial program. The analysis tool need only navigate the target application's single source hierarchy.
A parallel program that follows the Single Program Multiple Data (SPMD) model. Each process in the target application has the same source and, therefore, the same source hierarchy. The analysis tool need only navigate a single source hierarchy.
The analysis tool still has the option to either insert identical instrumentation in each of the processes, or else instrument the processes differently.
A parallel program that follows the Multiple Program Multiple Data (MPMD) model. There are multiple programs, and, therefore, the analysis tool will need to navigate multiple source objects.

If the target application is:	Then:
A serial program.	The analysis tool need only navigate the target application's single source hierarchy.
A parallel program that follows the Single Program Multiple Data (SPMD) model.	Each process in the target application has the same source and, therefore, the same source hierarchy. The analysis tool need only navigate a single source hierarchy. The analysis tool still has the option to either insert identical instrumentation in each of the processes, or else instrument the processes differently.
A parallel program that follows the Multiple Program Multiple Data (MPMD) model.	There are multiple programs, and, therefore, the analysis tool will need to navigate multiple source objects.

What are instrumentation points?

Instrumentation points are locations within a target application process where an analysis tool can install point probes. Instrumentation points are locations that the DPCL system determines are safe to insert new code. Such locations are:

function entry
function exit
function call

Instrumentation points are obtained from source objects, at the function level, using the SourceObj::exclusive_point or SourceObj::inclusive_point functions. Both functions take an integer index value as an input value and return an instrumentation point as a result. The difference between the two is that the SourceObj::exclusive_point function gives the analysis tool access only to instrumentation points that are tied to that particular source object in the source object hierarchy, while the SourceObj::inclusive_point function gives the analysis tool access to all instrumentation points associated with the given source object and all of its lower level source objects in the source object hierarchy.

When should an analysis tool use a point probe?

An analysis tool should use a point probe when it needs to collect data associated with a particular location in the target application's code.

To better understand when it is useful, and when it is not useful, to use a point probe, consider the following hypothetical situation. Say your analysis tool is a profiler that needs to measure the accumulation of floating point counts by function. Say also that this analysis tool is a Java(TM) client that needs to periodically refresh itself to display the newly-collected data to the user. Since the functions are specific locations in the target application code, you would need to use point probes to measure them effectively. In this particular example, you would set up two point probes for each function -- one at the beginning of the function and one at the end. Each time the function starts executing, the first probe would determine the number of floating point instructions executed up to that point. Later, the second probe would determine the number of floating point instructions executed, and, by subtracting the first figure from the second, the number of floating point instructions executed within the function.

So in this example, you would use a set of point probes to collect the data. Keep in mind, however, that it is not enough to simply accumulate collected data within the target application process(es) as we have in this example. Remember that we also need to communicate this information back to the analysis tool. In this case, we have said that our hypothetical application needs to periodically refresh its Java client to display the newly-collected data. We could have the point probes themselves send their collected data back to the analysis tool, but this would not be an efficient solution. You would not, in this example, want to send data back to the analysis tool using the point probes because such probes located in frequently-executed functions could swamp the network with messages and, in doing so, take a valuable resource away from the target application. This solution would be unacceptable, as it would likely slow the target application appreciably; the instrumented version of the target application would no longer be representative of the actual, uninstrumented, version of the target application. While point probes are useful for collecting the data in this example, the actual communication of that data back to the analysis tool would be better handled using phase probes (as described in What is a phase probe?) or one-shot probes (as described in What is a one-shot probe?).

What is a phase probe?

Phase probes are probes that are executed periodically, upon expiration of a timer, regardless of what part of the target application's code is executing. A phase probe, unlike the other two types of probes, must call a probe module function; in other words, a phase probe cannot be a simple probe expression that does not call a probe module function. The control mechanism for invoking these time-initiated phase probes is called a phase.

What is a phase?

Phases are the control mechanism for invoking phase probes at set intervals. Represented by instances of the Phase class, phases enable your analysis tool code to specify the particular phase probe(s) to be invoked and the CPU-time interval at which their execution is triggered. The set interval at which a phase is activated to invoke its phase probes is called the phase period. Although the phase period is initially defined when the analysis tool first creates the phase, the analysis tool can later lengthen or shorten the phase period as desired.

A phase can, each time the phase period expires, call up to three phase probes. As already stated, a phase probe must call a probe module function, so the phase is actually triggering calls to up to three probe module functions -- a begin function, a data function, and an end function. While the phase must, in order to be useful, call at least one of these functions, any one of them is optional. At the very least, an analysis tool will usually supply a data function.

When a phase is added to a target application process, it will, once the phase period expires, be activated by the DPCL system. (The DPCL system uses a SIGPROF signal to activate a phase, so be aware that target applications that themselves use the SIGPROF signal cannot be instrumented with phases.) Once the phase is activated, it will call the phase probe module functions that have been associated with it. The first phase probe it calls is the one identifying the begin function (provided one has been specified). Typically, the begin function will perform any setup tasks that may be required. When the begin function completes, the phase calls the phase probe that identifies the data function (provided one has been specified). The data function executes once per datum that the analysis tool will have previously allocated and associated with this phase. Executing once per datum enables the data function to perform the same actions on different data. Each datum, for example, could be a separate counter -- each incremented by the same data function. If the analysis tool does not associate any data with the phase, then the data function will not execute. When the data function finishes executing for the last datum, the phase calls the phase probe that identifies the end function (provided one has been specified). Typically, the end function performs any clean up chores that may be required.

When should an analysis tool use phases to invoke phase probes?

An analysis tool should use phases and phase probes whenever necessary work is best done on a periodic basis. For example, in When should an analysis tool use a point probe?, we described a hypothetical situation in which an analysis tool needed to measure the accumulation of floating point counts by function. While we determined that point probes placed inside functions were the best way to collect data, we also determined that they were an impractical way to send that data back to the analysis tool. We determined point probes would be impractical because such probes located within frequently-executed functions would utilize too much of the available network communication resource, and so slow the target application unacceptably.

In this example, a phase that triggers one or more phase probes at a set interval would be an ideal way to communicate the data collected by the point probes back to the analysis tool. By using a phase, you would be able to tune how often you send updates back to the target application. While you cannot control how often the point probes are executed to gather the data, you can use a phase to govern how often a phase probe is triggered to send the collected data back to the analysis tool. What's more, since the analysis tool can modify the phase period to trigger execution of the phase probe more frequently or less frequently, the analysis tool could dynamically govern how often the data is sent. For example, our hypothetical analysis tool could also monitor network traffic or the target application's performance to determine if the intrusion cost of the data being sent from the phase probes is too great. If so, the analysis tool could modify the phase period so that the data is sent less often.

What is a one-shot probe?

A one-shot probe is a type of probe that is executed by the DPCL system immediately upon request, regardless of what the application happens to be doing.

When should an analysis tool use a one-shot probe?

An analysis tool should use a one-shot probe whenever it wants to explicitly and immediately execute code within the target application process on a one-time basis. Most commonly, analysis tools would use one-shot probes to:

Perform setup or cleanup activities for other probes. For example, say an analysis tool has installed a set of point probes to write trace data to a file. Before data collection starts, the analysis tool could execute a one-shot probe to open the trace file. When data collection is complete, and the other probes are through writing to the trace file, the analysis tool could execute another one-shot probe to flush the file descriptor and close the trace file.
Get a "snapshot" of a particular measure at a particular time. For example, the analysis tool could execute a one-shot probe to call UNIX subroutines like getrusage, times, or vtimes to get performance and system-resource information for a target application process.

For example, in When should an analysis tool use a point probe?, we introduced a hypothetical analysis tool that, in order to measure the accumulation of floating point counts by function, installed a set of point probes to collect this data. We continued this same example in When should an analysis tool use phases to invoke phase probes? by using a phase probe to minimize network traffic by only periodically sending the data back to the analysis tool to be displayed to the operator. Suppose now that, as the creator of this analysis tool, you wanted to add a "Refresh" button to the tool's graphical user interface so that the operator could force the tool to update itself with the most current information. To do this, the analysis tool could, whenever the operator clicks on the "Refresh" button, execute a one-shot probe to send the most recently collected data back to the analysis tool for display.

Why is it advantageous to build analysis tools on the DPCL system?

Our original motivation for creating the DPCL system came from the observation that customers were often asking for more application performance analysis tools than tool suppliers had the resources to build. High performance application developers were asking for tools that would provide detailed, accurate information about I/O usage, cache (and other memory usage), CPU and functional unit usage, message passing and synchronization, and operating system effects. Furthermore, they were asking for application profiles to identify problems, and event traces to determine the root causes of problems.

However, while programming tools were becoming more expensive to build and maintain, available tool development resources were shrinking rapidly. More tools were needed, but fewer tools could be created. So, in creating the DPCL system, our goals were to:

Reduce the cost of developing new tools. The DPCL system accomplishes this goal by providing a scalable general-purpose infrastructure that enables analysis tools to instrument target applications. Its relatively simple application programming interface enables analysis tools to easily connect to target applications, and insert probes to perform typical tasks such as reading system counters and program variables. What's more, the DPCL system encapsulates a parallel infrastructure, making it ideally suited for analyzing parallel programs. To create an analysis tool without the benefit of the DPCL system would be a highly nontrivial job involving more complicated and time-consuming programming. For example, you would have to:
- employ compilation techniques before you could insert the instrumentation probes into the executable you want to examine
- create careful locking mechanisms to ensure that the analysis tool and target application do not write to the same files
- set up sockets to enable the instrumentation probes that you place inside the target application to send collected data back to the analysis tool.
- set up some system of callbacks in the analysis tool to handle the data being sent back from the instrumentation probes in the target application.
- address scalability issues if you intend to use your tool to analyze scalable parallel programs.
Not only would you have to create an application from scratch to do all that, but, since analysis tools must be very careful not to adversely effect the target application's performance, you must manage to do all these things in such a way that the interference, or "intrusion cost", to the target application is minimal. This is essential, because if the intrusion cost is too great, then the data you're collecting from executing the instrumented version of the target application is no longer representative of the actual, uninstrumented program.
By building your analysis tool on top of the DPCL system, however, you are able to easily leverage its capabilities and thus can spare yourself the burdensome programming chores outlined above. What's more, by saving you the time and effort normally associated with developing analysis tools, the DPCL system effectively reduces the cost of developing new tools.
Reduce the intrusion cost of instrumentation. As already stated, it is essential that the instrumented version of the target application is still representative of the actual, uninstrumented version of the application. The DPCL system is able to easily reduce the instrumentation intrusion cost that can be quite problematic for more traditional software instrumentation techniques. This is because the DPCL system is based on dynamic instrumentation, and so can add instrumentation probes to, and remove them from, the target application while it is running. That means that the instrumentation code need only reside in the target application for as long as it is needed to gather data, and that decisions regarding what data should be collected can be made and changed during the program's execution. For example, if a problem is suspected, an instrumentation probe can be inserted into the target application to gather just the data needed to verify if the suspected problem does in fact exist. If the probe does verify that the problem exists, it can be removed and replaced by another probe designed to ascertain the cause of the problem. If the original probe, however, concludes that the problem does not exist, then it could be removed and replaced by a probe designed to verify a different hypothesis.
This ability to make and change data collection decisions during execution is unique to dynamic instrumentation. All other methods of instrumentation require you to make data collection decisions before running the program, and often before compiling or linking the program. Such restrictions often result in one choosing to gather more data than is actually needed, thus increasing the intrusion cost of the instrumentation.
Enable the creation of common tools across an organization or industry. The DPCL system accomplishes this goal by its very nature; it is a general-purpose and reusable class library that provides a common architecture to all analysis tools that are built on it.
Enable greater flexibility and interoperability among tools. Since the DPCL system provides a common architecture to all analysis tools that are built on it, it is able to work with multiple analysis tools concurrently. This means you can use more than one analysis tool (for example, a test coverage tool and an application profiler) on the same target application at the same time.
Increase industry innovation in tool development, and, in doing so, increase the number and variety of programming tools. A side benefit of reducing the cost of developing new tools is that it then becomes cost effective to experiment with more speculative analysis techniques. By building analysis tools on the DPCL system, novel and innovative ideas can be evaluated inexpensively, leading to a greater variety of tools available to the whole industry.

[ Top of Page | Previous Page | Next Page | Table of Contents | Index ]