Method of navigating a collection of interconnected nodes6931604Abstract A method and apparatus for organizing and processing interconnected pieces of information ("nodes") using a digital computer is disclosed. Each node has elements that may be text, images, audio, video, and other computer programs. A graph-based user interface presents the individual nodes in spatial arrangements that reflect the relationships among the nodes. User interaction indicating interest in a particular node results in an increase in the "activation" of that node. This leads to an increase in the size of the presentation of that node, as well as an increase in the size of the presentation of closely related nodes. The result is a unique user interaction paradigm that allows for intuitive traversal of complex collections of nodes. Claims 1. A method for organizing and processing a plurality of nodes, comprising (a) Storing said plurality of nodes, (b) Tracking for each node a measure of interest held by the user in said node, said measure being referred to as the "activation" of said node, (c) Presenting said plurality of nodes to said user using an output device, wherein each node is allocated a fraction of the output capacity of said output device, said fraction being substantially proportional to the activation of said node, (d) Interpreting user input indicating interest in a node such that activation of said node is increased, (e) Storing with a first node a plurality of links, each link referring to a second node, and storing with each link a measure of relatedness of said first node to said second node, said measure being referred to as the "weight" of said link, (f) Presenting to said user a plurality of nodes using said output device, wherein the proximity of the presentation of a first node to the presentation of a second node is substantially proportional to the weight of the link between said first node and said second node, Description STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH
BRIEF SUMMARY OF THE INVENTION The present invention solves the stated drawbacks of the prior art in several respects. It provides a selection mechanism that establishes priorities for the nodes as the user navigates through the graph. These priorities change as the user interacts with the graph, reflecting shifts in the user's interest. It also utilizes link priorities to assist in the graph layout. These features of the system are used to create a more meaningful dynamic depiction of the graph. The specific innovations of the present invention over the prior art are as follows.
Rather than a simple selected-node mechanism, the user's actions are translated into an activation value for each node. This activation value is a numerical quantity that represents the amount of interest the user currently has in the node. In the current embodiment, the activation value is a floating point value between 0.0 and 1.0. Because each node has its own activation value, the user's interest can involve more than one node at once. In the preferred embodiment, the user does not directly specify the activation for each node. Instead, the user's actions are interpreted to either increase or decrease the activation of each node. In the absence of actions indicating interest in a node, the activation values for that node will gradually decay over time. In this way, the set of activation values for all of the nodes carries information not only about the user's interest at the current moment, but also the user's interest in the recent past.
The additional space for higher activation nodes is used to display more details about the node, including text, images, video, audio, and any other information that may be stored with the node. The user can see more details about nodes of interest directly on the graph without having to open additional windows that would obscure the graph display.
The link weights are numerical values for each link that indicate how strongly the two linked nodes are related. In the current embodiment, the links are weighted from 0.0 to 1.0. The link weights and activation values together determine which relationships among the nodes should be depicted more accurately. By using this prioritization information, the present invention is able to display and navigate non-hierarchical graphs more effectively than the prior art. The link weights serve a further purpose in navigation. As the user activates a node, the increase in activation is propagated to nodes that are linked to that node. The strength of propagation is determined by the weight of the link. In this way, the user's interest in one node induces activation of other nodes that are strongly linked to it, and the user can more efficiently traverse the network. The present invention is a graph-based user interface that improves the prior art in several respects. These include a novel node selection mechanism, a flexible means of depicting nodes to show more detail for nodes that are more salient, and a sophisticated graph layout algorithm that makes use of information both about the users present interests and the weighted relationships among the nodes. BRIEF DESCRIPTION OF DRAWINGS FIG. 1 is an example of the Cyberbolic user interface, which embodies the Rao method. FIG. 2 is an example of the ThinkMap user interface. FIG. 3 is a system diagram depicting the physical components of the present invention and the connections among those components. FIG. 4 is a class diagram depicting the major classes of the preferred embodiment of the present invention, the attributes and operations of the classes, and the associations among members of the classes. FIG. 5 is a flowchart depicting the operations comprising the CSpaceView::InitNodeViews procedure, which is responsible for creating and initializing the CNodeViews for a collection of CNodes. FIG. 6 is a flowchart depicting the operations comprising the CNodeView::OnMouseClick procedure, which is responsible for responding to a mouse click on a particular CNodeView. FIG. 7 is a flowchart depicting the operations comprising the CNodeView::OnMouseMove procedure, which is responsible for responding to a mouse move event over a particular CNodeView. FIG. 8 is a flowchart depicting the operations comprising the CNodeView::Propagate procedure, which is responsible for propagating changes in activation of a particular CNodeView to linked CNodeViews. FIG. 9 is a flowchart depicting the operations comprising the CSpaceView::LayoutNodeViews procedure, which is responsible for determining the positions of a collection of CNodeViews given the activation values for the CNodeViews and link weights for the CNodes presented by the CNodeViews. FIG. 10 is an example of the present invention's user interface. DETAILED DESCRIPTION FIG. 3 is a block diagram of a system that embodies the present invention. The processor 340 is responsible for the main computations associated with interacting and traversing the graph. The memory 330 contains an internal representation of the graph and also is used for various computations and generating the map diagram. The display 350 is responsible for displaying the generated map diagram to the user. The user input device 360 may be a mouse, trackball, keyboard, or any other suitable means for the user to interact with the system. The external storage device 320 may be a hard disk drive, CD-ROM, optical drive, or any other suitable storage device. The network 310 can be the Internet, a local network, or another means of accessing and retrieving information from another computer system. Additional devices may be added to the system, such as an audio system or a different means of displaying the generated map diagrams. The current embodiment of the present invention has been implemented on a Microsoft Windows™-compatible PC, using C++ as the programming language and the Microsoft Foundation Classes (MFC) as the application framework. MFC provides many of the base functions associated with the operation of the memory, external storage device, network, display, and user input device. The application was designed using object-oriented analysis and design (OOA/D) techniques, so the design will be presented using the preferred notation for these techniques, the Unified Modeling Language (UML). Note that any suitable operating system, programming language, and design methodology could be used to design and build this invention. FIG. 4 is a UML class diagram for the main classes of the preferred embodiment. A class diagram is the main design artifact of an object-oriented system. It shows the major classes of objects within the system, the characteristics of those objects, and the relationships among those objects. Each class has attributes, which are values stored for the objects of the class. Each class also has operations that are performed on objects of the class. Each class also has relationships that are represented in the class diagram as lines connecting to other classes. Note that each class is named starting with a "C", as in "CNode". This is consistent with the naming style used in MFC. Two groups of classes are defined: the Model classes and the View classes. The model classes 410 are the classes that represent the graph itself internally within the system memory. Three classes are used to represent the parts of the graph: the nodes (CNode 450), the links (CNodeLink 440), and the container that holds the nodes (CSpace 430). These three classes together completely represent the graph. The class CNode represents the nodes, and CNode objects contain the node's title, body text, any images, and any additional information that may be associated with the node such as an audio clip. Each CNode also contains a collection of zero or more CNodeLinks, each of which represents a link to another CNode. The CNodeLinks contain the link weight as an attribute. All of the CNodes that are interlinked are collected together into a single CSpace. The CSpace manages various operations for all of the CNodes in the CSpace. All of the nodes in a CSpace must be able to be saved and restored from a persistent storage device. This device may be an external storage device 320 connected to the computer, or it may exist across a network 310. The form of storage may be in a flat file or it may be in a database. The means by which the graph can be retrieved from a persistent store is generally understood in the art. The preferred embodiment utilizes the serialization mechanism built in to MFC in order to retrieve the model classes from a persistent store. This mechanism can be used for either external storage devices or serialization over a network. This function is implemented within the CSpace::Load function. The view classes 420 are classes that represent the model objects on the display device 350. The view classes generally also handle user input from the user input device 360. The two view classes are CSpaceView 460 and CNodeView 470. These represent the views of the entire space of nodes and the individual node, respectively. A CSpaceView contains a plurality of CNodeViews, one for each CNode in the CSpace. The CSpaceView is responsible for arranging the CNodeViews within it according to the layout procedure described below. The CNodeViews are responsible for rendering the contents of the associated CNode, using the drawing procedure described below. The CNodeView contains attributes that represent its position within the CSpaceView. This position is determined by the layout procedure described below. Each CNodeView also stores the current activation value for the node. The activation value is a value from 0.0 to 1.0, and it represents the fraction of the area of the containing CSpaceView that the CNodeView occupies. So a CNodeView with an activation of 0.1 would have an area equal to one-tenth of the total area of the CSpaceView. FIG. 5 is a flowchart showing the CSpaceView::InitNodeViews procedure, which is responsible for creating and initializing the CNodeViews after a new CSpace has been loaded from persistent storage. For each CNode, a new CNodeView is created and added to the CSpaceview 510. Then the root CNode (which is the main entry point into the graph) is located, and the associated CNodeView for the root CNode is located. This CNodeView's activation is then set to 1.0, while the activation for all other CNodeViews are set to 0.0 520. The activation value is propagated from the root CNodeView to the others 530, using the propagation procedure described below. After propagation, the CNodeView activations are normalized 540 so that they sum to a constant value (0.45 in the preferred embodiment, indicating that the total area of the CNodeViews is 0.45 of the total area of the CSpaceView that contains them). Then the CNodeViews are laid out 550 using the layout procedure described below. Finally, if the change in activation for the CNodeViews is larger than a pre-defined threshold, the process is repeated 560. This continues until the activation values converge. Once the activation values converge, the system presents the CNodeViews at the positions determined by the layout procedure, and then waits for user input. After the initial positions and activation levels for the CNodeViews have been determined, the CNodeViews must draw themselves at the proper position on the CSpaceView window. In the current embodiment, a progressive strategy is pursued where incrementally more information about the node is displayed. If the area is very small, only the title or part of the title is displayed. As the area increases, an image (if present) is depicted to the left of the title, growing in size as the area increases further. When the area is large enough to contain more than one line of text, the node's body text is displayed beneath the title (and to the right of the image, if available). The body text is displayed in a smaller font than the title text, and it receives most of the additional area as more area is given to the node. More sophisticated layout algorithms are possible. In particular, using a general markup language such as HTML makes it possible to perform custom layouts based on the size of the display. In this case, statements within the markup code are used to determine which parts of the node should be displayed, what font should be used, and where they should be displayed. This would also provide more flexibility in terms of multiple images per node, background images, etc. Two modes of user input are available. Both operate by interpreting a user action as indicating interest in a particular node. This results in the activation of the CNodeView for that node is increased. FIG. 6 is a flowchart showing the CNodeView::OnMouseClick procedure, which is the first mode of user input. This procedure is called by the system in response to the user positioning the mouse pointer over a CNodeView and clicking the mouse button. Alternatively, a different input device could be used (such as a track ball) that also has a means of positioning a pointer over a CNodeView and a means of indicating interest. When the button click occurs, the system interprets the click by locating the CNodeView that is to receive mouse clicks (the one underneath the current position of the pointer), and then calling the CNodeView::OnMouseClick procedure. The CNodeView::OnMouseClick procedure first computes the new desired activation for the CNodeView 610 using the following formula: where actold is the old activation value and actnew is the new activation value. MAX_ACT is a constant that indicates the maximum activation value any single CNodeView can have. In the preferred embodiment MAX_ACT is defined as 0.33. CLICK_SCALE is a constant that indicates how much the old activation value should be increased toward MAX_ACT for each click. For the preferred embodiment, CLICK_SCALE is defined as 0.33. The CNodeView's activation is then set to the new activation value 620. As can be seen from the formula, the result of clicking multiple times on a CNodeView is that the activation increases, approaching the MAX_ACT value. After the increase in activation, the OnMouseClick function then propagates the activation increase to linked nodes 630 using the propagate procedure described below. Finally, all of the CNodeView activations are normalized 640, layout is done 650, and the CNodeViews are drawn at their new position 660. FIG. 7 is a flowchart showing the CNodeView::OnMouseMove procedure, which is the other mode of user input. This procedure is called by the system in response to the user moving the mouse pointer over a CNodeView. Alternatively, a different input device could be used (such as a track ball) that also has a means of moving a pointer visible on the display screen. When the move event occurs, the system interprets the click by locating the CNodeView that is to receive the move event (the one underneath the current position of the pointer), and then calling the CNodeView::OnMouseMove procedure. The CNodeView::OnMouseMove procedure first computes the new desired activation for the CNodeView 710 using the following formula: where actold is the old activation value and actnew is the new activation value. MAX_ACT is a constant that indicates the maximum activation value any single CNodeView can have. It is identical to the MAX_ACT constant in the CNodeView::OnMouseClick procedure. In the preferred embodiment MAX_ACT is defined as 0.33. MOVE_SCALE is a constant that indicates how much the old activation value should be increased toward MAX_ACT for each move event. Because move events occur more frequently than click events, the MOVE_SCALE constant will generally be significantly smaller than the CLICK_SCALE constant. For the preferred embodiment, MOVE_SCALE is defined as 0.10. The CNodeView's activation is then set to the new activation value 720. As can be seen from the formula, the result of multiple move events over a CNodeView is that the activation increases, approaching the MAX_ACT value. After the increase in activation, the OnMouseMove function then propagates the activation increase to linked nodes 730 using the propagate procedure described below. Finally, all of the CNodeView activations are normalized 740, layout is done 750, and the CNodeViews are drawn at their new position 760. FIG. 8 is a flowchart showing the CNodeView::Propagate procedure. During initialization, click activation, and move activation, activation is propagated from a node to linked nodes. There is a single parameter, called scale, which controls the propagation. The scale parameter is a number in the range 0.0 to 1.0 indicating the amount of activation increase that should occur during the propagation. Propagation proceeds by traversing the list of node links from the link with largest weight to the link with smallest weight 805-835. For each node link, the desired activation for the target node is computed 820 as where weight is the weight of the link connecting the source node to the target node and actold is the targets old activation value. Then a test is performed to determine if the desired activation is greater than the current activation 825. If the desired activation is greater than the current activation, the new activation is computed 830 as where actnew is the target node's new activation and PROP_SCALE is a constant indicating how much the target node's activation should be increased toward the desired activation value. For the preferred embodiment this constant is 0.1. As propagation is performed on each node, a flag is used to indicate that the node has been propagated 840. In this way, the recursive propagation procedure will not result in an endless loop. After all of the links for the source node have been traversed for increasing the target activation values, they are traversed again in order from largest to smallest weight 845-865. This time, the entire propagation procedure is recursively called for each of the target nodes 860. FIG. 9 is a flow chart showing the CSpaceView::LayoutNodeViews procedure. This is the procedure for actually determining optimal positions for the node views, given their activation values and link weights. The layout is a numerical optimization that determines the positions of the CNodeViews based on an energy function. The energy function assumes a larger value for CNodeView positions that are not consistent with the desired mapping. So, by changing the layout positions to minimize the energy function, the system achieves an optimal layout for the node views. Three possible optimizations can be used. The Powell direction-set method is a good general-purpose optimization that can be used for any defined energy function, without requiring gradient information. If gradient information is available, then the conjugate gradient optimization may be used instead of the Powell algorithm. Also, if gradient information is available, gradient descent may be used for small perturbations to the system. The energy functions that are defined for the present invention have gradient information available, so any of the three optimizations can be used to perform the layout. The energy function is built up out of 2-dimensional Gaussian distributions. ##EQU1## One distribution, the spacing distribution Gspacing, is used to create space around each of the nodes. The σ parameter of this distribution is proportional to the size of the node. The size of the node is itself proportional to the activation value. A second Gaussian distribution, the attraction distribution Gattract, is used to pull together nodes with large link weights. This distribution's sigma parameter is also proportional to the size of the node, and its amplitude is proportional to the link weight connecting the two nodes. These two distributions are summed over all pairs of nodes to create the total energy of the current layout. This energy depends on the node activation values, the link weight values, and the current set of node positions. ##EQU2## where i and j are indices over all of the nodes, (xi,yi) is the position of node i, and wij is the weight of the link connecting node i to node j. Other objective functions may also be defined for the layout procedure. For instance, a quadratic penalty function could be employed that was at a minimum for optimal node placements, and increased quadratically as node placements deviated from the optimal placement. Linear combinations of objective functions might also be used as well, allowing distinct factors to influence the layout procedure. The optimization occurs over a state space composed of all of the positions of the nodes. So, for N nodes, the state space is a 2N-dimensional space. The current state of the system is a 2N-dimensional position vector in this space. The CSpaceView::LayoutNodeViews procedure first forms this state vector from the current positions of the CNodeViews 910-930. Then the optimization is called 940 with the energy function described above passed as the objective function and the state vector as the initial point. It optimizes the objective function and returns a new state vector, which is the state that has a minimum energy as defined by the objective function. After the optimization, the new positions of the CNodeViews are extracted from the new state vector 950-960. This results in a new layout that minimizes the energy given by the objective function. This layout must be recomputed any time the CNodeView activations or link weights are changed. Operation of the Preferred Embodiment User operation of the preferred embodiment begins with the specification of the collection of nodes to be traversed. The form of this specification depends on the application configuration: for a stand-alone application, the user may directly specify the file that contains the nodes. For a browser plug-in, browsing to a particular site would trigger launching of the plug-in and loading of the nodes. After the nodes have been loaded, the application determines the initial activations and node positions as described in the detailed description. FIG. 10 shows an example of the user interface for the present invention. The nodes are presented in the appropriate position with the correct size, based on the initial activation value. The presentation may include various elements for each node, including descriptive text, images, audio, and video. The presentation may also include executing a computer program associated with the node. The user can now begin traversing the space by mouse-clicking on a node in which they are interested. As a node is clicked, its activation is increased by a determined amount, leading to an increase in size of the node's presentation on the screen. At the same time, the increase in activation is propagated to linked nodes. The presentations of these linked nodes are increased in size, also, in an amount proportional to the weight of the links. Alternatively, the user can put the software in the move-activation state. In the move-activation state, movement of the mouse pointer over a node will move-activate the node. Propagation to linked nodes also occurs with move-activation. To activate the node even further, the user can continue to move the mouse pointer over the node in a circular or waving motion. Normalization of the node activations occurs after each click- or move-activation to ensure that the total activation for all nodes is at or below a given limit. Also, the layout algorithm is triggered after each click- or move-activation, arranging the nodes based on their new sizes so that nodes which are more related (based on the link weights) are placed in closer proximity than nodes which are less related. The user can continue activating nodes, traversing the space based on interest in the progression of nodes being presented to them. When the user is finished traversing the space, they may either stop the application or browse to a new location. This invention provides a unique and intuitive user interface for browsing collections of interconnected pieces of information. The information may be World-Wide Web sites, book summaries, scientific references, music clips, movie clips, personal interests, descriptions of merchandise, and any of a myriad of other possibilities.
|
Same subclass Same class Consider this |
||||||||||
