Manipulating data structure (e.g., compression, compaction, compilation)

Method and apparatus for performing data collection, interpretation and analysis, in an information platform

6078924

Abstract

An information platform automates the collection of data, provides a method for organizing the library of information and provides analysis using multiple content-types, thereby providing a user with a market understanding necessary to execute rapid and knowledgeable decision making. The information platform collects and integrates data, observations and intelligence; provides controls for multiple methods of information navigation and analysis; and allows details to be digested in the context of other data, regardless of its type. The information platform is a client/server implementation that is subdivided into four major sections, including: (1) Data Retrieval, which provides a sophisticated catalog for finding internal and external information and collection agents which retrieve specific information without user intervention; (2) Data Classification and Storage which handles the storage of the information once it has been gathered from a source; (3) Information Browsing, Query, Analysis, and Report Creation which provides information browsing, reporting, and analysis tools; and (4) Desktop Integration where the information platform takes information from a wide variety of formats (HTML, text, spreadsheet) and combines them all into a single format (HTML, text, spreadsheet).


Claims

What is claimed is:

1. An information platform for performing data collection, interpretation and analysis, comprising:

a data retrieval module comprising:

a catalog including a data store for collecting internal and external information from relevant sources;

a geometry recognition module for analyzing multiple sources and recognizing particular patterns within each source; and

a page analyzer for scanning a source document, breaking said source document into blocks and sub-blocks of information, and returning granular pieces for aggregation in said data store;

a data classification and storage module;

an information browsing, query, analysis, and report creation module said information browsing, query, analysis, and report creation module comprising:

a classification subsystem for classifying data according to a specific language, wherein said classification allows said data to be archived and tracked in an object store, and wherein said classification allows said object store to manage complex relationships between a plurality of items whereby, once classified, an item is associated to several other data types by any of one or more characteristics; and

a desktop integration module.

2. The information platform of claim 1, wherein said classification subsystem uses object-oriented meta data technology.

3. The information platform of claim 1, wherein said information browsing, query, analysis, and report creation module comprises:

visualization controls that include any of a time-span slider bar to allow for comparisons of data over time; a population quadrant to compare data frequency across multiple axis; and a cross tabular chart to analyze information.

4. An information platform for performing data collection, interpretation and analysis, comprising:

a data retrieval module comprising:

a catalog including a data store for collecting internal and external information from relevant sources; and

a parsing engine for interpreting the format of a stream of information and then returning requested elements to a user by reading a source document and determining said source document page geometry, wherein said parsing engine locates an element by finding a specific string or pattern within a source document, where regular expressions are character strings in which plain text indicates that that text must exist in a target string, and special characters are used to indicate what variability is allowed in said target strings, and wherein said parsing engine performs any of:

a simple content match which looks for a sub-string on regular expression in said source document and returns a primary document element containing the match;

a bounded content match in which the search scope is limited to one contiguous part of said source document; and

a simple/bounded content match in element type which is a bounded or document wide search that look for a sub-string within a specific element type; and

a data classification and storage module; and

an information browsing, query, analysis, and report creation module.

5. The information platform of claim 4, wherein said parsing engine subdivides a source document into elements at include any of a section, a sub-section, a paragraph including any of a sentence, phrase, and word; a table including any of a title, column, row, column header, spanned column header, header, row header, spanned row header, and cell; an image, a link, form, and a line, optionally within a range.

6. The information platform of claim 4, wherein said parsing engine identifies an element by either of its ordinal position and by searching for specific content and pattern matching.

7. The information platform of claim 4, wherein said parsing engine parses one contiguous span of information in a source document that is identified by specifying beginning and ending elements, wherein said specified elements can be defined using any coordinate specification techniques.

8. The information platform of claim 7, wherein said parsing engine further comprises:

a parsing subsystem is implemented as an object with interfaces that support any of the following operations:

Set source stream;

Set/get operation bounds (passing begin and ending element coordinates or ids);

Get source stream in specific format (pass desired format type);

Get document tree structure (passing handle to a window tree control);

Get element stream (passing element coordinates, or ID);

Get element ID (passing element coordinates);

Get element count (passing element type);

Get first element stream (passing pattern based element coordinates, optional element type);

Get next element stream;

Get last element stream;

Get table size (passing element coordinates or ID; and

Get header coordinates (passing table coordinates/ID plus col/row of header).

9. The information platform of claim 8, wherein said parsing subsystem further comprises any of the following parsing classes:

CBlock which parses high level blocks with methods to access different kind of information about them and methods to create a fully structured document;

CHtmlParse which has a container of virtual and non-virtual HTML tags and methods to access different kind of structured information about them and where an input stream which is either structured, quasi-structured, or non-structured HTML or Text document is used to create a container of tags and provide methods to access information and to create a new HTML structured document;

CTextParse which takes as an input a segment of a stream document which has no structured information and creates a container of virtual tags and structured information about said stream segment where said container has any of tables, rows, columns, table cells, table headers, paragraphs, section, and subsection headers, and page numbers and where hyper-text links may be added;

CParag which takes a segment of unstructured text that ends with a blank line or end of document and converts it to HTML by deciding if it should be a table, section header, subsection header, paragraph, or page number;

CLine which takes one line of unstructured text and creates a list of phrases in it with information about the phrases and which also provides a line calculator to be able to merge two lines together if needed, or compare lines together to see if they line up in a table, to decide if the line is a header row;

CPhrase which takes one phrase of unstructured text and provides information about it and where it starts and ends in terms of characters and column cells and which also provides a phrase calculator to merge phrases, compare phrases;

CTag which creates an HTML tag structure with information about the attributes of a tag and which is used to create HTML code from the container of the blocks; and

CHtmlTags which provides the latest syntax of the HTML language with its attributes and which is used in the parsing of HTML documents and in the creation of structured HTML streams.

10. A method for performing data collection, interpretation and analysis, in an information platform, said method comprising the steps of:

finding tags starting and ending points in a source document;

identifying tags that have structural information;

providing a text parser for a text segment that has raw text or text with tags that have no structural information;

dividing said segment into paragraphs with said text parser;

parsing each paragraph using a paragraph parser, wherein said paragraph parser studies the lines of a paragraph using a line parser and decides if said paragraph is a regular text paragraph, header, page number, or table, wherein said line parser divides a line into phrases and calls a phrase parser to get information about each phrase, wherein said paragraph parser then uses line and phrase calculators to identify the structure of said paragraph; wherein if said paragraph is a table, said paragraph parser creates a table; wherein said paragraph parser generates virtual tags and returns a list of such tags to said text parser;

combining all of said virtual tags from all of said parsed paragraphs with said text parser and then passing said virtual tags back to an HTML parser;

creating one container for all the virtual and non-virtual tags with said HTML parser and enumerating all the tags properly in said source document; and

creating a list of high level blocks from said virtual and non-virtual tags;

wherein a user can retrieve a generated, fully structured document of said source document; and wherein said user can retrieve information about any block in said source document using regular expressions.

11. An apparatus for performing data collection, interpretation and analysis, in an information platform, comprising:

a parse engine for finding tags starting and ending points in a source document and for identifying tags that have structural information;

a text parser for a text segment that has raw text or text with tags that have no structural information; said text parser dividing said segment into paragraphs; and

a paragraph parser for parsing each paragraph;

wherein said paragraph parser studies the lines of a paragraph using a line parser and decides if said paragraph is a regular text paragraph, header, page number, or table, wherein said line parser divides a line into phrases;

wherein said paragraph parser calls a phrase parser to get information about each phrase;

wherein said paragraph parser then uses line and phrase calculators to identify the structure of said paragraph;

wherein if said paragraph is a table, said paragraph parser creates a table;

wherein said paragraph parser generates virtual tags and returns a list of such tags to said text parser;

wherein said text parser combines all of said virtual tags from all of said parsed paragraphs and then passes said virtual tags back to an HTML parser;

wherein said HTML parser creates one container for all the virtual and non-virtual tags, enumerates all the tags properly in said source document, and creates a list of high level blocks from said virtual and non-virtual tags;

wherein a user can retrieve a generated, fully structured document of said source document; and wherein said user can retrieve information about any block in said source document using regular expressions.


Description

BACKGROUND OF THE INVENTION

1. Technical Field

The invention relates to information management and navigation. More particularly, the invention relates to an information platform that collects and integrates data, observations, and intelligence; provides controls for multiple methods of information navigation and analysis; and allows details to be digested in the context of other data, regardless of its type.

2. Description of the Prior Art

Regardless of their specific job title, knowledge workers in the high-technology and financial service industries perform research and analysis function every day. Their work requires navigation between different types of information, from stock quotes, market capital, product comparisons, and industry commentary to internal sales figures and quotas. The decision making process for such knowledge workers is compounded by the fact that vital market information exists in widely dispersed locations and in many varied formats. Furthermore, when the research is complete, there is a high probability that a significant piece of data was overlooked.

The following examples illustrate these issues:

A research analyst from a large investment bank must make an investment recommendation for a specific company. The analyst must determine how the company's stock might perform in relation to the market, as well as its competitors. The analyst must determine the industry perception of the company and its products, in conjunction with the possibilities for future product launches. The analyst must also understand what other financial analysts are forecasting for this company's future stock, and the markets overall trend.

A VP of marketing must decide what new features to implement in an established software product in light of competitive threats and a raft of emerging technologies. The VP must determine the relevance of the emerging technologies, assess competitive activities, document consumer response to similar product initiatives, and review analyst predictions. The new plan must determine a prior plan's effectiveness by comparing internal initiatives with media coverage during the following months.

An executive team wants to compare the company's productivity with that of the competition. The team must research revenues and expenses within specific departments, identify sales per employee, and unit costs and compare them with the top five competitors.

Like a maze with doors and hallways, analysis may require a return trip backward through historical information with each new finding or insight. The traversing of isolated information, from paper reports, to internal databases to external Internet news sources and back to a Microsoft Excel spreadsheet is arduous and requires significant time investment.

H. Pajak, Electronic Library, European Patent Application No. 472070 (filed Aug. 9, 1991) discloses an electronic library metaphor on a computer workstation that includes shared books with data base metaphor, a reference books metaphor, and a card catalog metaphor in one system that allows large object oriented data base be organized and accessed in an exclusive environment and that allows access to screen icons, creates a visual hierarchy of related and shared objects, and allows mutually exclusive access to the metaphors within the library.

S -Y Hsu, Process For Identifying Simple and Complex Objects From Fused Images And Map Data, U.S. Pat. No. 5,631,970 (May 20, 1997) discloses a method of identifying and/or extracting an object from multiple fused information sources, such as amps and images. The systems allows a user to integrate information freely from multiple sources, such as maps, socioeconomic data, and various types of images. Data is first retrieved and then processed to transform its pixel representation to a region representation. The region representation is then refined by merging mechanisms, such as segmentation. The identified and/or extracted object can then be visualized by the user. Features or objects can then be extracted using the user's expert rules and confidence levels, which confidence levels may be derived by fuzzy logic.

An Information Retrieval System, European Patent Application No. 774722 (filed Nov. 15, 1996) provides a system in which design and content are separated. In this system, an information retrieval server (IR) indexes and searches stories in titles. INdexing takes place when a title is released to the network by a publisher workstation. The IR server inter-relates title, section, and story objects by their globally unique identifiers and creates a routing table which is used to located objects across multiple database partitions. The IR search service is requested in two different way at customer runtime. The first way is the resolution of the search objects to retrieve matching stories. The retrieved stories are concatenated and poured into the area defined by the dynamic control when the title is viewed. In the second way, the IR search service is requested when a search is initiated by a customer using a "find" dialog to search across all stories in one or more titles, both dynamic and static.

It would be advantageous to provide an application that automated the collection of data, provided a method for organizing the library of information and provided analysis using multiple content-types, and thereby provide a market understanding necessary to execute rapid and knowledgeable decision-making.

SUMMARY OF THE INVENTION

The invention provides information platform that automates the collection of data, provides a method for organizing the library of information and provides analysis using multiple content-types, thereby providing user with a market understanding necessary to execute rapid and knowledgeable decision-making.

The information platform is a combination of desktop and server technologies that raise the decision making abilities of business professionals to the highest possible level.

The information platform:

Collects and integrates data, observations and intelligence;

Provides controls for multiple methods of information navigation and analysis;

Allows details to be digested in the context of other data, regardless of its type.

The information platform is a client/server implementation that is subdivided into four major sections.

Section 1--Data Retrieval.

The entire information platform relies upon reliable, predictable access to data, regardless data source. The data retrieval section of the platform provides a sophisticated catalog for finding internal and external information. It provides collection agents, which retrieve specific information without user intervention. The information platform provides users with multiple ways for collecting vast amounts and varied types of information in the shortest time possible.

Using the invention, there are several approaches a user can adopt for collecting information:

The user can comb a data source catalog, looking for a specific nugget(s) of information, such as the names of companies selling virus-protection software. Collection profiles can gather bodies of information with minimal input. A user, for example, enters a single company name, and the information platform gathers financial, organizational, product and manufacturing information about the company and its competitors. If the information is not available in the catalog, the user can find the information using traditional means (such as a search engine, or by combing through files on the network, browsing) and the information platform can automatically add the new information source to the catalog. An analysis template guides a user through the data collection, interpretation and analysis process for a specific topic, e.g. a template comparing the effectiveness of corporate communications on press coverage. By selecting an analysis template, the user is walked through the data collection cycle.

A user analyzing data in Excel wants to add an additional row to a grid. By extending the Microsoft Office suite with the information platform menus and buttons, a user browses, selects and imports information using the catalog from within the desktop application.

The control center for the information collection is the catalog. The catalog contains details about how to access and retrieve data located throughout e.g. the Internet, as well as knowledge for collecting information from major market research companies, such as Gartner Group, Dun and Bradstreet, and AdScope. A corporation can quickly add all internal data sources to the catalog, providing a user with a one-stop place to pull information from internal and external sources. Similarly, a user can add external information sources, create or modify collection profiles, and customize the catalog. Any catalog can receive updates from the master catalog via the Internet.

There are two core technologies that make the catalog superior to traditional indexes and contents lists.

First, the information platform provides a document analyzer that can reliably extract individual elements from any document type, even when the document is unstructured. This allows the catalog to contain a much greater level of detail than a simple document catalog. For example, the Securities Exchange Commission I0Q Report (Source: EDGAR database) contains more than 100 elements, each can be individually retrieved and processed. Using the catalog, a user finds that they can retrieve operating expenses for Research and Development from the I0Q, something they might not have realized was buried in the EDGAR SEC filings.

Second, the catalog is built upon an object-oriented database, referred to as a `store.` The object-oriented store, used in conjunction with a rich set of classifications for each information source, provides unprecedented flexibility for catalog navigation. Users can find an information source using a variety of different navigation paths.

A scripting engine works in conjunction with the catalog to automate repeated information retrieval from internal and external sources. Having captured a nugget of information once, the information platform returns to the location repeatedly retrieving updated information without user interaction.

Section 2--Data Classification and Storage.

The second section of the application platform handles the storage of the information once it has been gathered from a source. As with the data source catalog, the information store uses an object-oriented database and exhibits the same flexible access behaviors, i.e. users can navigate the store and find information using a variety of navigation paths.

Whenever a user pulls data from a source the data is tagged, metadata attributes are assigned, and the data is saved in the store.

There are many benefits to maintaining an information store:

Information can be retrieved once and used multiple times--reducing access and subscription fees, connection time and network traffic. The information platform automatically gathers information updates and stores them without client intervention--data are always updated and historically tracked.

The store maintains a growing base of information--the system becomes increasingly valuable over time. The store manages complex relationships between information elements--allowing for sophisticated comparisons and context when viewing data.

Information about the origin of the data is permanently recorded--data are always associated with an author, organization or other source. Data are volatile and may not be permanently accessible from the source--long after the original source of the information has expired, the user can access the information from the store. The store allows unique time dimensions to be applied to both structured and unstructured information--items are always historically monitored. Custom data visualization controls can be developed because the information is retrievable in a consistent way--allowing for cross-analysis of multiple information types.

Key to a valuable information store is the assignment of quality metadata to the individual information elements. Metadata are sets of specific details about data, similar to an abstract of a library book. The metadata falls into two major categories:

Base information about the specific information element (from where it was obtained, the data format); and

The relationship of the information to other information in the store.

Relationships are classified according to business-specific taxonomies. `Number of Employees,` for example, is classified in relation to a year, a company, a market, and the number's source. Because the store already has a relationship between a company and operating expenses, the information platform can construct a derivative relationship and report `employee numbers` as related to a company's `net income` for a certain `year.`

Although metadata is the lifeblood of a complex data store, it is important that the burden of metadata assignment does not rest solely with the user. The information platform uses a variety of methods of automatically assigning metadata. The first source of metadata is the data source catalog, which contains a wealth of metadata about known information sources. The invention gathers the majority of metadata from the data source catalog and automatically assigns classifications to new pieces of information as they are collected. Another way of automatically assigning metadata is by analyzing the information content. Also, the source of the information, the individual who requested the collection and the data format (e.g. HTML, Excel) is automatically assigned.

Section 3--Information Browsing, Query, Analysis, and Report Creation.

Because all the structured and unstructured information from internal and external sources (captured over time) is collected in a consistent object-oriented store, the information platform can provide unprecedented information browsing, reporting, and analysis tools.

One embodiment of the invention provides a set of pre-built interfaces between the object store and the desktop. These preconfigured interfaces provide users with the opportunity to browse, query, and graph multiple types of information.

A user can choose one of the following ways to peruse and analyze information in the store:

A user can navigate the store by roaming through different topics and subtopics. Information can be accessed in many ways, using varied pathways. Information on a trade show, for example, can be accessed from an Events section, or by selecting a product, then the product's company, and then seeing at which trade shows the company exhibits it products (or is scheduled to exhibit its products).

A sophisticated search engine provides a quick way to find individual information nuggets without having to navigate the store.

Quick Reports provide instant analysis (often in graphical form) of a specific topic. Quick reports compare, for example, companies advertising expenditures and gross sales revenue. The information platform knows what information to retrieve from the store and how to display it. These quick reports are similar to Quicken reports, which take checkbook transactions and display interesting trends and facts gleaned from the source data.

Information Visualization Controls provide unique ways to view multiple data elements in a single control.

Analysis templates extend the notion of Quick Reports and provide a framework for analysis on a specific topic. The template includes a `Score Card` guideline for tracking information (such as revenue numbers and sales figures) that must be gathered for complete analysis. Other templates include a collection of graphs and tables that are germane to the analysis, and skeleton Microsoft Office documents which provide the final report framework.

The information platform client takes advantage of the latest Microsoft Explorer technologies and uses a combination of Java, JavaScript, ActiveX, and dynamic HTML to provide a sophisticated information delivery platform.

Section 4--Desktop Integration.

Rather than compete with major desktop analysis and reporting tools (such as Excel and Word), the information platform exports data to these standard applications for ease of use. The information platform takes information from a wide variety of formats (HTML, text, spreadsheet) and combines them all into a single format (HTML, text, spreadsheet).

Information provided on the user desktop can be saved in all of the standard Office file formats (such as Word, Excel, Access, and PowerPoint), and text or HTML files. Information can also be `pushed` into an Active Desktop applications using OLE automation.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of the architecture of an information platform according to the invention;

FIG. 2 is a block diagram of information platform task analysis according to the invention;

FIG. 3 is a block diagram of information platform source information breakdown according to the invention;

FIG. 4 is a block diagram of information platform manipulate source information breakdown according to the invention;

FIG. 5 is a block diagram of information platform source user set breakdown according to the invention;

FIG. 6 is a block diagram of information platform favorites breakdown according to the invention;

FIG. 7 is a block diagram of information platform user account breakdown according to the invention;

FIG. 8 is a block diagram of information platform editing a catalog breakdown according to the invention;

FIG. 9 is a flow diagram of information platform parser showing the relationship of the parsing classes according to the invention;

FIG. 10 is a block diagram of information platform parser operation according to the invention;

FIG. 11 is a more detailed block diagram of the information platform parser operation of FIG. 10.

DETAILED DESCRIPTION OF THE INVENTION

The herein disclosed information platform is a combination of desktop and server technologies that raise the decision making abilities of business professionals to the highest possible level.

The information platform:

collects and integrates data, observations and intelligence;

provides controls for multiple methods of information navigation and analysis;

allows details to be digested in the context of other data, regardless of its type.

Some of the benefits of invention include decreased time spent searching, collecting, integrating, analyzing, and reporting information. The platform provides the widest coverage of data sources and eliminates `Information isolation`--expensive, yet useless, information that is never analyzed in conjunction with other internal and external data sources.

The information platform is a client/server implementation that is subdivided into four major sections, as illustrated on FIG. 1.

Section 1 (10)--Data Retrieval.

The entire information platform relies upon reliable, predictable access to data, regardless data source. The data retrieval section of the platform provides a sophisticated catalog 19 for finding internal and external information. It provides collection agents, which retrieve specific information without user intervention. The information platform provides users with multiple ways for collecting vast amounts and varied types of information in the shortest time possible.

Using the invention, there are several approaches a user can adopt for collecting information:

The user can comb a data source catalog, looking for a specific nugget(s) of information, such as the names of companies selling virus-protection software. Collection profiles can gather bodies of information with minimal input. A user, for example, enters a single company name, and the information platform gathers financial, organizational, product and manufacturing information about the company and its competitors. If the information is not available in the catalog, the user can find the information using traditional means (such as a search engine, or by combing through files on the network, browsing) and the information platform can automatically add the new information source to the catalog. An analysis template guides a user through the data collection, interpretation and analysis process for a specific topic, e.g. a template comparing the effectiveness of corporate communications on press coverage. By selecting an analysis template, the user is walked through the data collection cycle.

A user analyzing data in Excel wants to add an additional row to a grid. By extending the Microsoft Office suite with the information platform menus and buttons, a user browses, selects and imports information using the catalog from within the desktop application.

The control center for the information collection is the catalog 19. The catalog contains details about how to access and retrieve data located throughout e.g. the Internet, as well as knowledge for collecting information from major market research companies, such as Gartner Group, Dun and Bradstreet, and AdScope. A corporation can quickly add all internal data sources to the catalog, providing a user with a one-stop place to pull information from internal and external sources. Similarly, a user can add external information sources, create or modify collection profiles, and customize the catalog. Any catalog can receive updates from the master catalog via the Internet.

There are two core technologies that make the catalog superior to traditional indexes and contents lists.

First, the information platform provides a document analyzer 22 that can reliably extract individual elements from any document type, even when the document is unstructured. This allows the catalog to contain a much greater level of detail than a simple document catalog. Thus, source documents may be obtained from the World Wide Web (web) 14, a user desktop 15, an SQL query to a database 16, an OLAP query 17, or from Lotus Notes 18. For example, the Securities Exchange Commission I0Q Report (Source: EDGAR database) contains more than 100 elements, each can be individually retrieved and processed. Using the catalog, a user finds that they can retrieve operating expenses for Research and Development from the I0q, something they might not have realized was buried in the EDGAR SEC filings. Second, the catalog is built upon an object-oriented database, referred to as a `store` 20. The object-oriented store, used in conjunction with a rich set of classifications for each information source, provides unprecedented flexibility for catalog navigation. Users can find an information source using a variety of different navigation paths.

A scripting engine 21 works in conjunction with the catalog to automate repeated information retrieval from internal and external sources. Having captured a nugget of information once, the information platform returns to the location repeatedly retrieving updated information without user interaction.

Section 2 (11)--Data Classification and Storage.

The second section of the application platform handles the storage of the information once it has been gathered from a source. As with the data source catalog, the information store 23 uses an object-oriented database and exhibits the same flexible access behaviors, ie. users can navigate the store and find information using a variety of navigation paths.

Whenever a user pulls data from a source the data is tagged, metadata attributes are assigned, and the data is saved in the store.

There are many benefits to maintaining an information store:

Information can be retrieved once and used multiple times--reducing access and subscription fees, connection time and network traffic. The information platform automatically gathers information updates and stores them without client intervention--data are always updated and historically tracked.

The store maintains a growing base of information--the system becomes increasingly valuable over time. The store manages complex relationships between information elements--allowing for sophisticated comparisons and context when viewing data.

Information about the origin of the data is permanently recorded--data are always associated with an author, organization or other source. Data are volatile and may not be permanently accessible from the source--long after the original source of the information has expired, the user can access the information from the store. The store allows unique time dimensions to be applied to both structured and unstructured information--items are always historically monitored. Custom data visualization controls can be developed because the information is retrievable in a consistent way--allowing for cross-analysis of multiple information types.

The key to a valuable information store is the assignment of quality metadata to the individual information elements. Metadata are sets of specific details about data, similar to an abstract of a library book. The metadata falls into two major categories:

Base information 26 about the specific information element (from where it was obtained, the data format); and

The relationship 27 of the information to other information in the store.

Relationships are classified according to business-specific taxonomies. `Number of Employees,` for example, is classified in relation to a year, a company, a market, and the numbers source. Because the store already has a relationship between a company and operating expenses, the information platform can construct a derivative relationship and report `employee numbers` as related to a company's `net income` for a certain `year.`

Although metadata is the lifeblood of a complex data store, it is important that the burden of metadata assignment does not rest solely with the user. The information platform uses a variety of methods of automatically assigning metadata. The first source of metadata is the data source catalog, which contains a wealth of metadata about known information sources. The invention gathers the majority of metadata from the data source catalog and automatically assigns classifications to new pieces of information as they are collected. Another way of automatically assigning metadata is by analyzing the information content. Also, the source of the information, the individual who requested the collection and the data format (e.g. HTML, Excel) is automatically assigned.

Section 3 (12)--Information Browsing, Query, Analysis, and Report Creation.

Because all the structured and unstructured information from internal and external sources (captured over time) is collected in a consistent object-oriented store, the information platform can provide unprecedented information browsing, reporting, and analysis tools.

One embodiment of the invention provides a set of pre-built interfaces between the object store and the desktop. These preconfigured interfaces provide users with the opportunity to browse, query, and graph multiple types of information.

A user 24 can choose one of the following ways to peruse and analyze information in the store:

A user can navigate the store by roaming through different topics and subtopics. Information can be accessed in many ways, using varied pathways. Information on a trade show, for example, can be accessed from an Events section, or by selecting a product, then the product's company, and then seeing at which trade shows the company exhibits it products (or is scheduled to exhibit its products).

A sophisticated search engine provides a quick way to find individual information nuggets without having to navigate the store.

Quick Reports provide instant analysis (often in graphical form) of a specific topic. Quick reports compare, for example, companies advertising expenditures and gross sales revenue. The information platform knows what information to retrieve from the store and how to display it. These quick reports are similar to Quicken reports, which take checkbook transactions and display interesting trends and facts gleaned from the source data.

Information Visualization Controls provide unique ways to view multiple data elements in a single control.

Analysis templates extend the notion of Quick Reports and provide a framework for analysis on a specific topic. The template includes a `Score Card` guideline for tracking information (such as revenue numbers and sales figures) that must be gathered for complete analysis. Other templates include a collection of graphs and tables that are germane to the analysis, and skeleton Microsoft Office documents which provide the final report framework.

Active Desktop users can instruct the information platform to push information, using Microsoft's Channel Definition Format (CDF), to an active desktop.

The information platform client takes advantage the latest Microsoft Explorer technologies and uses a combination of Java, JavaScript, ActiveX, and dynamic HTML to provide a sophisticated information delivery platform.

Section 4 (13)--Desktop Integration.

Rather than compete with major desktop analysis and reporting tools (such as Excel and Word), the information platform exports data to these standard applications for ease of use. The information platform takes information from a wide variety of formats (HTML, text, spreadsheet) and combines them all into a single format (HTML, text, spreadsheet).

Information provided on the user desktop 25 can be saved in all of the standard Office file formats (such as Word 27, Excel 28, Access, and PowerPoint) 29, and text or HTML files. Information can also be `pushed` into an Active Desktop 30 applications using OLE automation.

All of the visualization controls can be embedded directly into Office applications, providing the user with the flexibility to develop custom templates.

Core technologies

Document Analyzer 22.

A major step in building a data store filled with inter-related objects from multiple locations, begins by collecting information from relevant sources. However, web pages, documents, and database outputs are too large to be classified accurately with a high degree of confidence and specification. The information platform provides a geometry recognition technology that analyzes multiple sources and recognize particular patterns within each site. A page analyzer scans a source document, e.g. a SEC 10K fiscal-year--revenue filing, and breaks it into blocks and sub-blocks of information, returning the granular pieces for aggregation in the data store 20. With the 10K filing, the page analyzer separates different data types, such as net income, expenses, products lists, company descriptions, and commentary about profits and losses. The page analyzer can recognize and break down input, for example, from HTML pages, text documents, charts, tables, Excel spreadsheets.

Metadata implementation.

Once data are located and dissected into its constituent parts, A subsystem classifies the data according to business-specific language. The classification allows for the data to be archived and tracked in an object store, similar to a book in a library. More importantly, the classification allows the object store to manage complex relationships between thousands of items. Once classified, an item is associated to several other data types by time, company name, revenue, product, product manager, and age, for example. The management of complex relationships allows the system and the user to find and inspect data through multiple paths. One benefit of maintaining data inter-relationships is the speed with which a particular fact, statistic, or analyst quote can be located and referenced.

The classification subsystem uses object-oriented metadata technology, and complies with the Metadata Council's Metadata Interchange Specification. As the information platform identify and collect new data over time, the metadata subsystem 26, 27, 31 constantly redefines and updates these relationships. In this way, industry perception of a particular product, company, or CEO can be monitored throughout the course of weeks, months, or years.

Cross-Information Visualization.

The importance of complex data relationship management is realized when users are able to view multiple types of information in a single visual interface. Valuable insight is gained from comparing unique statistics, such as the number of employees per year to net profits to industry perception over a time span to customer buying patterns of a flagship product.

The information platform provides users with a variety of methods for displaying both structured and unstructured information.

Structured information includes data from tables, SQL query results, and financial information in a spreadsheet.

Unstructured information includes analyst reports, product comparisons, corporate profiles and SEC 10K filings.

Visualization controls include:

Time-span slider bars to allow for comparisons of data over time;

A population quadrant to compare data frequency across multiple axis;

Cross tabular charts to analyze products across geographic regions, company dimensions, product dimensions, officer dimensions, product sales analysis, promotion analysis, market share analysis, product feature analysis, industry revenue forecast analysis, industry opinion analysis, competitive expenditure analysis, events/effects analysis.

Discussion.

The information platform integrates structured and unstructured information from internal and external data sources into visual applications for the purpose of allowing a user to make intelligent business decisions. The information platform consists of components that aid in several steps in the analysis process, namely information gathering. These steps are: the formation of a plan for an analysis; collection of external and internal information for the analysis; separation of large informational pages into component elements; review and manipulation of these component elements; and seamless transport of the collected information into various reporting desktop applications.

The information platform provides users with multiple ways for collecting vast amounts of information in the shortest time possible and provides central data access (import/export) from popular desktop reporting, and analysis products.

Import Formats

Web (*.html)

Word (*.doc)

Excel (*.xls)

Postscript (*.pdf)

SQL

Lotus Notes

OLAP

Export Formats

Web (*.html)

Word (*.doc)

Excel (*.xls)

Postscript (*.pdf)

PowerPoint (*.ppt)

ACT

OLAP

The information platform provides a set of sophisticated information catalogs for finding internal and external information, along with collection profiles that retrieve requested information unattended.

The following discussion describes the feature set of the information platform.

Setting Analysis Objectives

General Discussion<Case Creation; Case Building>

Once the purpose of a report is determined, the breadth of information required for analysis is reduced, and the needs for particular data nuggets are prioritized.

Every analysis project has specific data gathering and analysis requirements: a `market opportunity analysis` uses different data than a `product feature audit.` As such, guidelines for collecting and analyzing data need to be established for most projects. Some guidelines are general `one day marketing plan` items such as `Determine Market share,` which have associated tactics of determining the companies in a market, gathering their revenue and unit sales, and finally comparing them. Most reports, however, are more far more complex and unique to market of research firms and marketing organizations. For each company, their analysis methodology provides them and their customers a sense of competitive advantage.

The information platform case feature taps into this business function by providing users with the ability to create and share objectives and tactics for each analysis project. Not only does the information platform provide users the power to create their own Analysis Cases, but the information platform provides several industry standard Case templates, that can be used and edited to perform the most thorough market research and analysis possible quickly and efficiently. Through the use of Cases and Case templates, organizations can determine what objectives as well as the types of information vital to their decision making process. The information platform automates the collection of that information and ensures that the gathered results are reviewed and digested by the appropriate individual and or groups.

Identify objectives<Case Creation: New (Blank/Template/Default

With this feature the user determines the goals of a project, which directly relates to the types and quantities of data required for collection. The user can selects from one of several Case templates (i.e. Pricing analysis, Product Feature Audit, Competitive Week-in-Review) and the system recommends objectives, including analyses and the corresponding list of data that needs to be collected.

If a user desires, he may add/edit/delete objectives as well as data elements.

Identifying and Collecting Data

Definitions

The following definitions are applied for their respective terms within this document:

Catalog Folder: The folders creating the Catalog hierarchy, these can be nested within one another to create various levels of hierarchy.

Catalog Collection Item: The item that defines the information collection node, contains information about the specific piece of information that is returned to the user when requested. May consist of one or more Info Items.

Source Topic Folder: The folders creating the Source hierarchy, these organize the sources into topics and subtopics. These can be nested within one another to create various levels of hierarchy.

Source Folder: The folder defining a specific source of information. This source can be a web site, CD-ROM, database, or subscription service source, for example. A source may have sub-source folders if the information space is complex enough to warrant further subdivision.

Document Folder: The individual documents that can be collected from a source are each represented by a Document Folder. The Document Folder can contain one or many Info Items.

Info Item: The specific data element that is returned to the user upon request for retrieval. Associated with one or more Catalog Collection Items. For each document, there are one or more Info Items that can be collected from it. At a minimum, the full document is an Info Item. Additional, finer grained Info Items can be added which represent extracted portions of a document.

General Discussion

When users build their own Cases, they have several methods for identifying the type and scope of information to be collected.

As well, through pre-built Case Templates, the information platform assists users with analysis projects by highlighting collections of data elements that satisfy a suggested set of analysis objectives. Users may view the list of suggested data to be collected, as well as add their own data elements for the system to collect.

Knowing the sources that the system collects data from, as well as the ability to add new sources of data, is a high priority to users.

Coverage of Sources <Viewing fulfilled/unfulfilled Case Items and Sources>

For any given Case, the user is able to review and output a summary of the Case which includes a bibliography of sources that are, or will be, used in the Case.

Users are concerned that they gather data from as wide a range and variety of sources as possible. An example of this is collecting data from multiple news sources, analyst firms and web sites for a Case.

Adding data elements to a Case:

Users can add additional data elements to any case either through the systems automated collection or through the systems `Source-Specific Search` feature or from documents they located via browsing.

<add items from Information Source Catalog>

An Information Source Catalog contains details about the locations and types of data that can be retrieved from internal and external sources. The catalog combines tree view and list view controls that allow users to identify entire documents or just important parts of a document for automated collection. As such the Catalog allows users to collect:

A single data element;

A single document; and/or

A set of either data elements or documents or both, called packages.

Browsing the Information Catalog: Users navigate the catalog by using one of the taxonomies (1: Business language, 2: Sources, 3: Input types). Users may filter and sort through the catalog based on the catalog items' properties.

Searching the Catalog: Users are able to enter simple search requests to find individual catalog entries.

<add items from Source-Specific Search Catalog>

While the Information Source Catalog contains several thousand entries for the automated collection of data, users have the power to perform targeted searches--of content located on both the Internet and intranets. A Source-Specific Search catalog contains details and access scripts for limited scope searches--searching only the sites with valuable content to the project, or rather Case, at hand. Users identify the sites or sites they wish to search, including internal systems and the results are returned in a list display.

Browsing the Site-Specific Search Catalog: Users navigate this catalog in a similar fashion to the Information Source Catalog. They may navigate one of the taxonomies (1 Business language, 2 Sources, 3 Input types). Users may also filter and sort through the catalog based on sets of properties associated to data elements.

Search Results Display: The results of the limited scope search are displayed in a list view. The list displays properties such as description, source, input criteria and relevancy ranking.

<add items from viewing other documents: htm, doc, xls, pdf>

Users who identify valuable documents and/or information while in other popular desktop applications can add their `finds` to the system by way of an active Case. (Using some type of `send-to` or `copy/past` feature entire documents or unique data elements can be added to an active Case.)

Automated Retrieval of Data

General Discussion<User Sets>

The information platform automates the collection of Information Source Catalog items. As well, the information platform automates the limited scope searching across high value sites in its Site-Specific search Catalog.

The information platform maintains access scripts for collection information from each source, both the Information Source Catalog and the Specific-Site Search Catalog. As such, users may be required to enter various criteria based on the needs and intricacies of each site's search engine. Rather than have the user input criteria for each site, the information platform maintains User Sets.

<User Sets>

User Sets store and maintain frequently used criteria for collection and search. User sets contain information including, for example:

Companies to watch list; and

Product to watch list.

User Sets Display: User Sets are displayed using a tree view list view control. Sets are presented in a hierarchy with folders and sub-folders for organization.

Creating Sets: Users sets may be created from scratch by the user, or compiled as the result of a result list.

Editing sets: Items may be added to a set, such as a new company name.

Sharing Sets: Users are able to share sets with other users.

Tracking retrieval status.

Users can view the progress of a Case collection at any time. The Case maintains properties that informs the user as to the percent/status of the gathering process. As well, users can check the server queue to see the status of multiple Cases. Lastly, users are notified of various system and Case progress via iconic representations on the task bar.

Viewing Information

General Discussion

Users have the ability to view Case items and Search Items whether a Case has been fulfilled or only partially completed. Cases are located on a corporate server, and depending on access privileges, they may (or not) be shared with a group, a department or an entire company.

<Viewing Case Items>

When a user opens a fulfilled Case, they can view the Case items using the tree view list view controls. Case items are organized in folders and sub-folders that display their Case Items contents in the list view. When a user wants to view a particular item, they select an item from the list view and the information it is displayed in a browser window. Case items are displayed using OLE automation as .doc, xls, .pdf and .htm files depending on the individual Case item. Users can edit the case items from within the information platform.

<Viewing Search Results >

Search results are displayed to the users as a list of documents with additional properties such as description, source, input criteria, date of search, and date of document. A user selects a document to view, and that item is displayed in the Document Viewer.

<Viewing information with the Document Analyzer>

Document Viewer: a window that instantiates Internet Explorer, Excel, Word, or Acrobat to view the document.

Page Geometry: a window displays the elements of a document in a thumbnail view, showing items such as title, headline, paragraph, quote, chart, and table. Clicking on a particular item on thumbnail displays that item in the document viewer. In a reverse manner, when the user is using the Document viewer, the thumbnail identifies the where they are in the document.

Data nugget display: this window displays the particular data nugget value that is added to a Case

Binning: the system parses the document and allows data nuggets to be displayed in information platform Bins--preconfigured bins that identify items such as charts, tables, statistics, or quotes. Users can create custom bins. When a user selects a bin--i.e. analyst quote--only analyst quotes are displayed in the Document Viewer.

Adding other data elements to a Case

When users identify valuable data elements through browsing, searching, or using the Analyzer, they can easily add those items to a case. The user is prompted to assist with the categorization of that item.

Working with Popular Desktop Applications

General Discussion

Case items are collected and stored in popular desktop application formats, such as .doc, xls, .pdf and .htm. Each Case item has been parsed and given structure. As such, case items can easily be exported to desktop applications such as Word or Excel.

Panning nuggets from Office.

Menu items can be optionally installed in Word and Excel, supporting catalog browsing, and information retrieval. All jobs are submitted high priority, and the client task is held waiting for the information to be gathered; the user is presented with a progress dialog (containing a Cancel button). The information is inserted at the current cursor position.

Note: this feature requires a custom retrieval client that pushes the data to the host desktop application using OLE automation.

Fill-me-up templates (For routine reports, or company specific reports)

The information platform provides, for example pre-built and custom documents and spreadsheets that are filled with predetermined data nuggets once they are collected. This is valuable for companies that perform several `routine` reports.

E-mail.

A retrieval job can be e-mailed to the client (using MAPI-compliant transport) as an e-mail with each information nugget as an attachment.

Administration Issues

Sharing Fulfilled Cases

<Case>

Cases reside on a server. This leads to at least three benefits. 1) Cases vary in memory footprint from a few megabytes to several gigabytes, keeping these on the server limits storage issues. 2) Cases may be fulfilled while the user(s) perform other activities that require processing power on their client PCs. 3) Fulfilled Cases can be shared company wide and access privileges can be administrated using standard NT users and groups policies.

Sharing Case Templates

As organizations mature in their use of Cases and Case template, they want to share their frameworks across a group, a department or company. As such, the information platform provides methods for sharing Case templates, editing Case templates and maintenance of a Template Master Set.

Sharing User Sets

As organizations mature in the use User Sets, they want to share them across a group, a department or company. As such, the information platform provides methods for sharing User Sets and the maintenance of a Master User Set(s).

Sharing Favorites

Similar to sharing Case Templates and User Sets, the information platform provides for the sharing of Favorites across a group, a department or company. The information platform provides methods for sharing Favorites and the maintenance of a Favorite Master Set.

Information Source Catalog

Catalog Editing

Users with security access may Change Classifications and relationships in the catalog. The catalog is extendible, which allows users the opportunity to add nodes, classifications and packages.

Catalog Updates--Knowledge Network

Catalog information pushed to customers using TCP/IP.

The updates are synchronized with the customers catalog to ensure that customer modifications and additions are retained, while the virgin catalog entries are updated.

Source Specific Search Catalog

Search Catalog Editing.

Users with security access may Change Search Classifications and relationships in the catalog. The catalog is extendible.

Search catalog Updates. Search Catalog information can be (perhaps by subscription) pushed to customers using TCP/IP.

The updates are synchronized with the customer's search catalog to ensure that customer modifications and additions are retained, while the virgin catalog entries are updated.

Subscription Management

Subscription Assistant

There are times when users need to either collect a Case Item, or perform a Site-Specific Search at a source requiring a subscription. Subscription details are maintained in the User Sets.

However, there may be times when a user does not have a subscription to a source. If users desire to subscribe to a site, the information platform assists them.

Security

The information platform operates in conjunction with standard NT users, groups and policies, avoiding the effort of maintaining a custom set of users and groups.

General Architecture Evaluation

The system is implemented as a multi-tiered client-server application. The primary benefits of implementing the product as a client-server platform are as follows:

Multi-User features such as security, shared configurations and system administration are accommodated and centrally located.

Slow, lengthy or resource intensive tasks are assigned to the server for completion, allowing the user to proceed with other activities.

Data retrieval tasks can be performed unattended--i.e. collection can be scheduled and automatically collected once a month.

By serving the information as HTTP/HTML compliant information, users can access the system from a kiosk, a NC workstation or from a remote system via a web browser.

By storing the catalog on a server, all users can share a common catalog without the need to copy catalogs to client workstations.

Updates made to the data source catalog are `pushed` directly to one location: the server.

The drawbacks of using a client-server architecture are as follows:

The system must have a "local" server option to easily download and install as a single user evaluation version.

Implementation of a disconnected (i.e. use while on a airplane) model is complex and can be time consuming.

Profiles

Users

The broad base of users of the information platform are business decision makers who currently collect, maintain, and utilize information from internal and external resources. These resources include, but are not limited to, documentation and databases, marketing materials, financial data, subscription and professional information services information, Internet & Intranet information, and press publications.

A target segment, for example, of this user base is defined by the following characteristics:

Profession.

The user is a financial analyst focused on the high technology industry. The user completes analyses of data from internal, external, and subscription sources on a regular basis. The user collects and analyzes the data and creates reports and presentations based on the results of the data collection and analysis.

Professional Experience.

The analyst's professional experience ranges from 5 to 25 years. The analyst's age ranges from 25 to 45 years.

Computer related Experience.

The user's computer experience ranges from intermediate to expert with the Windows 95 or Windows NT operating systems. An intermediate user has general knowledge of the system, is able to switch between applications, and has experience using general office applications such as word processing, spreadsheet, and presentation software. An expert user has in depth knowledge of the system, and be able to configure and maintain the system.

On-line Experience.

The users on-line experience ranges from novice to expert. A novice user has little or no experience using on-line information. An intermediate user has several weeks to months of experience using on-line information, with most experience derived from email and web browser usage. An expert user has several months to years of experience using on-line information.

Education.

The user has at minimum a Bachelor's degree in Finance.

Knowledge Gathering

Table 1 below describes the knowledge gathering aspect of the information platform.

                                      TABLE 1
    __________________________________________________________________________
    Knowledge Gathering
    User Processes
           User Tasks
                     Tasks & Functions
                                 Components Used
                                          Functions Available
    __________________________________________________________________________
    Client needs &
           Determine needs and
                     <none>
    objectives
           objectives
           Create new Case
                     Start Case, set aside
                                 New Case dialog
                                          Create blank case
                     space                Create case from template
                                          Set title for new case
           <none>    Set pre-built Objectives,
                                 New Case dialog;
                                          Set of pre-built objectives
                     collection Case Items,
                                 Case     Set of pre-built case items
                     and Search Items for case
                                          and search items
                     template, if used
           Open existing Case
                     Show available Cases,
                                 Open Case dialog
                                          Filter and sort Case listing
                     allow opening of selected
                                          based on properties
                     Case                 Select Case to open
                                          Opening a Case closes
                                          current Case
           Set or edit Case
                     Record properties
                                 Case     Set Case properties: Title;
           properties                     Description; Owner;
                                          Categories; Security/Access
                                          Rights; Sharing
           Edit Case Objectives
                     Enter or edit objectives
                                 Case     View, edit any predefined
                                          needs for case template, if
                                          used
                                          View, add, edit, organize
                                          objectives
                                          Edit Objectives properties:
                                          Fulfillment Status; Descrip-
                                          tion; Due Date; Status;
                                          Priority; Percentage Com-
                                          plete; Owner
           Edit Case Items
                     Enter or edit case items
                                 Case; Catalog;
                                          View, edit any predefined
                                 User Sets; Favorites
                                          case items from template
                                          View, add, edit, organize
                                          case items
           Create new case
                     Save settings as new
                                 Save As dialog
                                          Edit objectives
           template  template             View, add, remove case
                                          items
                                          Save as Case template
           Create new Catalog
                     Store settings as new
                                 Case; Catalog
                                          View, add, remove
           collection item
                     collection item      collection items
                                          Save as collection item
                                          package
    Data collection
           Review case items     Case; Catalog
                                          View Case Item informa-
                                          tion and properties
                                          Edit Case Items properties:
                                          Name; Description;
                                          Criteria; Categories;
                                          Collection Timing; Author
           Determine or refine
                     Update Case Items
                                 Case; Catalog;
                                          Add, remove, and edit
           information needs for User Sets; Favorites
                                          Case Items
           case      Provide sources, informa-
                                          Select categories, sources,
                     tion categories, inputs
                                          common inputs; select
                                          Catalog Items to add to the
                                          Case
                                          Enter criteria for collection
                                          of Case Item
                     Provide gathering    Collect the specified infor-
                     structures           mation
           Filter the Catalog for
                     Filter and sort the
                                 Catalog  Set filters
           collection items
                     Catalog display by   Select sort property
                     property             Update Catalog display to
                                          match filtering and sorting
                                          settings
           Review Case Item
                     Display Case Item and
                                 Case     View properties
           progress  progress information Edit properties for
                                          collection
           Search for information
                     Display Search Catalog
                                 Case; Catalog;
                                          Add, remove Search Items
           at sources
                     for information retrieval
                                 User Sets
                                          Edit Search Items for the
                                          Case and their properties:
                                          Name; Description;
                                          Criteria; Search Timing;
                                          Author
           Browse for informa-
                     Navigate to source
                                 Case; Catalog;
                                          Add information to Case
           tion at sources
                     information Favorites;
                                          from source data item
                                 Document Analyzer
                                          Edit properties for collected
                                          Case Item
           Edit catalog structure
                     Add catagories
                                 Catalog  Add category
                     Add collection packages
                                          Add package
                     Assign items to      Add items
                     categories
                     Categorize items     Choose existing items to
                                          associate with category
                                          Choose categories to
                                          associate with item
           Add sources to
                     Access source and
                                 Catalog  Select source
           catalog, select
                     information
           source and
                     Categorization of source
                                          Categorize source infor-
           categorize
                     and items available  mation
                     Add source and relation-
                     ships
    Think, review
           Review retrieved data
                     Display data elements
                                 Case     Select Case to view
    data, analyze                         Select items within the
    the data                              Case to view
                                          Cut, copy, paste infor-
                                          mation
                                          Edit data
                                          Output information
           Analyze collected item
                     Display thumbnail
                                 Case; Document
                                          View components
                                 Analyzer Select components
                     Display components   Select component types to
                                          view
                                          Zoom in (parse selected
                                          components)
                     Show components by   Zoom out (parse document)
                     type                 Edit data or component
                     Parse and display select-
                                          Add-Remove from Case
                     ed components        Output information
           Analyze across data
                     <none>
           items
           Filter & sort retrieved
                     Allow filtering and sort-
                                 Case     Enter filter criteria
           information elements
                     ing by property      Sort by property
                                          Update display to items
                                          matching settings
                                          Select item for manipula-
                                          tion
    Create report
           Select or create outline
                     <none>
    outline
           in reporting tool of
           choice
    Write report
           Move data information
                     Export data to correct
                                 Case     Select item or data elements
                     format for reporting tool
                                          within item
                                          Copy selection
                                          Output/export selection
           Edit gathered informa-
                     Edit information with
                                 In-place editing
                                          Edit item
           tion      compatible tools
                                 tools    Save, output, export item
           Edit report information
                     <none>
    Publish report
           Output draft
                     <none>
    draft for feed-
           Select recipients
                     <none>
    back   Edit report based on
                     Edit and reexport data to
                                 Browser  Select item or data elements
           feedback  correct format for report-
                                          within item
                     ing tool             Copy selection
                                          Output/export selection
    Publish final
           Output final report
                     <none>
    report Select recipients
                     <none>
    __________________________________________________________________________


The following discussion provides an outline of a full task analysis of the information platform.

FIG. 2 is a block diagram of information platform task analysis according to the invention. FIG. 2 shows top level client application operation, in which a client application is launched 40. The client (user) may then view and manipulate case information 42, view and manipulate source information 44, view and manipulate user sets 46, view and manipulate favorites 48, view user account information 50, or edit the catalog 52. Each of these operations is discussed in greater detail below.

FIG. 3 is a block diagram of information platform source information breakdown according to the invention. More specifically, FIG. 3 identifies the flow with regard to client viewing and manipulation of case information 60. At the highest level, a user may navigate a case 99, view case properties 65, view a case summary 63, or create a case template. Each of these operations is discussed in greater detail below.

When navigating a case 99, the user may view objectives 100, view case items 101, or view search items. For each of these operations, the user may sort a list of such information on properties 103 or may filter the list on properties 104. Thereafter, the user may choose an objective 105, choose a case item 112, or choose a search item 118.

If an objective is chosen 105, the user may view the item 106 and either copy, delete, or add the item to a favorites list 107, select a property 110 and edit properties 111, or add the item to a case 108 and specify the case 109.

If a case item is chosen 112, the user may select a collection item 113 or an information item 114. If a collection item is chosen 112, the user may view the item 106 and either copy, delete, or add the item to a favorites list 107, select a property 110 and edit properties 111, or add the item to a case 108 and specify the case 109. If an information item is chosen 114, the user may view the item 117 or organize the information 121 and move the information to a folder or copy the information to a folder 122.

If the user views the item 117, the user may copy, delete, or add the item to a list of favorites 123, select a property 126 and edit the property 127, edit the information 124, and specify a tool for such editing 125, add the item to a case 140, specify the case to which the information is to be added 141, and specify the associated properties 142, export or output the information 143, specifying the format 144, or analyze the information 128 using a document analyzer 129. Document analysis can proceed by viewing bin items 130, geometry elements 131, or multiple elements 132. An item is then selected 133 and the item may be edited 134 using a specified tool 135, added to a case 136, where the case is specified 137, or exported or output 138 using a specific format 139.

If the user chooses a search item 118, the user may view either a search item 119, where the user may view the item 106 and either copy, delete, or add the item to a favorites list 107, select a property 110 and edit properties 111, or add the item to a case 108 and specify the case 109; or the user may search a results item, in which case the user may view the item 117 or organize the information 121 and move the information to a folder or copy the information to a folder 122. An explanation of the user view selection 117 is provided above.

When viewing case properties 65, the user may select a property 66 and edit the property 67.

When viewing a case summary 63, the user may edit associated properties, copy, or import/output the summary 64.

When creating a case template 61, the user may save a current case as a template 62.

When creating a search item 68, the user may create a folder 69 in a specific location 70, create a case item 71 from a specific category 72, where the user may navigate the catalog 73 and either sort a list of items on a specific property 74 or filter the list on properties 75, choose an item 76, set properties and criteria for the item 77, and add the item to the case 78; or the user may create the case from another case 79, where information may be pasted from the system clipboard 80. Further, the user may create a search item 81 from another case 79, where information may be pasted from the system clipboard 80; or the search item may be created from the search catalog 82 by first navigating the catalog 83 and then either sorting a list on a specific property 84 or filtering the list on a properties 85. Thereafter, the user chooses the item 86, sets properties and criteria 87, and add the item to the case 88.

Finally, when selecting a different case 89, the user may either begin a new case 90, where the case is either blank 91 and a new case is created 92, or from a template 93, where the user first views the available templates 94 and either sorts the templates on a specific property 95 or filters the templates on properties 96, and then chooses the template 97, and creates the new case 98; or the user may open an existing case 145, first viewing the available cases 146, either by sorting the cases on a specific property 147 or by filtering the cases on properties 148, and thereafter choosing the case 149 and opening the case 150.

FIG. 4 is a block diagram of information platform manipulate source information breakdown according to the invention. More specifically, FIG. 4 identifies the flow with regard to client viewing and manipulation of source information 200. This operation is discussed in greater detail below.

The source information is obtained from any of case items, search items, history, an entered location, an open file dialog, a favorite, or a link from a source (201). The user navigates to the source 202 and may view the item 203. The use may then copy, delete, or add the item to a list of favorites 204, add the item to the catalog 205, specifying item properties 209, add the item to a case 206, specifying the case 212 and case properties 213, export or output the item 207, specifying the format 214, edit information 210, specifying a tool therefor 211, or link the item to other source information 208.

Importantly, the user may analyze the information 215 using a document analyzer 216. Such analysis may involve viewing bin elements 217, viewing geometry elements 218 or viewing multiple elements 219. An item is then selected 220 and may be edited 221 using a specified tool 22, added to a case 223, where the case is specified 224, or exported/output 225 in a specified format 226.

FIG. 5 is a block diagram of information platform source user set breakdown according to the invention. More specifically, FIG. 5 identifies the flow with regard to client viewing and manipulation of user sets 250. This operation is discussed in greater detail below.

The user may either view user sets 260 or create/add an item 270. If the user views user sets 260, the user may sort the sets on a specific property 261 or filter the sets on properties 262, choose and item 263 and either view the sets, edit set properties, import or output a set, duplicate a set, or share a set 264; or organize the set 265 and either move the set to a folder or copy the set to a folder 266.

If the user creates/adds an item 270, the item may be either a folder 267 or a set item 268, both of which allow the user to set properties and/or locations for the item 269.

FIG. 6 is a block diagram of information platform favorites breakdown according to the invention. More specifically, FIG. 6 identifies the flow with regard to client viewing and manipulation of favorites 2880. This operation is discussed in greater detail below.

The user may either vie favorites 281 or create/add and item 282. If the user views favorites 281, an item is chosen 283 and the user may either view the sets, edit set properties, import or output a set, duplicate a set, or share a set 284; or organize the set 285 and either move the set to a folder or copy the set to a folder 286.

If the user creates/adds an item 282, the item may be either a folder 287 or a favorite item 288, both of which allow the user to set properties and/or locations for the item 291. With regard to the favorite item, the user also may select from a blank favorite 287 or a current favorite 290.

FIG. 7 is a block diagram of information platform user account breakdown according to the invention. More specifically, FIG. 7 identifies the flow with regard to client viewing of user account information 300. This operation is discussed in greater detail below.

The user account information includes account information 301 and subscriptions 302. For the account information, the user chooses a property 303, specifying such items as name, server, log-in, password, group(s), and rights 304, each of which may be edited 305.

For subscriptions 302, the subscription may be a group subscription 306, where the group account properties may be viewed 308; or it may be an individual subscription 307. For an individual subscription, the user may create a subscription 309, for example using an account wizard 310, providing set-up information 311, including subscription type, connection type, location, user name, log-in, password, access limits, and areas or topics (312). The user may also view an existing account 313 by choosing the account 314, which allows the user to view the account properties 315, such as subscription name, connection type, location, user name, log-in, password, access limits, and areas or topics (316). the information may be edited 317 as appropriate.

FIG. 8 is a block diagram of information platform editing a catalog breakdown according to the invention. More specifically, FIG. 8 identifies the flow with regard to catalog editing 320. This operation is discussed in greater detail below.

When editing a catalog, a user may either add an item 321 or select an item 322. Items that may be added include folders 323, collection items 324 which have specified properties 325 and locations 326, and package items 327.

When selecting an item 322, the user may either copy the item or duplicate the item 328, view item properties 330 and edit those properties 331, move the item 329, or view a script 332, and also edit the script 333.

Table 2 below lists some of the tasks performed by the users.

                                      TABLE 2
    __________________________________________________________________________
    User Tasks
    Domain
          Task         Description  Steps & Entries
    __________________________________________________________________________
    Basic <Example>    Retrieve all the press
                                    Open catalog
    Information
          Use the information platform
                       releases issued in June by
                                    Choose documents (press
    Retrieval
          client to get a specific
                       Apple, Microsoft, Intel and
                                    releases
          nugget of information
                       Aeneid.      Enter company names
                                    Select get now
                                    View collected information
          Select a profile for rapid
                       For example, the user may
          comprehensive data gathering
                       enter a company name and
                       then be presented with a rich
                       body of information that was
                       (semi)automatically gathered.
          Use the information platform
                       Retrieve, for example, all the
          client to get a specific nugget
                       press releases issued in June
          of information
                       by four companies.
          Add a piece of data to an
                       While working in Excel, the
          Excel spreadsheet.
                       user may add some data from
                       a SEC 10K, for example. The
                       user chooses "Insert
                       Information" from the
                       information platform menu
                       and then select what informa-
                       tion to retrieve, and where to
                       place it. The metadata (iden-
                       tifying information source) is
                       added as a note to the work-
                       sheet.
          Add a piece of data to a Word
          document
          View the information down-
          load log, and use it to search
          for a specific piece of infor-
          mation.
          View the Retrieval Queue.
                       This is similar to viewing a
                       network printer queue. The
                       queue identifies what jobs
                       are pending, who submitted
                       them, the priority, and the
                       data to be retrieved.
    Advanced
          Run an analysis
    Information
          Create a bibliography/sources
    Retrieval
          document as foundation of a
          research project
          Create an information
                       Controls what information is
          retrieval job
                       retrieved, how frequently it
                       should be retrieved, and
                       where the information is
                       stored.
    Catalog
          Add an internal data source
    Management
          to the catalog
          Modify the classification of a
          data source
          Browse the catalog looking
          for the types of information
          Add a new set of classifica-
          tions to the catalog
          From Explorer, add a site to
          the catalog
          From Explorer, add a page to
          the catalog
          Define the geometry of an
          unknown page to allow for the
          intelligent gathering of
          elements of the page
          Configure the team's custom
                       Setting the competitor list by
          relationships and interests.
                       market segment, configure the
                       most influential companies in
                       each market segment, rating
                       of value, influence and
                       accuracy of all information
                       sources.
          Create a custom analysis
          Configure the preferred sites
                       The customer is able to define
          to gather specific information.
                       where to go for information
                       when the system includes
                       multiple potential sources.
                       The user should also be able
                       to set a "get from the
                       cheapest" option when
                       multiple are available
    User/System
          Configure individual user
    Management
          rights
          Configure rights for a group
                       Rights include the ability to
                       add/edit/delete sites, users,
                       groups, profiles; access to
                       sites; controls which elements
                       of the resource catalog are
                       accessible.
          Add a user to a group
                       Using NT services
          Add/View/Modify/Delete the
                       Site location, user, password,
          site list which details the
          configuration and logon
          details for secure/subscription
          sites.
          View the activity log
          Control which activities are
          logged
          Control how often catalog
          updates are accepted.
    __________________________________________________________________________


Client Design

Introduction.

The various components that make up the information platform offer the user of the platform a host of functionality in an integrated, seamless information environment. To discuss the concepts and functionality of these components, the following discussion maps out the various capabilities of each.

Client Interface Design.

The client application must be able to incorporate the various controls in an integrated manner. In addition, there must be document display support, navigation, searching, and printing capabilities.

Installation.

There is a Windows installation application that installs all the necessary components on the user's system as well as locate the database that is used for Catalog information storage. The user is able to specify where the components are located and various installation settings during the installation process.

Upon launch of the installer, the application determines if current or previous versions of the application or components are installed on the user's system. If components are detected, the user is informed that some components are installed and the application installs or updates the required components.

The user has the choice of completing a Typical or Custom installation.

User Accounts.

During installation, the user selected a server that stores the Catalog and Case information. The user also may have selected an existing user account that was created on the server for them to use. If the account is not available at installation, the user is prompted to select an account when the application is first launched.

Once the user has installed the appropriate components, the application can be launched and the user is automatically logged into their user account using the stored user information. The user enters their user password each time the system is launched to access the system.

Global Elements.

Navigation.

There is a navigation support built into the information platform client.

History Tracking and Access.

Viewing and selecting from history of interactions;

History includes location, controls, entries;

Integration with the Windows' system history; and

Viewing history across sessions.

ARL Encoding and Display.

Navigation to specific elements;

Show the path, ARL, URL, other information about path;

Use a Open (Point interaction or allow free form typing, or both.

Progress Information Feedback.

Navigation to a location or data item;

Anytime the system is busy in a foreground task, animation and status feedback is necessary;

Background tasks, like queued retrieval or updating, should be viewable but not prominent

Menus

The menu consists of File, Edit, Tools, History, Favorites, and Help. In the following tables, menu separators are shown as double lines between table cells.

                  TABLE 3
    ______________________________________
    File Menu
    Menu Item
            Function
    ______________________________________
    New (   Cascades to the following items . . .
    Case . . .
            Displays the New Case dialog, allowing the user to create a
            new blank Case or a new Case from a template.
    Objective
            Creates a new Objective in the current case. If the user has
            an Objective folder selected, then the new Objective is
            created in that folder, otherwise it is created at the top
            level.
    Case Item
            Creates a new Case Item from the currently displayed item.
    Search Item
            Creates a new Search Item, opening the Search Catalog for
            the user to choose a search.
    User Set
            Creates a new User Set item, allowing the user to enter the
    Item    item properties.
    Favorite
            Creates a new Favorite for the currently selected item, this
            item can be a location (like a web site, database, document,
            local folder), Case element, or Catalog element.
    Folder  Creates a new folder in the item that has focus. The items
            that can have the focus are: in the Case control, Objectives,
            Case Items, Search Items; in the User Sets control, the
            control itself (top level) or a Set Folder; in the Favorites
            control, the control itself or a Favorites folder; in the
            Catalog, the various levels within the hierarchy.
    Open (  Cascades to the following items . . .
    Item    Opens the currently selected item, usually a folder or file.
    Case . . .
            Displays the Open Case dialog box.
    File . . .
            Displays an Open File dialog box, which allows the user to
            open a file.
    Location . . .
            Displays an ARL entry dialog box for the user to enter a
            location to open.
    Save (  Cascades to the following items . . .
    Item As . . .
            Displays a Save As dialog for the currently displayed
            information. The user can save the item as a file in the
            following formats: MS Word (*.doc); MS Excel (*.xls);
            plain text (*.txt); Web page (*.html); or MS PowerPoint
            (*.ppt).
    Case As . . .
            Displays the Save Case As dialog, allowing the user to save
            the Case as a new Case of as a Case template.
    Page    Displays the system Page Setup . . . dialog to allow the user
    Setup . . .
            to set printing options.
    Print . . .
            Prints the currently displayed item using the system Print
            dialog and print drivers.
    Delete  Deletes the currently selected file or item.
    Rename  Selects the name of the currently selected item for editing.
    Properties
            Displays the properties of the currently selected item.
    Exit    Exists the application.
    ______________________________________


TABLE 4 ______________________________________ Edit Menu Menu Item Function ______________________________________ Undo Reverses the last N actions by the user. The user can undo all the actions performed from the launch of the product, up to 100 actions. Cut This is enabled at any time the user can edit text. The currently selected text is copied to the clipboard and deleted from its current location. Copy This is enabled at any time the user can edit text. Copies the currently selected text to the clipboard. Paste This is enabled at any time the user can edit text. Pastes the text from the clipboard to the selection entry point. Delete This is enabled at any time the user can edit text. Removes the selected text from its current location. Select All Selects all the items or text displayed in the control that has focus. ______________________________________

TABLE 5 ______________________________________ Tools Menu Menu Item Function ______________________________________ Cases Displays the Cases control. Catalog Displays the Catalog control. Document Displays the Document Analyzer control. Analyzer User Sets Displays the User Sets control. Favorites Displays the Favorites control. <Supplemental List of applications that the user can evoke to manipulate Applications the data. Examples are Microsoft Office applications List> (Word, Excel, PowerPoint, Outlook) or reporting soft- ware (ACT). (option 1) Selecting an item from the list uses the application to edit or manipulate the currently displayed element in place. (option 2) Selecting an item from the list launches the application and open the currently selected element as a new untitled document within the application. ______________________________________

TABLE 6 ______________________________________ History Menu Menu Item Function ______________________________________ Back Navigates to the previously item in the History list. Disabled if the current item is the first item in the list. Forward Navigates to the next item in the History list. Disabled if the current item is the last item in the History list. <History List> Displays the list of items in the History list. Selecting an item navigates to that item. ______________________________________

TABLE 7 ______________________________________ Favorites Menu Menu Item Function ______________________________________ Add . . . Adds the current element to the user's Favorites, allows the user to specify a name and other properties with a dialog box. The item can be a location (e.g. a web site, database, document, local folder), Case element, or Catalog element. Open . . . Opens the Favorites control. <Favorites List> Displays the list of user Favorites, if there are folders and sub-folders, they are displayed as cascading menu items. Internet Cascades to a list of the Favorites for Internet Explorer Explorer ( of for the Windows system. Netscape Cascades to a list of the Favorites for Navigator. Nagivator ( ______________________________________

TABLE 8 ______________________________________ Help Menu Menu Item Function ______________________________________ Contents Opens the help file to the Contents. Index Opens the help file to the Index section. About Opens the About box for the application. ______________________________________


Toolbars

Controls.

The various components that make up the information platform offer the user of the platform a host of functionality in an integrated, seamless information environment. To discuss the concepts and functionality of these components, the following maps out the various capabilities of each.

Cases Control.

The Case is center of the information platform environment, it is the home base for the user to compile and review the information space that satisfies their business intelligence needs. The Cases control allows the user to set objectives, collect and review information for an individual Case, and manage multiple Cases. The elements of a Case are: Title, uniquely identifying the Case; Properties, defining the Case; Objectives, items to be fulfilled for the Case; Case Items, the information items that fulfill the Objectives; and Search Items, the results from source searches. The Cases Control uses the Hierarchy and Filtering List controls to display some of its elements. The Objectives, Case Items, and Search Items are all displayed in a combination hierarchy and list.

Case Elements

Case Title.

The Case Title is a user defined text field, up to 256 characters, titling the Case. The user can directly enter the title of the case in the field, and the user can retitle the Case at any time. When the user retitles a Case, if there is a Case that already has the specified title, the user is prompted to replace the old Case with the current one or title the current Case using a different title.

In a multi-user environment, the titling of the Case is limited to those with Ownership or Authorship status only.

Properties.

Each Case has a set of properties that can be reviewed by the user. The user can edit some of these properties (*), others are produced and updated by the system. Table 9 below identifies these properties.

                  TABLE 9
    ______________________________________
    Case Properties
    Property
            Description
    ______________________________________
    Icon    Iconic representation for the Case <may be different for
            shared, user is owner, user is author>
    Title*  Title for Case, user defined
    Description*
            User defined text
    Date & time
            System set time stamp
    created
    Date & time
            System updated time stamp
    last modified
    Owner*  User name of current Case owner
    Last    User name of last person to modify the Case
    modifier
    Base    Template used in creation, if any
    Template
    Case    Set of Case types that the Owner can select from when
    Categories
            initiating or editing the case; is predefined for templates,
            but still can be edited by Owner*
    Security/
            The security, access rights setting for this user
    access
    rights*
    Sharing Sharing settings for other users; if this user is the Owner,
            then can be edited
    ______________________________________


The user can edit the Title, Description, Owner, Categories, and Security settings. In a multi-user environment, editing these properties is limited to user with Ownership or Authorship status, except for Owner and Security properties, which can only be edited by the Owner or Aeneid Platform Administrator. If the current user is the Owner, then the Sharing settings for other users can be edited. If the user is not the Owner, it is disabled.

Objectives.

The objectives are akin to a list of tasks that the user completes to gather the appropriate information to satisfy a Case. These objectives are represented as a list of items, which are user defined or can be initially from a template. The status for each objective, and set of objectives, is user controlled, and not automated or updated by system.

Each objective has detailed properties associated with it; these properties are set forth in Table 10 below.

                  TABLE 10
    ______________________________________
    Objective Properties
    Property    Description
    ______________________________________
    Fulfillment Status
                A check box which indicates the task is fulfilled
    Icon        Iconic representation for the Objective
    Description A description of the task up to 250 characters
    Due Date    The date the task should be completed
    Status      Selection from a combo box with the following
                values: Not Started; In Progress; Completed;
                Waiting on Someone Else; Deferred
    Priority    Selection of: High; Medium; Low
    Percentage Complete
                0-100
    Owner       One of more Troy users, or Not Assigned
    ______________________________________


These fields correspond with the Microsoft Outlook Tasks. The information platform supports the transfer of items to Outlook; this allows the user to see the case tasks in the context of other cases, and tasks which are not information platform tasks. This mitigate the need for a user to see a list of all their cross case objectives within the information platform itself.

The user maintains each of these properties. In a multi-user environment, all users with Ownership or Authorship status can edit these items. The items can be viewed, edited, added, or removed from a Case. The user can also move objectives from one Case to another.

Organizing Objectives.

The user can create folders and sub-folders to organize the Case Objectives.

These are viewed in a hierarchical tree of items, which the user can navigate.

Selection of a folder displays the contained sub-folders and Objectives in the list.

When a sub-folder is displayed in the list, its associated properties are displayed. The properties for an objective folder are shown below in Table 11.

                  TABLE 11
    ______________________________________
    Objective Folder Properties
    Property    Description
    ______________________________________
    Fulfillment Status
                A check box which indicates that all the
                objectives within the folder are fulfilled
    Icon        Iconic representation <folder>
    Description A description of the folder up to 250 characters
    Due Date    <no entry>
    Status      <no entry>
    Priority    <no entry>
    Percentage Complete
                <no entry>
    Owner       <no entry>
    ______________________________________


The user can choose to view all the Objectives for a Case, or all the Objectives below a specified level, by right clicking on a level and choosing List All Contained Items (the specific implementation of this is to be determined). The selected folder remains selected, however, the hierarchy control is disabled and the list displays all the items in that folder and all contained sub-folders. The user can re-enable the hierarchy navigation by right clicking on the hierarchy and choosing List This Level Only.

Filtering & Sorting the Objectives.

The display of the Case Objectives uses the Filtering List control. This control allows the user to view a list of items and their associated properties, sort the list in an ascending or descending order based on a selected property, and filter the list on property criteria.

The Case Objectives (see Table 12 below) are listed with their properties. Each property can be used for sorting the list of Objectives and can be used to filter the list.

                  TABLE 12
    ______________________________________
    Case Objectives
    Property
            Logical operators
                        Filter options
    ______________________________________
    Fulfillment
            Is          Selection of: Fulfilled; Not fulfilled
    Status  Is not
            Contains*
            Starts*
            Ends*
            (*text entry only)
    Icon                <no filtering allowed>
    Description         Text entry
    Due Date            Date entry; fielded entry box <or
                        Selection of: Past Due; Not Yet Due;
                        Due Today; Due Tomorrow; Due This
                        Week>
    Status              Selection of: Not Started; In Progress;
                        Completed; Waiting on Someone Else;
                        Deferred
    Priority            Selection of: High; Medium; Low
    Percentage          Numeric entry of 0-100 <or Selection
    Complete            of: 0%; 0-25%; 26-50%; 51-75%; 76-
                        100%>
    Owner               Text entry
    ______________________________________


Case Items.

The Case Items are the information items that satisfy the Case Objectives. These items can represent one of two things: a collection item from the Catalog that needs to be fulfilled; or an information item that contains data. The collection items have an unchecked Collection Status property, while the information items have a checked Collection Status. Additionally, there is a different iconic representation for each type of item.

The Case Items are displayed in a list with each item's associated properties. The properties that are associated with a particular Case Item are shown below in Table 13.

                  TABLE 13
    ______________________________________
    Case Item Properties
    Property  Description
    ______________________________________
    Collection Status
              A check box, indicates if item is collection item or
              information item (information item is checked); the
              application manages the status of the item
    Icon      Iconic representation of the item
    Name      The name of the Case Item
    Description
              The item description, inherited from the Catalog
              collection item it was created from
    Criteria  The criteria for fulfillment
    Progress  Selection from a combo box with the following values:
              Collected; In Collection Queue; Waiting for Collection
              Timing; Deferred; Paused; Partially fulfilled; Error -
              unable to fulfill.
    Categories
              The selected item categories: for example, financial;
              press release; corporate; product review.
    Information Type
              Retrieved type of data: document; financial data; table.
    Input Type
              The type of input needed to retrieve: company name;
              SIC code; product name; product type.
    Source    The source the information is, or will be, collected
              from
    Source Type
              The type of source: web site; subscription; local;
              intranet.
    Collection Timing
              Setting showing when the information is fulfilled, for
              example: immediate; cheap rate; after 5 PM; weekend.
    Date & Time
              When the item was collected, if fulfilled
    Author    User who set the collection item or added the informa-
              tion item
    ______________________________________


Organizing Case Items.

Case Items can be organized into folders and sub-folders, using the Hierarchy and Filtering List controls. The folders and sub-folders are viewed in a hierarchical tree, which the user can navigate. Selection of a folder displays the contained sub-folders and Items in the list.

When a sub-folder is displayed in the list, its associated properties are displayed. The properties for a Case Item folder are listed below in Table 14.

                  TABLE 14
    ______________________________________
    Case Item Folder Properties
    Property  Description
    ______________________________________
    Collection Status
              A check box, indicates if items in the folder are all
              fulfilled; the application manages the status of the item
    Icon      Iconic representation <folder>
    Name      The name of the folder
    Description
              <no entry>
    Criteria  <no entry>
    Progress  <no entry>
    Categories
              <no entry>
    Information Type
              <no entry>
    Input Type
              <no entry>
    Source    <no entry>
    Source Type
              <no entry>
    Collection Timing
              <no entry>
    Date & Time
              When the folder's items were completely fulfilled
    Author    <no entry>
    ______________________________________


The user can choose to view all the Items for a Case, or all the Items contained in a specified level, by right clicking on the level and choosing List All Contained Items. The selected folder remains selected and the hierarchy control disables and the list displays all the items in that folder and all contained sub-folders. The user can re-enable the hierarchy navigation by right clicking on the hierarchy control and choosing List This Level Only.

Filtering & Sorting Case Items.

The display of the Case Items uses the Filtering List control. This control allows the user to view a list of items and their associated properties, sort the list in an ascending or descending order based on a selected property, and filter the list on property criteria.

The Case Items are listed with their properties. Each property can be used for sorting the list of Items and can be used to filter the list (see Table 15 below).

                  TABLE 15
    ______________________________________
    Case Item Properties
    Property
            Logical operators
                        Filter options
    ______________________________________
    Collection
            Is          Selection of: Collected; Not Collected
    Status  Is not
            Contains*
            Starts*
            Ends*
            (*text only)
    Icon                <no filtering>
    Name                Text entry
    Description         Text entry
    Criteria            Text entry
    Progress            Selection of: Collected; In Collection
                        Queue; Waiting for Collection
                        Timing; Deferred; Paused; Partially
                        fulfilled; Error - unable to fulfill
    Categories          Selection of items' categories, for
                        example: financial; press release;
                        corporate; product review
    Information         Selection of items' types: document;
    Type                financial data; table
    Input Type          Selection of items' types: company
                        name; SIC code; product name; product
                        type
    Source              Text entry
    Source Type         Selection of items' types: web site;
                        subscription; local; intranet
    Collection          Selection of items' timings: immediate;
    Timing              cheap rate: after 5 pm; weekend
    Date & Time         Date entry; fielded entry box
    Author              Text entry
    ______________________________________


Viewing Case Items.

Case Items are either collection items or information items. Collection items are added from the Catalog or are supplied from the Catalog as part of a Case template. These items have collection criteria associated with them that the user can view and edit. Selecting the Case Item in the list displays the collection criteria for the user to review and edit. These include some of the properties that are associated with the Case Item.

Information items can be items that were added from the Catalog as collection items and are now fulfilled, items that were added directly from information sources while browsing, or items transferred from other Cases. These items have the same properties as the collection items, however, the Status is fulfilled, Progress is collected, and Collection Timing cannot be changed. Selecting the Case Item in the list displays the information that was collected.

Editing Case Items.

The user can edit the Case Item properties as well as the content. When the Case Item represents an unfulfilled collection item, the content is the collection settings. When the Case Item represents a fulfilled information item, the content is the collected data.

Editing Properties.

The user can edit the properties of a Case Item directly in the list control or by selecting the item and choosing Properties from the menu. In the list control, the user can choose a property setting for an item and change its setting. If the property is a free from text entry, the user can select and edit the text. If the property has a limited number of possible settings, those settings can be chosen from a drop down list. If the setting requires complex interaction, the user can choose the setting and click on an ellipse (. . . ) button to open a dialog to change the settings.

If the user chooses Properties from the menu, the user is presented with a Property Sheet dialog for the item. Within this dialog, the user can change the property settings for the item.

Editing Collection Settings.

If the Case Item is a collection item that has not yet been fulfilled, clicking on it displays the collection settings associated with that item. The user can change the settings and the item is automatically updated. This interaction is the same as when the user first added the item to the Case.

Editing Collected Data.

Once the Case Item has been fulfilled, clicking on it displays the collected information. The user can edit this information directly using in-place activation.

Adding Case Items.

Case Items are added the user from interaction with the Catalog, by selecting New (Case Item from the menu, adding directly from a source, or by copying and pasting from another Case.

Search Items

The Search Items are source searches using the Search Catalog, and are intended to allow the user to perform limited scope searching. Similar to the Case Items, these items can represent one of two things: a search item from the Search Catalog that in the process of fulfillment; or a results item that contains returned search results. The former have an unchecked Completion Status property, while the latter have a checked Completion Status. Each has a different iconic representation.

The Search Items are displayed in a list with each item's associated properties. The properties that are associated with a Search Item are shown in Table 16 below.

                  TABLE 16
    ______________________________________
    Search Item Properties
    Property  Description
    ______________________________________
    Completion
              A check box, indicates if item is a search item or a
    Status    search results item (results are checked); the applica-
              tion manages the status of the item
    Icon      Iconic representation of the item
    Name      The name of the search item
    Description
              The item description, inherited from the Search
              Catalog item it was created from
    Criteria  The criteria for fulfillment
    Progress  Selection from a combo box with the following values:
              Collected; In Collection Queue; Waiting for Collection
              Timing; Deferred; Paused; Partially fulfilled; Error -
              unable to fulfill.
    Source    The source(s) the information was, or will be,
              collected from
    Source Type
              The type(s) of source(s): web site; subscription; local;
              intranet
    Search Timing
              Setting showing when the information is fulfilled, for
              example: immediate; cheap rate; after 5 pm; weekend
    Date & Time
              When the search was completed, if fulfilled
    Author    User who set the collection item or added the informa-
              tion item
    ______________________________________


Organizing Search Items.

Search Items can be organized into folders and sub-folders, using the Hierarchy and Filtering List controls. The folders and sub-folders are viewed in the hierarchical tree, which the user can navigate. Selection of a folder displays the contained sub-folders and search items in the list.

When a sub-folder is displayed in the list, its associated properties are displayed. The properties for a search folder are shown in Table 17 below.

                  TABLE 17
    ______________________________________
    Search Folder Properties
    Property   Description
    ______________________________________
    Completion Status
               A check box, indicates if item if all contained
               items are completed; the application manages the
               status of the item
    Icon       Iconic representation <folder>
    Name       The name of the folder or sub-folder
    Description
               <no entry>
    Criteria   <no entry>
    Progress   <no entry>
    Source     <no entry>
    Source Type
               <no entry>
    Search Timing
               <no entry>
    Date & Time
               When the folder search items were all completed
    Author     <no entry>
    ______________________________________


The user can choose to view all the Search Items contained in a specified level and its sub-folder, or all the Search Items in a Case, by right clicking on a folder and selecting View All Contained Items from the pop up menu. The hierarchy control disables and all the items at that level and all contained levels are displayed in the list. The user can right click on the hierarchy control and select

List This Level Only to re-enable folder by folder navigation of the items.

Filtering & Sorting Search Items.

The display of the Search Items uses the Filtering List control. This control allows the user t