Document detection system with improved document detection efficiency5907841Abstract A document detection system capable of detecting a desired document from a large number of documents easily and accurately in which the user can make a judgement concerning the appropriateness of the detection result quickly. In the system, those documents which contain a semantic structure of a detection command containing natural language expressions entered by a user are detected. Also, the keywords of each document can be extracted from the summary of each document and those documents whose keywords match with detection keywords specified by a user can be detected. Also, the summary of each detected document can be automatically generated according to text structures of each detected document and displayed along with the detected document itself. Also, the detection processing can be carried out with respect to the summaries of the documents instead of the documents themselves. Claims What is claimed is: Description BACKGROUND OF THE INVENTION
______________________________________
SERIAL RELATION right stressed type
DIRECTION right stressed type
EXEMPLIFICATION left stressed type
______________________________________
such that the relationship table can indicate that the relationship of "serial relation" is the right stressed type, for example. In a case the relationship indicated by the "text structure Rel" is the right stressed type, the node on the right hand side is more important, so that the penalty value obtained by adding the entered penalty value P and a predetermined penalty P1 is given to the "text structure L" at the step 2104, while the entered penalty value P itself is given to the "text structure R" at the step 2105. On the contrary, in a case the relationship indicated by the "text structure Rel" is the left stressed type, the node on the left hand side is more important, so that the entered penalty value P itself is given to the "text structure L" at the step 2107, while the penalty value obtained by adding the entered penalty value P and a predetermined penalty P1 is given to the "text structure R" at the step 2108. On the other hand, in a case the relationship indicated by the "text structure Rel" is neither one of the right stressed type and the left stressed type, the entered penalty value P itself is given to both of the "text structure L" and the "text structure R" at the steps 2109 and 2110. In this manner, when the penalty calculation is made recursively for all the nodes, the sentence with a higher importance will have a smaller penalty value, while the sentence with a lower importance will have a larger penalty value. Also, in this key sentence judgement unit 143, the manner of setting the penalty value can be altered by changing the data in the relationship table, so that by allowing the user to define each relationship in the relationship table as either one of the right stressed type and the left stressed type freely, the selections of the key sentences at the key sentence judgement unit 143 can be adjusted such that it becomes possible for the text re-construction unit 144 to generate the summary from a point of view desired by the user. The text re-construction unit 144 operates according to the flow chart of FIG. 22, as follows. First, at the step 2201, all the sentences with the penalty obtained at the key sentence judgement unit 143 less than a predetermined threshold P2 are selected. Then, the connective expressions in the sentences selected at the step 2201 are changed to generate the summary at the step 2202. To illustrate the operation of the text reconstruction unit 144 more concretely, consider a simple exemplary case of handling the following three sentences. (1) In order to build a nice building, nice materials are necessary. (2) However, nice tools are also necessary. (3) In other words, the materials alone are not sufficient. In this case, the text structure of these three sentences can be expressed as follows. (1<NEGATIVE RELATION> (2<REPHRASE>3) Now, suppose the sentences (1) and (3) are selected as the key sentences by the key sentence judgement unit 143. Then, if the sentences (1) and (3) are simply connected, the following sentences which are logically different from the original sentences would be generated. "In order to build a nice building, nice materials are necessary. In other words, the materials alone are not sufficient." In order to avoid such an erroneous generation of an inaccurate summary, at the step 2202 described above, the connective expression between the sentences is replaced by that which corresponds to the most superior relationship among the selected key sentences. Namely, in a case of the sentences (1) and (3) above, the most superior relationship between these sentences (1) and (3) is that of "negative relation" according to the text structure described above, so that the connective expression "In other words" in the sentences generated above is replaced by the connective expression "However" corresponding to this "negative relation" relationship, so as to obtain the following sentences, which can be considered as the summary of the original sentences (1) to (3). "In order to build a nice building, nice materials are necessary. However, the materials alone are not sufficient." In this manner, the text re-construction unit 144 can generate the summary which is logically consistent with the original sentences. In addition, by changing the value of the threshold P2 used at the step 2201 described above, the size of the summary to be generated can be controlled. Consequently, by allowing the user to change this threshold P2 freely, it becomes possible to obtain and display the summary of the desired size. In this first embodiment, the individual data storage unit 16 stores the correspondences between the summaries and the original sentences in a form of a summary sentence memory shown in FIG. 23. In this case, each summary sentence memory data includes an original sentence pointer, a summary pointer, and a summary sentence number, which are stored in a continuous memory region to indicate their correspondences. The summary sentence number indicates the sentence numbers of the sentences forming the summary. The detection control unit 18 also determines the display priority order among the detected documents by using the detection result obtained from the detection key and the syntactic and semantic analyses results for the input sentences which are obtained by the detection processing unit 13 and stored in the individual data storage unit 16, and the summary data obtained by the summary generation unit 14 and stored in the individual data storage unit 16 as follows. Namely, the display priority order is determined according to the prescribed priority order conditions provided as a display priority rule dictionary in the detection control unit 18, as shown in FIG. 24. The detection control unit 18 then controls the detection result output unit 17 to display the titles of the detected documents in the determined display priority order as the detection result. As a concrete example, FIG. 25 shows an exemplary data content of the individual data storage unit 16, and FIG. 26 shows an exemplary display priority order determined for the exemplary data content of FIG. 25. Here, the detection processing unit 13 counts a number of times each document has been detected as the detection result and stores this number of detections for each document in the document storage unit 15. Then, in a case the detection result includes more than one documents with the same display priority order, the document with the greater number of detections is displayed before the document with the lesser number of detections, as indicated in FIG. 26. As a concrete example, FIG. 27 shows an exemplary data content for the detection result stored in the individual data storage unit 16, and FIG. 28 shows an exemplary screen display of the detection result according to the exemplary data content of FIG. 27. Now, various variations of the above described first embodiment of a document detection system according to the present invention will be described. First, the first variation concerning the ambiguity in the analysis results of the morphological, syntactic, and semantic analyses will be described. Namely, in the first embodiment described above, it has been assumed that there is no ambiguity in the analysis results of the morphological, syntactic, and semantic analyses. In contrast, in this first variation, a case of dealing with the ambiguity in the analysis results of the morphological, syntactic, and semantic analyses will be described. In this case, the detection control unit 18 possesses an analysis result learning dictionary for storing the analysis results for the input character string selected by the user in the past detection operations, in a form shown in FIG. 29. In addition, the semantic structure index memory in the document storage unit 15 stores all the analysis results obtained by syntactic and semantic analyses of the sentences in each document in the document database in correspondences, whenever a plurality of analysis results are obtained. The other features are substantially identical to those in the first embodiment described above. In this first variation, the ambiguity in the analysis result is handled by the following operation according to the flow chart of FIG. 30. Namely, in a case the input analysis unit 12 obtained the analysis results for the input sentence as shown in FIG. 31 which includes an ambiguity at the step 2801, the input analysis unit 12 stores the obtained plurality of analysis results into the individual data storage unit 16, and transmits a signal indicating an occurrence of the ambiguity to the detection control unit 18. In response, the detection control unit 18 takes out an ambiguous part of the input character string corresponding to the plurality of analysis results stored in the individual data storage unit 16, and looks up the analysis result learning dictionary for the ambiguous part at the step 2802, and then looks up the semantic structure index memory for each one of the syntactic and semantic analysis results looked up from the analysis result learning dictionary at the step 2803. Then, at the step 2804, the detection result output unit 17 displays the detection results obtained at the step 2803 along with the plurality of analysis results and appropriate messages. Here, in a case the character string coinciding with the ambiguous part of the input character string is detected in the analysis result learning dictionary at the step 2802, the appropriate message which notifies the fact that this analysis result had been selected in the past is attached to the analysis result corresponding to the coinciding character string. On the other hand, in a case the coinciding semantic structure is detected in the semantic structure index memory at the step 2803, the appropriate message which notifies the fact that the document containing this semantic structure is present in the document storage unit 15 is attached to the detected coinciding semantic structure. Next, at the step 2805, a completion of the selection of an appropriate analysis result from the displayed plurality of analysis results to be made by the user is awaited. Then, whether the selected analysis result is that which has an ambiguity or not is determined at the step 2806. If the selected analysis result is an unambiguous one, the operation proceeds to the step 2810 described below, whereas otherwise next at the step 2807, a message for inquiring whether or not to delete the unselected analysis results from the semantic structure index memory to the user is displayed. Then, whether the user selected "delete" as an answer to the above inquiry is determined at the step 2808, and only when the "delete" is selected, the unselected analysis results are deleted from the semantic structure index memory at the step 2809. Finally, at the step 2810, the selected analysis result is stored into the analysis result learning dictionary as well as the individual data storage unit 16. Thus, in this first variation, the semantic structure index memory in a state shown in FIG. 32A in which a plurality of analysis results causing the ambiguity are present can be turned into a state shown in FIG. 32B in which the ambiguity is resolved as only the semantic structure of "COMPUTER-object.fwdarw.DESIGN" is remaining according to the selection made by the user. After the above described ambiguity resolving operation of FIG. 30, the detection control unit 18 operates according to the flow chart of FIG. 33 as follows. Namely, after the selected analysis result is stored into the analysis result learning dictionary and the individual data storage unit 16, the detection control unit 18 transmits a signal indicating a completion of the ambiguity resolving operation to the input analysis unit 12. In response, the input analysis unit 12 produces the detection key from the syntactic and semantic analysis results stored in the individual data storage unit 16 at the step 3101 which are the syntactic and semantic analysis results detected from the semantic structure index memory. Then, the keyword index memory is looked up for the keywords in the detection key at the step 3102, and the set calculation for the detected documents according to the detection key and its result is stored in the individual data storage unit 16 at the step 3103. Next, the second variation concerning the input sentence using the logic operators will be described. Namely, in the first embodiment described above, it has been assumed that the input sentence is given in the natural language. In contrast, in this second variation, a case of dealing with the input sentence using the logic operators in conjunction with the natural language will be described. In this case, the input analysis unit 12 is going to carry out the logic operator interpretation processing for the input sentence containing the logic operators. For example, FIG. 34 shows an exemplary case of the input sentence using the logic operator "+", in which the logic operator interpretation processing interprets the meaning of this logic operator "+" to yield four semantic structures. The final detection result will then be obtained as a sum set of all the document sets obtained by the detection processing using each of these four semantic structures. In FIG. 34, "niyoru" indicates a Japanese expression meaning "by". As another example, FIG. 35 shows an exemplary case of the word "something" in the input sentence is converted into a symbol "?", which will be regarded as matching with arbitrary word in the detection processing. In FIG. 35, a symbol "goal" indicates the objective case relationship. Next, the third variation concerning the detection processing using the bibliographical matters in the input sentence. Namely, in the first embodiment described above, the detection processing has been carried out for all the documents without utilizing their bibliographical matters at all. In contrast, in this third variation, a case of utilizing the bibliographical matters of the desired document such as the title, the author, and the issue date that can be specified in the input sentence will be described. For example, FIG. 36 shows an exemplary case of the input sentence specifying the bibliographical matters of the desired document as those written by "M. Tanaka" and issued "since 1980". In response, the detection processing unit 13 carries out the detection processing for only those documents which have "M. Tanaka" as the author and the issue year not less than "1980", according to the bibliographical matter analysis rules shown in FIG. 37 which are provided in the detection processing unit 13. In this case, the detection processing is carried out by looking up the keyword index memory for the keyword "machine translation" obtained as the analysis result as indicated in FIG. 36. Here, the word "papers" is not used as a keyword because of the application of the unnecessary expression extraction rule as shown in FIG. 38. The procedure for carrying out the detection processing is substantially similar to that in the first embodiment described above. Next, the fourth variation concerning the display priority order setting will be described. Namely, in the first embodiment described above, the display priority order is determined by weighting the detected documents in accordance with their summaries. In contrast, in this fourth variation, the detected documents are weighted according to the display priority scores determined in accordance with the document structure analysis result such as the title, table of content, index, and references of each document. For example, the display priority scores can be assigned as indicated in FIG. 39. In this case, the highest score for which each document qualify under the conditions concerning the keyword and the highest score for which each document qualify under the conditions concerning the semantic structure are added together as the score of each document, and the detected documents are displayed in a descending order of the assigned score. Next, the fifth variation concerning the detection result display will be described. Namely, in the first embodiment described above, the detection result displayed by the detection result output unit 17 only contained the titles, etc. In contrast, in this fifth variation, a case of enabling the user to request the display of various data concerning the detected documents will be described. In this case, in response to the command from the user entered through the input unit 11, the detection processing unit 13 controls the detection result output unit 17 to display a list containing each set of two keywords in the detection key which are coinciding with the target word and the source word in the semantic structure index memory along with a relation symbol registered between these two keywords. For example, when the semantic structure index memory is in a state as shown in FIG. 40 which indicates the presence of the documents having different relationships between two keywords "computer" and "design", according to the command from the user entered through the input unit 11, the detection processing unit 13 controls the detection result output unit 17 to display a list for these two semantic structures, whenever one keyword in the detection key coincides with either the target word or the source word for these two semantic structures in the semantic structure index memory. On the other hand, when the semantic structure index memory is in a state as shown in FIG. 41, the relationships between the keyword "example" and the other keywords can be displayed whenever the keyword "example" in the detection key is detected in the semantic structure index memory, such that the user can immediately recognize the semantic structures contained In the documents stored in the document storage unit 15. In this case, after the keyword detection operation using the detection key is carried out, according to the command from the user entered through the input unit 11, the detection processing unit 13 looks up the semantic structure index memory for the file name of each document obtained by the keyword detection operation, and controls the detection result output unit 17 to display the document name of each document along with all the semantic structures contained in each document in a format of the target word, the relation symbol, and the source word. At this point, a list display of all the documents containing each semantic structure can be provided for each semantic structure separately along with a display of the semantic structures, such that the user can Immediately recognize the semantic structures contained in the documents obtained by the keyword detection operation. Similarly, after the semantic structure detection operation using the syntactic and semantic analysis results for the input character string is carried out, according to the command from the user entered through the input unit 11, the detection processing unit 13 looks up the semantic structure index memory for the file name of each document obtained by the semantic structure detection operation, and controls the detection result output unit 17 to display the document name of each document along with all the semantic structures contained in the document in a format of the target word, the relation symbol, and the source word. At this point, a list display of all the documents containing each semantic structure can be provided for each semantic structure separately along with a display of the semantic structures, such that the user may even imagine the text contents of the detected documents. In addition, after the summary generation operation by the summary generation unit 14 is carried out, the detection control unit 18 can control the detection result output unit 17 to display the summaries of the detected documents as well as the correspondences between the summaries and the semantic structures, such that the user can actually know the text contents of the detected documents. Next, the sixth variation concerning the use of more than one input sentences will be described. Namely, in the first embodiment described above, only one input sentence has been used. In contrast, in this sixth variation, a case of using more than one input sentences in the detection processing will be described. In this case, it is possible for the user to enter more than one input sentences or a text. It is also possible to analyze the relationships among more than two keywords for the purpose of the detection processing by the input analysis. As an example, FIG. 42 shows an exemplary semantic structure index memory in this sixth variation in which one semantic structure is specified by the relationship among three keywords. In addition, in this sixth variation, the input sentence may be given in a form of a character string such as a file name of a particular document rather than the natural language sentence aimed for commanding the desired detection processing as in the first embodiment described above, such that the detection of the documents similar to the particular document specified by the input character string can be carried out as follows. In this case, the morphological, syntactic, and semantic analyses are applied to all the sentences in the particular document specified by the input character string and then the appropriate detection keys are produced from the obtained analysis result. Then, the detection processing operation is carried out by using the semantic structures and the detection keys obtained from that particular document, to detect the other documents having the similar semantic structures and keywords as that particular document. Here, each semantic structure may be associated with a count indicating a number of times at which each semantic structure appears in that particular document, and the detected documents similar to that particular document may be displayed in an order starting from those detected documents containing the semantic structure associated with the larger count. Next, the seventh variation concerning the procedure for the input analysis and the detection processing will be described. Namely, in the first embodiment described above, the detection processing including the keyword detection operation and the semantic structure detection operation is carried out only after the input analysis including the morphological, syntactic, and semantic analyses of the input character string is completed. In contrast, in this seventh variation, a case of carrying out the keyword detection operation immediately after the morphological analysis of the input character string, which is followed by the syntactic and semantic analyses of the input character string and the semantic structure detection operation, will be described. In this case, the input analysis unit 12 possesses a keyword extraction rule dictionary for specifying rules for extracting the content words, which has an exemplary form as shown in FIG. 43. Here, the user can freely modify, delete, and add the rules to this keyword extraction rule dictionary. In this seventh variation, the input analysis and the detection processing are carried out according to the flow chart of FIG. 44 as follows. First at the step 4401, the input analysis unit 12 carries out the morphological analysis for the input character string, and stores its result into the individual data storage unit 16. Then, the input analysis unit 12 extracts the content words from the input character string as the detection target keywords according to the morphological analysis result by using the keyword extraction rule dictionary and the unnecessary word dictionary. Here, according to the keyword extraction rule dictionary shown in FIG. 43, the word whose part of speech is noun or verb is extracted as a content word at the step 4402, and the unnecessary word dictionary is looked up for each extracted content word, and those content words not coinciding with the words registered in the unnecessary word dictionary are set as the detection target keywords at the step 4403. Then, at the step 4404, the input analysis unit 12 produces the detection key by using the obtained keywords and the appropriate logic operators, and store the produced detection key in the individual data storage unit 16. Also, at the step 4405, the related word dictionary is looked up for the keywords, and the detection keys in which the keywords are replaced by the looked up related words are also produced by the input analysis unit 12. Next, the detection processing unit 13 looks up the keyword index memory for the keywords in the detection key at the step 4406, and then carries out the set calculation for the detected documents according to the detection key and stores its result into the individual data storage unit 16 at the step 4407. Then, whether there is at least one document is stored in the individual data storage unit 16 and there is a sentence containing more than one keywords in said at least one document, or not is judged at the step 4408. Only when there is at least one document is stored in the individual data storage unit 16 and there is a sentence containing more than one keywords in said at least one document at the step 4408, the input analysis unit 12 takes out the morphological analysis result stored in the individual data storage unit 16 and carries out the syntactic analysis and the semantic analysis at the steps 4409 and 4410, respectively. Then, at the step 4411, the structure coinciding with that registered in the unnecessary expression extraction rule is deleted. Then, whether the semantic structure is obtained or not is judged at the step 4412, and only when the semantic structure is obtained, the detection processing unit 13 looks up the semantic structure index memory for the obtained semantic structure at the step 4413, and stores the result obtained at the step 4413 into the individual data storage unit 16 at the step 4414. Next, the eighth variation concerning the procedure or the treatment of the documents stored in the document storage unit 15 will be described. Namely, in the first embodiment described above, the morphological, syntactic, and semantic analyses are carried out on all the documents stored in the document storage unit 15 in advance to prepare the semantic structure index memory. In contrast, in this eighth variation, a case of not carrying out the morphological, syntactic, and semantic analyses on the documents stored in the document storage unit 15 in advance will be described. In this case, the detection processing unit 13 carries out the keyword detection operation according to the detection key produced by the input analysis unit 12 and stored in the individual data storage unit 16. Then, when more than one documents are obtained by this keyword detection operation, whether there is a document having at least one sentence containing more than one keywords or not is judged according to the file name of each obtained document and the sentence numbers of the sentences containing the keywords in the obtained document. Then, when such a document exists, the morphological, syntactic, and semantic analyses are carried out for the sentences contained in such a document, and the analysis results are stored in the individual data storage unit 16. On the other hand, the Input analysis unit 12 takes out the morphological analysis result for the input character string stored in the individual data storage unit 16, and carries out the syntactic and semantic analyses for the input character string. Then, in a case the syntactic and semantic analysis results are obtained, the obtained syntactic and semantic analysis results are matched with the syntactic and semantic analysis results for the documents stored in the individual data storage unit 16, and the matching result is stored in the individual data storage unit 16 as the detection result. Here, the detection result contains those which are not completely matching, and those which are not completely matching are accompanied by information indicating this fact in the individual data storage unit 16. This information can be used at a time of setting the display priority orders, such that the document without this information is displayed before the document with this information among the documents in the same display priority order. For example, when the semantic structure obtained from the input sentence is as indicated in (a) of FIG. 45, and there is a document containing an expression "design automation system using computer" which has the semantic structure as indicated in (b) of FIG. 45, such a document will be included in the detection result even though this semantic structure of (b) of FIG. 45 does not match the semantic structure of (a) of FIG. 45 completely, because this semantic structure of (b) of FIG. 45 contains all of the keyword "computer", the relation "instrument", and the keyword "design"0 relevant to the semantic structure of (a) of FIG. 45. Next, the ninth variation concerning the procedure for the detection processing and the summary generation will be described. Namely, in the first embodiment described above, the detection processing including the keyword detection operation and the semantic structure detection operation and the summary generation are carried out continuously. In contrast, in this ninth variation, the user is allowed to specify the order for carrying out the keyword detection operation, the semantic structure detection operation, and the summary generation operation, and to command whether or not to continue the subsequent processing after each of the keyword detection operation, the semantic structure detection operation, and the summary generation operation is completed. In this case, the user can specify the order of executions of the operations by various modules, and command whether or not to continue the subsequent processing, according to the selected document storage unit or the particular detection target document. Next, the tenth variation concerning the display of the summary will be described. Namely, in the first embodiment described above, the summary generation unit 14 does not provide any information concerning the manner in which the summary had been produced. In contrast, in this tenth variation, the reliability of the summary is indicated by displaying a number or a rate of the rhetorical expressions used as clues for producing the summary. In this case, the display of the detection result or the summary includes a number of rhetorical expressions such as "for example", "as a background", etc., or a rate of these rhetorical expressions with respect to a total number of clauses in the entire document, or the rate of a number of characters in these rhetorical expressions with respect to a total number of words in the entire document. For example, FIG. 46 shows an exemplary screen display of the detection result containing the summary reliability given by a percentage indicating the rate of a number of characters in the rhetorical expressions with respect to a total number of words in the entire document. Next, the eleventh variation concerning the treatment of data used in the detection processing will be described. Namely, in the first embodiment described above, the keywords and/or the semantic structures used in the detection processing are not retained after the detection processing is completed. In contrast, in this eleventh variation, a case of retaining the keywords and/or the semantic structures used in the already completed detection processing will be described. In this case, the detection control unit 18 stores the keywords and/or the semantic structures contained in the summary generated by the summary generation unit 14, in relation to the document from which the summary has been generated, in the document storage unit 15. Then, in a case the keywords and/or the semantic structures stored in relation to the document exist, the detection processing unit 13 carries out the keyword detection operation and/or the semantic structure detection operation by using these stored keywords and/or semantic structures. Here, instead of storing the keywords and/or semantic structures in relation to the document, the summary keyword index memory and/or the summary semantic structure index memory may be produced and utilized in the detection processing. Next, the twelfth variation concerning the use of the summary will be described. Namely, in the first embodiment described above, the summary is generated only after the detection processing is completed. In contrast, in this twelfth variation, the summaries are generated for all the documents stored in the document storage unit 15 in advance, and the keyword index memory and the semantic structure index memory used in the detection processing are formed by only those keywords and semantic structures which are contained in the generated summaries. In this case, the detection processing unit 13 carries out the keyword detection operation and the semantic structure detection operation for only those keywords and semantic structures which are contained in the generated summaries. Next, the thirteenth variation concerning the expression of the semantic structure will be described. Namely, in the first embodiment described above, the semantic structure is expressed by using words. In contrast, in this thirteenth variation, a case of expressing the semantic structure by using the symbols or the numbers assigned to the meanings of words will be described. In this case, the input analysis unit 12 possesses an analysis word dictionary for storing the description of the meaning of each word and a symbol or a number assigned to it. Here, in a case the word used in the input character string is a multivocal word, the user is allowed to select a desired meaning from a plurality of meanings displayed by the detection result output unit 17. Also, the input analysis unit 12 stores the symbol or the number corresponding to the selected meaning along with the word in the analysis result memory and the individual data storage unit 16. In this case, the detection processing by the detection processing unit 13 and the summary generation operation by the summary generation unit 14 are also carried out in terms of the symbols or the numbers instead of the words. Next, the fourteenth variation concerning the manner of displaying the detection result will be described. In this fourteenth variation, the display means 4 in the overall configuration of FIG. 1 uses a bit map display in which the display screen can be divided such that a list of identifiers of the detected documents can be displayed simultaneously along the text content of the original document and the summary for a selected one of the detected documents. The user enters the input for selecting the desired one of the detected documents as well as command input for controlling the manner of detection result display, through the input means 6. Here, the detection processing unit 13 stores either the detected documents themselves or their identifiers as the detection result obtained by the detection processing operation in the individual data storage unit 16. Also, in the first embodiment described above, the input from the user is given in a natural language and the morphological, syntactic, and semantic analyses are applied to the input given in a natural language, but in this fourteenth variation, the keywords or the detection keys may be entered directly. When the input is given by the keywords or the detection keys directly, the operation at the input analysis unit 12 can be skipped and the detection processing unit 13 can be activated immediately. Also, in the first embodiment described above, all the detected documents obtained by the detection processing operation by the detection processing unit 13 are stored in the individual data storage unit 16. However, in this fourteenth variation, all the detected documents are stored in the individual data storage unit 16 only when a number of detected documents is relatively small, and when a number of detected documents is relatively large, only the identifiers of the detected documents are stored as the detection result in the individual data storage unit 16, and the summary generation unit 14 and the detection result output unit 17 obtains the detected documents themselves from the document storage unit 15 according to the identifiers stored in the individual data storage unit 16. It is also possible for the individual data storage unit 16 to utilize an externally connected disk memory to store the detected documents temporarily in a case a number of detected documents is large. Also, in this fourteenth variation, the summary generated by the summary generation unit 14 contains various information such as a correspondence relationship between the original document and the summary, the title, author, chapter and section headers, etc. of the document, as well as the abstract of each chapter of the document. After the operation by the summary generation unit 14 is completed, the detection control unit 18 controls the detection result output unit 17 to display a list of the identifiers of the detected documents in a prescribed display priority order, while simultaneously displaying the summary and the text content of the original document for the detected document with the highest display priority order on the divided display screen. When the user enters the input for selecting the document identifier for the document other than the document with the highest display priority order, the display of the summary and the original document are changed to those for the selected document identifier. When the user enters the input for requesting the change of the displayed content of either one of the summary and the original document, the displayed content of the requested one of the summary and the original document is changed accordingly, and the displayed content of the other one of the summary and the original document is also changed in correspondence. Here, it is also possible for the summary generation unit 14 to generate the summary only for the document with the highest display priority order at first, and whenever the request to change the displayed document is entered by the user, the summary generation unit 14 is activated to generate the summary for the requested new document to be displayed. In this fourteenth variation, the individual data storage unit 16 stores the correspondences between the summaries and the original documents in a form of a summary information memory shown in FIG. 47. In this case, each summary information memory data includes an original document pointer, a summary pointer, and a document structure pointer to the document structure of the original document, which are stored in a continuous memory region to indicate their correspondences. Here, the document structure of the original document contains various information such as the sentence numbers and the sentence positions in the original document, the sentence title, author, chapter and section headers, etc. corresponding to the sentence numbers. Also, the summary contains summary sentences as well as the sentence numbers in the original document indicating the correspondences between the summary and the original document. Here, the detection control unit 18 also determines the display priority order among the detected documents according to the prescribed priority order conditions provided as a display priority rule dictionary in the detection control unit 18, similar to the first embodiment described above. The detection control unit 18 then controls the detection result output unit 17 to display a list of the identifiers of the detected documents in the determined display priority order as the detection result. As a concrete example, FIG. 48 shows an exemplary detection result display in which the title and the author of the document are used as the document identifier, and in which a plurality of document identifiers are arranged in the display priority order determined by the detection control unit 18. In FIG. 48, a frame enclosing the title of one document indicates the currently selected document. Thus, in the initial detection result display, the frame is located around the title of the first document in the list which has the highest display priority order as shown in FIG. 48. In addition, the detection result display also include the display of the summary as shown in FIG. 49 for the first document in the list shown in FIG. 48 as an example, and the display of the original document as shown in FIG. 50 for the first document in the list shown in FIG. 48 as an example. Here, Just as in the first embodiment described above, the detection processing unit 13 counts a number of times each document has been detected as the detection result and stores this number of detections for each document in the document storage unit 15. Then, in a case the detection result includes more than one documents with the same display priority order, the document with the greater number of detections is displayed before the document with the lesser number of detections. Next, the fifteenth variation concerning the detailed manner for changing the displayed content in the detection result display will be described. FIG. 51 shows an exemplary screen display for allowing the user to change the displayed content in the detection result display. In this fifteenth variation, the commands for changing the displayed content of the detection result display are entered by selecting appropriate one of the displayed buttons by using the mouse. To this end, in FIG. 51, the display changing buttons are provided for commanding a top, previous page, next page, bottom, previous chapter, next chapter, and character string search. For example, by selecting the next page button, the displayed content of the summary and the original document as shown in FIGS. 49 and 50 can be changed to those shown in FIGS. 52 and 53, for example. In further detail, the operation to change the displayed content in the detection result display can be carried out according to the flow chart of FIGS. 54A and 54B, as follows. First, when the display change request from the user is detected at the step 5401, which one of the display changing buttons has been selected by the user is judged by sequentially comparing the requested code with the codes assigned to the top button, the previous page button, the next page button, the bottom button, the previous chapter button, the next chapter button, and the character string search button, at the steps 5402, 5404, 5406, 5408, 5410, 5412, and 5419, respectively. When it is judged that the top button has been selected at the step 5402, next at the step 5403, the summary display pointer is shifted to the top position. When it is judged that the previous page button has been selected at the step 5404, next at the step 5405, the summary display pointer is shifted backward for a prescribed amount proportional to one screen display size. Here, the summary display pointer is shifted for a prescribed amount proportional to one screen display size such that the user can set up the size of each page freely by changing the size of the summary display. Similarly, when it is judged that the next page button has been selected at the step 5406, next at the step 5407, the summary display pointer is shifted forward for a prescribed amount proportional to one screen display size. When it is judged that the bottom button has been selected at the step 5408, next at the step 5409, the summary display pointer is shifted to the bottom position. When it is judged that the previous chapter button has been selected at the step 5410, next at the step 5411, the summary display pointer is shifted to the top of the previous chapter. When it is judged that the next chapter button has been selected at the step 5412, next at the step 5413, the summary display pointer is shifted to the top of the next chapter. When it is judged that the character string search button has been selected at the step 5419, next at the step 5420, the character string search for the specified character string to be searched is carried out on the summary and the summary display pointer is shifted to a position of the searched out character string. After the shifting of the summary display pointer is completed at the step 5403, 5405, 5407, 5409, 5411, 5413, or 5420, next at the step 5414, the displayed content of the summary is changed according to the shifted summary display pointer. Then, the sentence number in the original document registered at a position pointed by the summary display pointer is taken out at the step 5415, and the corresponding position in the original document is determined by sequentially comparing the taken out sentence number with the sentence numbers registered in the document structure of the original document at the step 5416. Then, at the step 5417, the original document display pointer is shifted to the corresponding position determined at the step 5416, and the displayed content of the original document is changed according to the shifted original document display pointer at the step 5418. In the display changing operation described above, in a case the previous chapter button has been selected, the shifting of the summary display pointer and the original document display pointer can be achieved by utilizing the document structure of the original document according to the flow chart of FIG. 55 as follows. Namely, the sentence number in the original document registered at a present position pointed by the summary display pointer is taken out at the step 3201, and the taken out sentence number is sequentially compared with the sentence numbers registered in the document structure of the original document, to search out the sentence number of the immediately previous chapter at the step 3202. Then, whether the previous chapter actually exists or not is determined at the step 3203. If the previous chapter does not exist, next at the step 3204, the message indicating the non-existence of the previous chapter is displayed as a reply, whereas otherwise, the position of the searched out sentence number of the previous chapter is set to the original document display pointer at the step 3205. Then, the sentence which has the same sentence number as the searched out sentence number of the previous chapter is searched in the summary at the step 3206, and the position of the searched out sentence which has the same sentence number as the searched out sentence number of the previous chapter is set to the summary display pointer at the step 3207. The similar procedure can also be used in a case the next chapter button has been selected. In the display changing operation described above, the character string search operation to be carried out at the step 5420 is already well known so that its detail will be omitted. Also, this character string search operation may be carried out with respect to the original document instead of the summary as described above, by simply changing the target of the comparison. In this fifteenth variation, when the change of the displayed content is requested for the original document, the displayed content of the summary is changed in correspondence. As a concrete example, by selecting the next page button for the original document, the displayed content of the summary and the original document as shown in FIGS. 52 and 53 can be changed to those shown in FIGS. 56 and 57, for example. Also, when the user enters the input for selecting the document identifier for the document other than the document with the highest display priority order, the display of the summary and the original document are changed to those for the selected document identifier. As a concrete example, FIG. 58 shows an exemplary detection result display in which the third document on the list is selected. In this case, the displayed contents of the summary and the original document are changed accordingly from those shown in FIGS. 56 and 57 to those shown in FIGS. 59 and 60, for example. Here, the correspondence between the selected identifier and the summary and the original document corresponding to the selected identifier can be recognized easily by utilizing the fact that the summary information memory contains the pointer to the original document, the pointer to the summary, and the pointer to the document structure for each document in a continuous memory region. Next, the sixteenth variation concerning the manner for controlling an amount of summary sentences in each summary to be displayed on the detection result display will be described. In this sixteenth variation, there is provided a screen display shown in FIG. 61 for indicating and changing the items to be contained in the summary display. In FIG. 61, those items with markings entered in accompanying boxes are the currently selected items to be contained in the summary display, so that the title, the chapter header, the abstracts of the chapters are currently contained in the displayed summary. However, as for the abstracts of the chapters, only the chapters "Introduction" and "Conclusion" are marked, so that the abstracts of only these two chapters are included. As a concrete example, when the settings of the items for the summary display is as indicated in FIG. 61, an exemplary summary display appears as shown in FIG. 62, for the original document shown in FIG. 60. Here, the items to be contained in the summary display can be judged according to the document structure of document registered in the summary information memory. In this sixteenth variation, the display of the desired portion of the original document can be obtained by specifying a corresponding portion in the summary by using the mouse, according to the flow chart of FIG. 63 as follows. Here, for the sake of explanation, a specific case of obtaining the display of a portion in the original document corresponding to "3. System function" in the summary shown in FIG. 62 will be described. In this case, the user moves the mouse to any character within "3. System function" portion of the summary display and clicks the mouse button. Then, at the step 4001, whether the character in the summary sentence has been selected or not is checked in order to distinguish this operation from the other input processing. When the character on the summary document is selected, next at the step 4002, the character position of the selected character is obtained. In this example, the character position is specified by a number of characters from the first character in the summary display to the selected character. Next, at the step 4003, the obtained character position is converted into the summary sentence number. This conversion can be carried out by using a summary sentence table shown in FIG. 64, in which the corresponding character positions and the sentence number in the original document are enlisted for each displayed summary sentence number. Thus, the summary sentence number can be obtained by sequentially comparing the obtained character position with the character position ranges in this summary sentence table to find out the character position range containing the obtained character position. Then, at the step 4004, the corresponding original document sentence number is obtained from this summary sentence table. In a case the character in "3. System function" is selected, the character position is within the range of 95 to 102, so that the summary sentence number can be determined as "5", and the corresponding original document sentence number can be determined as "16" according to the summary sentence table of FIG. 64. Then, at the step 4005, the position of the obtained original document sentence number is determined, and set to the original document display pointer. Here, the position of the obtained original document sentence number can be determined by sequentially comparing the obtained original document sentence number with the sentence numbers registered in the logical structure of the original document to find out the corresponding position. Finally, at the step 4006, the original document is displayed according to the original document display pointer set at the step 4005. As a concrete example, FIG. 65 shows the original document display for the original document corresponding to the summary shown in FIG. 62, when "3. System function" is selected. It is to be noted here that, apart from the various variations described above, it is also possible to modify the first embodiment described above such that, at a time of generating the summaries at the summary generation unit 14, the generated summaries may be stored in the document storage unit 15 in order to construct the system with spare memory capacity. It is also possible to modify the first embodiment described above such that, instead of storing the generated summaries as the text data, only the document structure and the text structure obtained by the document structure analysis unit 141 and the text structure analysis unit 142 in a process of the summary generation are stored, and the summary is reproduced whenever necessary by the key sentence Judgement unit 143 and the text re-construction unit 144 from the stored document structure and text structure. As described in detail above, according to the first embodiment and its variations, the desired documents can be detected according to the detection key produced from the natural language input sentence, and the detection result for those documents containing the identical syntactic and semantic structures as the natural language input sentence in the text contents or the summaries can be presented to the user, so that those documents which are likely to be desired by the user can be displayed at the higher priority from a large number of documents in the document database, and consequently it becomes possible for the user to obtain the desired document easily, accurately, and efficiently. Now, the second embodiment of a document detection system according to the present invention will be described in detail. In the following, those elements which are substantially equivalent to the corresponding elements in the first embodiment described above will be given the same reference numerals in the drawings, and their detailed descriptions are omitted. Also, those elements which are similar but not identical to the corresponding elements in the first embodiment described above will be given the same reference numerals with apostrophe attached in the drawings. In this second embodiment, the document detection system has an overall configuration similar to that shown in FIG. 1 for the first embodiment described above. Here, however, in further detail, a main portion of the document detection system of this second embodiment has a functional configuration as shown in FIG. 66, which comprises: an input unit 11' for entering an input sentence containing keywords and natural language sentences for a detection command from the user; an input analysis unit 12' for analyzing the input sentence entered at the input unit 11' and converting it into the detection command; a detection processing unit 13' for carrying out the detection processing for detecting documents according to the detection command; a summary generation unit 14 for producing a summary of each detected document; an internal document relation data analysis unit 21 for analyzing internal relations within each document; an external document relation data analysis unit 22 for analyzing external relations of each document with external documents; a detection result output unit 17' for outputting various results obtained by the detection processing unit 13', the summary generation unit 14, the internal document relation data analysis unit 21, and the external document relation data analysis unit 22, while managing user interactions; a document storage unit 15, connected with the detection processing unit 13', the summary generation unit 14, the internal document relation data analysis unit 21, the external document relation data analysis unit 22, and the detection result output unit 17', for storing the document database; an individual data storage unit 16, connected with the detection processing unit 13', the summary generation unit 14, the internal document relation data analysis unit 21, the external document relation data analysis unit 22, and the detection result output unit 17', for storing individual data including the detected documents and the generated summaries; a detection control unit 18' for controlling the operations of the detection processing operation by the input unit 11', the input analysis unit 12', and the detection processing unit 13'; and a detection result processing control unit 20 connected with the detection control unit 18' for controlling detection result processing operation by the summary generation unit 14, the internal document relation data analysis unit 21, the external document relation data analysis unit 22, and the detection result output unit 17'. In this FIG. 66, the document storage unit 15 and the individual data storage unit 16 belong to the memory means 2 in the overall configuration of FIG. 1, while the input unit 11', the input analysis unit 12', the detection processing unit 13', the summary generation unit 14, the detection result output unit 17', the internal document relation data analysis unit 21, the external document relation data analysis unit 22, the detection control unit 18', and the detection result processing control unit 20 belong to the central processing means 1 in the overall configuration of FIG. 1. Also, in this FIG. 66, the thick arrows indicate data lines while thin lines indicate control lines. In this functional configuration of FIG. 66, the detection control unit 18' controls each processing module to realize the following detection processing operation sequence. Namely, the input sentence entered at the input unit 11' is transferred to the input analysis unit 12' at which a list of keywords in the input sentence is constructed as the detection command to be handed over to the detection processing unit 13'. The detection processing unit 13' then detects a set of relevant documents according to the detection commands from the document data stored in the document storage unit 15, and stores a set of detected documents into the individual data storage unit 16. After this detection processing operation sequence is completed, the detection control unit 18' transmits an activation signal to the detection result processing control unit 20 to hand over the control. In response, the detection result processing control unit 20 takes out the original documents stored in the document storage unit 15 according to the detected documents stored in the individual data storage unit 16, and controls the detection result processing operations of the summary generation unit 14, the internal document relation data analysis unit 21, and the external document relation data analysis unit 22, for each detected document, as follows. The summary generation unit 14 generates a summary for each detected document from the original document, and stores the generated summary along with a correspondence data indicating a correspondence to the original document into the individual data storage unit 16. The internal document relation data analysis unit 21 and the external document relation data analysis unit 22 carry out appropriate analyses to obtain the external and internal document relation data and store them into the individual data storage unit 16. After these detection result processing operations are completed, the detection result processing control unit 20 activates the detection result output unit 17' to display or change the detection results, the summaries, and the external and internal document relation data stored in the individual data storage unit 16, according to the command input entered from the user through the input unit 11'. Then, the detection result processing control unit 20 transmits the activation signal to the detection control unit 18' to return the control. Now, the detailed operation of each processing module in this second embodiment will be described in detail. First, the detection control unit 18' operates according to the flow chart of FIG. 67, as follows. At first, the detection control unit 18' awaits for the input at the input unit 11' at the step 6701. Then, when the input at the input unit 11' is detected, the input analysis unit 12' is activated at the step 6702. Next, the detection control unit 18' awaits for the end of the processing at the input analysis unit 12' at the step 6703. Then, when the end of the processing at the input analysis unit 12' is detected, the detection processing unit 13' is activated at the step 6704. Next, the detection control unit 18' awaits for the end of the processing at the detection processing unit 13' at the step 6705. Then, when the end of the processing at the detection processing unit 13' is detected, the activation signal is transmitted to the detection result processing control unit 20 so as to hand over the control at the step 6706. Next, the detection control unit 18' awaits for the transmission of the activation signal from the detection result processing control unit 20 at the step 6707. Then, when the activation signal is received from the detection result processing control unit 20, the detection control unit 18' recovers the control and the operation returns to the step 6701 to repeat the process of the steps 6701 to 6707 for the next input. Next, the input analysis unit 12' has a detailed functional configuration as shown in FIG. 68, which comprises: a morphological analysis unit 41, a content word extraction unit 42, and an unnecessary word dictionary 43 utilized by the content word extraction unit 42. With this functional configuration of FIG. 68, the input analysis unit 12' operates according to the flow chart of FIG. 69, as follows. First, at the step 6901, the morphological analysis is carried out on the input sentence at the morphological analysis unit 41 to divide the input sentence into words. Here, the details of the morphological analysis to be carried out at this step 6901 is not essential to the present invention, and any known schemes can be adopted. Next, at the step 6902, the content word is extracted from the input sentence at the content word extraction unit 42 according to the morphological analysis result. Then, at the step 6903, whether the content word extracted at the step 6902 exists in the unnecessary word dictionary 43 or not is determined, and only when the extracted content word is not in the unnecessary word dictionary 43, the extracted content word is set as the detection target keyword at the step 6904, whereas otherwise the step 6904 is skipped. Then, whether there is any other content word in the input sentence or not is determined at the step 6905, and only when there is another content word in the input sentence, the operation returns to the step 6902 above to repeat the steps 6902 to 6904 for the next content word in the input sentence, until all the content words in the input are extracted. As a concrete example of the result of the operation by this input analysis unit 12', FIG. 70 shows an exemplary input sentence, and various results obtained at various stages in the operation of this input analysis unit 12'. More specifically, for the input sentence indicated in (a) of FIG. 70, the morphological analysis result appears as indicated in (b) of FIG. 70 in which the input sentence is divided into words. Then, the content word extraction result appears as indicated in (c) of FIG. 70 in which the content words "topics", "translation", and "examples" are extracted from the morphological analysis result. Finally, the detection target keywords to be handed over to the detection processing unit 13' appear as indicated in (d) of FIG. 70 in which only "translation" and "examples" are set as the keywords, assuming that "topics" is on the unnecessary word dictionary 43. Next, the detection processing unit 13' has a detailed functional configuration as shown in FIG. 71, which comprises: a keyword index matching unit 71 connected with the input analysis unit 12' and the individual data storage unit 16, and a document file set calculation unit 72 connected with the keyword index matching unit 71. The keyword index matching unit 71 carries out the detection operation for each keyword entered from the input analysis unit 12' on the document data in the document storage unit 15 to obtain a set of documents containing the same keyword. The document file set calculation unit 72 then carries out a logical set calculation for the documents obtained by the keyword index matching unit 71 for all the keywords to obtain the final document file set for the detected documents. Here, the dictionary storage unit 15 possesses the keyword index memory similar to that shown in FIG. | ||||||
