Computer based system, method, and computer program product for selectively displaying patent text and images5991780
Abstract
A system, method, and computer program product for displaying a patent document and a patent image. The method operates by preparing for display a patent document comprised of at least one patent text file, and a patent image document comprised of at least one patent image file. The patent text file is generated from at least one source text file. The patent text file includes equivalency information detailing an equivalency relationship between the patent image file and the source text file. The equivalency information comprises linking information indicative of a correspondence between at least one portion of the source text file and at least one portion of the patent image file. The patent image file is at least one data file having stored therein one or more image pages associated with a patent. Each of the image pages is an electronic image of at least a portion of a page of the patent or at least a portion of a page of a document related to the patent. The source text file is at least one data file having stored therein text data representing textual data in the patent. A user interface is generated for display on the display. The user interface selectively displays the patent text file and the patent image file on the display, such that at least a portion of the patent text file is displayed in a first window and at least a portion of the patent image file is displayed in a second window and the windows may be selectively viewed simultaneously or individually on the display.
Claims
We claim:
1. A computer controlled display system including at least one central processing unit (CPU), said CPU coupled to a display for displaying a patent document and a patent image on said display, comprising:
means for preparing for display at least one patent document comprised of at least one patent text file, and at least one patent image document comprised of at least one patent image file, said at least one patent text file generated from at least one source text file, said patent text file including equivalency information detailing an at least partial equivalency relationship between said at least one patent image file and said at least one source text file, said equivalency information comprising linking information indicative of a correspondence between at least one portion of said at least one source text file and at least one portion of said at least one patent image file, said equivalency information also comprising one or more of (A)-(M):
(A) special character information specifying at least one mapping of a group of characters in said at least one source text file to at least one special character in said at least one patent image file,
(B) item location information identifying locations in said patent image file of items referred to or contained in said source text file, said items including any combination of figures, drawing sheets, figure elements, equations, non-text tables, structures, diagrams, and text objects,
(C) formatting information representing at least an approximate arrangement of at least some bibliographic data from said at least one source text file as represented in said at least one patent image file,
(D) column information representing at least an approximate arrangement of text in columns of said at least one patent image file,
(E) line information representing at least an approximate arrangement of text in lines of said at least one patent image file,
(F) column line number information representing approximate line numbers of lines of text,
(G) section information representing at least approximate positions of patent sections,
(H) font information representing font styles of characters of said at least one source text file as represented in said at least one patent image file,
(I) font size information representing font sizes of characters of said at least one source text file as represented in said at least one patent image file,
(J) superscript information indicating characters in said at least one source text file that are represented using superscripts in said at least one patent image file,
(K) subscript information indicating characters in said at least one source text file that are represented using subscripts in said at least one patent image file,
(L) bold attribute information indicating characters in said at least one source text file that are bolded in said at least one patent image file, and
(M) italicized attribute information indicating characters in said at least one source text file that are italicized in said at least one patent image file,
said at least one patent image file being at least one data file having stored therein one or more image pages associated with a patent, each of said image pages being an electronic image of at least a portion of a page of said patent or at least a portion of a page of a document related to said patent, said at least one source text file being at least one data file having stored therein text data representing at least a portion of textual data in said patent; and
a user interface generated by said CPU for display on said display, said user interface selectively displaying said patent text file and said patent image file on said display, such that at least a portion of said at least one patent text file is displayed in a first window and at least a portion of said at least one patent image file is displayed in a second window and said windows may be selectively viewed simultaneously or individually on said display.
2. The system of claim 1, wherein said equivalency information comprises a plurality of (A)-(M).
3. The system of claim 1, wherein said equivalency information comprises at least all of (A)-(M).
4. The system of claim 1, wherein said at least one patent text file is at least one equivalent text file.
5. The system of claim 1, wherein said at least one patent image file is a PTO Image file and said at least one source text file is a PTO Text file, wherein said at least one patent image file is stored in a compressed format.
6. The system of claim 1, further comprising:
means for displaying a list of sections contained in said at least one patent text file;
means for enabling a user to select one of said sections from said list; and
means for displaying in said second window at least a part of said at least one patent image file containing at least a portion of said selected section.
7. The method of claim 1, wherein said at least one source text file and said at least one patent image file are from at least one national patent office.
8. A computer controlled display method for displaying a patent document and a patent image on a display, comprising the steps of:
(1) preparing for display at least one patent document comprised of at least one patent text file, and at least one patent image document comprised of at least one patent image file, said at least one patent text file generated from at least one source text file, said patent text file including equivalency information detailing an at least partial equivalency relationship between said at least one patent image file and said at least one source text file, said equivalency information comprising linking information indicative of a correspondence between at least one portion of said at least one source text file and at least one portion of said at least one patent image file, said equivalency information also comprising one or more of (A)-(M):
(A) special character information specifying at least one mapping of a group of characters in said at least one source text file to at least one special character in said at least one patent image file,
(B) item location information identifying locations in said patent image file of items referred to or contained in said source text file, said items including any combination of figures, drawing sheets, figure elements, equations, non-text tables, structures, diagrams, and text objects,
(C) formatting information representing at least an approximate arrangement of at least some bibliographic data from said at least one source text file as represented in said at least one patent image file,
(D) column information representing at least an approximate arrangement of text in columns of said at least one patent image file,
(E) line information representing at least an approximate arrangement of text in lines of said at least one patent image file,
(F) column line number information representing approximate line numbers of lines of text,
(G) section information representing at least approximate positions of patent sections,
(H) font information representing font styles of characters of said at least one source text file as represented in said at least one patent image file,
(I) font size information representing font sizes of characters of said at least one source text file as represented in said at least one patent image file,
(J) superscript information indicating characters in said at least one source text file that are represented using superscripts in said at least one patent image file,
(K) subscript information indicating characters in said at least one source text file that are represented using subscripts in said at least one patent image file,
(L) bold attribute information indicating characters in said at least one source text file that are bolded in said at least one patent image file, and
(M) italicized attribute information indicating characters in said at least one source text file that are italicized in said at least one patent image file,
said at least one patent image file being at least one data file having stored therein one or more image pages associated with a patent, each of said image pages being an electronic image of at least a portion of a page of said patent or at least a portion of a page of a document related to said patent, said at least one source text file being at least one data file having stored therein text data representing at least a portion of textual data in said patent; and
(2) generating a user interface for display on the display, said user interface selectively displaying said patent text file and said patent image file on said display, such that at least a portion of said at least one patent text file is displayed in a first window and at least a portion of said at least one patent image file is displayed in a second window and said windows may be selectively viewed simultaneously or individually on said display.
9. The method of claim 8, wherein said equivalency information comprises a plurality of (A)-(M).
10. The method of claim 8, wherein said equivalency information comprises at least all of (A)-(M).
11. The method of claim 8, wherein said at least one patent text file is at least one equivalent text file.
12. The method of claim 8, wherein said at least one patent image file is a PTO Image file and said at least one source text file is a PTO Text file, wherein said at least one patent image file is stored in a compressed format.
13. The method of claim 8, further comprising the steps of:
displaying a list of sections contained in said at least one patent text file;
enabling a user to select one of said sections from said list; and
displaying in said second window at least a part of said at least one patent image file containing at least a portion of said selected section.
14. The system of claim 8, wherein said at least one source text file and said at least one patent image file are from at least one national patent office.
15. A computer program product comprising control logic recorded therein, said control logic, when executed, enabling a computer to display a patent document and a patent image on a display, said control logic comprising:
means for enabling the computer to prepare for display at least one patent document comprised of at least one patent text file, and at least one patent image document comprised of at least one patent image file, said at least one patent text file generated from at least one source text file, said patent text file including equivalency information detailing an at least partial equivalency relationship between said at least one patent image file and said at least one source text file, said equivalency information comprising linking information indicative of a correspondence between at least one portion of said at least one source text file and at least one portion of said at least one patent image file, said equivalency information also comprising one or more of (A)-(M):
(A) special character information specifying at least one mapping of a group of characters in said at least one source text file to at least one special character in said at least one patent image file,
(B) item location information identifying locations in said patent image file of items referred to or contained in said source text file, said items including any combination of figures, drawing sheets, figure elements, equations, non-text tables, structures, diagrams, and text objects,
(C) formatting information representing at least an approximate arrangement of at least some bibliographic data from said at least one source text file as represented in said at least one patent image file,
(D) column information representing at least an approximate arrangement of text in columns of said at least one patent image file,
(E) line information representing at least an approximate arrangement of text in lines of said at least one patent image file,
(F) column line number information representing approximate line numbers of lines of text,
(G) section information representing at least approximate positions of patent sections,
(H) font information representing font styles of characters of said at least one source text file as represented in said at least one patent image file,
(I) font size information representing font sizes of characters of said at least one source text file as represented in said at least one patent image file,
(J) superscript information indicating characters in said at least one source text file that are represented using superscripts in said at least one patent image file,
(K) subscript information indicating characters in said at least one source text file that are represented using subscripts in said at least one patent image file,
(L) bold attribute information indicating characters in said at least one source text file that are bolded in said at least one patent image file, and
(M) italicized attribute information indicating characters in said at least one source text file that are italicized in said at least one patent image file,
said at least one patent image file being at least one data file having stored therein one or more image pages associated with a patent, each of said image pages being an electronic image of at least a portion of a page of said patent or at least a portion of a page of a document related to said patent, said at least one source text file being at least one data file having stored therein text data representing at least a portion of textual data in said patent; and
means for enabling the computer to generate a user interface for display on said display, said user interface selectively displaying said patent text file and said patent image file on said display, such that at least a portion of said at least one patent text file is displayed in a first window and at least a portion of said at least one patent image file is displayed in a second window and said windows may be selectively viewed simultaneously or individually on said display.
16. The computer program product of claim 15, wherein said equivalency information comprises a plurality of (A)-(M).
17. The computer program product of claim 15, wherein said equivalency information comprises at least all of (A)-(M).
18. The computer program product of claim 15, wherein said at least one patent text file is at least one equivalent text file.
19. The computer program product of claim 15, wherein said at least one patent image file is a PTO Image file and said at least one source text file is a PTO Text file, wherein said at least one patent image file is stored in a compressed format.
20. The computer program product of claim 15, wherein said control logic further comprises:
means for enabling the computer to display a list of sections contained in said at least one patent text file;
means for enabling the computer to enable a user to select one of said sections from said list; and
means for enabling the computer to display in said second window at least a part of said at least one patent image file containing at least a portion of said selected section.
21. The computer program product of claim 15, wherein said at least one source text file and said at least one patent image file are from at least one national patent office.
22. A method of displaying patent text and images, comprising the steps of:
(1) preparing for display at least one patent document comprised of at least one patent text file and at least one patent image file, said at least one patent text file obtained from at least one source text file, said at least one patent text file including equivalency information comprising one or more of (A)-(M):
(A) special character information specifying at least one mapping of a group of characters in said at least one source text file to at least one special character in said at least one patent image file,
(B) linking information effectively providing an association between at least one portion of said at least one source text file and at least one portion of said at least one patent image file,
(C) formatting information representing an approximate arrangement of at least some bibliographic data from said at least one source text file as represented in bibliographic page images of said at least one patent image file,
(D) column information representing at least an approximate arrangement of text in columns of said at least one patent image file,
(E) line information representing at least an approximate arrangement of text in lines of said at least one patent image file,
(F) column line number information representing approximate line numbers of lines of text,
(G) section information representing at least approximate positions of patent sections,
(H) font information representing font styles of characters of said at least one source text file as represented in said at least one patent image file,
(I) font size information representing font sizes of characters of said at least one source text file as represented in said at least one patent image file,
(J) superscript information indicating characters in said at least one source text file that are represented using superscripts in said at least one patent image file,
(K) subscript information indicating characters in said at least one source text file that are represented using subscripts in said at least one patent image file,
(L) bold attribute information indicating characters in said at least one source text file that are bolded in said at least one patent image file, and
(M) italicized attribute information indicating characters in said at least one source text file that are italicized in said at least one patent image file,
said at least one patent image file being at least one data file having stored therein one or more image pages associated with a patent or a document related to said patent, each of said one or more image pages being an electronic image of a portion of said patent or said document related to said patent, said at least one source text file being at least one data file having stored therein at least text data representing at least a portion of textual data in said patent; and
(2) selectively displaying at least a portion of said at least one patent text file and at least a portion of said at least one patent image file, such that said at least a portion of said at least one patent text file is displayed in a first window and said at least a portion of said at least one patent image file is displayed in a second window and said windows may be selectively viewed individually or simultaneously.
23. The method of claim 22, wherein said patent text file is an equivalent text file.
24. The method of claim 22, wherein said at least one patent image file is a PTO Image file and said at least one source text file is a PTO Text file, wherein said at least one patent image file is stored in a compressed format.
25. The method of claim 22, wherein said patent is a U.S. patent.
26. The method of claim 22, wherein said linking information comprises item location information specifying locations in said at least one patent image file of items contained or referred to in said at least one source text file.
27. The method of claim 26, wherein said items comprise any of figures, drawing sheets, figure elements, equations, tables, structures, diagrams, text, text pages, portions of text pages, and sections.
28. The method of claim 22, wherein said at least a portion of textual data in said patent includes at least patent bibliography information and patent text.
29. The method of claim 22, further comprising the step of:
(3) displaying, upon receipt of appropriate user command, said at least one patent text file in said first window and said at least one patent image file in said second window in a synchronized fashion or an unsynchronized fashion.
30. The method of claim 29, wherein step (3) comprises the steps of:
receiving a command to display text and images in said first and second windows in a synchronized fashion; and
displaying in said second window at least a part of said at least one patent image file that corresponds to at least a part of said at least one patent text file being displayed in said first window.
31. The method of claim 29, wherein step (3) comprises the steps of:
(a) receiving a command to display text and images in said first and second windows in an unsynchronized fashion; and
(b) displaying in said second window at least a part of said at least one patent image file that relates to but does not correspond to at least a part of said at least one patent text file being displayed in said first window.
32. The method of claim 31, wherein step (b) comprises the step of:
displaying in said second window said at least a part of said at least one patent image file that contains an item referenced in said at least a part of said at least one patent text file being displayed in said first window.
33. The method of claim 32, wherein said item comprises a figure, drawing sheet, figure element, equation, table, structure, or diagram.
34. The method of claim 29, wherein step (3) comprises the steps of:
receiving a command from a user to display text and images in said first and second windows in an unsynchronized fashion; and
displaying in said second window at least a part of said at least one patent image file that is unrelated to at least a part of said at least one patent text file being displayed in said first window, said at least a part of said at least one patent image file having been identified by said user.
35. The method of claim 22, further comprising the steps of:
(3) displaying a list of sections contained in said at least one patent text file;
(4) enabling a user to select one of said sections from said list; and
(5) displaying in said second window at least a part of said at least one patent image file containing at least a portion of said selected section.
36. The method of claim 22, wherein said equivalency information in said at least one patent text file includes linking information that effectively provides an association between at least one portion of said at least one source text file and at least one portion of said at least one patent image file, and wherein said textual data in said patent and said linking information are contained in a single patent text file.
37. The method of claim 22, wherein said at least one source text file and said at least one patent image file are from at least one national patent office.
38. A system for displaying patent text and images, comprising:
means for preparing for display at least one patent document comprised of at least one patent text file and at least one patent image file, said at least one patent text file obtained from at least one source text file, said at least one patent text file including equivalency information comprising one or more of (A)-(M):
(A) special character information specifying at least one mapping of a group of characters in said at least one source text file to at least one special character in said at least one patent image file,
(B) linking information effectively providing an association between at least one portion of said at least one source text file and at least one portion of said at least one patent image file,
(C) formatting information representing an approximate arrangement of at least some bibliographic data from said at least one source text file as represented in bibliographic page images of said at least one patent image file,
(D) column information representing at least an approximate arrangement of text in columns of said at least one patent image file,
(E) line information representing at least an approximate arrangement of text in lines of said at least one patent image file,
(F) column line number information representing approximate line numbers of lines of text,
(G) section information representing at least approximate positions of patent sections,
(H) font information representing font styles of characters of said at least one source text file as represented in said at least one patent image file,
(I) font size information representing font sizes of characters of said at least one source text file as represented in said at least one patent image file,
(J) superscript information indicating characters in said at least one source text file that are represented using superscripts in said at least one patent image file,
(K) subscript information indicating characters in said at least one source text file that are represented using subscripts in said at least one patent image file,
(L) bold attribute information indicating characters in said at least one source text file that are bolded in said at least one patent image file, and
(M) italicized attribute information indicating characters in said at least one source text file that are italicized in said at least one patent image file,
said at least one patent image file being at least one data file having stored therein one or more image pages associated with a patent or a document related to said patent, each of said one or more image pages being an electronic image of a portion of said patent or said document related to said patent, said at least one source text file being at least one data file having stored therein at least text data representing at least a portion of textual data in said patent; and
means for selectively displaying at least a portion of said at least one patent text file and at least a portion of said at least one patent image file, such that said at least a portion of said at least one patent text file is displayed in a first window and said at least a portion of said at least one patent image file is displayed in a second window and said windows may be selectively viewed individually or simultaneously.
39. The system of claim 38, wherein said patent text file is an equivalent text file.
40. The system of claim 38, wherein said at least one patent image file is a PTO Image file and said at least one source text file is a PTO Text file, wherein said at least one patent image file is stored in a compressed format.
41. The system of claim 38, wherein said patent is a U.S. patent.
42. The system of claim 38, wherein said linking information comprises item location information specifying locations in said at least one patent image file of items contained or referred to in said at least one source text file.
43. The system of claim 42, wherein said items comprise any of figures, drawing sheets, figure elements, equations, tables, structures, diagrams, text, text pages, portions of text pages, and sections.
44. The system of claim 38, wherein said at least a portion of textual data in said patent includes at least patent bibliography information and patent text.
45. The system of claim 38, further comprising:
synchronized/unsynchronized display means for displaying, upon receipt of appropriate user command, said at least one patent text file in said first window and said at least one patent image file in said second window in a synchronized fashion or an unsynchronized fashion.
46. The system of claim 45, wherein said synchronized/unsynchronized display means comprises:
means for receiving a command to display text and images in said first and second windows in a synchronized fashion; and
means for displaying in said second window at least a part of said at least one patent image file that corresponds to at least a part of said at least one patent text file being displayed in said first window.
47. The system of claim 45, wherein said synchronized/unsynchronized display means comprises:
means for receiving a command to display text and images in said first and second windows in an unsynchronized fashion; and
image display means for displaying in said second window at least a part of said at least one patent image file that relates to but does not correspond to at least a part of said at least one patent text file being displayed in said first window.
48. The system of claim 47, wherein said image display means comprises:
means for displaying in said second window said at least a part of said at least one patent image file that contains an item referenced in said at least a part of said at least one patent text file being displayed in said first window.
49. The system of claim 48, wherein said item comprises a figure, drawing sheet, figure element, equation, table, structure, or diagram.
50. The system of claim 45, wherein said synchronized/unsynchronized display means comprises:
means for receiving a command from a user to display text and images in said first and second windows in an unsynchronized fashion; and
means for displaying in said second window at least a part of said at least one patent image file that is unrelated to at least a part of said at least one patent text file being displayed in said first window, said at least a part of said at least one patent image file having been identified by said user.
51. The system of claim 38, further comprising:
means for displaying a list of sections contained in said at least one patent text file;
means for enabling a user to select one of said sections from said list; and
means for displaying in said second window at least a part of said at least one patent image file containing at least a portion of said selected section.
52. The system of claim 38, wherein said equivalency information in said at least one patent text file includes linking information that effectively provides an association between at least one portion of said at least one source text file and at least one portion of said at least one patent image file, and wherein said textual data in said patent and said linking information are contained in a single patent text file.
53. The system of claim 38, wherein said at least one source text file and said at least one patent image file are from at least one national patent office.
54. A computer program product comprising control logic recorded therein, said control logic, when executed, enabling a computer to display patent text and images, said control logic comprising:
means for enabling the computer to prepare for display at least one patent document comprised of at least one patent text file and at least one patent image file, said at least one patent text file obtained from at least one source text file, said at least one patent text file including equivalency information comprising one or more of (A)-(M):
(A) special character information specifying at least one mapping of a group of characters in said at least one source text file to at least one special character in said at least one patent image file,
(B) linking information effectively providing an association between at least one portion of said at least one source text file and at least one portion of said at least one patent image file,
(C) formatting information representing an approximate arrangement of at least some bibliographic data from said at least one source text file as represented in bibliographic page images of said at least one patent image file,
(D) column information representing at least an approximate arrangement of text in columns of said at least one patent image file,
(E) line information representing at least an approximate arrangement of text in lines of said at least one patent image file,
(F) column line number information representing approximate line numbers of lines of text,
(G) section information representing at least approximate positions of patent sections,
(H) font information representing font styles of characters of said at least one source text file as represented in said at least one patent image file,
(I) font size information representing font sizes of characters of said at least one source text file as represented in said at least one patent image file,
(J) superscript information indicating characters in said at least one source text file that are represented using superscripts in said at least one patent image file,
(K) subscript information indicating characters in said at least one source text file that are represented using subscripts in said at least one patent image file,
(L) bold attribute information indicating characters in said at least one source text file that are bolded in said at least one patent image file, and
(M) italicized attribute information indicating characters in said at least one source text file that are italicized in said at least one patent image file,
said at least one patent image file being at least one data file having stored therein one or more image pages associated with a patent or a document related to said patent, each of said one or more image pages being an electronic image of a portion of said patent or said document related to said patent, said at least one source text file being at least one data file having stored therein at least text data representing at least a portion of textual data in said patent; and
means for enabling the computer to selectively display at least a portion of said at least one patent text file and at least a portion of said at least one patent image file, such that said at least a portion of said at least one patent text file is displayed in a first window and said at least a portion of said at least one patent image file is displayed in a second window and said windows may be selectively viewed individually or simultaneously.
55. The computer program product of claim 54, wherein said patent text file is an equivalent text file.
56. The computer program product of claim 54, wherein said at least one patent image file is a PTO Image file and said at least one source text file is a PTO Text file, wherein said at least one patent image file is stored in a compressed format.
57. The computer program product of claim 54, wherein said patent is a U.S. patent.
58. The computer program product of claim 54, wherein said linking information comprises item location information specifying locations in said at least one patent image file of items contained or referred to in said at least one source text file.
59. The computer program product of claim 58, wherein said items comprise any of figures, drawing sheets, figure elements, equations, tables, structures, diagrams, text, text pages, portions of text pages, and sections.
60. The computer program product of claim 54, wherein said at least a portion of textual data in said patent includes at least patent bibliography information and patent text.
61. The computer program product of claim 54, wherein said control logic further comprises:
synchronized/unsynchronized display means for enabling the computer to display, upon receipt of appropriate user command, said at least one patent text file in said first window and said at least one patent image file in said second window in a synchronized fashion or an unsynchronized fashion.
62. The computer program product of claim 61, wherein said synchronized/unsynchronized display means comprises:
means for enabling the computer to receive a command to display text and images in said first and second windows in a synchronized fashion; and
means for enabling the computer to display in said second window at least a part of said at least one patent image file that corresponds to at least a part of said at least one patent text file being displayed in said first window.
63. The computer program product of claim 61, wherein said synchronized/unsynchronized display means comprises:
means for enabling the computer to receive a command to display text and images in said first and second windows in an unsynchronized fashion; and
image display means for enabling the computer to display in said second window at least a part of said at least one patent image file that relates to but does not correspond to at least a part of said at least one patent text file being displayed in said first window.
64. The computer program product of claim 63, wherein said image display means comprises:
means for enabling the computer to display in said second window said at least a part of said at least one patent image file that contains an item referenced in said at least a part of said at least one patent text file being displayed in said first window.
65. The computer program product of claim 64, wherein said item comprises a figure, drawing sheet, figure element, equation, table, structure, or diagram.
66. The computer program product of claim 64, wherein said synchronized/unsynchronized display means comprises:
means for enabling the computer to receive a command from a user to display text and images in said first and second windows in an unsynchronized fashion; and
means for enabling the computer to display in said second window at least a part of said at least one patent image file that is unrelated to at least a part of said at least one patent text file being displayed in said first window, said at least a part of said at least one patent image file having been identified by said user.
67. The computer program product of claim 54, wherein said control logic further comprises:
means for enabling the computer to display a list of sections contained in said at least one patent text file;
means for enabling the computer to enable a user to select one of said sections from said list; and
means for enabling the computer to display in said second window at least a part of said at least one patent image file containing at least a portion of said selected section.
68. The computer program product of claim 54, wherein said equivalency information in said at least one patent text file includes linking information that effectively provides an association between at least one portion of said at least one source text file and at least one portion of said at least one patent image file, and wherein said textual data in said patent and said linking information are contained in a single patent text file.
69. The computer program product of claim 54, wherein said at least one source text file and said at least one patent image file are from at least one national patent office.
70. A method of displaying patent text and images, comprising the steps of:
(1) receiving a first command from a user to display a patent;
(2) displaying, in response to receipt of said first command, at least a portion of textual data contained in at least a portion of one or more patent text files associated with said patent, said textual data being from at least one source text file corresponding to said patent, said one or more patent text files including equivalency information having one or more of (A)-(L):
(A) special character information specifying at least one mapping of a group of characters in said at least one source text file to at least one special character in at least one patent image file associated with said patent,
(B) formatting information representing an approximate arrangement of at least some bibliographic data from said at least one source text file as represented in bibliographic page images of said at least one patent image file,
(C) column information representing at least an approximate arrangement of text in columns of said at least one patent image file,
(D) line information representing at least an approximate arrangement of text in lines of said at least one patent image file,
(E) column line number information representing approximate line numbers of lines of text,
(F) section information representing at least approximate positions of patent sections,
(G) font information representing font styles of characters of said at least one source text file as represented in said at least one patent image file,
(H) font size information representing font sizes of characters of said at least one source text file as represented in said at least one patent image file,
(I) superscript information indicating characters in said at least one source text file that are represented using superscripts in said at least one patent image file,
(J) subscript information indicating characters in said at least one source text file that are represented using subscripts in said at least one patent image file,
(K) bold attribute information indicating characters in said at least one source text file that are bolded in said at least one patent image file, and
(L) italicized attribute information indicating characters in said at least one source text file that are italicized in said at least one patent image file,
(3) receiving from a user a second command to display image data of said patent;
(4) referencing, in response to receipt of said second command, linking information that effectively provides an association between at least a portion of said at least one source text file and at least a portion of said at least one patent image file;
(5) using said referenced linking information to retrieve at least said at least a portion of said at least one patent image file; and
(6) displaying at least a portion of image information retrieved in step (5).
71. The method of claim 70, wherein said one or more patent text files comprises at least one equivalent text file.
72. The method of claim 70, wherein said at least one patent image file is a PTO Image file and said at least one source text file is a PTO Text file, wherein said at least one patent image file is stored in a compressed format.
73. The method of claim 70, wherein said at least one source text file and said at least one patent image file are from at least one national patent office.
74. A system for displaying patent text and images, comprising:
means for receiving a first command from a user to display a patent;
means for displaying, in response to receipt of said first command, at least a portion of textual data contained in at least a portion of one or more patent text files associated with said patent, said textual data being from at least one source text file corresponding to said patent, said one or more patent text files including equivalency information having one or more of (A)-(L):
(A) special character information specifying at least one mapping of a group of characters in said at least one source text file to at least one special character in at least one patent image file associated with said patent,
(B) formatting information representing an approximate arrangement of at least some bibliographic data from said at least one source text file as represented in bibliographic page images of said at least one patent image file,
(C) column information representing at least an approximate arrangement of text in columns of said at least one patent image file,
(D) line information representing at least an approximate arrangement of text in lines of said at least one patent image file,
(E) column line number information representing approximate line numbers of lines of text,
(F) section information representing at least approximate positions of patent sections,
(G) font information representing font styles of characters of said at least one source text file as represented in said at least one patent image file,
(H) font size information representing font sizes of characters of said at least one source text file as represented in said at least one patent image file,
(I) superscript information indicating characters in said at least one source text file that are represented using superscripts in said at least one patent image file,
(J) subscript information indicating characters in said at least one source text file that are represented using subscripts in said at least one patent image file,
(K) bold attribute information indicating characters in said at least one source text file that are bolded in said at least one patent image file, and
(L) italicized attribute information indicating characters in said at least one source text file that are italicized in said at least one patent image file,
means for receiving from a user a second command to display image data of said patent;
means for referencing, in response to receipt of said second command, linking information that effectively provides an association between at least a portion of said at least one source text file and at least a portion of said at least one patent image file;
means for using said referenced linking information to retrieve at least said at least a portion of said at least one patent image file; and
means for displaying at least a portion of said retrieved image information.
75. The system of claim 74, wherein said one or more patent text files comprises at least one equivalent text file.
76. The system of claim 74, wherein said at least one patent image file is a PTO Image file and said at least one source text file is a PTO Text file, wherein said at least one patent image file is stored in a compressed format.
77. The system of claim 74, wherein said at least one source text file and said at least one patent image file are from at least one national patent office.
78. A computer program product comprising control logic recorded therein, said control logic, when executed, enabling a computer to display patent text and images, said control logic comprising:
means for enabling the computer to receive a first command from a user to display a patent;
means for enabling the computer to display, in response to receipt of said first command, at least a portion of textual data contained in at least a portion of one or more patent text files associated with said patent, said textual data being from at least one source text file corresponding to said patent, said one or more patent text files including equivalency information having one or more of (A)-(L):
(A) special character information specifying at least one mapping of a group of characters in said at least one source text file to at least one special character in at least one patent image file associated with said patent,
(B) formatting information representing an approximate arrangement of at least some bibliographic data from said at least one source text file as represented in bibliographic page images of said at least one patent image file,
(C) column information representing at least an approximate arrangement of text in columns of said at least one patent image file,
(D) line information representing at least an approximate arrangement of text in lines of said at least one patent image file,
(E) column line number information representing approximate line numbers of lines of text,
(F) section information representing at least approximate positions of patent sections,
(G) font information representing font styles of characters of said at least one source text file as represented in said at least one patent image file,
(H) font size information representing font sizes of characters of said at least one source text file as represented in said at least one patent image file,
(I) superscript information indicating characters in said at least one source text file that are represented using superscripts in said at least one patent image file,
(J) subscript information indicating characters in said at least one source text file that are represented using subscripts in said at least one patent image file,
(K) bold attribute information indicating characters in said at least one source text file that are bolded in said at least one patent image file, and
(L) italicized attribute information indicating characters in said at least one source text file that are italicized in said at least one patent image file,
means for enabling the computer to receive from a user a second command to display image data of said patent;
means for enabling the computer to reference, in response to receipt of said second command, linking information that effectively provides an association between at least a portion of said at least one source text file and at least a portion of said at least one patent image file;
means for enabling the computer to use said referenced linking information to retrieve at least said at least a portion of said at least one patent image file; and
means for enabling the computer to display at least a portion of said retrieved image information.
79. The computer program product of claim 78, wherein said one or more patent text files comprises at least one equivalent text file.
80. The computer program product of claim 78, wherein said at least one patent image file is a PTO Image file and said at least one source text file is a PTO Text file, wherein said at least one patent image file is stored in a compressed format.
81. The computer program product of claim 78, wherein said at least one source text file and said at least one patent image file are from at least one national patent office.
82. A method of displaying patent text and images, comprising the steps of
(1) receiving a first command from a user to display a patent;
(2) displaying, in response to receipt of said first command, at least a portion of textual data in at least a portion of one or more patent text files associated with said patent, said one or more patent text files comprising equivalency information comprising one or more of (A)-(L):
(A) information reflecting an approximate arrangement of at least some bibliographic data as represented in at least one patent image file corresponding to said patent,
(B) special character information specifying at least one mapping of a group of characters to at least one special character in said at least one patent image file,
(C) column information representing at least an approximate arrangement of text in columns of said at least one patent image file,
(D) line information representing at least an approximate arrangement of text in lines of said at least one patent image file,
(E) column line number information representing approximate line numbers of lines of text,
(F) section information representing at least approximate positions of patent sections,
(G) font information representing font styles of characters as represented in said at least one patent image file,
(H) font size information representing font sizes of characters as represented in said at least one patent image file,
(I) superscript information indicating characters that are represented using superscripts in said at least one patent image file,
(J) subscript information indicating characters that are represented using subscripts in said at least one patent image file,
(K) bold attribute information indicating characters that are bolded in said at least one patent image file, and
(L) italicized attribute information indicating characters that are italicized in said at least one patent image file;
(3) receiving from a user a second command to display image data of said patent;
(4) referencing, in response to receipt of said second command, linking information that effectively provides an association between at least a portion of said one or more patent text files and at least a portion of said at least one patent image file;
(5) using said referenced linking information to retrieve at least said at least a portion of said at least one patent image file; and
(6) displaying at least a portion of image information retrieved in step (5).
83. The method of claim 82, wherein said patent text files are equivalent text files.
84. The method of claim 82, wherein said at least one source text file and said at least one patent image file are from at least one national patent office.
85. A system for displaying patent text and images, comprising:
means for displaying, in response to appropriate command, at least a portion of textual data in at least a portion of one or more patent text files associated with said patent, said one or more patent text files comprising equivalency information comprising one or more of (A)-(L):
(A) information reflecting an approximate arrangement of at least some bibliographic data as represented in at least one patent image file corresponding to said patent,
(B) special character information specifying at least one mapping of a group of characters to at least one special character in said at least one patent image file,
(C) column information representing at least an approximate arrangement of text in columns of said at least one patent image file,
(D) line information representing at least an approximate arrangement of text in lines of said at least one patent image file,
(E) column line number information representing approximate line numbers of lines of text,
(F) section information representing at least approximate positions of patent sections,
(G) font information representing font styles of characters as represented in said at least one patent image file,
(H) font size information representing font sizes of characters as represented in said at least one patent image file,
(I) superscript information indicating characters that are represented using superscripts in said at least one patent image file,
(J) subscript information indicating characters that are represented using subscripts in said at least one patent image file,
(K) bold attribute information indicating characters that are bolded in said at least one patent image file, and
(L) italicized attribute information indicating characters that are italicized in said at least one patent image file;
means for referencing, in response to appropriate command, linking information that effectively provides an association between at least a portion of said one or more patent text files and at least a portion of at least one patent image file;
means for using said referenced linking information to retrieve at least said at least a portion of said at least one patent image file; and
means for displaying at least a portion of said retrieved image information.
86. The system of claim 85, wherein said patent text files are equivalent text files.
87. The system of claim 85, wherein said at least one source text file and said at least one patent image file are from at least one national patent office.
88. A computer program product comprising control logic recorded therein, said control logic, when executed, enabling a computer to display patent text and images, said control logic comprising:
means for enabling the computer to display, in response to appropriate command, at least a portion of textual data in at least a portion of one or more patent text files associated with said patent, said one or more patent text files comprising equivalency information comprising one or more of (A)-(L):
(A) information reflecting an approximate arrangement of at least some bibliographic data as represented in at least one patent image file corresponding to said patent,
(B) special character information specifying at least one mapping of a group of characters to at least one special character in said at least one patent image file,
(C) column information representing at least an approximate arrangement of text in columns of said at least one patent image file,
(D) line information representing at least an approximate arrangement of text in lines of said at least one patent image file,
(E) column line number information representing approximate line numbers of lines of text,
(F) section information representing at least approximate positions of patent sections,
(G) font information representing font styles of characters as represented in said at least one patent image file,
(H) font size information representing font sizes of characters as represented in said at least one patent image file,
(I) superscript information indicating characters that are represented using superscripts in said at least one patent image file,
(J) subscript information indicating characters that are represented using subscripts in said at least one patent image file,
(K) bold attribute information indicating characters that are bolded in said at least one patent image file, and
(L) italicized attribute information indicating characters that are italicized in said at least one patent image file;
means for enabling the computer to reference, in response to appropriate command, linking information that effectively provides an association between at least a portion of said one or more patent text files and at least a portion of at least one patent image file;
means for enabling the computer to use said referenced linking information to retrieve at least said at least a portion of said at least one patent image file; and
means for enabling the computer to display at least a portion of said retrieved image information.
89. The computer program product of claim 88, wherein said patent text files are equivalent text files.
90. The computer program product of claim 88, wherein said at least one source text file and said at least one patent image file are from at least one national patent office.
91. A method of displaying patent text and images, comprising the steps of:
(1) receiving a first command from a user to display a patent;
(2) displaying, in response to receipt of said first command, at least a portion of at least one patent document comprising at least text corresponding to said patent, said at least one patent document also comprising equivalency information comprising one or more of (A)-(M):
(A) formatting information representing an approximate arrangement of at least some bibliographic data as represented in at least one patent image file corresponding to said patent,
(B) linking information that effectively provides an association between at least a portion of said at least one patent document and at least a portion of said at least one patent image file,
(C) special character information specifying at least one mapping of a group of characters in said at least one patent document to at least one special character in said at least one patent image file,
(D) column information representing at least an approximate arrangement of text in columns of said at least one patent image file,
(E) line information representing at least an approximate arrangement of text in lines of said at least one patent image file,
(F) column line number information representing approximate line numbers of lines of text,
(G) section information representing at least approximate positions of patent sections,
(H) font information representing font styles of characters of said at least one patent document as represented in said at least one patent image file,
(I) font size information representing font sizes of characters of said at least one patent document as represented in said at least one patent image file,
(J) superscript information indicating characters in said at least one patent document that are represented using superscripts in said at least one patent image file,
(K) subscript information indicating characters in said at least one patent document that are represented using subscripts in said at least one patent image file,
(L) bold attribute information indicating characters in said at least one patent document that are bolded in said at least one patent image file, and
(M) italicized attribute information indicating characters in said at least one patent document that are italicized in said at least one patent image file,
(3) receiving from a user a second command to display image data of said patent;
(4) using linking information to access and retrieve at least a portion of said at least one patent image file; and
(5) displaying at least a portion of image information retrieved in step (4).
92. The method of claim 91, wherein said at least one patent document is an equivalent text file.
93. The method of claim 91, wherein said at least one patent document is generated at least partially from one or more electronic files from at least one national patent office, and said at least one patent image file is from at least one national patent office.
94. The method of claim 91, wherein said at least one patent document is at least one equivalent text file.
95. The method of claim 91, wherein said at least one patent document is generated prior to receipt of said first command, and stored in a storage device.
96. The method of claim 91, wherein said at least one patent document is a single document having stored therein said text corresponding to said patent and said linking information.
97. A system for displaying patent text and images, comprising:
means for displaying, in response to appropriate command, at least a portion of at least one patent document comprising at least text corresponding to said patent, said at least one patent document also comprising equivalency information comprising one or more of (A)-(M):
(A) formatting information representing an approximate arrangement of at least some bibliographic data as represented in at least one patent image file corresponding to said patent,
(B) linking information that effectively provides an association between at least a portion of said at least one patent document and at least a portion of said at least one patent image file,
(C) special character information specifying at least one mapping of a group of characters in said at least one patent document to at least one special character in said at least one patent image file,
(D) column information representing at least an approximate arrangement of text in columns of said at least one patent image file,
(E) line information representing at least an approximate arrangement of text in lines of said at least one patent image file,
(F) column line number information representing approximate line numbers of lines of text,
(G) section information representing at least approximate positions of patent sections,
(H) font information representing font styles of characters of said at least one patent document as represented in said at least one patent image file,
(I) font size information representing font sizes of characters of said at least one patent document as represented in said at least one patent image file,
(J) superscript information indicating characters in said at least one patent document that are represented using superscripts in said at least one patent image file,
(K) subscript information indicating characters in said at least one patent document that are represented using subscripts in said at least one patent image file,
(L) bold attribute information indicating characters in said at least one patent document that are bolded in said at least one patent image file, and
(M) italicized attribute information indicating characters in said at least one patent document that are italicized in said at least one patent image file,
means for using linking information to access and retrieve at least a portion of said at least one patent image file; and
means for displaying at least a portion of said retrieved image information.
98. The system of claim 97, wherein said at least one patent document is at least one equivalent text file.
99. The system of claim 97, wherein said at least one patent document is generated prior to receipt of said first command, and stored in a storage device.
100. The system of claim 97, wherein said at least one patent document is a single document having stored therein said text corresponding to said patent and said linking information.
101. The system of claim 97, wherein said at least one patent document is generated at least partially from one or more electronic files from at least one national patent office, and said at least one patent image file is from at least one national patent office.
102. The system of claim 97, wherein said at least one patent document is an equivalent text file.
103. A computer program product comprising control logic recorded therein, said control logic, when executed enabling a computer to display patent text and images, said control logic comprising:
means for enabling the computer to display, in response to appropriate command, at least a portion of at least one patent document comprising at least text corresponding to said patent, said at least one patent document also comprising equivalency information comprising one or more of (A)-(M):
(A) formatting information representing an approximate arrangement of at least some bibliographic data as represented in at least one patent image file corresponding to said patent,
(B) linking information that effectively provides an association between at least a portion of said at least one patent document and at least a portion of said at least one patent image file,
(C) special character information specifying at least one mapping of a group of characters in said at least one patent document to at least one special character in said at least one patent image file,
(D) column information representing at least an approximate arrangement of text in columns of said at least one patent image file,
(E) line information representing at least an approximate arrangement of text in lines of said at least one patent image file,
(F) column line number information representing approximate line numbers of lines of text,
(G) section information representing at least approximate positions of patent sections,
(H) font information representing font styles of characters of said at least one patent document as represented in said at least one patent image file,
(I) font size information representing font sizes of characters of said at least one patent document as represented in said at least one patent image file,
(J) superscript information indicating characters in said at least one patent document that are represented using superscripts in said at least one patent image file,
(K) subscript information indicating characters in said at least one patent document that are represented using subscripts in said at least one patent image file,
(L) bold attribute information indicating characters in said at least one patent document that are bolded in said at least one patent image file, and
(M) italicized attribute information indicating characters in said at least one patent document that are italicized in said at least one patent image file,
means for enabling the computer to use linking information to access and retrieve at least a portion of said at least one patent image file; and
means for enabling the computer to display at least a portion of said retrieved image information.
104. The computer program product of claim 103, wherein said at least one patent document is at least one equivalent text file.
105. The computer program product of claim 103, wherein said at least one patent document is generated prior to receipt of said first command, and stored in a storage device.
106. The computer program product of claim 103, wherein said at least one patent document is a single document having stored therein said text corresponding to said patent and said linking information.
107. The computer program product of claim 103, wherein said at least one patent document is generated at least partially from one or more electronic files from at least one national patent office, and said at least one patent image file is from at least one national patent office.
108. The computer program product of claim 103, wherein said at least one patent document is an equivalent text file.
109. A method of displaying patent text and images, comprising the steps of:
(1) receiving a first command from a user to display a patent;
(2) displaying, in response to receipt of said first command, at least a portion of textual data contained in at least a portion of one or more patent text documents associated with said patent, and also displaying at least one icon representative of a link between at least one source text file and at least one patent image document, wherein said at least one source text file and said at least one patent image document are associated with said patent, wherein said at least one patent image document comprises at least one data file having stored therein one or more image pages associated with said patent, each of said image pages being an electronic image of at least a portion of a page of said patent or at least a portion of a page of a document related to said patent, wherein said textual data is from said at least one source text file, wherein said one or more patent text documents comprises equivalency information detailing at least a partial equivalency relationship between said at least one patent image document and said at least one source text file, said equivalency information comprising one or more of (A)-(M):
(A) special character information specifying at least one mapping of a group of characters in said at least one source text file to at least one special character in said at least one patent image document,
(B) item location information identifying locations in said at least one patent image document of items referred to or contained in said at least one source text file,
(C) formatting information representing at least an approximate arrangement of at least some bibliographic data from said at least one source text file as represented in said at least one patent inage document,
(D) column information representing at least an approximate arrangement of text in columns of said at least one patent image document,
(E) line information representing at least an approximate arrangement of text in lines of said at least one patent image document,
(F) column line number information representing approximate line numbers of lines of text,
(G) section information representing at least approximate positions of patent sections,
(H) font information representing font styles of characters of said at least one source text file as represented in said at least one patent image document,
(I) font size information representing font sizes of characters of said at least one source text file as represented in said at least one patent image document,
(J) superscript information indicating characters in said at least one source text file that are represented using superscripts in said at least one patent image document,
(K) subscript information indicating characters in said at least one source text file that are represented using subscripts in said at least one patent image document,
(L) bold attribute information indicating characters in said at least one source text file that are bolded in said at least one patent image document, and
(M) italicized attribute information indicating characters in said at least one source text file that are italicized in said at least one patent image document;
(3) receiving from a user a second command to display image data of said patent, said second command having been issued by selecting said icon;
(4) referencing, in response to receipt of said second command, linking information associated with said icon, wherein said linking information comprises a link to at least a portion of said at least one patent image document;
(5) using said referenced linking information to retrieve at least a portion of said at least one patent image document; and
(6) displaying at least a portion of image information retrieved in step (5).
110. The method of claim 109, wherein step (2) comprises:
displaying at least one of said at least one icon adjacent to said display of said at least a portion of said textual data.
111. The method of claim 109, wherein step (2) comprises:
displaying at least one of said at least one icon embedded within said display of said at least a portion of said textual data.
112. The method of claim 109, wherein said at least one source text file and said at least one patent image document are from at least one national patent office.
113. A system of displaying patent text and images, comprising:
means for receiving a first command from a user to display a patent;
displaying means for displaying, in response to receipt of said first command, at least a portion of textual data contained in at least a portion of one or more patent text documents associated with said patent, and also displaying at least one icon representative of a link between at least one source text file and at least one patent image document, wherein said at least one source text file and said at least one patent image document are associated with said patent, wherein said at least one patent image document comprises at least one data file having stored therein one or more image pages associated with said patent, each of said image pages being an electronic image of at least a portion of a page of said patent or at least a portion of a page of a document related to said patent, wherein said textual data is from said at least one source text file, wherein said one or more patent text documents comprises equivalency information detailing at least a partial equivalency relationship between said at least one patent image document and said at least one source text file, said equivalency information comprising one or more of (A)-(M):
(A) special character information specifying at least one mapping of a group of characters in said at least one source text file to at least one special character in said at least one patent image document,
(B) item location information identifying locations in said at least one patent image document of items referred to or contained in said at least one source text file,
(C) formatting information representing at least an approximate arrangement of at least some bibliographic data from said at least one source text file as represented in said at least one patent image document,
(D) column information representing at least an approximate arrangement of text in columns of said at least one patent image document,
(E) line information representing at least an approximate arrangement of text in lines of said at least one patent image document,
(F) column line number information representing approximate line numbers of lines of text,
(G) section information representing at least approximate positions of patent sections,
(H) font information representing font styles of characters of said at least one source text file as represented in said at least one patent image document,
(I) font size information representing font sizes of characters of said at least one source text file as represented in said at least one patent image document,
(J) superscript information indicating characters in said at least one source text file that are represented using superscripts in said at least one patent image document,
(K) subscript information indicating characters in said at least one source text file that are represented using subscripts in said at least one patent image document,
(L) bold attribute information indicating characters in said at least one source text file that are bolded in said at least one patent image document, and
(M) italicized attribute information indicating characters in said at least one source text file that are italicized in said at least one patent image document;
means for receiving from a user a second command to display image data of said patent, said second command having been issued by selecting said icon;
means for referencing, in response to receipt of said second command, linking information associated with said icon, wherein said linking information comprises a link to at least a portion of said at least one patent image document;
means for using said referenced linking information to retrieve at least a portion of said at least one patent image document; and
means for displaying at least a portion of said retrieved image information.
114. The system of claim 113, wherein said displaying means comprises:
means for displaying at least one of said at least one icon adjacent to said display of said at least a portion of said textual data.
115. The system of claim 113, wherein said displaying means comprises:
means for displaying at least one of said at least one icon embedded within said display of said at least a portion of said textual data.
116. The system of claim 113, wherein said at least one source text file and said at least one patent image document are from at least one national patent office.
117. A computer program product comprising control logic recorded therein, said control logic, when executed, enabling a computer to display patent text and images, said control logic comprising:
means for enabling the computer to receive a first command from a user to display a patent;
displaying means for enabling the computer to display, in response to receipt of said first command, at least a portion of textual data contained in at least a portion of one or more patent text documents associated with said patent, and also displaying at least one icon representative of a link between at least one source text file and at least one patent image document, wherein said at least one source text file and said at least one patent image document are associated with said patent, wherein said at least one patent image document comprises at least one data file having stored therein one or more image pages associated with said patent, each of said image pages being an electronic image of at least a portion of a page of said patent or at least a portion of a page of a document related to said patent, wherein said textual data is from said at least one source text file, wherein said one or more patent text documents comprises equivalency information detailing at least a partial equivalency relationship between said at least one patent image document and said at least one source text file, said equivalency information comprising one or more of (A)-(M):
(A) special character information specifying at least one mapping of a group of characters in said at least one source text file to at least one special character in said at least one patent image document,
(B) item location information identifying locations in said at least one patent image document of items referred to or contained in said at least one source text file,
(C) formatting information representing at least an approximate arrangement of at least some bibliographic data from said at least one source text file as represented in said at least one patent image document,
(D) column information representing at least an approximate arrangement of text in columns of said at least one patent image document,
(E) line information representing at least an approximate arrangement of text in lines of said at least one patent image document,
(F) column line number information representing approximate line numbers of lines of text,
(G) section information representing at least approximate positions of patent sections,
(H) font information representing font styles of characters of said at least one source text file as represented in said at least one patent image document,
(I) font size information representing font sizes of characters of said at least one source text file as represented in said at least one patent image document,
(J) superscript information indicating characters in said at least one source text file that are represented using superscripts in said at least one patent image document,
(K) subscript information indicating characters in said at least one source text file that are represented using subscripts in said at least one patent image document,
(L) bold attribute information indicating characters in said at least one source text file that are bolded in said at least one patent image document, and
(M) italicized attribute information indicating characters in said at least one source text file that are italicized in said at least one patent image document;
means for enabling the computer to receive from a user a second command to display image data of said patent, said second command having been issued by selecting said icon;
means for enabling the computer to reference, in response to receipt of said second command, linking information associated with said icon, wherein said linking information comprises a link to at least a portion of said at least one patent image document;
means for enabling the computer to use said referenced linking information to retrieve at least a portion of said at least one patent image document; and
means for enabling the computer to display at least a portion of said retrieved image information.
118. The computer program product of claim 117, wherein said displaying means comprises:
means for enabling the computer to display at least one of said at least one icon adjacent to said display of said at least a portion of said textual data.
119. The computer program product of claim 117, wherein said displaying means comprises:
means for enabling the computer to display at least one of said at least one icon embedded within said display of said at least a portion of said textual data.
120. The computer program product of claim 117, wherein said at least one source text file and said at least one patent image document are from at least one national patent office.
121. A method of enabling a computer to display patent text and images, comprising the steps of:
(1) enabling a computer to access at least one patent document comprising at least one patent text file and at least one patent image file, said at least one patent text file comprising equivalency information comprising one or more of (A)-(M):
(A) special character information specifying at least one mapping of a group of characters in at least one source text file to at least one special character in said at least one patent image file,
(B) linking information effectively providing an association between at least one portion of said at least one patent text file and at least one portion of said at least one patent image file,
(C) information representative of an approximate arrangement of at least some bibliographic data in said at least one patent image file,
(D) column information representing at least an approximate arrangement of text in columns of said at least one patent image file,
(E) line information representing at least an approximate arrangement of text in lines of said at least one patent image file,
(F) column line number information representing approximate line numbers of lines of text,
(G) section information representing at least approximate positions of patent sections,
(H) font information representing font styles of characters of said at least one source text file as represented in said at least one patent image file,
(I) font size information representing font sizes of characters of said at least one source text file as represented in said at least one patent image file,
(J) superscript information indicating characters in said at least one source text file that are represented using superscripts in said at least one patent image file,
(K) subscript information indicating characters in said at least one source text file that are represented using subscripts in said at least one patent image file,
(L) bold attribute information indicating characters in said at least one source text file that are bolded in said at least one patent image file, and
(M) italicized attribute information indicating characters in said at least one source text file that are italicized in said at least one patent image file; and
(2) enabling a computer to selectively display at least a portion of said at least one patent text file and at least a portion of said at least one patent image file, such that said at least a portion of said at least one patent text file and said at least a portion of said at least one patent image file may be selectively displayed individually or simultaneously.
122. The method of claim 121, wherein said linking information comprises item location information specifying locations in said at least one patent image file of items contained or referred to in said at least one patent text file.
123. The method of claim 122, wherein said items comprise any of figures, drawing sheets, figure elements, equations, tables, structures, diagrams, text, text pages, portions of text pages, and sections.
124. The method of claim 121, further comprising the step of:
(3) enabling the computer to display, upon receipt of appropriate user command, said at least one patent text file and said at least one patent image file in a synchronized fashion or an unsynchronized fashion.
125. The method of claim 121, wherein said at least one source text file and said at least one patent image file are from at least one national patent office.
126. The method of claim 121, wherein said at least one patent text file is an equivalent text file.
Description
BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates to the fields of publishing, document editing and manipulation, and displaying documents and images. More particularly, the present invention relates to paginating, extracting, synchronizing, and displaying, a document in electronic form.
2. Art Background
As the development of multimedia computer display systems continues to advance, more computing power and features are available to computer users. For example, information which has historically been limited to published paper documents is now being made available through on-line computing services from publishers and information vendors. As an increasing market share of the data and computing capacity is provided through low cost high performance personal computers, some of the on-line information is also being made available in compact disks (CD) and magnetic media formats. Compact disk and magnetic media technology offer cost effective mass storage of documents, images and other data, in a format readily accessible for use with personal computers in a home or office environment. The combination of personal computers, compact disk technology and multimedia interactive graphic user interfaces, permits the access and display of textual and graphic information by personal computer (PC) users in a manner not previously known in the industry. The type of information potentially available to a PC user includes professional and technical publications, newspapers, magazines, and other scientific and literary data and images.
However, much of the information which is published through, for example, government sources, newspapers and magazines is not in machine readable form, but rather is printed on paper. Because of the amount of work and effort required to convert the printed information into a machine readable form, only a small portion of the total published information is currently available for use by PC users using magnetic disks, CDs and the like. In addition, the information which is in machine readable form is typically available either as an image of the original document or as a stream of text data. An image of a document has the advantage of presenting the information in its original format as published, including non-text material, such as drawings, equations, symbols, diagrams, etc. The viewer is familiar with the format, and the information is easily recognized and understood. However, since a document image is often stored as a bitmap, the content of the document cannot be easily searched or manipulated. Alternatively, a text data stream format has the advantage of presenting the information in a manipulable and searchable format. Unfortunately, in many cases, the format of presentation is not the format in which the information was originally published in print. Thus, the users are often unfamiliar with the format, inhibiting easy navigation of the document making information difficult to find and use.
One example of the problem of reproducing originally published documents stored in machine readable form, is the storage and display of United States patent documents by the United States Government. The United States Patent Office (herein referred to as the "PTO") provides magnetic tapes of issued U.S. patents and other documents, in the form of a scanned in image, and as a separate stream of text data. The magnetic tape storing the text data does not include graphical illustrations such as drawings, charts, textual tables, or much in the way of formatting data. Thus, the reproduction of a United States patent from PTO Text Files stored on magnetic tape does not result in the display of a U.S. patent as originally published by the U.S. Government. An example of a well known system for displaying text files provided by the PTO is that of the LexPat.RTM. system provided by Mead Data offered in conjunction with the Lexis.RTM. display system. Using the LexPat.RTM. system, the display of a U.S. patent on a terminal, such as a PC, results in a display of text only, and does not include drawings, charts, graphs, or original formatting information. The text of a selected patent appears in ASCII format, but does not appear as the original patent issued by the PTO, and may not be referenced by the original column and line numbers from the published patent. Other systems display text files of periodicals such as the Wall Street Journal or legal documents such as contracts. However, the text files do not appear as the original documents.
The U.S. Patent Office also provides magnetic tapes with image files comprising a scanned in image of the original U.S. patent issued by the PTO and published by the U.S. Government. The image files provided on magnetic tape by the PTO simply represent a bitmap image of the original published patent. As a scanned in image, the entire patent is provided including drawings, charts, graphs, text and the original format, since it represents a simple bitmap of the scanned original document. However, a scanned document may not be easily searched, edited, navigated or otherwise manipulated as can a text file.
As will be described, the present invention provides a method and apparatus for extracting, synchronizing, displaying, navigating and manipulating text and image documents simultaneously in electronic form. The present invention is described with particular reference for use with U.S. patent documents, and includes the process of extracting patent text and image data from magnetic tapes provided by the PTO, synchronizing the text and image data for recovering the original format (i.e., columns and lines) of the original published patent, and displaying the formatted text along with images using a unique graphical user interface (GUI) workbench. Although the present invention is described with reference to patent documents, it will be appreciated that the invention has application to a variety of different types of documents and applications.
The present invention's graphical user interface permits a user to selectively view ASCII text documents as well as bitmapped scanned images simultaneously on a display. When used in conjunction with U.S. patent documents, the graphic user interface of the present invention allows a user, such as a patent attorney, to display and manipulate both textual as well as graphic portions of patents. The text of a patent may be viewed on the display as it was originally published by the PTO, including column and line numbers. Simultaneously, the user may view the figures of a patent in the form of an image comprising a bitmap. Various functions are provided by the present invention for viewing, manipulating and displaying the patent documents. In order to assist the reader in understanding of graphic user interface (GUI) technology, it is suggested that certain references be considered for background. Many user interfaces utilize metaphors in the design of the interface as a way of maximizing human familiarity, and conveying information between the user and the computer. As for the use of familiar metaphors, such as desktops, notebooks, spread sheets, and the like, the interface takes advantage of existing human mental structures to permit a user to draw upon the metaphor analogy to understand the requirements of the particular computer system. (See for example, Patrick Chan "Learning Considerations in User Interface Design: The Room Model", Report CS-84-16, University of Waterloo, Computer Science Department, Ontario, Canada, July, 1984 and the references cited therein.) In addition, the reader is referred to the following references which describe various aspects, methods and apparatus associated with prior art graphic user interface design: U.S. Pat. No. Re. 32,632; U.S. Pat. No. 4,931,783; U.S. Pat. No. 5,072,412; and U.S. Pat. No. 5,148,154, and the references cited therein.
As will be described more fully below, the present invention's graphic user interface is based on a desktop "windows" metaphor, and provides the user with the ability to simultaneously display text and image documents in both a synchronized and unsynchronized fashion, as will be more fully described herein.
SUMMARY OF THE INVENTION
The present invention provides a method and apparatus for extracting, synchronizing, displaying, and manipulating text and image documents in machine readable form for display. In the preferred embodiment of the present invention, text and image files for documents, such as for example patent documents, are initially stored on separate magnetic tape media. These data files are extracted from the respective tapes and placed onto a faster medium, such as a hard disk drive. Catalogues are generated of the contents of the tapes and procedures are provided for locating and loading tapes from a tape inventory. The text and image files are synchronized to produce Equivalent Files using heuristic algorithms to create an approximate equivalence relationship between the text and the image files. In the presently preferred embodiment, the automatic pagination of the text and image files provides an equivalence relationship, and a final Equivalent File is obtained through human intervention to correct any inaccuracies still remaining after the automatic process has been completed. However, the present invention also contemplates an entirely automatic pagination process which would require no human intervention to obtain a usable Equivalent File. A word based inverted tree index is created for the text files to allow for very fast text searching using a graphic user interface (GUI) workbench.
The Equivalent Files and image files residing on, for example, a hard disk drive or compact disk (CD), are coupled as a resource to a computer display system. The computer display system includes a computer having a central processing unit (CPU) coupled to memory and input/output (I/O) circuitry. The computer is also coupled to a CD ROM, hard disk drive, or other mass memory device onto which the Equivalent File and image file have been stored. The computer is coupled to a display, such as a cathode ray tube (CRT) or liquid crystal display, as well as a keyboard and a cursor control device. The graphic user interface of the present invention is displayed by the computer on the CRT, and includes a menu bar and a tool bar, each bar having a plurality of command options for selection by a user. The graphical user interface of the present invention permits the user to display, manipulate, and navigate the Equivalent File created using the process of the present invention, and to simultaneously view the image file on the display. In accordance with the teachings of the present invention, the Equivalent File may be synchronized with the image file, or alternatively, an Equivalent File may be displayed along with a completely separate and distinct image (for example, viewing the Equivalent File of one patent while viewing the image file of another patent). Once created, and as shown on the display, the Equivalent File is displayed in substantially the same column and line format as a printed patent published by the U.S. Government.
Using the graphic user interface of the present invention, a user may create libraries of patent text Equivalent Files and image files, as well as open cases to include a plurality of different patents or other documents. The Equivalent File may be selectively viewed on the display in an equivalent window. The Equivalent File may be navigated, highlighted, searched, and otherwise annotated using highlights, patent and case notes. Simultaneous with the viewing of the Equivalent File of a patent within the equivalent window, the user may view the exact portion of the image file corresponding to the display of the Equivalent File, or any portion of an image file within one or more image windows on the display. The present invention further provides search mechanisms for defining and searching key words chosen by the user or selected from the Equivalent File, or a word list. Boolean and proximity searches may also be performed on the Equivalent File and the results displayed. The search terms may be used to search documents within the equivalent window of a current Equivalent File, current library of documents, documents notes (referred to herein as "patent notes" and/or "case notes"), as well as other selected cases. The word list includes an alphabetical list of all words within the selected library, document or the like. The present invention also permits the user to display an image, for example a patent drawing image, within the image window by placing a cursor in the text of a patent Equivalent File and signaling the computer. In response to this signal, the computer displays the last referenced figure drawing within the image window. The interface of the present invention also permits the user to select portions of text and/or drawings within the image window, and enlarge or reduce the selected image for viewing by the user. The interface further permits the user to select any element number appearing on the patent drawings in the image window. The selection of an element number in a patent drawing results in the automatic highlighting of the first and every subsequent occurrence of that element number in the Equivalent File comprising a specification and claims of the selected patent equivalent displayed in the equivalent window. Additionally, multiple patents, drawings and/or other documents may be viewed simultaneously on the display in accordance with the teachings of the graphic user interface comprising the present invention. A variety of other features and functions are provided by the present invention for the manipulation, navigation and display of patent documents on the user interface. The user may display either a synchronized Image File wherein the image displayed is synchronized with the Equivalent file displayed, or an unsynchronized Image File wherein the image displayed is at some page other than the one containing the column of text in the Equivalent File. A user may also copy and paste a portion of, or the whole, Equivalent File to notes of third party programs, such as word processors or drawing programs as well as allowing the user to import ASCII text into the notes from third party systems, such as deposition testimony in ASCII format into patent notes that relate to the topic of the testimony. Particularly when using the present invention with patents, it may be used to facilitate patent searching in the preparation and prosecution of patents, licensing of patents, litigation of patents, conducting infringement and validity studies of patents, producing infringement claim charts, managing and valuing a portfolio or group of patents, conducting 35 U.S.C. .sctn. 112 searches on patents or pending applications, and many other uses which are regularly performed by a patent attorney, patent agent or technical personnel.
NOTATION AND NOMENCLATURE
In some of the detailed descriptions which follow, the present invention is presented partly in terms of interface display images, process steps, and symbolic representations of operations of data bits within a computer memory. These algorithmic descriptions and representations are the means used by those skilled in the data processing arts to most effectively convey the substance of their work to others skilled in the art.
An algorithm is here, and generally, conceived to be a self-consistent sequence of steps leading to a desired result. These steps are those requiring physical manipulations of physical quantities. Usually, though not necessarily, these quantities may take the form of electrical or magnetic signals capable of being stored, transferred, combined, compared, displayed and otherwise manipulated. It proves convenient at times, principally for reasons of common usage, to refer to these signals as bits, values, elements, symbols, characters, images, terms, numbers, or the like. It should be borne in mind, however, that all of these similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities.
In the present invention, the operations referred to are machine operations performed in conjunction with a human operator. Useful machines for performing the operations of the present invention include general purpose digital computers, digitally controlled displays or other similar devices. In all cases, the reader is advised to keep in mind the distinction between the method of operating a computer and/or display system, and the method of computation itself. The present invention relates to methods for operating a computer and interactive display system, and processing electrical or other physical signals to generate other desired physical signals.
The present invention also relates to apparatus for performing these operations. This apparatus may be specially constructed for the required purposes or it may comprise a general purpose computer selectively activated or reconfigured by a computer program stored in the computer. The method steps presented herein are not inherently related to any particular computer or other apparatus. Various general purpose machines may be used with programs in accordance with the teachings herein, or it may prove more convenient to construct specialized apparatus to perform the required method steps. As such, no particular programming language is provided, as any one of a variety of languages may be utilized to implement the invention. The required structure for a variety of these machines and programming environments will be apparent from the description given below.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is a block diagram of production configuration to extract text and image files, paginate the text files with the image files to produce Equivalent Files, and index the Equivalent Files.
FIG. 2 is a flow chart illustrating the sequence of steps utilized by the present invention to extract text and image files, paginate the text files with the image files to produce Equivalent Files, index the Equivalent Files and display the Equivalent Files and/or Image Files on a display.
FIG. 3 is a functional block diagram illustrating a computer display system incorporating the teachings of the present invention.
FIG. 4 illustrates an enlarged portion of an image file comprising the bibliography page of U.S. Pat. No. 5,165,027.
FIG. 5 illustrates a sample portion of a PTO Text File for U.S. Pat. No. 5,165,027 illustrated in FIG. 4.
FIG. 6 illustrates an example of the column information listed in the PTO Text File for the U.S. Pat. No. 5,165,027 illustrated in FIGS. 4 and 5.
FIG. 7 illustrates the paragraph shown in FIG. 6 as it is stored in the PTO Image File for U.S. Pat. No. 5,165,027.
FIG. 8 illustrates the column line number information provided by a published United States patent.
FIG. 9 illustrates a flow chart block diagram of the extraction process utilized by the present invention to extract PTO Text Files and PTO Image Files for magnetic tapes provided by the PTO for use by the processing system of the present invention to synchronize and index the text and image files.
FIG. 10 is a flow chart illustrating the pagination process of the present invention to synchronize the PTO Text File and the PTO Image File to produce an Equivalent File.
FIG. 11 illustrates the user interface of the present invention upon system start including the title, menu and tool bars.
FIG. 12 illustrates the selection by a user of a down arrow function to open a list of available cases.
FIG. 13 illustrates the present invention's use of information arrows to direct the user to currently available options for execution.
FIG. 14 illustrates the patent text toolbox of the present invention and the display of a menu of patent section headings to assist the user in navigating a selected patent.
FIG. 15 illustrates the sub-command items available for selection by a user upon activating the Library menu option.
FIG. 16 illustrates the Set Library Directories dialog box, displayed after selection of the Set Library Directories sub-command item on the Library menu.
FIG. 17 illustrates the New Library dialog box.
FIG. 18 illustrates the Open Library dialog box.
FIG. 19 illustrates the present invention's Library dialog box for working with the library currently in use.
FIG. 20 illustrates the selection of a patent within the Intel.RTM. Library.
FIG. 21 illustrates the present invention's minimization of a library to an icon.
FIG. 22 illustrates the present invention's Update Library dialog box for updating the library currently in use, which in the present example, the Intel.RTM. Library.
FIG. 23 illustrates the present invention's Search Library dialog box which is displayed upon selection of the Search sub-command item from the library menu.
FIG. 24 illustrates the present invention's Word List dialog box which is displayed upon the activation of the Word List button function within the Search Library dialog box.
FIG. 25 illustrates the operation of the present invention's Word List dialog box for selecting an alphabetical tab and viewing the corresponding list of words from the library patents.
FIG. 26 illustrates the present invention's Search Results dialog box identifying the number of occurrences of the search term defined by the user in each of the library patents.
FIG. 27 illustrates the present invention's Library to Case Cross Reference dialog box.
FIG. 28 illustrates the present invention's Patent Text Toolbox for operating upon Equivalent Files displayed in an equivalent window.
FIG. 29 further illustrates the present invention's Patent Text Toolbox for operating upon the Equivalent File within the equivalent window.
FIG. 30 illustrates the present invention's simultaneous display of an equivalent window and an image window, as well as the display of a Patent Image Toolbox for operating upon images displayed within the image window.
FIG. 31 illustrates the present invention's simultaneous and synchronized display of an Equivalent File in an equivalent window and enlarged image displayed in an image window on the display screen.
FIG. 32 illustrates the display of patent section headings and the ability of a user to navigate the patent sections displayed within the equivalent window through the selection of section headings.
FIG. 33 illustrates the present invention's synchronization of an Equivalent File displayed in the equivalent window with the drawings of a patent disposed in an image file displayed in an image window on the display screen. The present invention links references to the figure numbers in the Equivalent File to the figures in the image file displayed in the image window.
FIG. 34 illustrates the present invention's use of an outline box to identify an area of the patent image to be enlarged.
FIG. 35 illustrates the present invention's user interface in which an Equivalent File is displayed in an equivalent window, and simultaneously, an enlarged portion of a figure from the image file is displayed in the image window on the display screen.
FIG. 36 illustrates the present invention's Select Element Number dialog box, which permits a user to input a drawing element and locate the first occurrence and the subsequent occurrences of the drawing element in the Equivalent File displayed in the equivalent window.
FIG. 37 illustrates the present invention's use of highlighting to highlight desired portions of the Equivalent File in various colors.
FIG. 38 illustrates the present invention's display of two equivalent windows and one image window on the display screen.
FIG. 39 illustrates the Import Patents dialog box of the present invention.
FIG. 40 illustrates the Import Patents dialog box after the selection of an Equivalent File to be imported.
FIG. 41 illustrates sub-command items available for selection upon the activation of the Case menu option.
FIG. 42 illustrates the Open Case dialog box which is displayed once the Open Case sub-command item illustrated in FIG. 41 is selected.
FIG. 43 illustrates the New Case dialog box which is displayed upon the selection of the New Case sub-command item illustrated in FIG. 41.
FIG. 44 illustrates the patent number drop down menu which permits a user to select a patent within a case for displaying.
FIG. 45 illustrates the Update Case dialog box which is displayed upon the activation of the Update Case sub-command item illustrated in FIG. 41.
FIG. 46 illustrates the search case dialog box which is displayed upon the selection of the Search sub-command item of the Case menu illustrated in FIG. 41.
FIG. 47 illustrates the Set Case Directories dialog box which is displayed upon the activation of the Set Case Directories sub-command item illustrated in FIG. 41.
FIG. 48 illustrates the Copy to Case dialog box which is displayed upon the selection of the Copy Case sub-command item illustrated in FIG. 41.
FIG. 49 illustrates the Backup Case dialog box which is displayed upon the activation of the Backup Case sub-command item of FIG. 41.
FIG. 50 illustrates the Delete dialog box which is displayed upon the selection of the Delete Case sub-command item illustrated in FIG. 41.
FIG. 51 illustrates the Print dialog box of the present invention which is displayed upon the activation of the Print sub-command item illustrated in FIG. 41.
FIG. 52 illustrates the Print Setup dialog box which is displayed upon the activation of the Print Setup sub-command item illustrated in FIG. 41.
FIG. 53 illustrates the sub-command items available for selection upon the activation of the Edit command option.
FIG. 54 illustrates the sub-command items available for selection by a user upon the activation of the View command option.
FIG. 55 illustrates the Preferences dialog box displayed upon the activation of the Preferences sub-command item of FIG. 54.
FIG. 56 illustrates the Screen Layout dialog box which is displayed upon the selection of a Screen Layout sub-command item of FIG. 54.
FIG. 57 illustrates the user interface of the present invention upon the selection of the Screen Layout of the Screen Layout dialog box illustrating one equivalent window and one image window on the display screen.
FIG. 58 illustrates the user interface of the present invention in which two equivalent windows are displayed side by side on the display screen after selection of Screen Layout of the Screen Layout dialog box.
FIG. 59 illustrates the graphic user interface of the present invention in which two equivalent windows and two image windows are displayed on the display screen subsequent to the selection of Screen Layout of the Screen Layout dialog box.
FIG. 60 illustrates the sub-command items available for selection upon the activation of the Window command option.
FIG. 61 illustrates the patent note menu of the present invention which displays all patent notes which have been generated by a user.
FIG. 62 illustrates a patent note of the present invention.
FIG. 63 illustrates the present invention's use of multi-notes wherein multiple patent notes may be created within a single patent note.
FIG. 64 illustrates the present invention's case note.
FIG. 65 illustrates the minimization of exemplary documents, such as search results and the like on the display of the present invention.
FIG. 66 illustrates the present invention's Go To Section dialog box which permits a user to input a patent column number and upon activation, results in the display of the column in the Equivalent File corresponding to the desired patent column.
FIG. 67 illustrates the present invention's Go To section dialog box which permits a user to select a section of the patent and upon activation, results in the display of the selected section in the Equivalent window.
FIG. 68 illustrates the sub-command items available for selection by a user upon the activation of the Help command option.
FIG. 69 illustrates the About dialog box which is displayed upon the activation of the About sub-command item illustrated in FIG. 68.
FIG. 70 illustrates the sub-command items which are available for selection by a user upon the activation of the Note command option.
FIG. 71 illustrates the case notes in Case dialog box which is displayed upon the selection of the View Case Note sub-command option illustrated in FIG. 70.
FIG. 72 illustrates the patent notes in Case dialog box which is displayed upon the selection of the View Patent Note sub-command item illustrated in FIG. 70.
DETAILED DESCRIPTION OF THE INVENTION
In the following description, numerous specific details are set forth such as functional blocks, representative data processing devices, window configurations, specific patent documents, text and drawings, etc., to provide a thorough understanding of the present invention. However, it will be apparent to one skilled in the art that the present invention may be practiced without these specific details. In other instances, well known circuits and structures are not described in detail in order not to obscure the present invention unnecessarily.
The present invention will be described in various sections including a discussion of the general system configuration, the tape extraction process, the pagination process, the indexing process, and the graphic user interface. It is to be understood that although the following description is directed to U.S. patent documents, the present invention is not limited to patents, and has application to a variety of documents and images, as may be required by a particular application, such as for example, legal contracts, the Wall Street Journal, The Los Angeles Times, etc.
General Overview of the Invention
The general system configuration of the present invention discloses one possible implementation of the present invention for the display, navigation, manipulation and editing of text and image data in a graphical user interface. As will be described, the general system configuration describes a computer display system which may be in the form of a personal computer, workstation, or dedicated processor system to permit the user to utilize the teachings of the present invention. No particular computer hardware is described within this specification, and the general system configuration description is intended to encompass a broad range of possible data processing systems in which the present invention may be implemented.
A general overview of the system of the present invention is shown in FIG. 1, and a flow chart of the primary process steps comprising the method of the present invention is illustrated in FIG. 2.
The tape extraction process of the present invention extracts data files from PTO text and PTO Image File magnetic tapes provided by the PTO. The data files are extracted from these tapes onto a faster medium (such as a hard disk drive) to provide access times which are useful in modern data processing systems. As will be described, the process of extraction involves appropriately generating catalogues and inventories of the contents of the tapes, as well as procedures for selecting and loading tapes from the newly created tape inventories.
The process of paginating the PTO Text Files and the PTO Image Files to produce "Equivalent Files" is performed by using a heuristic set of algorithms to automatically create an approximate equivalent relationship between the text and image files. A human operator verifies the results to finalize the Equivalent File, such that the original formatting of the published patent document is reflected in the Equivalent File.
As will be described, a process for creating an inverted tree index for the text contained in the PTO Text Files is disclosed. This indexing process results in a pre-built index for very fast text searching when using the graphic user interface of the present invention. Although the present invention describes an inverted tape index, other types of text searching methods may be employed, instead of the inverted tape index.
The graphic user interface ("GUI") of the present invention displays the Equivalent File and the PTO Image File, and allows the user to perform analysis on the displayed files or other stored files. The Equivalent File is formatted and displayed with a similar appearance to the PTO Image File, having the same column and line formatting as the published patent. The user may then, for example, use the GUI to perform text searches to generate accurate column and line citations, navigate the Equivalent File via section headings to locate desired sections of text, as well as to view the figures or text images in the displayed files or other stored files. Images and equivalent patent text may be viewed either in a synchronized or unsynchronized fashion using the teachings of the present invention.
General System Configuration
FIG. 1 illustrates a block diagram of the present invention's production configuration to extract text and image files, to paginate the text files to produce Equivalent Files, and to index the Equivalent Files. The process begins with the PTO magnetic tapes 1 that are of type 3480 from the PTO. There are three different categories of PTO magnetic tapes: PTO text tapes, PTO image tapes and PTO assignment tapes. A UNIX machine 2 reads the data in the PTO tapes 1 into a large file buffer. The data is then parsed to find each of the documents that are on the tapes. Parsing creates a table which contains patent numbers, the physical locations of the patent files on the tapes, the total number of bytes and other control information about each document that appears on the tape. A document can be either a patent, a certificate of correction, a reissued patent disclaimer or any other post-issuance document. The data can then be either stored in a digital linear tape (DLT) 3 or in any other suitable data storage medium. Because the amount of disk storage space required for the total active set of patents is greater than 1 terabyte (TB), currently the data is stored into libraries 5. The libraries may contain PTO Text Files 6, PTO Image Files 7 and post issuance documents 9. If a disk drive system with a large enough storage is available, the data can be stored in a disk drive. At present, the PTO image tapes are left in their original medium, namely the 3480 magnetic tapes.
Continuing to refer to FIG. 1, when an order 10 requesting a list of patents is entered into a UNIX database 11, the UNIX database 11 sorts the request list by patent location to minimize the number of different tapes that need to be mounted, and sends to the staging machine 8 the list of patents and other pertinent information such as the volume serial number of the tapes, and location information that allows the staging machine 8 to fast forward to the individual patent files that are requested. The staging machine 8 creates a file on its disks of all the text and image portions of each patent that has been requested to process. When the staging machine 8 has the text and image files available, it sends the text and image files to the pagination machine 13.
Further referring to FIG. 1, at present, the pagination machine 13 utilizes one or more DOS based machines 16 to paginate the text and image files and to create Equivalent Files as described more fully in the Terminology and Definition section in this Specification. After pagination, an index machine 19 adds post issuance documents 9 and indexes the Equivalent Files. The index machine 19 incorporates one or more DOS based machines 20. Next, the manufacturing machine 23 creates a CD ROM image of the Equivalent Files and the Image Files and writes the image to a CD ROM and digital linear tapes 28. The manufacturing machine 23 may utilize one or more DOS based machines 27, a CD ROM writer 25 and digital linear tapes 28. The CD ROM with the Equivalent Files and the Image Files are delivered to a user who then uses a system, such as the one illustrated in FIG. 3, to display and manipulate the files. The digital linear tapes with the finished patents are stored in a library 30, and the database 11 is updated so that when a particular patent in the library 30 is requested, the staging machine 8 mounts the finished patent from the library 30, and the database flags that the patent has already been paginated and indexed, so that pagination and indexing steps can be skipped for a faster process. Although in the present invention, specific machines such as UNIX machines and DOS machines are disclosed, these are mere examples of different types of computer systems that can be incorporated and not limitations upon the present invention.
Referring now to FIG. 3, an exemplary computer display system for use in accordance with the teachings of the present invention is shown. The computer system includes a display 40, such as a CRT monitor or a liquid crystal display (LCD), and further includes a cursor control device 42, such as a mouse of the type shown in U.S. Pat. No. Re. 32,632, a track ball, joy stick, keyboard or other device for selectively positioning a cursor 44 on a display screen 68 of the display 40. Typically, the cursor control device 42 includes a signal generation means, such as a switch 46 having a first position and a second position. For example, the mouse shown and described in U.S. Pat. No. Re. 32,632 includes a switch which the user of the computer system uses to generate signals directing the computer to execute certain commands. As illustrated, the cursor control means 42 (hereinafter all types of applicable cursor control devices, such as mice, track ba |