Method and system for redirection to arbitrary front-ends in a communication system7043563Abstract A system and method for providing network resources from an origin server to a client. A set of intermediary servers is topologically dispersed throughout a network. An enhanced communication channel is provided between the set of intermediary servers and the origin server. A redirector receives address resolution requests for the origin server, selects one of the intermediary servers in response to the request, and provides a network address of the selected intermediary servers to an entity generating the address resolution request. Claims We claim: Description BACKGROUND OF THE INVENTION
A particular advantage of the architecture shown in FIG. 2 is that it is readily scaled. Any number of client machines 205 may be supported. In a similar manner, a web site owner may choose to implement a site using multiple web servers 210 that are co-located or distributed throughout network 101. To avoid congestion, additional front-end servers 201 may be implemented or assigned to particular web sites. Each front-end server 201 is dynamically re-configurable by updating address parameters to serve particular web sites. Client traffic is dynamically directed to available front-end servers 201 to provide load balancing. Hence, when quality of service drops because of a large number of client accesses, an additional front-end server 201 can be assigned to the web site and subsequent client requests directed to the newly assigned front-end server 201 to distribute traffic across a broader base. In the particular examples, this is implemented by a front-end manager component 207 that communicates with multiple front-end servers 201 to provide administrative and configuration information to front-end servers 201. Each front-end server 201 includes data structures for storing the configuration information, including information identifying the IP addresses of web servers 210 to which they are currently assigned. Other administrative and configuration information stored in front-end server 201 may include information for prioritizing data from and to particular clients, quality of service information, and the like. Similarly, additional back-end servers 203 can be assigned to a web site to handle increased traffic. Backend manager component 209 couples to one or more back-end servers 203 to provide centralized administration and configuration service. Back-end servers 203 include data structures to hold current configuration state, quality of service information and the like. In the particular examples front-end manager 207 and back-end manager 209 serve multiple web sites 210 and so are able to manipulate the number of front-end servers and back-end servers assigned to each web site 210 by updating this configuration information. When the congestion for the site subsides, the front-end server 201 and back-end server 203 can be reassigned to other, busier web sites. These and similar modifications are equivalent to the specific examples illustrated herein. In the case of web-based environments, front-end server 201 is implemented using custom or off-the-shelf web server software. Front-end server 201 is readily extended to support other, non-web-based protocols, however, and may support multiple protocols for varieties of client traffic. Front-end server 201 processes the data traffic it receives, regardless of the protocol of that traffic, to a form suitable for transport by TMP 202 to a back-end server 203. Hence, most of the functionality implemented by front-end server 201 is independent of the protocol or format of the data received from a client 205. Hence, although the discussion of the exemplary embodiments herein relates primarily to front-end server 201 implemented as a web server, it should be noted that, unless specified to the contrary, web-based traffic management and protocols are merely examples and not a limitation of the present invention. As shown in FIG. 2, in accordance with the present invention a web site is implemented using an originating web server 210 operating cooperatively with the web server of front-end server 201. More generally, any network service (e.g., FTP, VoIP, NNTP, MIME, SMTP, Telnet, DBMS) can be implemented using a combination of an originating server working cooperatively with a front-end server 201 configured to provide a suitable interface (e.g., FTP VoIP, NNTP, MIME, SMTP, Telnet, DBMS, WAP) for the desired service. In contrast to a simple front-end cache or proxy software, implementing a server in front-end server 201 enables portions of the web site (or other network service) to actually be implemented in and served from both locations. The actual web pages or service being delivered comprises a composite of the portions generated at each server. Significantly, however, the web server in front-end server 201 is close to the browser in a client 205 whereas the originating web server is close to all resources available at the web hosting center at which web site 210 is implemented. In essence the web site 210 is implemented by a tiered set of web servers comprising a front-end server 201 standing in front of an originating web server. This difference enables the web site or other network service to be implemented so as to take advantage of the unique topological position each entity has with respect to the client 205. By way of a particular example, assume an environment in which the front-end server 201 is located at the location of an ISP used by a particular set of clients 205. In such an environment, clients 205 can access the front-end server 205 without actually traversing the network 101. In order for a client 205 to obtain service from a front-end server 201, it must first be directed to a front-end server 201 that can provide the desired service. Preferably, client 205 does not need to be aware of the location of front-end server 201, and initiates all transactions as if it were contacting the originating server 210. FIG. 3 illustrates a domain name server (DNS) redirection mechanism that illustrates how a client 205 is connected to a front-end server 201. The DNS systems is defined in a variety of Internet Engineering Task Force (IETF) documents such as RFC0883, RFC 1034 and RFC 1035 which are incorporated by reference herein. In a typical environment, a client 205 executes a browser 301, TCP/IP stack 303, and a resolver 305. For reasons of performance and packaging, browser 301, TCP/IP stack 303 and resolver 305 are often grouped together as routines within a single software product. Browser 301 functions as a graphical user interface to implement user input/output (I/O) through monitor 311 and associated keyboard, mouse, or other user input device (not shown). Browser 301 is usually used as an interface for web-based applications, but may also be used as an interface for other applications such as email and network news, as well as special-purpose applications such as database access, telephony, and the like. Alternatively, a special-purpose user interface may be substituted for the more general purpose browser 301 to handle a particular application. TCP/IP stack 303 communicates with browser 301 to convert data between formats suitable for browser 301 and IP format suitable for Internet traffic. TCP/IP stack also implements a TCP protocol that manages transmission of packets between client 205 and an Internet service provider (ISP) or equivalent access point. IP protocol requires that each data packet include, among other things, an IP address identifying a destination node. In current implementations the IP address comprises a 32-bit value that identifies a particular Internet node. Non-IP networks have similar node addressing mechanisms. To provide a more user-friendly addressing system, the Internet implements a system of domain name servers that map alpha-numeric domain names to specific IP addresses. This system enables a name space that is more consistent reference between nodes on the Internet and avoids the need for users to know network identifiers, addresses, routes and similar information in order to make a connection. The domain name service is implemented as a distributed database managed by domain name servers (DNSs) 307 such as DNS_A, DNS_B and DNS_C shown in FIG. 3. Each DNS relies on <domain name:IP> address mapping data stored in master files scattered through the hosts that use the domain system. These master files are updated by local system administrators. Master files typically comprise text files that are read by a local name server, and hence become available through the name servers 307 to users of the domain system. The user programs (e.g., clients 205) access name servers through standard programs such as resolver 305. Resolver 305 includes an address of a DNS 307 that serves as a primary name server. When presented with a reference to a domain name (e.g., http://www.circadence.com), resolver 305 sends a request to the primary DNS (e.g., DNS_A in FIG. 3). The primary DNS 307 returns either the IP address mapped to that domain name, a reference to another DNS 307 which has the mapping information (e.g., DNS_B in FIG. 3), or a partial IP address together with a reference to another DNS that has more IP address information. Any number of DNS-to-DNS references may be required to completely determine the IP address mapping. In this manner, the resolver 305 becomes aware of the IP address mapping which is supplied to TCP/IP component 303. Client 205 may cache the IP address mapping for future use. TCP/IP component 303 uses the mapping to supply the correct IP address in packets directed to a particular domain name so that reference to the DNS system need only occur once. In accordance with the present invention, at least one DNS server 307 is owned and controlled by system components of the present invention. When a user accesses a network resource (e.g., a web site), browser 301 contacts the public DNS system to resolve the requested domain name into its related IP address in a conventional manner. In a first embodiment, the public DNS performs a conventional DNS resolution directing the browser to an originating server 210 and server 210 performs a redirection of the browser to the system owned DNS server (i.e., DNC_C in FIG. 3). In a second embodiment, domain:address mappings within the DNS system are modified such that resolution of the of the originating server's domain automatically return the address of the system-owned DNS server (DNS_C). Once a browser is redirected to the system-owned DNS server, it begins a process of further redirecting the browser 301 to the best available front-end server 201. Unlike a conventional DNS server, however, the system-owned DNS_C in FIG. 3 receives domain:address mapping information from a redirector component 309. Redirector 309 is in communication with front-end manager 207 and back-end manager 209 to obtain information on current front-end and back-end assignments to a particular server 210. A conventional DNS is intended to be updated infrequently by reference to its associated master file. In contrast, the master file associated with DNS_C is dynamically updated by redirector 309 to reflect current assignment of front-end server 201 and back-end server 203. In operation, a reference to web server 210 (e.g., http://www.circadence.com) may result in an IP address returned from DNS_C that points to any selected front-end server 201 that is currently assigned to web site 210. Likewise, web site 210 can identify a currently assigned back-end server 203 by direct or indirect reference to DNS_C. Front-end server 201 typically receives information directly from front-end server manager 207 about the address of currently assigned back-end servers 203. Similarly, back-end server 203 is aware of the address of a front-end server 201 associated with each data packet. Hence, reference to the domain system is not required to map a front-end server 201 to its appropriate back-end server 203. FIG. 6 shows an exemplary implementation of redirector 309 in accordance with the present invention. Redirector 309 is implemented as a multi-level set of redirector servers that cooperate to determine an IP address for a particular front-end server 201 out of a pool of otherwise arbitrary or generic front-end servers 201. When a global redirector server 309 receives a request for domain name resolution, it estimates the global region from which the request came using knowledge about IP address space allocated to particular global region. Third-party services (such as NetGeo) are available that provide such IP address "maps". The user will be redirected to a selected regional redirector server 603 which serves that region. The term "region" refers to portion of the Internet, which could be associated with a part of a country (e.g. southern United States), a country (e.g. Japan), or an entire geographic area (e.g. Western Europe). Such a determination is based on the Internet topology, not the global population. Once the user has been redirected to regional redirector server 603 in their home region, regional redirector server 603 takes over the process of finding the single best gateway front-end server 201 for the user. Regional redirector 603 identifies a network where the user is located. The "network" in which the user is located refers to a set of one or more front-end servers 201 connected to the same network redirector 604. On any given "network", for any given back-end server 203 (shown in FIG. 2), there must be at least one front-end server 201 connected to that back-end server 203. It is possible that regional redirector 603 might miscalculate the user's location and redirects that user to the wrong network redirector 604. In that situation, the selected network redirector 604 will not reject the user, but will do its best to calculate and return the address of its best available front-end server 201. In the preferred implementations, each front-end server 201 supports a communication channel with each back-end server 203. Hence, there exist a large number of alternative channels that may be selected to provide suitable service for a given connection. The best available front-end server 201 is determined based on estimates and actual measurements of quality of service that can be provided by the various alternative channels. In accordance with the present invention, network redirector 604 determines a quality of service index or factor for each of a plurality of the available alternative channels. This quality of service index takes into account a variety of component factors such as latency, lost packets, server load (e.g., CPU usage) within the various front-end servers 201 and back-end servers 203, and the like. This index also attempts to account for components that affect the connection between a client 205 and each front-end server 201, although these factors may not be determinable with precision. Based upon the client's IP address, redirector 604 can estimate the geographic or topological distance between each front-end server 201 and estimate the quality of service of the alternate links. For example, it might often be true that a front-end server 201 that is close to client 205 will provide superior service. The various component factors are combined arithmetically or logically to generate an index. The combination might be a simple addition or multiplication operation, or may involve weighting each of the component factors. Based upon the determined index, a front-end server 201 can be selected from the arbitrary set of available front-end servers 201. The network address (e.g., IP address) for the selected front-end server 201 is communicated to client 205 in response to the domain resolution request issued by client 205. Henceforth, client 205 maintains the association between the requested domain (www.abc.com) and the returned IP address of the selected front-end server 201. From the client's perspective, the server found at the returned IP address is within the address domain of the requested web site 210. If a regional redirector 603 were to be off-line, the user's request would fail. To avoid this situation, it is desirable that each regional redirector 603 will be served by at least two computers, to be known as primary and secondary. For regions with significant traffic, it may be advisable to deploy not only primary and secondary computers, but tertiary computers as well. Front-end servers 201 report their status on a regular basis to their network redirector 604. It is possible that a specific front-end server 201 might go down without the network redirector 604 becoming immediately aware. In that situation, a user's request would not fail. Network redirectors 604 will return not one, but two or three IP addresses of different front-end server computers 201. As a result, if one front-end server is down, the user will be automatically connected to another front-end server 201. It is expected that redirector mechanism 309 will usually realize the unavailability of a front-end server 201 in less than one minute, although the time in a particular implementation may be selected to meet the needs of that implementation. In cases where regional redirector 603 is unable to identify the network that the user belongs to, it will try to find an alternate network closest to that user, and will redirect that users to the network redirector 604 that serves this alternate network. The selected network redirector 604 completes the process of selecting the most suitable front-end server 201 for a user. The redirector makes a decision by considering several factors that may include:
Finding the absolute best available front-end server 201 may, in some cases, take too long. If this is the case for a given search, the search process could be curtailed in several ways:
Having selected a front-end server 201, the user is connected to the selected front-end server and the job of redirector 309 is complete. In a particular example, front-end servers 201 communicate only with their corresponding network redirector servers 604. As a result, the user will almost always be sent to a front-end server 201 that is topologically close, for example in their home region, even if not the first "best" choice. Communications between the front-end server computers 201 and the redirector 309 are preferably secure and fault-tolerant. A special case exists for secure communications. When a client sends a packet using secure HTTP, for example, the payload is encrypted in a manner that only the origin server 210 can decrypt. While the origin server could share its decryption keys with front-ends 201, this may reduce the integrity of the keys to an unacceptable degree owing to the wider distribution of the keys. Hence, a front-end 201 will be unable to parse and process secure packets in the same manner as it is able to work with conventional packets. In many cases, secure communications use a specially designated port on the sending and receiving machines. For example, conventional HTTP traffic uses port 80 whereas secure HTTP (HTTPS) uses port 443. While other ports can be designated, standard-compliant TCP/IP processes will listen for secure communication on port 443 by default. In these cases, when the client sends a communication on the secure port the back-end 203 should deliver the packet on the secure port. In accordance with an embodiment of the present invention, redirector 309 specifies a secure port (e.g., port 443) of the selected front-end 201 in the IP address returned to a client 205. Further, front-end 201 implements processes that listen to the secure port and handle packets received on the secure port specially. These packets may be blended with non-secure packets over TMP link 202. However, packets received on a secure port, or any other non-standard port, are marked or tagged with the port number on which they should be delivered. This enables a back-end 203 to address the server 210 on the same port that client 205 intended to send the packet. Each front-end server 201 regularly reports its status to its network redirector server 604. Once a new front-end server 201 comes online, the redirector 309 would immediately become aware of this new front-end server 201, as well as all back-end servers 203 connected to that front-end server. Should a front-end server 201 become unavailable, the redirector 309 will find out quickly, preferably in less than one minute. This small potential downtime can be reduced even further by means of an optional, third layer of DNS servers. It is important to prevent the network redirector servers from being misled. Communications between redirector 309 and the front-end servers 201 is preferably secure. Secure connections can be ensured in multiple ways:
The redirection system in accordance with the present invention is usefully understood in contrast with a conventional DNS system. A conventional DNS is intended to be updated infrequently by reference to its associated master file. In contrast, the master file associated with DNS_C is dynamically updated by redirector 309 to reflect current assignment of front-end server 201 and back-end server 203. In operation, a reference to web server 210 (e.g., http://www.abc.com) may result in an IP address returned from redirector 309 that points to a selected front-end server 201. Front-end server 201 typically receives information directly from front-end manager 207 about the address of currently assigned back-end servers 203. Similarly, back-end server 203 is aware of the address of a front-end server 201 associated with each data packet. Management utilities will be required to administer the redirector system 309. The global, regional and network DNS servers may require separate utilities. The following is an exemplary list of tasks that the management utilities for redirector system 309 may perform:
Returning now to FIG. 4, principle functional components of an exemplary front-end server 201 are illustrated in greater detail. Primary functions of the front-end server 201 include translating transmission control protocol (TCP) packets from client 205 into TMP packets used in the system in accordance with the present invention. It is contemplated that the various functions described in reference to the specific examples may be implemented using a variety of data structures and programs operating at any location in a distributed network. For example, a front-end server 201 may be operated on a network appliance 107 or server within a particular network 102, 103, or 104 shown in FIG. 1. The present invention is readily adapted to any application where multiple TCP clients are coupling to a centralized TCP resource. Moreover, other transport control protocols may be used, including proprietary transport protocols, so long as the transport protocols supply the functionality of the TCP protocol. TCP component 401 includes devices for implementing physical connection layer and Internet protocol (IP) layer functionality. Current IP standards are described in IETF documents RFC0791, RFC0950, RFC0919, RFC0922, RFC792, RFC1112 that are incorporated by reference herein. For ease of description and understanding, these mechanisms are not described in great detail herein. TCP component 401 communicates TCP packets with one or more clients 205. Received packets are coupled to parser 402 where the Internet protocol (or equivalent) information is extracted. TCP is described in IETF RFC0793 which is incorporated herein by reference. Each TCP packet includes header information that indicates addressing and control variables, and a payload portion that holds the user-level data being transported by the TCP packet. The user-level data in the payload portion typically comprises a user-level network protocol datagram. Parser 402 analyzes the payload portion of the TCP packet. In the examples herein, HTTP is employed as the user-level protocol because of its widespread use and the advantage that currently available browser software is able to readily use the HTTP protocol. In this case, parser 402 comprises an HTTP parser. More generally, parser 402 can be implemented as any parser-type logic implemented in hardware or software for interpreting the contents of the payload portion. Parser 402 may implement file transfer protocol (FTP), mail protocols such as simple mail transport protocol (SMTP), structured query language (SQL) and the like. Any user-level protocol, including proprietary protocols, may be implemented within the present invention using appropriate modification of parser 402. To improve performance, front-end server 201 optionally includes a caching mechanism 403. Cache 403 may be implemented as a passive cache that stores frequently and/or recently accessed web pages or as an active cache that stores network resources that are anticipated to be accessed. In non-web applications, cache 403 may be used to store any form of data representing database contents, files, program code, and other information. Upon receipt of a TCP packet, HTTP parser 402 determines if the packet is making a request for data within cache 403. When the request can be satisfied from cache 403, the data is supplied directly without reference to web server 210 (i.e., a cache hit). Cache 403 implements any of a range of management functions for maintaining fresh content. For example, cache 403 may invalidate portions of the cached content after an expiration period specified with the cached data or by web sever 210. Also, cache 403 may proactively update the cache contents even before a request is received for particularly important or frequently used data from web server 210. Cache 403 evicts information using any desired algorithm such as least recently used, least frequently used, first in/first out, or random eviction. When the requested data is not within cache 403, a request is forwarded to web server 210, and the returned data may be stored in cache 403. Several types of packets will cause parser 404 to forward a request towards web server 201. For example, a request for data that is not within cache 403 (or if optional cache 403 is not implemented) will require a reference to web server 210. Some packets will comprise data that must be supplied to web server 210 (e.g., customer credit information, form data and the like). In these instances, HTTP parser 402 couples to data blender 404. Optionally, front-end 201 implements security processes, compression processes, encryption processes and the like to condition the received data for improved transport performance and/or provide additional functionality. These processes may be implemented within any of the functional components (e.g., data blender 404) or implemented as separate functional components within front-end 201. Also, parser 402 may implement a prioritization program to identify packets that should be given higher priority service. A prioritization program requires only that parser 402 include a data structure associating particular clients 205 or particular TCP packet types or contents with a prioritization value. Based on the prioritization value, parser 402 may selectively implement such features as caching, encryption, security, compression and the like to improve performance and/or functionality. The prioritization value is provided by the owners of web site 210, for example, and may be dynamically altered, statically set, or updated from time to time to meet the needs of a particular application. Blender 404 slices and/or coalesces the data portions of the received packets into a more desirable "TMP units" that are sized for transport through the TMP mechanism 202. The data portion of TCP packets may range in size depending on client 205 and any intervening links coupling client 205 to TCP component 401. Moreover, where compression is applied the compressed data will vary in size depending on the compressibility of the data. Data blender 404 receives information from front-end manager 207 that enables selection of a preferable TMP packet size. Alternatively, a fixed TMP packet size can be set that yields desirable performance across TMP mechanism 202. Data blender 404 also marks the TMP units so that they can be re-assembled at the receiving end. Data blender 404 also serves as a buffer for storing packets from all clients 205 that are associated with front-end server 201. Blender 404 mixes data packets coming into front-end server 201 into a cohesive stream of TMP packets sent to back-end server 203 over TMP link 202. In creating a TMP packet, blender 404 is able to pick and choose amongst the available packets so as to prioritize some packets over others. In an exemplary implementation, a "TMP connection" comprises a plurality of "TCP connection buffers", logically arranged in multiple "rings". Each TCP socket maintained between the front-end server 201 and a client 205 corresponds to a TCP connection buffer. When a TCP connection buffer is created it is assigned a priority. For purposes of the present invention, any algorithm or criteria may be used to assign a priority. Each priority ring is associated with a number of TCP connection buffer having similar priority. In a specific example, five priority levels are defined corresponding to five priority rings. Each priority ring is characterized by the number of connection buffers it holds (nsockets), the number of connection buffers it holds that have data waiting to be sent (nReady) and the total number of bytes of data in all the connection buffers that it holds (nBytes). When composing TMP data packets, the blender goes into a loop comprising the steps: 1) Determine the number of bytes available to be sent from each ring (nBytes), and the number of TCP connections that are ready to send (nReady) 2) Determine how many bytes should be sent from each ring. This is based on a weight parameter for each priority. The weight can be thought of as the number of bytes that should be sent at each priority this time through the loop. 3) The nsend value computed in the previous step reflects the weighted proportion that each ring will have in a blended TMP packet, but the values of nSend do not reflect how many bytes need to be selected to actually empty most or all of the data waiting to be sent a single round. To do this, the nSend value is normalized to the ring having the most data waiting (e.g., nbytes=nSendNorm). This involves a calculation of a factor: S=nBytes/(Weight*nReady) for the ring with the greatest nReady. Then, for each ring, calculate nReady*S*Weight to get the normalized value (nSendNorm) for each priority ring. 4) Send sub-packets from the different rings. This is done by taking a sub-packet from the highest priority ring and adding it to a TMP packet, then adding a sub-packet from each of the top two queues, then the top three, and so on. 5) Within each ring, sub-packets are added round robin. When a sub-packet is added from a TCP connection buffer the ring is rotated so the next sub-packet the ring adds will come from a different TCP connection buffer. Each sub-packet can be up to 512 bytes in a particular example. If the connection buffer has less than 512 bytes waiting, the data available is added to the TMP packet. 6) When a full TMP packet (roughly 1.5 kB in a particular example) is built, it is sent. This can have three or more sub packets, depending on their size. The TMP packet will also be sent when there is no more data ready. TMP mechanism 405 implements the TMP protocol in accordance with the present invention. TMP is a TCP-like protocol adapted to improve performance for multiple channels operating over a single connection. Front-end TMP mechanism 405 and corresponding back-end TMP mechanism 505 shown in FIG. 5 are computer processes that implement the end points of TMP link 202. The TMP mechanism in accordance with the present invention creates and maintains a stable connection between two processes for high-speed, reliable, adaptable communication. TMP is not merely a substitute for the standard TCP environment. TMP is designed to perform particularly well in environments such as the Internet. TMP connections are made less often than TCP connections. Once a TMP connection is made, it remains up unless there is some kind of direct intervention by an administrator or there is some form of connection breaking network error. This reduces overhead associated with setting up, maintaining and tearing down connections normally associated with TCP. Another feature of TMP is its ability to channel numerous TCP connections through a single TMP pipe 202. The environment in which TMP resides allows multiple TCP connections to occur at one end of the system. These TCP connections are then mapped into a single TMP connection. The TMP connection is then broken down at the other end of the TMP pipe 202 in order to traffic the TCP connections to their appropriate destinations. TMP includes mechanisms to ensure that each TMP connection gets enough of the available bandwidth to accommodate the multiple TCP connections that it is carrying. Another advantage of TMP as compared to traditional protocols is the amount of information about the quality of the connection that a TMP connection conveys from one end to the other of a TMP pipe 202. As often happens in a network environment, each end has a great deal of information about the characteristics of the connection in one direction, but not the other. By knowing about the connection as a whole, TMP can better take advantage of the available bandwidth. In contrast with conventional TCP mechanisms, the behavior implemented by TMP mechanism 405 is constantly changing. Because TMP obtains bandwidth to host a variable number of TCP connections and because TMP is responsive information about the variable status of the network, the behavior of TMP is preferably continuously variable. One of the primary functions of TMP is being able to act as a conduit for multiple TCP connections. As such, a single TMP connection cannot behave in the same manner as a single TCP connection. For example, imagine that a TMP connection is carrying 100 TCP connections. At this time, it loses one packet (from any one of the TCP connections) and quickly cuts its window size in half (as specified for TCP). This is a performance reduction on 100 connections instead of just on the one that lost the packet. Each TCP connection that is passed through the TMP connection must get a fair share of the bandwidth, and should not be easily squeezed out. To allow this to happen, every TMP becomes more aggressive in claiming bandwidth as it accelerates. Like TCP, the bandwidth available to a particular TMP connection is measured by its window size (i.e., the number of outstanding TCP packets that have not yet been acknowledged). Bandwidth is increased by increasing the window size, and relinquished by reducing the window size. Up to protocol specified limits, each time a packet is successfully delivered and acknowledged, the window size is increased until the window size reaches a protocol specified maximum. When a packet is dropped (e.g., no acknowledge received or a resend packet response is received), the bandwidth is decreased by backing off the window size. TMP also ensures that it becomes more and more resistant to backing off (as compared to TCP) with each new TCP connection that it hosts. A TMP should not go down to a window size of less than the number of TCP connections that it is hosting. In a particular implementation, every time a TCP connection is added to (or removed from) what is being passed through the TMP connection, the TMP connection behavior is altered. It is this adaptation that ensures successful connections using TMP. Through the use of the adaptive algorithms discussed above, TMP is able to adapt the amount of bandwidth that it uses. When a new TCP connection is added to the TMP connection, the TMP connection becomes more aggressive. When a TCP connection is removed from the TMP connection, the TMP connection becomes less aggressive. TMP pipe 202 provides improved performance in its environment as compared to conventional TCP channels, but it is recognized that TMP pipe 202 resides on the open, shared Internet in the preferred implementations. Hence, TMP must live together with many protocols and share the pipe efficiently in order to allow the other transport mechanisms fair access to the shared communication bandwidth. Since TMP takes only the amount of bandwidth that is appropriate for the number of TCP connections that it is hosting (and since it monitors the connection and controls the number of packets that it puts on the line), TMP will exist cooperatively with TCP traffic. Furthermore, since TMP does a better job at connection monitoring than TCP and TMP is better suited to throughput and bandwidth management than TCP. Also shown in FIG. 4 are data filter component 406 and HTTP reassemble component 407 that process incoming (with respect to client 205) data. TMP mechanism 405 receives TMP packets from TMP pipe 202 and extracts the TMP data units. Using the appended sequencing information, the extracted data units are reassembled into HTTP data packet information by HTTP reassembler 407. Data filter component 406 may also implement data decompression where appropriate, decryption, and handle caching when the returning data is of a cacheable type. FIG. 5 illustrates principle functional components of an exemplary back-end 203 in greater detail. Primary functions of the back-end 203 include translating transmission control protocol (TCP) packets from web server 210 into TMP packets as well as translating TMP packets received from a front-end 201 into the one or more corresponding TCP packets to be send to server 210. TMP unit 505 receives TMP packets from TMP pipe 202 and passes them to HTTP reassemble unit 507 where they are reassembled into the corresponding TCP packets. Data filter 506 may implement other functionality such as decompression, decryption, and the like to meet the needs of a particular application. The reassembled data is forwarded to TCP component 501 for communication with web server 210. TCP data generated by the web server process are transmitted to TCP component 501 and forwarded to HTTP parse mechanism 502. Parser 502 operates in a manner analogous to parser 402 shown in FIG. 4 to extract the data portion from the received TCP packets, perform optional compression, encryption and the like, and forward those packets to data blender 504. Data blender 504 operates in a manner akin to data blender 404 shown in FIG. 3 to buffer and prioritize packets in a manner that is efficient for TMP transfer. Priority information is received by, for example, back-end manager 209 based upon criteria established by the web site owner. TMP data is streamed into TMP unit 505 for communication on TMP pipe 202. FIG. 7 illustrates a conceptual diagram showing entity relationships maintained by the system in accordance with the present invention. In FIG. 7, client 205 is an HTTP client of the web server or servers implementing web site 210. Virtual network 200 is defined by the network of front-end servers 201 and back-end servers 203 in accordance with the present invention that are illustrated by stars along the periphery of virtual network 200. There may be many hundreds or thousands of these front-end servers 201 and back-end servers 203 making the reach of virtual network 200 substantially coextensive with Internet 101 itself. The present invention involves two conceptual redirection activities. First, a client request for the web site at domain "abc.com" must be redirected to virtual network 200. This first level of redirection does not select a particular front-end server 201, but instead involves the redirection to the virtual network 200 which comprises the collection of arbitrary front-end servers 201. For purposes of discussion, virtual network 200 is within a domain "redir.abc.com" although it should be understood that virtual network 200 is shared across every domain that has a web site 210 coupled to a back-end server 203. A second redirection activity involves selecting a particular channel 202 such that all subsequent communication between client 205 and web site 210 is conducted over the selected channel. One or more back-end servers 203 have a persistent relationship or assignment to web site 210. In contrast, any of the available front-end servers 201 may be dynamically connected to the back-end 203 assigned to a particular site. The first level of redirection can be accomplished in a number of ways. As described hereinbefore, the public DNS administered by InterNIC can be informed that the domain "abc.com" is to be administered by redirector 601 rather than a conventional DNS. This requires a web site owner to turn over control of all or a portion of its domain name to the administrator of redirector 601. It is contemplated that only a portion may be turned over by the owner of abc.com. (e.g., abc.def.com is turned over, but ghi.def.com is retained). The first level of redirection is then handled by the upper level public DNS system as shown in FIG. 6 where InterNic 610 references redirector 601 during the resolution process. Alternatively, web site 210 can retain control over the domain name "abc.com" and explicitly redirect as desired by responding to the initial (or any subsequent) HTTP request with a reference to "redir.abc.com". The domain "redir.abc.com" refers only to a virtual network 200 and not to a physical machine. However, "redir.abc.com" is registered with Internic 610 to be resolved by redirector 601. In this manner, the administrator of web site 210 can determine at what point a client 205 is redirected into the enhanced communication channels 202 made available by virtual network 200. Although the invention has been described and illustrated with a certain degree of particularity, it is understood that the present disclosure has been made only by way of example, and that numerous changes in the combination and arrangement of parts can be resorted to by those skilled in the art without departing from the spirit and scope of the invention, as hereinafter claimed. For example, while devices supporting HTTP data traffic are used in the examples, the HTTP devices may be replaced or augmented to support other public and proprietary protocols including FTP, NNTP, SMTP, SQL and the like. In such implementations the front-end server 201 and/or back end 203 are modified to implement the desired protocol. Moreover, front-end server 201 and back-end server 203 may support different protocols such that the front-end server 201 supports, for example, HTTP traffic with a client and the back-end server supports a DBMS protocol such as SQL. Such implementations not only provide the advantages of the present invention, but also enable a client to access a rich set of network resources with minimal client software.
|
Same subclass Same class Consider this |
||||||||||
