Plant sugar transport proteins6383776Abstract This invention relates to an isolated nucleic acid fragment encoding a sugar transport protein. The invention also relates to the construction of a chimeric gene encoding all or a portion of the sugar transport protein, in sense or antisense orientation, wherein expression of the chimeric gene results in production of altered levels of the sugar transport protein in a transformed host cell. Claims What is claimed is: Description FIELD OF THE INVENTION
TABLE 1
Sugar Transport Proteins
Enzyme Clone Plant
Sugar Transport Protein (Arabidopsis-like) p0032.crcba66r Corn
p0097.cqran41r Corn
crln.pk0143.h10 Corn
p0128.cpict38 Corn
p0106.cjlpm67r Corn
ci1lc.pk001.f21 Corn
p0072.comgi92r Corn
p0114.cimm181r Corn
p0002.cgevb73r Corn
rdslc.pk007.n17 Rice
rlr12.pk0013.d11 Rice
rls6.pk0003.d5 Rice
sgs4c.pk005.c9 Soybean
sfl1.pk0079.a4 Soybean
sdp3c.pk012.i1 Soybean
ss1.pk0022.f1 Soybean
wlk8.pk0001.a12 Wheat
wlm96.pk043.e19 Wheat
wre1n.pk0062.g6 Wheat
wre1n.pk0006.b4 Wheat
Sugar Transport Protein cc1.mn0002.h1 Corn
(Beta vulgaris-like) cepe7.pk0018.g3 Corn
rlr6.pk0005.b10 Rice
rl0n.pk102.p24 Rice
rl0n.pk107.p2 Rice
sr1.pk0061.g8 Soybean
sfl1.pk0058.h12 Soybean
sgs2c.pk004.o17 Soybean
sre.pk0032.h6 Soybean
wlk8.pk0001.a11 Wheat
wlml.pk0012.h1 Wheat
The nucleic acid fragments of the instant invention may be used to isolate cDNAs and genes encoding homologous proteins from the same or other plant species. Isolation of homologous genes using sequence-dependent protocols is well known in the art. Examples of sequence-dependent protocols include, but are not limited to, methods of nucleic acid hybridization, and methods of DNA and RNA amplification as exemplified by various uses of nucleic acid amplification technologies (e.g., polymerase chain reaction, ligase chain reaction). For example, genes encoding other Arabidopsis thaliana-like sugar transport proteins or Beta vulgaris-like sugar transport proteins, either as cDNAs or genomic DNAs, could be isolated directly by using all or a portion of the instant nucleic acid fragments as DNA hybridization probes to screen libraries from any desired plant employing methodology well known to those skilled in the art. Specific oligonucleotide probes based upon the instant nucleic acid sequences can be designed and synthesized by methods known in the art (Maniatis). Moreover, the entire sequences can be used directly to synthesize DNA probes by methods known to the skilled artisan such as random primer DNA labeling, nick translation, or end-labeling techniques, or RNA probes using available in vitro transcription systems. In addition, specific primers can be designed and used to amplify a part or all of the instant sequences. The resulting amplification products can be labeled directly during amplification reactions or labeled after amplification reactions, and used as probes to isolate fall length cDNA or genomic fragments under conditions of appropriate stringency. In addition, two short segments of the instant nucleic acid fragments may be used in polymerase chain reaction protocols to amplify longer nucleic acid fragments encoding homologous genes from DNA or RNA. The polymerase chain reaction may also be performed on a library of cloned nucleic acid fragments wherein the sequence of one primer is derived from the instant nucleic acid fragments, and the sequence of the other primer takes advantage of the presence of the polyadenylic acid tracts to the 3' end of the MnRNA precursor encoding plant genes. Alternatively, the second primer sequence may be based upon sequences derived from the cloning vector. For example, the skilled artisan can follow the RACE protocol (Frohman et al., (1988) PNAS USA 85:8998) to generate cDNAs by using PCR to amplify copies of the region between a single point in the transcript and the 3' or 5' end. Primers oriented in the 3' and 5' directions can be designed from the instant sequences. Using commercially available 3' RACE or 5' RACE systems (BRL), specific 3' or 5' cDNA fragments can be isolated (Ohara et al., (1989) PNAS USA 86:5673; Loh et al., (1989) Science 243:217). Products generated by the 3' and 5' RACE procedures can be combined to generate full-length cDNAs (Frohman, M. A. and Martin, G. R., (1989) Techniques 1:165). Availability of the instant nucleotide and deduced amino acid sequences facilitates immunological screening of cDNA expression libraries. Synthetic peptides representing portions of the instant amino acid sequences may be synthesized. These peptides can be used to immunize animals to produce polyclonal or monoclonal antibodies with specificity for peptides or proteins comprising the amino acid sequences. These antibodies can be then be used to screen cDNA expression libraries to isolate full-length cDNA clones of interest (Lemer, R. A. (1984) Adv. Immunol. 36:1; Maniatis). The nucleic acid fragments of the instant invention may be used to create transgenic plants in which the disclosed Arabidopsis thaliana-like sugar transport proteins or Beta vulgaris-like sugar transport proteins are present at higher or lower levels than normal or in cell types or developmental stages in which they are not normally found. This would have the effect of altering the level of sugar transport in those cells. Overexpression of the Arabidopsis thaliana-like sugar transport proteins or Beta vulgaris-like sugar transport proteins of the instant invention may be accomplished by first constructing a chimeric gene in which the coding region is operably linked to a promoter capable of directing expression of a gene in the desired tissues at the desired stage of 35 development. For reasons of convenience, the chimeric gene may comprise promoter sequences and translation leader sequences derived from the same genes. 3' Non-coding sequences encoding transcription termination signals may also be provided. The instant chimeric gene may also comprise one or more introns in order to facilitate gene expression. Plasmid vectors comprising the instant chimeric gene can then constructed. The choice of plasmid vector is dependent upon the method that will be used to transform host plants. The skilled artisan is well aware of the genetic elements that must be present on the plasmid vector in order to successfully transform, select and propagate host cells containing the chimeric gene. The skilled artisan will also recognize that different independent transformation events will result in different levels and patterns of expression (Jones et al., (1985) EMBO J. 4:2411-2418; De Almeida et al., (1989) Mol. Gen. Genetics 218:78-86), and thus that multiple events must be screened in order to obtain lines displaying the desired expression level and pattern. Such screening may be accomplished by Southern analysis of DNA, Northern analysis of MRNA expression, Western analysis of protein expression, or phenotypic analysis. For some applications it may be useful to direct the instant sugar transport proteins to different cellular compartments, or to facilitate its secretion from the cell. It is thus envisioned that the chimeric gene described above may be further supplemented by altering the coding sequence to encode Arabidopsis thaliana-like sugar transport proteins or Beta vulgaris-like sugar transport proteins with appropriate intracellular targeting sequences such as transit sequences (Keegstra, K. (1989) Cell 56:247-253), signal sequences or sequences encoding endoplasmic reticulum localization (Chrispeels, J. J., (1991) Ann. Rev. Plant Phys. Plant Mol. Biol. 42:21-53), or nuclear localization signals (Raikhel, N. (1992) Plant Phys. 100:1627-1632) added and/or with targeting sequences that are already present removed. While the references cited give examples of each of these, the list is not exhaustive and more targeting signals of utility may be discovered in the future. It may also be desirable to reduce or eliminate expression of genes encoding Arabidopsis thaliana-like sugar transport proteins or Beta vulgaris-like sugar transport proteins in plants for some applications. In order to accomplish this, a chimeric gene designed for co-suppression of the instant sugar transport proteins can be constructed by linking a gene or gene fragment encoding an Arabidopsis thaliana-like sugar transport protein or Beta vulgaris-like sugar transport protein to plant promoter sequences. Alternatively, a chimeric gene designed to express antisense RNA for all or part of the instant nucleic acid fragment can be constructed by linking the gene or gene fragment in reverse orientation to plant promoter sequences. Either the co-suppression or antisense chimeric genes could be introduced into plants via transformation wherein expression of the corresponding endogenous genes are reduced or eliminated. The instant Arabidopsis thaliana-like sugar transport proteins or Beta vulgaris-like sugar transport proteins (or portions thereof) may be produced in heterologous host cells, particularly in the cells of microbial hosts, and can be used to prepare antibodies to the these proteins by methods well known to those skilled in the art. The antibodies are useful for detecting Arabidopsis thaliana-like sugar transport proteins or Beta vulgaris-like sugar transport proteins in situ in cells or in vitro in cell extracts. Preferred heterologous host cells for production of the instant sugar transport proteins are microbial hosts. Microbial expression systems and expression vectors containing regulatory sequences that direct high level expression of foreign proteins are well known to those skilled in the art. Any of these could be used to construct a chimeric gene for production of the instant Arabidopsis thaliana-like sugar transport proteins or Beta vulgaris-like sugar transport proteins. This chimeric gene could then be introduced into appropriate microorganisms via transformation to provide high level expression of the encoded sugar transport protein. An example of a vector for high level expression of the instant Arabidopsis thaliana-like sugar transport proteins or Beta vulgaris-like sugar transport proteins in a bacterial host is provided (Example 7). All or a substantial portion of the nucleic acid fragments of the instant invention may also be used as probes for genetically and physically mapping the genes that they are a part of, and as markers for traits linked to those genes. Such information may be useful in plant breeding in order to develop lines with desired phenotypes. For example, the instant nucleic acid fragments may be used as restriction fragment length polymorphism (RFLP) markers. Southern blots (Maniatis) of restriction-digested plant genomic DNA may be probed with the nucleic acid fragments of the instant invention. The resulting banding patterns may then be subjected to genetic analyses using computer programs such as MapMaker (Lander et al., (1987) Genomics 1:174-181) in order to construct a genetic map. In addition, the nucleic acid fragments of the instant invention may be used to probe Southern blots containing restriction endonuclease-treated genomic DNAs of a set of individuals representing parent and progeny of a defined genetic cross. Segregation of the DNA polymorphisms is noted and used to calculate the position of the instant nucleic acid sequence in the genetic map previously obtained using this population (Botstein, D. et al., (1980) Am. J. Hum. Genet. 32:314-331). The production and use of plant gene-derived probes for use in genetic mapping is described in R. Bernatzky, R. and Tanksley, S. D. (1986) Plant Mol. Biol. Reporter 4(1):37-41. Numerous publications describe genetic mapping of specific cDNA clones using the methodology outlined above or variations thereof. For example, F2 intercross populations, backcross populations, randomly mated populations, near isogenic lines, and other sets of individuals may be used for mapping. Such methodologies are well known to those skilled in the art. Nucleic acid probes derived from the instant nucleic acid sequences may also be used for physical mapping (i.e., placement of sequences on physical maps; see Hoheisel, J. D., et al., In: Nonmammalian Genomic Analysis: A Practical Guide, Academic press 1996, pp. 319-346, and references cited therein). In another embodiment, nucleic acid probes derived from the instant nucleic acid sequences may be used in direct fluorescence in situ hybridization (FISH) mapping (Trask, B. J. (1991) Trends Genet. 7:149-154). Although current methods of FISH mapping favor use of large clones (several to several hundred KB; see Laan, M. et al. (1995) Genome Research 5:13-20), improvements in sensitivity may allow performance of FISH mapping using shorter probes. A variety of nucleic acid amplification-based methods of genetic and physical mapping may be carried out using the instant nucleic acid sequences. Examples include allele-specific amplification (Kazazian, H. H. (1989) J. Lab. Clin. Med. 114(2):95-96), polymorphism of PCR-amplified fragments (CAPS; Sheffield, V. C. et al. (1993) Genomics 16:325-332), allele-specific ligation (Landegren, U. et al. (1988) Science 241:1077-1080), nucleotide extension reactions (Sokolov, B. P. (1990) Nucleic Acid Res. 18:3671), Radiation Hybrid Mapping (Walter, M. A. et al. (1997) Nature Genetics 7:22-28) and Happy Mapping (Dear, P. H. and Cook, P. R. (1989) Nucleic Acid Res. 17:6795-6807). For these methods, the sequence of a nucleic acid fragment is used to design and produce primer pairs for use in the amplification reaction or in primer extension reactions. The design of such primers is well known to those skilled in the art. In methods employing PCR-based genetic mapping, it may be necessary to identify DNA sequence differences between the parents of the mapping cross in the region corresponding to the instant nucleic acid sequence. This, however, is generally not necessary for mapping methods. Loss of function mutant phenotypes may be identified for the instant cDNA clones either by targeted gene disruption protocols or by identifying specific mutants for these genes contained in a maize population carrying mutations in all possible genes (Ballinger and Benzer, (1989) Proc. Natl. Acad. Sci USA 86:9402; Koes et al., (1995) Proc. Natl. Acad. Sci USA 92:8149; Bensen et al., (1995) Plant Cell 7:75). The latter approach may be accomplished in two ways. First, short segments of the instant nucleic acid fragments may be used in polymerase chain reaction protocols in conjunction with a mutation tag sequence primer on DNAs prepared from a population of plants in which Mutator transposons or some other mutation-causing DNA element has been introduced (see Bensen, supra). The amplification of a specific DNA fragment with these primers indicates the insertion of the mutation tag element in or near the plant gene encoding the Arabidopsis thaliana-like sugar transport protein or Beta vulgaris-like sugar transport protein. Alternatively, the instant nucleic acid fragment may be used as a hybridization probe against PCR amplification products generated from the mutation population using the mutation tag sequence primer in conjunction with an arbitrary genomic site primer, such as that for a restriction enzyme site-anchored synthetic adaptor. With either method, a plant containing a mutation in the endogenous gene encoding an Arabidopsis thaliana-like sugar transport protein or Beta vulgaris-like sugar transport protein can be identified and obtained. This mutant plant can then be used to determine or confirm the natural function of the Arabidopsis thaliana-like sugar transport protein or Beta vulgaris-like sugar transport protein gene product. EXAMPLES The present invention is further defined in the following Examples, in which all parts and percentages are by weight and degrees are Celsius, unless otherwise stated. It should be understood that these Examples, while indicating preferred embodiments of the invention, are given by way of illustration only. From the above discussion and these Examples, one skilled in the art can ascertain the essential characteristics of this invention, and without departing from the spirit and scope thereof, can make various changes and modifications of the invention to adapt it to various usages and conditions. Example 1 Composition of cDNA Libraries: Isolation and Sequencing of cDNA Clones cDNA libraries representing mRNAs from various corn, rice, soybean and wheat tissues were prepared. The characteristics of the libraries are described below.
TABLE 2
cDNA Libraries from Corn, Rice, Soybean and Wheat
Library Tissue Clone
cc1 Corn (Zea mays L.) callus stage 1** cc1.mn0002.h1
Cepe7 Corn (Zea mays L.) epicotyl from 7 day old etiolated
cepe7.pk0018.g3
seedling
cil1c Corn (Zea mays L.) pooled immature leaf tissue at V4,
cil1c.pk001.f21
V6 and V8**
cr1n Corn (Zea mays L.) root from 7 day seedlings grown in
cr1n.pk0143.h10
light*
p0002 Corn (Zea mays L.) tassel: premeiotic > early uninucleate
p0002.cgevb73r
p0032 Corn (Zea mays L.) regenernerating callus, 10 and 14 days
p0032.crcba66r
after auxin removal.
p0072 Corn (Zea mays L.) 14 days after planting etiolated p0072.comgi92r
seedling: mesocotyl
p0097 Corn (Zea mays L.) V9, 7 cm whorl section after p0097.cqran4lr
application of European Corn Borer
p0106 Corn (Zea mays L.) 5 days after pollenation whole kernels*
p0106.cjlpm67r
p0114 Corn (Zea mays L.) intercalary meristem of expanding p0114.cimm181r
internodes 5-9 at V10 stage*
p0128 Corn (Zea mays L.) pooled primary and secondary p0128.cpict38
immature ear
Rdslc Rice (Oryza sativa, YM) developing seeds rdslc.pk007.n17
rlr6 Rice (Oryza sativa L.) leaf (15 days after germination)
rlr6.pk0005.b10
6 hrs after infection of Magaporthe grisea strain
4360-R-62 (AVR2-YAMO); Resistant
r10n Rice (Oryza sativa L.) 15 day leaf* r10n.pk102.p24
r10n.pk107.p2
rlr12 Rice (Oryza sativa L.) leaf, 15 days after germination,
rlr12.pk0013.d11
12 hours after infection of Magaporthe grisea strain
4360-R-62 (AVR2-YAMO); Resistant
rls6 Rice (Oryza sativa L.) leaf, 15 days after germination,
rls6.pk0003.d5
6 hrs after infection of Magaporthe grisea strain
4360-R-67 (avr2-yamo); Susceptible
sdp3c Soybean (Glycine max L.) developing pods 8-9 mm sdp3c.pk012.i1
sfl1 Soybean (Glycine max L.) immature flower sfl1.pk0079.a4
sfl1.pk0058.h12
sgs2c Soybean (Glycine max L.) seeds 14 hrs after germination
sgs2c.pk004.o17
sgs4c Soybean (Glycine max L.) seeds 2 days after germination
sgs4c.pk005.c9
srl Soybean (Glycine max L.) root library srl.pk0061.g8
sre Soybean (Glycine max L.) root elongation sre.pk0032.h6
ssl Soybean (Glycine max L.) seedling 5-10 day ssl.pk0022.f1
wlk8 Wheat (Triticum aestivum L.) seedlings 8 hr after wlk8.pk0001.a11
treatment with fungicide*** wlk8.pk0001.a12
wlm1 Wheat (Triticum aestivum L.) seedlings 1 hr after wlm1.pk0012.h1
inoculation with Erysiphe graminis f. sp tritici
wlm96 Wheat (Triticum aestivum L.) seedlings 96 hr after wlm96.pk043.e19
inoculation w/ E. graminis
wre1n Wheat (Triticum aestivum L.) root; 7 day old etiolated
wre1n.pk0006.b4
seedling* wre1n.pk0062.g6
*These libraries were normalized essentially as described in U.S. Pat. No.
5,482,845
**V4, V6 and V8 refer to stages of corn growth. The descriptions can be
found in "How a Corn Plant Develops" Special Report No. 48, Iowa State
University of Science and Technology Cooperative Extension Service Ames,
Iowa, Reprinted February 1996.
***Application of 6-iodo-2-propoxy-3-propyl-4(3H-quinazolinone; synthesis
and methods of using this compound are described in USSN 08/545,827,
incorporated herein by reference.
cDNA libraries were prepared in Uni-ZAP.TM. XR vectors according to the manufacturer's protocol (Stratagene Cloning Systems, La Jolla, Calif.). Conversion of the Uni-ZAP.TM. XR libraries into plasmid libraries was accomplished according to the protocol provided by Stratagene. Upon conversion, cDNA inserts were contained in the plasmid vector pBluescript. cDNA inserts from randomly picked bacterial colonies containing recombinant pBluescript plasmids were amplified via polymerase chain reaction using primers specific for vector sequences flanking the inserted cDNA sequences or plasmid DNA was prepared from cultured bacterial cells. Amplified insert DNAs or plasmid DNAs were sequenced in dye-primer sequencing reactions to generate partial cDNA sequences (expressed sequence tags or "ESTs"; see Adams, M. D. et al., (1991) Science 252:1651). The resulting ESTs were analyzed using a Perkin Elmer Model 377 fluorescent sequencer. Example 2 Identification of cDNA Clones ESTs encoding sugar transport proteins were identified by conducting BLAST (Basic Local Alignment Search Tool; Altschul, S. F., et al., (1993) J. Mol. Biol. 215:403-410; see also www.ncbi.nlm.nih.gov/BLAST/) searches for similarity to sequences contained in the BLAST "nr" database (comprising all non-redundant GenBank CDS translations, sequences derived from the 3-dimensional structure Brookhaven Protein Data Bank, the last major release of the SWISS-PROT protein sequence database, EMBL, and DDBJ databases). The cDNA sequences obtained in Example 1 were analyzed for similarity to all publicly available DNA sequences contained in the "nr" database using the BLASTN algorithm provided by the National Center for Biotechnology Information (NCBI). The DNA sequences were translated in all reading frames and compared for similarity to all publicly available protein sequences contained in the "nr" database using the BLASTX algorithm (Gish, W. and States, D. J. (1993) Nature Genetics 3:266-272 and Altschul, Stephen F., et al. (1997) Nucleic Acids Res. 25:3389-3402) provided by the NCBI. For convenience, the P-value (probability) of observing a match of a cDNA sequence to a sequence contained in the searched databases merely by chance as calculated by BLAST are reported herein as "pLog" values, which represent the negative of the logarithm of the reported P-value. Accordingly, the greater the pLog value, the greater the likelihood that the cDNA sequence and the BLAST "hit" represent homologous proteins. Example 3 Characterization of cDNA Clones Encoding Arabidopsis thaliana-Like Sugar Transport Proteins The BLASTX search using the EST sequences from several corn, rice, soybean and wheat clones revealed similarity of the proteins encoded by the cDNAs to a sugar transport protein from Arabidopsis thaliana (NCBI Identifier No. gi 3080420). In the process of comparing the ESTs it was found that many of the clones had overlapping regions of homology. Using this homology it was possible to align the ESTs and assemble several contigs encoding unique corn, rice, soybean and wheat sugar transport proteins. The individual clones and the composition of each assembled contig are shown in Table 3. The BLAST results for each of the contigs and individual ESTs and are also shown in Table 3:
TABLE 3
BLAST Results for Clones Encoding Polypeptides Homologous
to Arabidopsis thaliana Sugar Transport Protein
Clone BLAST pLog Score
Contig composed of clones: >250.00
p0032.crcba66r
p0097.cqran41r
crln.pk0143.h10
p0128.cpict38
p0106.cjlpm67r
cillc.pk001.f21
p0072.comgi92r
p0114.cimm181r
p0002.cgcvb73r
Contig composed of clones: 27.70
rlr12.pk0013.d11
rdslc.pk007.n17
rls6.pk0003.d5 54.00
Contig composed of clones: >250.00
sgs4c.pk005.c9
sfl1.pk0079.a4
sdp3c.pk012.i1
ssl.pk0022.fl >250.00
wlk8.pk0001.a12 21.30
Contig composed of clones: 149.00
wlm96.pk043.el9
wreln.pk0062.g6
wreln.pk0006.b4 117.00
The sequence of the corn contig composed of clones p0032.crcba66r, p0097.cqran41r, cr1n.pk0143.h10, p0128.cpict38, p0106.cjlpm67r, cil1c.pk001.f21, p0072.comgi92p0114 .cimm181r and p0002.cgevb73r is shown in SEQ ID NO:1; the deduced amino acid sequence of this contig, which represents 100% of the protein, is shown in SEQ ID NO:2. A calculation of the percent similarity of the amino acid sequence set forth in SEQ ID NO:2 and the Arabidopsis thaliana sequence (using the Clustal algorithm) revealed that the protein encoded by SEQ ID NO:2 is 66% similar to the Arabidopsis thaliana sugar transport protein. The sequence of the rice contig composed of clones rlr12.pk0013.d11 and rds1c.pk007.n17 is shown in SEQ ID NO:3; the deduced amino acid sequence of this contig, which represents 9% of the protein (N-terminal region), is shown in SEQ ID NO:4. A calculation of the percent similarity of the amino acid sequence set forth in SEQ ID NO:4 and the Arabidopsis thaliana sequence (using the Clustal algorithm) revealed that the protein encoded by SEQ ID NO:2 is 86% similar to the Arabidopsis thaliana sugar transport protein. The sequence of the entire cDNA insert from clone rls6.pk0003.d5 is shown in SEQ ID NO:5; the deduced amino acid sequence of this cDNA, which represents 18% of the of the protein (C-terminal region), is shown in SEQ ID NO:6. A calculation of the percent similarity of the amino acid sequence set forth in SEQ ID NO:6 and the Arabidopsis thaliana sequence (using the Clustal algorithm) revealed that the protein encoded by SEQ ID NO:6 is 74% similar to the Arabidopsis thaliana sugar transport protein. The sequence of the soybean contig composed of clones sgs4c.pk005.c9, sfl1.pk0079.a4 and sdp3c.pk012.i1 is shown in SEQ ID NO:7; the deduced amino acid sequence of this contig, which represents 100% of the protein, is shown in SEQ ID NO:8. A calculation of the percent similarity of the amino acid sequence set forth in SEQ ID NO:8 and the Arabidopsis thaliana sequence (using the Clustal algorithm) revealed that the protein encoded by SEQ ID NO:8 is 68% similar to the Arabidopsis thaliana sugar transport protein. The sequence of a portion of the cDNA insert from clone ss1.pk0022.f1 is shown in SEQ ID NO:9; the deduced amino acid sequence of this cDNA, which represents 66% of the of the protein (C-terminal region), is shown in SEQ ID NO:10. A calculation of the percent similarity of the amino acid sequence set forth in SEQ ID NO:10 and the Arabidopsis thaliana sequence (using the Clustal algorithm) revealed that the protein encoded by SEQ ID NO:10 is 66% similar to the Arabidopsis thaliana sugar transport protein. The sequence of a portion of the cDNA insert from clone wlk8.pk0001.a12 is shown in SEQ ID NO:11; the deduced amino acid sequence of this cDNA, which represents 7% of the of the protein (N-terminal region), is shown in SEQ ID NO:12. A calculation of the percent similarity of the amino acid sequence set forth in SEQ ID NO:12 and the Arabidopsis thaliana sequence (using the Clustal algorithm) revealed that the protein encoded by SEQ ID NO:12 is 88% similar to the Arabidopsis thaliana sugar transport protein. The sequence of the wheat contig composed of clones wlm96.pk043.e19 and wre1n.pk0062.g6 is shown in SEQ ID NO:13; the deduced amino acid sequence of this contig, which represents 45% of the protein (C-terminal region), is shown in SEQ ID NO:14. A calculation of the percent similarity of the amino acid sequence set forth in SEQ ID NO:14 and the Arabidopsis thaliana sequence (using the Clustal algorithm) revealed that the protein encoded by SEQ ID NO:14 is 65% similar to the Arabidopsis thaliana sugar transport protein. The sequence of a portion of the cDNA insert from clone wre1n.pk0006.b4 is shown in SEQ ID NO:15; the deduced amino acid sequence of this cDNA, which represents 31% of the of the protein (C-terminal region), is shown in SEQ ID NO:16. A calculation of the percent similarity of the amino acid sequence set forth in SEQ ID NO:16 and the Arabidopsis thaliana sequence (using the Clustal algorithm) revealed that the protein encoded by SEQ ID NO:16 is 76% similar to the Arabidopsis thaliana sugar transport protein. FIG. 1 presents an alignment of the amino acid sequence set forth in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 and 16 with the Arabidopsis thaliana-like sugar transport protein amino acid sequence, SEQ ID NO:29. Alignments were performed using the Clustal algorithm. The percent similarity between the corn, rice, soybean and wheat acid sequences was calculated to range between 16% to 89% using the Clustal algorithm. BLAST scores and probabilities indicate that the instant nucleic acid fragments encode portions of sugar transport proteins. These sequences represent the first corn, rice, soybean and wheat sequences encoding Arabidopsis thaliana-like sugar transport proteins. Example 4 Characterization of cDNA Clones Encoding Beta vulgaris-Like Sugar Transport Proteins The BLASTX search using the EST sequences from several corn, rice, soybean and wheat clones revealed similarity of the proteins encoded by the cDNAs to a sugar transport protein from Beta vulgaris (NCBI Identifier No. gi 1778093). In the process of comparing the ESTs it was found that several of the rice and soybean clones had overlapping regions of homology. Using this homology it was possible to align the ESTs and assemble two contigs encoding unique rice and soybean B. vulgaris-like sugar transport proteins. The individual clones and the assembled composition of each contig are shown in Table 4. The BLAST results for each of the contigs and individual ESTs and are also shown in Table 4:
TABLE 4
BLAST Results for Clones Encoding Polypeptides Homologous
to Beta vulgaris Sugar Transport Protein
Clone BLAST pLog Score
ccl.mn0002.h1 53.70
cepe7.pk0018.g3 164.00
Contig composed of clones: >250.00
rlr6.pk0005.b10
rl0n.pk102.p24
rl0n.pk107.p2
Contig composed of clones: >250.00
srl.pk0061.g8
sfl1.pk0058.h12
sgs2c.pk004.o17
sre.pk0032.h6
wlk8.pk0001.a11 >250.00
wlml.pk0012.h1 >250.00
The sequence of a portion of the cDNA insert from clone cc1.mn0002.h1 is shown in SEQ ID NO:17; the deduced amino acid sequence of this cDNA, which represents 31% of the of the protein (N-terminal region), is shown in SEQ ID NO:18. A calculation of the percent similarity of the amino acid sequence set forth in SEQ ID NO:18 and the Beta vulgaris sequence (using the Clustal algorithm) revealed that the protein encoded by SEQ ID NO:18 is 65% similar to the Beta vulgaris sugar transport protein. The sequence of the entire cDNA insert from clone cepe7.pk0018.g3 is shown in SEQ ID NO:19; the deduced amino acid sequence of this cDNA, which represents 100% of the of the protein, is shown in SEQ ID NO:20. A calculation of the percent similarity of the amino acid sequence set forth in SEQ ID NO:20 and the Beta vulgaris sequence (using the Clustal algorithm) revealed that the protein encoded by SEQ ID NO:20 is 57% similar to the Beta vulgaris sugar transport protein. The sequence of the rice contig composed of clones rlr6.pk0005.b10, r10n.pk102.p24 and r10n.pk107.p2 is shown in SEQ ID NO:21; the deduced amino acid sequence of this contig, which represents 100% of the protein, is shown in SEQ ID NO:22. A calculation of the percent similarity of the amino acid sequence set forth in SEQ ID NO:22 and the Beta vulgaris sequence (using the Clustal algorithm) revealed that the protein encoded by SEQ ID NO:22 is 61% similar to the Beta vulgaris sugar transport protein. The sequence of the soybean contig composed of clones sr1.pk0061.g8, sfl1.pk0058.h12, sgs2c.pk004.o17 and sre.pk0032.h6 is shown in SEQ ID NO:23; the deduced amino acid sequence of this contig, which represents 100% of the protein, is shown in SEQ ID NO :24. A calculation of the percent similarity of the amino acid sequence set forth in SEQ ID NO:24 and the Beta vulgaris sequence (using the Clustal algorithm) revealed that the protein encoded by SEQ ID NO:24 is 66% similar to the Beta vulgaris sugar transport protein. The sequence of the entire cDNA insert from clone wlk8.pk0001.a11 is shown in SEQ ID NO:25; the deduced amino acid sequence of this cDNA, which represents 100% of the of the protein, is shown in SEQ ID NO:26. A calculation of the percent similarity of the amino acid sequence set forth in SEQ ID NO:26 and the Beta vulgaris sequence (using the Clustal algorithm) revealed that the protein encoded by SEQ ID NO:26 is 61% similar to the Beta vulgaris sugar transport protein. The sequence of the entire cDNA insert from clone wlm1.pk0012.h1 is shown in SEQ ID NO:27; the deduced amino acid sequence of this cDNA, which represents 100% of the of the protein, is shown in SEQ ID NO:28. A calculation of the percent similarity of the amino acid sequence set forth in SEQ ID NO:28 and the Beta vulgaris sequence (using the Clustal algorithm) revealed that the protein encoded by SEQ ID NO:28 is 56% similar to the Beta vulgaris sugar transport protein. FIG. 2 presents an alignment of the amino acid sequence set forth in SEQ ID NOS:18, 20, 22, 24, 26 and 28 with the Beta vulgaris-like sugar transport protein amino acid sequence, SEQ ID NO:30. Alignments were performed using the Clustal algorithm. The percent similarity between the corn, rice, soybean and wheat acid sequences was calculated to range between 43% to 81% using the Clustal algorithm. BLAST scores and probabilities indicate that the instant nucleic acid fragments encode portions of sugar transport proteins. These sequences represent the first corn, rice, soybean and wheat sequences encoding Beta vulgaris-like sugar transport proteins. Example 5 Expression of Chimeric Genes in Monocot Cells A chimeric gene comprising a cDNA encoding sugar transport protein in sense orientation with respect to the maize 27 kD zein promoter that is located 5' to the cDNA fragment, and the 10 kD zein 3' end that is located 3' to the cDNA fragment, can be constructed. The cDNA fragment of this gene may be generated by polymerase chain reaction (PCR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites (NcoI or SmaI) can be incorporated into the oligonucleotides to provide proper orientation of the DNA fragment when inserted into the digested vector pML 103 as described below. Amplification is then performed in a standard PCR. The amplified DNA is then digested with restriction enzymes NcoI and SmaI and fractionated on an agarose gel. The appropriate band can be isolated from the gel and combined with a 4.9 kb NcoI-SmaI fragment of the plasmid pML103. Plasmid pML103 has been deposited under the terms of the Budapest Treaty at ATCC (American Type Culture Collection, 10801 University Blvd., Manassas, Va. 20110-2209), and bears accession number ATCC 97366. The DNA segment from pML103 contains a 1.05 kb SalI-NcoI promoter fragment of the maize 27 kD zein gene and a 0.96 kb SmaI-SalI fragment from the 3' end of the maize 10 kD zein gene in the vector pGem9Zf(+) (Promega). Vector and insert DNA can be ligated at 15.degree. C. overnight, essentially as described (Maniatis). The ligated DNA may then be used to transform E. coli XL1-Blue (Epicurian Coli XL-1 Blue.TM.M; Stratagene). Bacterial transformants can be screened by restriction enzyme digestion of plasmid DNA and limited nucleotide sequence analysis using the dideoxy chain termination method (Sequenase.TM. DNA Sequencing Kit; U.S. Biochemical). The resulting plasmid construct would comprise a chimeric gene encoding, in the 5' to 3' direction, the maize 27 kD zein promoter, a cDNA fragment encoding a sugar transport protein, and the 10 kD zein 3' region. The chimeric gene described above can then be introduced into corn cells by the following procedure. Immature corn embryos can be dissected from developing caryopses derived from crosses of the inbred corn lines H99 and LH132. The embryos are isolated 10 to 11 days after pollination when they are 1.0 to 1.5 mm long. The embryos are then placed with the axis-side facing down and in contact with agarose-solidified N6 medium (Chu et al., (1975) Sci. Sin. Peking 18:659-668). The embryos are kept in the dark at 27.degree. C. Friable embryogenic callus consisting of undifferentiated masses of cells with somatic proembryoids and embryoids borne on suspensor structures proliferates from the scutellum of these immature embryos. The embryogenic callus isolated from the primary explant can be cultured on N6 medium and sub-cultured on this medium every 2 to 3 weeks. The plasmid, p35S/Ac (obtained from Dr. Peter Eckes, Hoechst Ag, Frankfurt, Germany) may be used in transformation experiments in order to provide for a selectable marker. This plasmid contains the Pat gene (see European Patent Publication 0 242 236) which encodes phosphinothricin acetyl transferase (PAT). The enzyme PAT confers resistance to herbicidal glutamine synthetase inhibitors such as phosphinothricin. The pat gene in p35S/Ac is under the control of the 35S promoter from Cauliflower Mosaic Virus (Odell et al. (1985) Nature 313:810-812) and the 3' region of the nopaline synthase gene from the T-DNA of the Ti plasmid of Agrobacterium tumefaciens. The particle bombardment method (Klein et al., (1987) Nature 327:70-73) may be used to transfer genes to the callus culture cells. According to this method, gold particles (1 .mu.m in diameter) are coated with DNA using the following technique. Ten .mu.g of plasmid DNAs are added to 50 .mu.L of a suspension of gold particles (60 mg per mL). Calcium chloride (50 .mu.L of a 2.5M solution) and spermidine free base (20 .mu.L of a 1.0M solution) are added to the particles. The suspension is vortexed during the addition of these solutions. After 10 minutes, the tubes are briefly centrifuged (5 sec at 15,000 rpm) and the supernatant removed. The particles are resuspended in 200 .mu.L of absolute ethanol, centrifuged again and the supernatant removed. The ethanol rinse is performed again and the particles resuspended in a final volume of 30 .mu.L of ethanol. An aliquot (5 .mu.L) of the DNA-coated gold particles can be placed in the center of a Kaptonh.TM. flying disc (Bio-Rad Labs). The particles are then accelerated into the corn tissue with a Biolistic.TM. PDS-1000/He (Bio-Rad Instruments, Hercules Calif.), using a helium pressure of 1000 psi, a gap distance of 0.5 cm and a flying distance of 1.0 cm. For bombardment, the embryogenic tissue is placed on filter paper over agarose-solidified N6 medium. The tissue is arranged as a thin lawn and covered a circular area of about 5 cm in diameter. The petri dish containing the tissue can be placed in the chamber of the PDS-1000/He approximately 8 cm from the stopping screen. The air in the chamber is then evacuated to a vacuum of 28 inches of Hg. The macrocarrier is accelerated with a helium shock wave using a rupture membrane that bursts when the He pressure in the shock tube reaches 1000 psi. Seven days after bombardment the tissue can be transferred to N6 medium that contains gluphosinate (2 mg per liter) and lacks casein or proline. The tissue continues to grow slowly on this medium. After an additional 2 weeks the tissue can be transferred to fresh N6 medium containing gluphosinate. After 6 weeks, areas of about 1 cm in diameter of actively growing callus can be identified on some of the plates containing the glufosinate-supplemented medium. These calli may continue to grow when sub-cultured on the selective medium. Plants can be regenerated from the transgenic callus by first transferring clusters of issue to N6 medium supplemented with 0.2 mg per liter of 2,4-D. After two weeks the issue can be transferred to regeneration medium (Fromm et al., (1990) Bio/Technology 8:833-839). Example 6 Expression of Chimeric Genes in Dicot Cells A seed-specific expression cassette composed of the promoter and transcription terminator from the gene encoding the .beta. subunit of the seed storage protein phaseolin from the bean Phaseolus vulgaris (Doyle et al. (1986) J. Biol. Chem. 261:9228-9238) can be used for expression of the instant sugar transport proteins in transformed soybean. The phaseolin cassette includes about 500 nucleotides upstream (5' ) from the translation initiation codon and about 1650 nucleotides downstream (3' ) from the translation stop codon of phaseolin. Between the 5' and 3' regions are the unique restriction endonuclease sites Nco I (which includes the ATG translation initiation codon), Sma I, Kpn I and Xba I. The entire cassette is flanked by Hind III sites. The cDNA fragment of this gene may be generated by polymerase chain reaction (PCR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites can be incorporated into the oligonucleotides to provide proper orientation of the DNA fragment when inserted into the expression vector. Amplification is then performed as described above, and the isolated fragment is inserted into a pUC18 vector carrying the seed expression cassette. Soybean embroys may then be transformed with the expression vector comprising a sequence encoding a sugar transport protein. To induce somatic embryos, cotyledons, 3-5 mm in length dissected from surface sterilized, immature seeds of the soybean cultivar A2872, can be cultured in the light or dark at 26.degree. C. on an appropriate agar medium for 6-10 weeks. Somatic embryos which produce secondary embryos are then excised and placed into a suitable liquid medium. After repeated selection for clusters of somatic embryos which multiplied as early, globular staged embryos, the suspensions are maintained as described below. Soybean embryogenic suspension cultures can maintained in 35 mL liquid media on a rotary shaker, 150 rpm, at 26.degree. C. with florescent lights on a 16:8 hour day/night schedule. Cultures are subcultured every two weeks by inoculating approximately 35 mg of tissue into 35 mL of liquid medium. Soybean embryogenic suspension cultures may then be transformed by the method of particle gun bombardment (Kline et al. (1987) Nature (London) 327:70, U.S. Pat. No. 4,945,050). A DuPont Biolistic.TM. PDS1000/HE instrument (helium retrofit) can be used for these transformations. A selectable marker gene which can be used to facilitate soybean transformation is a chimeric gene composed of the 35S promoter from Cauliflower Mosaic Virus (Odell et al. (1985) Nature 313:810-812), the hygromycin phosphotransferase gene from plasmid pJR225 (from E. coli; Gritz et al. (1983) Gene 25:179-188) and the 3' region of the nopaline synthase gene from the T-DNA of the Ti plasmid of Agrobacterium tumefaciens. The seed expression cassette comprising the phaseolin 5' region, the fragment encoding the sugar transport protein and the phaseolin 3' region can be isolated as a restriction fragment. This fragment can then be inserted into a unique restriction site of the vector carrying the marker gene. To 50 .mu.L of a 60 mg/mL 1 .mu.m gold particle suspension is added (in order): 5 .mu.L DNA (1 .mu.g/.mu.L), 20 .mu.l spermidine (0.1 M), and 50 .mu.L CaCl.sub.2 (2.5 M). The particle preparation is then agitated for three minutes, spun in a microfage for 10 seconds and the supernatant removed. The DNA-coated particles are then washed once in 400 .mu.L 70% ethanol and resuspended in 40 .mu.L of anhydrous ethanol. The DNA/particle suspension can be sonicated three times for one second each. Five .mu.L of the DNA-coated gold particles are then loaded on each macro carrier disk. Approximately 300-400 mg of a two-week-old suspension culture is placed in an empty 60.times.15 mm petri dish and the residual liquid removed from the tissue with a pipette. For each transformation experiment, approximately 5-10 plates of tissue are normally bombarded. Membrane rupture pressure is set at 1100 psi and the chamber is evacuated to a vacuum of 28 inches mercury. The tissue is placed approximately 3.5 inches away from the retaining screen and bombarded three times. Following bombardment, the tissue can be divided in half and placed back into liquid and cultured as described above. Five to seven days post bombardment, the liquid media may be exchanged with fresh media, and eleven to twelve days post bombardment with fresh media containing 50 mg/mL hygromycin. This selective media can be refreshed weekly. Seven to eight weeks post bombardment, green, transformed tissue may be observed growing from untransformed, necrotic embryogenic clusters. Isolated green tissue is removed and inoculated into individual flasks to generate new, clonally propagated, transformed embryogenic suspension cultures. Each new line may be treated as an independent transformation event. These suspensions can then be subcultured and maintained as clusters of immature embryos or regenerated into whole plants by maturation and germination of individual somatic embryos. Example 7 Expression of Chimeric Genes in Microbial Cells The cDNAs encoding the instant sugar transport proteins can be inserted into the T7 E. coli expression vector pBT430. This vector is a derivative of pET-3a (Rosenberg et al. (1987) Gene 56:125-135) which employs the bacteriophage T7 RNA polymerase/T7 promoter system. Plasmid pBT430 was constructed by first destroying the EcoR I and Hind III sites in pET-3 a at their original positions. An oligonucleotide adaptor containing EcoR I and Hind III sites was inserted at the BamH I site of pET-3a. This created pET-3aM with additional unique cloning sites for insertion of genes into the expression vector. Then, the Nde I site at the position of translation initiation was converted to an Nco I site using ligonucleotide-directed mutagenesis. The DNA sequence of pET-3aM in this region, 5'-CATATGG, was converted to 5'-CCCATGG in pBT430. Plasmid DNA containing a cDNA may be appropriately digested to release a nucleic acid fragment encoding the protein. This fragment may then be purified on a 1% NuSieve GTGTM low melting agarose gel (FMC). Buffer and agarose contain 10 .mu.g/ml ethidium bromide for visualization of the DNA fragment. The fragment can then be purified from the agarose gel by digestion with GELase.TM. (Epicentre Technologies) according to the manufacturer's instructions, ethanol precipitated, dried and resuspended in 20 .mu.L of water. Appropriate oligonucleotide adapters may be ligated to the fragment using T4 DNA ligase (New England Biolabs, Beverly, Mass.). The fragment containing the ligated adapters can be purified from the excess adapters using low melting agarose as described above. The vector pBT430 is digested, dephosphorylated with alkaline phosphatase (NEB) and deproteinized with phenol/chloroform as described above. The prepared vector pBT430 and fragment can then be ligated at 16.degree. C. for 15 hours followed by transformation into DH5 electrocompetent cells (GIBCO BRL). Transformants can be selected on agar plates containing LB media and 100 .mu.g/mL ampicillin. Transformants containing the gene encoding the sugar transport protein are then screened for the correct orientation with respect to the T7 promoter by restriction enzyme analysis. For high level expression, a plasmid clone with the cDNA insert in the correct orientation relative to the T7 promoter can be transformed into E. coli strain BL21(DE3) (Studier et al. (1986) J. Mol. Biol. 189:113-130). Cultures are grown in LB medium containing ampicillin (100 mg/L) at 25.degree. C. At an optical density at 600 nm of approximately 1, IPTG (isopropylthio-.beta.-galactoside, the inducer) can be added to a final concentration of 0.4 mM and incubation can be continued for 3 h at 25.degree.. Cells are then harvested by centrifugation and re-suspended in 50 .mu.L of 50 mM Tris-HCl at pH 8.0 containing 0.1 mM DTT and 0.2 mM phenyl methylsulfonyl fluoride. A small amount of 1 mm glass beads can be added and the mixture sonicated 3 times for about 5 seconds each time with a microprobe sonicator. The mixture is centrifuged and the protein concentration of the supernatant determined. One .mu.g of protein from the soluble fraction of the culture can be separated by SDS-polyacrylamide gel electrophoresis. Gels can be observed for protein bands migrating at the expected molecular weight.
SEQUENCE LISTING
<100> GENERAL INFORMATION:
<160> NUMBER OF SEQ ID NOS: 30
<200> SEQUENCE CHARACTERISTICS:
<210> SEQ ID NO 1
<211> LENGTH: 2824
<212> TYPE: DNA
<213> ORGANISM: Zea mays
<220> FEATURE:
<221> NAME/KEY: unsure
<222> LOCATION: (29)
<220> FEATURE:
<221> NAME/KEY: unsure
<222> LOCATION: (622)
<220> FEATURE:
<221> NAME/KEY: unsure
<222> LOCATION: (636)
<220> FEATURE:
<221> NAME/KEY: unsure
<222> LOCATION: (638)
<220> FEATURE:
<221> NAME/KEY: unsure
<222> LOCATION: (669)
<220> FEATURE:
<221> NAME/KEY: unsure
<222> LOCATION: (771)
<220> FEATURE:
<221> NAME/KEY: unsure
<222> LOCATION: (822)
<220> FEATURE:
<221> NAME/KEY: unsure
<222> LOCATION: (856)
<220> FEATURE:
<221> NAME/KEY: unsure
<222> LOCATION: (889)
<220> FEATURE:
<221> NAME/KEY: unsure
<222> LOCATION: (896)
<220> FEATURE:
<221> NAME/KEY: unsure
<222> LOCATION: (944)
<400> SEQUENCE: 1
cccacccccc tccactccac taccacggng gcacggcctg cctctgcagc tctgccctgc 60
tccgcacccc tcgctctcca accccaacgc gcggcgttgc taaaattcac ctcagcgcgt 120
actccagttt ggccacctca ccacccgccg ccgctgttta agaaggcccc gcgcccgatc 180
ggggatcacg aaccttggcc gccgctgccg gagtgggggc gtagatttcc ggcggccatg 240
gggggcgccg tgatggtcgc catcgcggcc tctatcggca acttgctgca gggctgggac 300
aatgcgacaa ttgctggagc cgtcctgtac ataaagaagg aattcaacct gcagagcgag 360
cctctgatcg aaggcctcat cgtcgccatg ttcctcattg gggcaacagt catcacaaca 420
tctccggggc caagggctga ctgcgttggt aggaggccca tgctggtcgc ctcggctgtc 480
ctctacttcg tcagtgggct ggtgatgctt tgggcgccaa ttgtgtacat cttgctcctc 540
gcaaggctca ttgatgggtt cggtatcggt ttggcggtca cacttgttcc tctctacatc 600
tccgaaactg caccgcacag anattcttgg ggctgntnga acacgttgcc gcagttcatt 660
ggggtcagng gagggatgtt cctctcctac tgcatggtgt ttgggatgtc cctcatgccc 720
aaacctgatt ggaggctcat gcttggagtt ctgtcgatcc cgtcacttat ntactttgga 780
ctgactgtct tctacttgcc tgaatcacca aggtggcttg tnagcaaagg aaggatggcg 840
gaggcgaaga gagtgntgca aaggctgcgg ggaagagaag atgtctcang ggaganggct 900
cttctagttg aaggtttggg ggtcggtaaa gatacacgta tttnagagta catcattgga 960
cctgccaccg aggcagccga tgatcttgta actgacggtg ataaggaaca aatcacactt 1020
tatgggcctg aagaaggcca gtcatggatt gctcgacctt ctaagggacc catcatgctt 1080
ggaagtgtgc tttctcttgc atctcgtcat gggagcatgg tgaaccagag tgtacccctt 1140
atggatccga ttgtgacact ttttggtagt gtccatgaga atatgcctca agctggagga 1200
agtatgagga gcacattgtt tccaaacttt ggaagtatgt tcagtgtcac agatcagcat 1260
gccaaaaatg agcagtggga tgaagagaat cttcataggg atgacgagga gtacgcatct 1320
gatggtgcag gaggtgacta tgaggacaat ctccatagcc cattgctgtc caggcaggca 1380
acaggtgcgg aagggaagga cattgtgcac catggtcacc gtggaagtgc tttgagcatg 1440
agaaggcaaa gcctcttagg ggagggtgga gatggtgtga gcagcactga tatcggtggg 1500
ggatggcagc ttgcttggaa atggtcagag aaggaaggtg agaatggtag aaaggaaggt 1560
ggtttcaaaa gagtctactt gcaccaagag ggagttcctg gctcaagaag gggctcaatt 1620
gtttcacttc ccggtggtgg cgatgttctt gagggtagtg agtttgtaca tgctgctgct 1680
ttagtaagtc agtcagcact tttctcaaag ggtcttgctg aaccacgcat gtcagatgct 1740
gccatggttc acccatctga ggtagctgcc aaaggttcac gttggaaaga tttgtttgaa 1800
cctggagtga ggcgtgccct gttagtcggt gttggaattc agatccttca acagtttgct 1860
ggaataaacg gtgttctgta ctatacccca caaattcttg agcaagctgg tgtggcagtt 1920
attctttcca aatttggtct cagctcggca tcagcatcca tcttgatcag ttctctcact 1980
accttactaa tgcttccttg cattggcttt gccatgctgc ttatggatct ttccggaaga 2040
aggtttttgc tgctaggcac aattccaatc ttgatagcat ctctagttat cctggttgtg 2100
tccaatctaa ttgatttggg tacactagcc catgctttgc tctccaccat cagtgttatc 2160
gtctacttct gctgcttcgt tatgggattt ggtcccatcc ccaacatttt atgtgcagag 2220
atctttccaa ccagggttcg tggcctctgt attgccattt gtgcctttac attctggatc 2280
ggagatatca tcgtcaccta cagccttcct gtgatgctga atgctattgg actggcgggt 2340
gttttcagca tatatgcagt cgtatgcttg atttcctttg tgttcgtctt ccttaaggtc 2400
cctgagacaa aggggatgcc ccttgaggtt attaccgaat tctttgcagt tggtgcgaag 2460
caagcggctg caaaagccta atttctttgg tacctttgtg tgcaactatt gcactgtaag 2520
ttagaaactt gaaggggttt caccaagaag ctcggagaat tactttggat ttgtgtaaat 2580
gttaagggaa cgaacatctg ctcatgctcc tcaaacggta aaaaagagtc cctcaatggc 2640
aaataggagt cgttaagttg tcaatgtcat ttaccatatg ttttacctat ttgtactgta 2700
ttataagtca agctattcaa cgctggttgt tgctagaaat ctttagaaca aagatgataa 2760
tgatctgatc tgatgttata atattcaaat ctcaaataaa gaaaatatcg tttctcaaaa 2820
aaaa 2824
<200> SEQUENCE CHARACTERISTICS:
<210> SEQ ID NO 2
<211> LENGTH: 747
<212> TYPE: PRT
<213> ORGANISM: Zea mays
<220> FEATURE:
<221> NAME/KEY: UNSURE
<222> LOCATION: (129)
<220> FEATURE:
<221> NAME/KEY: UNSURE
<222> LOCATION: (133)..(134)
<220> FEATURE:
<221> NAME/KEY: UNSURE
<222> LOCATION: (144)
<220> FEATURE:
<221> NAME/KEY: UNSURE
<222> LOCATION: (178)
<220> FEATURE:
<221> NAME/KEY: UNSURE
<222> LOCATION: (207)
<220> FEATURE:
<221> NAME/KEY: UNSURE
<222> LOCATION: (218)
<220> FEATURE:
<221> NAME/KEY: UNSURE
<222> LOCATION: (220)
<220> FEATURE:
<221> NAME/KEY: UNSURE
<222> LOCATION: (236)
<400> SEQUENCE: 2
Met Gly Gly Ala Val Met Val Ala Ile Ala Ala Ser Ile Gly Asn Leu
1 5 10 15
Leu Gln Gly Trp Asp Asn Ala Thr Ile Ala Gly Ala Val Leu Tyr Ile
20 25 30
Lys Lys Glu Phe Asn Leu Gln Ser Glu Pro Leu Ile Glu Gly Leu Ile
35 40 45
Val Ala Met Phe Leu Ile Gly Ala Thr Val Ile Thr Thr Ser Pro Gly
50 55 60
Pro Arg Ala Asp Cys Val Gly Arg Arg Pro Met Leu Val Ala Ser Ala
65 70 75 80
Val Leu Tyr Phe Val Ser Gly Leu Val Met Leu Trp Ala Pro Ile Val
85 90 95
Tyr Ile Leu Leu Leu Ala Arg Leu Ile Asp Gly Phe Gly Ile Gly Leu
100 105 110
Ala Val Thr Leu Val Pro Leu Tyr Ile Ser Glu Thr Ala Pro His Arg
115 120 125
Xaa Ser Trp Gly Xaa Xaa Asn Thr Leu Pro Gln Phe Ile Gly Val Xaa
130 135 140
Gly Gly Met Phe Leu Ser Tyr Cys Met Val Phe Gly Met Ser Leu Met
145 150 155 160
Pro Lys Pro Asp Trp Arg Leu Met Leu Gly Val Leu Ser Ile Pro Ser
165 170 175
Leu Xaa Tyr Phe Gly Leu Thr Val Phe Tyr Leu Pro Glu Ser Pro Arg
180 185 190
Trp Leu Val Ser Lys Gly Arg Met Ala Glu Ala Lys Arg Val Xaa Gln
195 200 205
Arg Leu Arg Gly Arg Glu Asp Val Ser Xaa Glu Xaa Ala Leu Leu Val
210 215 220
Glu Gly Leu Gly Val Gly Lys Asp Thr Arg Ile Xaa Glu Tyr Ile Ile
225 230 235 240
Gly Pro Ala Thr Glu Ala Ala Asp Asp Leu Val Thr Asp Gly Asp Lys
245 250 255
Glu Gln Ile Thr Leu Tyr Gly Pro Glu Glu Gly Gln Ser Trp Ile Ala
260 265 270
Arg Pro Ser Lys Gly Pro Ile Met Leu Gly Ser Val Leu Ser Leu Ala
275 280 285
Ser Arg His Gly Ser Met Val Asn Gln Ser Val Pro Leu Met Asp Pro
290 295 300
Ile Val Thr Leu Phe Gly Ser Val His Glu Asn Met Pro Gln Ala Gly
305 310 315 320
Gly Ser Met Arg Ser Thr Leu Phe Pro Asn Phe Gly Ser Met Phe Ser
325 330 335
Val Thr Asp Gln His Ala Lys Asn Glu Gln Trp Asp Glu Glu Asn Leu
340 345 350
His Arg Asp Asp Glu Glu Tyr Ala Ser Asp Gly Ala Gly Gly Asp Tyr
355 360 365
Glu Asp Asn Leu His Ser Pro Leu Leu Ser Arg Gln Ala Thr Gly Ala
370 375 380
Glu Gly Lys Asp Ile Val His His Gly His Arg Gly Ser Ala Leu Ser
385 390 395 400
Met Arg Arg Gln Ser Leu Leu Gly Glu Gly Gly Asp Gly Val Ser Ser
405 410 415
Thr Asp Ile Gly Gly Gly Trp Gln Leu Ala Trp Lys Trp Ser Glu Lys
420 425 430
Glu Gly Glu Asn Gly Arg Lys Glu Gly Gly Phe Lys Arg Val Tyr Leu
435 440 445
His Gln Glu Gly Val Pro Gly Ser Arg Arg Gly Ser Ile Val Ser Leu
450 455 460
Pro Gly Gly Gly Asp Val Leu Glu Gly Ser Glu Phe Val His Ala Ala
465 470 475 480
Ala Leu Val Ser Gln Ser Ala Leu Phe Ser Lys Gly Leu Ala Glu Pro
485 490 495
Arg Met Ser Asp Ala Ala Met Val His Pro Ser Glu Val Ala Ala Lys
500 505 510
Gly Ser Arg Trp Lys Asp Leu Phe Glu Pro Gly Val Arg Arg Ala Leu
515 520 525
Leu Val Gly Val Gly Ile Gln Ile Leu Gln Gln Phe Ala Gly Ile Asn
530 535 540
Gly Val Leu Tyr Tyr Thr Pro Gln Ile Leu Glu Gln Ala Gly Val Ala
545 550 555 560
Val Ile Leu Ser Lys Phe Gly Leu Ser Ser Ala Ser Ala Ser Ile Leu
565 570 575
Ile Ser Ser Leu Thr Thr Leu Leu Met Leu Pro Cys Ile Gly Phe Ala
580 585 590
Met Leu Leu Met Asp Leu Ser Gly Arg Arg Phe Leu Leu Leu Gly Thr
595 600 605
Ile Pro Ile Leu Ile Ala Ser Leu Val Ile Leu Val Val Ser Asn Leu
610 615 620
Ile Asp Leu Gly Thr Leu Ala His Ala Leu Leu Ser Thr Ile Ser Val
625 630 635 640
Ile Val Tyr Phe Cys Cys Phe Val Met Gly Phe Gly Pro Ile Pro Asn
645 650 655
Ile Leu Cys Ala Glu Ile Phe Pro Thr Arg Val Arg Gly Leu Cys Ile
660 665 670
Ala Ile Cys Ala Phe Thr Phe Trp Ile Gly Asp Ile Ile Val Thr Tyr
675 680 685
Ser Leu Pro Val Met Leu Asn Ala Ile Gly Leu Ala Gly Val Phe Ser
690 695 700
Ile Tyr Ala Val Val Cys Leu Ile Ser Phe Val Phe Val Phe Leu Lys
705 710 715 720
Val Pro Glu Thr Lys Gly Met Pro Leu Glu Val Ile Thr Glu Phe Phe
725 730 735
Ala Val Gly Ala Lys Gln Ala Ala Ala Lys Ala
740 745
<200> SEQUENCE CHARACTERISTICS:
<210> SEQ ID NO 3
<211> LENGTH: 443
<212> TYPE: DNA
<213> ORGANISM: Oryza sativa
<220> FEATURE:
<221> NAME/KEY: unsure
<222> LOCATION: (193)
<220> FEATURE:
<221> NAME/KEY: unsure
<222> LOCATION: (388)
<220> FEATURE:
<221> NAME/KEY: unsure
<222> LOCATION: (435)
<220> FEATURE:
<221> NAME/KEY: unsure
<222> LOCATION: (439)
<400> SEQUENCE: 3
gaagagctca cccccccccc ctcggccctg gactccctcc tccaaatctc ccctaaaagc 60
ttcccaattt ggcgagaatt ccccatatat ttgccccatc tcggcgtccc aacgagccct 120
tccagattcc cagccgcctc tcttcttgtt aggggatccg aaatctcggt ggacgagaga 180
cttggtggta atnattcgcc ggccatggcg ggcgccgtgc tggtcgccat cgcggcctcc 240
atcggcaact tgctgcaggg ctgggataat gcaaccattg caggtgcggt actgtacatc 300
aagaaggaat tcaacttgca tagcgacccc cttatcgaag gtctgatcgt ggccatgtcg 360
ctcattgggg ccaccatcat cacgacgntc tctgcgagca ggtggctgac tcttttggta 420
tggcggccca tgctnatcnc ttc 443
<200> SEQUENCE CHARACTERISTICS:
<210> SEQ ID NO 4
<211> LENGTH: 131
<212> TYPE: PRT
<213> ORGANISM: Oryza sativa
<220> FEATURE:
<221> NAME/KEY: UNSURE
<222> LOCATION: (65)
<220> FEATURE:
<221> NAME/KEY: UNSURE
<222> LOCATION: (130)
<400> SEQUENCE: 4
Glu Glu Leu Thr Pro Pro Pro Ser Ala Leu Asp Ser Leu Leu Gln Ile
1 5 10 15
| ||||||
