![]() Nucleic acids that enhance the synthesis of 2-acetyl-1-pyrroline in plants and fungi
专利摘要:
公开号:AU2005203356A1 申请号:U2005203356 申请日:2005-07-29 公开日:2006-08-10 发明作者:Wintai Kamolsukyunyong;Theerayut Toojinda;Somvong Tragoonrung;Apichart Vanavichit;Samart Wanchana 申请人:Kasetsart University;National Science and Technology Development Agency; IPC主号:A01H5-00
专利说明: AUSTRALIA PATENTS ACT 1990 COMPLETE SPECIFICATION NAME OF APPLICANT(S):: National Science and Technology Development Agency AND Kasetsart University ADDRESS FOR SERVICE: DAVIES COLLISON CAVE Patent Attorneys 1 Nicholson Street, Melbourne, 3000, Australia INVENTION TITLE: Nucleic acids that enhance the synthesis of 2-acetyl-1-pyrroline in plants and fungi The following statement is a full description of this invention, including the best method of performing it known to me/us:- 5102 C* FIELD OF THE INVENTION [0001] The present invention relates generally to plant molecular genetics. In particular, it relates to non-naturally occurring plants and fungi that have elevated levels of 2-acetyl-1-pyrroline, C methods for making such plants and fungi, and nucleic acids involved in the synthesis of 2- C' acetyl- 1 -pyrroline. BACKGROUND OF THE INVENTION [0002] Grain aroma is the most attractive characteristic of high quality rice increasingly demanded not only by the Asian market but also widely recognized in Europe and all over the world. Cooked rice fragrance is composed of more than one hundred volatile compounds such as hydrocarbons, alcohols, aldehydes, ketones, acids, esters, phenols, pyridines, pyrazines, and other compounds (Yajima et al., 1978; Maga, 1984; Takashi et al., 1980; Paule and Power, 1989). The "popcorn-like" aromatic compound, 2-acetyl-l-pyrroline (2AP), was discovered as the major potent flavor component of all aromatic rice, crust of bread wheat and rye bread (Buttery et al., 1982, 1983). 2-acetyl-1-pyrroline is chiefly responsible for the characteristic fragrance of many aromatic rice varieties (Tanchotikul and Hsieh, 1991). Surprisingly, this rice fragrance has also been isolated and identified from pandan leaves (Buttery et al., 1983), bread flowers (Vallaris Glabra Ktze.) (Wongpornchai et al., 2003), wet millet (Seitz et al., 1993), popcorn (Schieberle, 1991), Bacillus ceres (Romanczyk et al., 1995) and fungi (Nagsuk et al., 2004). 2-acetyl-l-pyrroline is present in all parts of the aromatic rice plant (stems, leaves, grains) except roots (Lorieux et al., 1996). While this fragrance is present in aromatic grains, it is not present in all grains. [0003] The aromatic compound 2-acetyl-l-pyrroline has a pyrroline ring similar to the amino acid proline (FIGURE The first evidence linking the amino acid proline as the precursor synthesizing 2-acetyl-1-pyrroline was found in experiments in cell and callus culture (Suprasanna et al., 1998; Suprasanna et al., 2002). That conclusion was supported by experiments using isotopic labeling showing that the precursor of the grain 2-acetyl-1-pyrroline S is most likely the amino acid proline in Thai Hom Mali (THM) Rice (Yoshihashi et al., 2002) and probably other aromatic rice. However, the exact biosynthetic pathway of 2-acetyl-1- Spyrroline is yet to be elucidated. Thus, there is a need to identify genes involved in 2-acetyl-1- C pyrroline synthesis and provide a method to increase 2-acetyl-l-pyrroline levels in plants and fungi to increase aroma. IND r SUMMARY OF THE INVENTION N [0004] The present invention meets these needs by providing non-naturally occurring plants and O fungi in which the compound 2-acetyl-l-pyrroline is produced at an elevated level compared to (N control plants and fungi. Control plants and fungi refer to plants and fungi of a similar or related genotype that have lower levels of the compound 2-acetyl-l-pyrroline. Such control plants can be non-naturally occurring. The present invention further provides methods for screening for and creating such non-naturally occurring plants and fungi, and seeds produced from said plants. [0005] The exact biosynthetic pathway of the compound 2-acetyl-1-pyrroline is not known, but it is a derivative of proline. It was hypothesized that proline can be converted either to 2-acetyl- 1-pyrroline or to glutamic acid, so inhibition of the glutamic acid synthesis pathway will increase the availability of proline (or intermediates) for 2-acetyl-1-pyrroline synthesis (FIGURE 4A). A gene encoding a protein controlling aroma in rice, named Os2AP (Oryza sativa 2-acetyl-1pyrroline) was identified as a member of the aldehyde dehydrogenase family that may play a key role in the conversion of proline to glutamic acid. All aromatic rice varieties tested have an eight nucleotide deletion in this gene. The deletion creates a premature stop codon that leads to nonsense mediated degradation against its own mRNA, leading to a loss-of-function phenotype. RNA interference (RNAi) studies showed that disruption of transcription of the Os2AP gene led to elevated levels of 2-acetyl-l-pyrroline in plants, along with increased aroma. [0006] The present invention provides non-naturally occurring plants and fungi with an elevated level of 2-acetyl-1-pyrroline created by inhibiting the expression of the 2-acetyl--pyrroline (2AP) gene, reducing the mRNA levels of the 2AP gene, and/or reducing the activity of the 2AP protein. The level of 2AP protein can be decreased by 25 percent, 50 percent, or 100 percent compared to a control plant. The inhibition of expression of the 2AP gene or reduction of mRNA levels of the 2AP gene can be accomplished by: a) expression of the 2AP gene or a S fragment thereof in the antisense orientation; b) cloning part of the gene into an RNA interference construct and expression of this construct in transgenic plants; or c) mutagenesis by Svarious methods (including Targeting Induced Local Lesions IN Genomes (TILLING) and C tDNA insertion mutagenesis) followed by screening by PCR or other methods for an aromatic variant. ic* [00071 The present invention further provides a transgenic rice plant having an elevated level of 0 the compound 2-acetyl-l-pyrroline compared to its level in a control non-transgenic rice plant t) wherein the level of the compound is increased in the transgenic plant by reducing the mRNA or 0 protein levels encoded by the 2AP gene in the transgenic plant compared to the mRNA or protein levels encloded by the 2AP gene in the control non-transgenic plant. In one format, the MRNA and protein levels are reduced by RNA interference or by antisense. The invention is further directed to transgenic rice seed produced from the rice of the invention. [0008] Although the examples to follow describe experiments performed in rice, the invention relates to other plants and fungi, including but not limited to wheat, barley, rye, coconut, sorghum, and oats. [0009] The present invention further provides an isolated nucleic acid encoding the 2AP gene, where said nucleic acid includes a nucleic acid which hybridizes to the nucleic acid sequence depicted in SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 4 or SEQ ID NO: 5 or its complement under hybridization conditions that include at least one wash in 0.1 X SSC and 0.1% SDS at °C for thirty minutes, a nucleic acid which is at least 70% identical, 80% identical, identical, 95% identical, or greater than 95% identical to the sequence depicted in SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 4 or SEQ ID NO: 5, and a nucleic acid that encodes a polypeptide that is at least 70%, at least 80%, at least 90%, at least 95%, or greater than identical to the amino acid sequence depicted in SEQ ID NO: 3 or SEQ ID NO: 6. [0010] The amount of 2-acetyl-l-pyrroline in rice can vary depending on harvest conditions and soil type. A non-aromatic variety, Nipponbare, has 2-acetyl-1-pyrroline levels in the range of 0 to 0.1 ppm (parts per million). In contrast an aromatic rice variety, Thai Hom Mali, has an amount of 2-acetyl-l-pyrroline in the range of 1 to 2.5 ppm. Example 2 details an RNA interference experiment against the rice Os2AP gene, which increased the 2-acetyl-1-pyrroline O levels in Nipponbare rice up to 2.5 ppm. The present invention thus provides methods for increasing the level of fragrance in a non-aromatic plant to aromatic levels. [0011] The invention further provides recombinant constructs and expression vectors containing the nucleic acids of the invention, where the nucleic acid can be operably linked to a promoter. ,O One useful promoter is a cauliflower mosaic virus (CaMV) promoter, which confers high levels Cc of expression in most plant tissues. r [0012] The invention further provides host cells that contain the nucleic acids, constructs, and O expression vectors of the invention. [0013] The invention also provides methods for screening plants and nucleic acids for a mutation in a 2AP gene leading to a decrease in 2AP protein expression or activity and a consequent increase in aroma resulting from increased production of the 2AP compound. One specific mutation is an eight nucleotide deletion associated with an aromatic phenotype in the rice 2AP gene. Screening for this and other mutations can be done by a variety of methods, such as PCR, sequencing, hybridization, or microarray experiments. Another option is to look for the reduction of 2AP protein levels or a change in structure or activity of the 2AP protein. This could be done, for example, by using an antibody which binds to the active 2AP protein, or by an assay for 2AP protein activity. [0014] The sequences described herein can be used as probes or primers in nucleic acid hybridization experiments. Nucleic acid segments that include at least a 14 nucleotide long contiguous sequence that has the same sequence as, or is complementary to, a 14 nucleotide long contiguous DNA segment of the nucleotide sequence depicted in SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 4 or SEQ ID NO: 5 can be used. [0015] Longer contiguous identical or complementary sequences, those of about 20, 30, 100, 200, 500, 1,000, 2,000, 5,000, 10,000 etc. (including all intermediate lengths and up to and including full-length sequences will also be of use in certain embodiments. [0016] The ability of such nucleic acid probes to specifically hybridize to 2AP gene sequences will enable them to be of use in detecting the presence of complementary sequences in a given sample. S [0017] However, other uses are envisioned, including the use of the sequence information for the preparation of mutant species primers, or primers for use in preparing other genetic Sconstructions. BRIEF DESCRIPTION OF THE DRAWINGS [0018] FIGURE 1 depicts the chemical structure of 2-acetyl-l-pyrroline from GC-MS extracted Cc from Khoa Dawk Mali (KDML105), the Thai Jasmine rice (aromatic), and Koshihikari, a non- C, aromatic rice. [0019] FIGURE 2 depicts fine-scale mapping of the aroma gene in rice using 186 F9 plants derived from a single F6 plant segregating for grain aroma and the construction of a physical map encompassing the gene for grain aroma. [0020] The first part of Figure 2 shows the graphical genotypes of eight F 11 plants from the single F6 plant. The second part shows ultrascale mapping using 1116 F12 plants derived from a single F6 plant to narrow down the critical region to 27 kb in a single BAC. The third part shows the annotation of genomic sequence from KDML 105 which revealed three open-reading frames. The fourth part shows that using 177 F6 plants from the cross between KDML105 and JHN identified three double recombinants within exon 7 of the unknown protein gene that significantly affects grain aroma and 2-acetyl-l-pyrroline contents. The unknown protein gene was named Os2AP, Orzya saliva 2-acetyl-1-pyrroline. "Aromarker" is the PCR-based marker defining the 8 base pair deletion and 3 SNPs (single nucleotide polymorphisms) specific to grain aroma. [0021] FIGURE 3A depicts expression of seven candidate genes using RT-PCR from total RNA isolated from 10, 15 and 20 days after pollination between aromatic and non-aromatic isogenic lines. In this figure Os2AP is referred to as betaine aldehyde dehydrogenase. [0022] FIGURE 3B depicts differential expression of Os2AP transcripts in leaves, stems and roots from total RNA isolated 15 days after pollination between aromatic and non-aromatic isogenic lines. S[0023] FIGURE 3C depicts analysis of 2-acetyl-l-pyrroline levels in grains of THM (KDML105), aromatic isogenic line 117, non-aromatic isogenic line 10, and their F2 (ISL117 x In Fl, analysis of 2-acetyl-l-pyrroline levels was conducted in leaves. [0024] FIGURE 3D, upper panel, depicts expression of the Os2AP homolog from chromosome C 4 and actin in transgenic Nipponbare carrying Os2AP RNAi construct. FIGURE 3D, lower rcn panel, depicts the Os2AP gene (here referred to as BADH) on chromosome 8, not chromosome 0 4, showing differential expression in aromatic and non-aromatic isogenic lines of rice. O [0025] FIGURE 4A shows the metabolic pathway between proline and glutamic acid. L-proline C can be synthesized from glutamic acid using enzyme P5C synthase and P5C reductase, and glutamic acid can be synthesized from proline using proline dehydrognase (ProDH), and dehydrogenase (P5C DH). The proposed metabolic shift was mediated by nonsense mutation of the Os2AP. [0026] FIGURE 4B shows the predicted protein structure of the Os2AP enzyme created by using RASMOL, a program available through the website of the Microbiology Department of the University of Massachusetts, Amherst, USA (new version is Protein Explorer). [0027] FIGURE 5A shows the genomic sequence comparison of Os2AP genes from KDML105 (aromatic) and Nipponbare (non-aromatic). [0028] FIGURE 5B depicts a DNA alignment showing an eight base pair deletion in an aromatic strain, Thai Hom Mali (THM) as compared to Nipponbare, a non-aromatic strain, and the amino acid sequence comparison between the two strains. The nucleotide sequences in the alignment include nucleotides 701 to 765 from SEQ ID NO: 5 and nucleotides 701 to 757 form SEQ ID NO: 2. [0029] FIGURE 6 depicts graphical genotypes of eight F1 1 plants from the single F6 plant and the analysis of 2-acetyl-1-pyrroline levels from the rice grains. [0030] FIGURE 7A depicts an RNA interference construct and vector for transformation. The lower panel shows confirmation of expression of the construct. BADH here is referring to Os2AP S [0031] FIGURE 7B depicts expression of the RNAi construct, GFP, and the endogenous Os2AP. The lower panel shows 2-acetyl-l-pyrroline levels and aroma. BADH-RNAi2 is an SRNAi construct against the Os2AP gene. [0032] FIGURE 8A depicts the results from screening F6 progeny for the Os2AP variant ,O correlated with increased aroma using the "Aromarker" primer set. C [0033] FIGURE 8B depicts the results from screening various varieties of rice for the aromatic O CN Os2AP allele. [0034] FIGURE 9A shows the DNA sequences of the Os2AP orthologs from various strains of rice. The nucleotide sequences in the alignment include nucleotides 701 to 765 from SEQ ID NO: 5 and nucleotides 701-757 form SEQ ID NO: 2. [0035] FIGURE 9B shows the phylogenetic tree constructed using Neighbor-Joint tree of 22 BADH (Os2AP) orthologs. BRIEF DESCRIPTION OF SOME OF THE SEQUENCES IN THE SEQUENCE LISTING [0036] SEQ ID NO: 1 is the genomic nucleotide sequence of the Os2AP gene from an aromatic rice strain, Thai Hom Mali. SEQ ID NO: atggccacgg ctcggccgcc ttcaccctct ctgctgtttt gacgcggcgg cccgcgccgg tgactaccct gggaggagct tagttttaag tagcttcggt ctgttctgga tggaataagt ttcaagctct ggctcacggc tgcttcgtta ttgtgccttg 1 cgatcccgca gcctccccgt ccaccctctg ttgcgtgtcg tggcggcggc gcgccgtccg tatcagcctg ccgcgtggct tttgctgctt ataagccccc cta ggaattg tggaagcatg tgttaattga catgggtgtg gttggcaggt acgatgggac gcggcagctc cgtcaacccc cttctgcctc gtgcgcaagc gcgggaggcg ggccaagtac cccgttttaa ctccagtagg aaattcctca catatgatgt gccataggtt gct'gattgtg tgggttcaag cacatttgat cctagt cact tgg attact a ttcgtcgccg gccaccgagt tgattagcct gagatcccgg cttaaaaaga attcgcgcaa cgggagqctt tttttgagcc agggtttagt gcgtgcgtcg gatcgacagt tatctggatg cggagagggt tggtgcgcaa aaatcactat gccttttggt gcgagtggcg cccccatcgg ttttgttgtt cgggcacggc acccgggccg tcgctgacaa gtgcgtgtgt. ccaaatcgat ttgcaaccaa gcatcggaag tcgagtattc ctgtttttgt gcgcaacaac caacaagtgt tggattggta tgcctttgtg cgcccccgcg taccctcctc gttgttgttg ggaggacgtg cgactgggcg agtagggtgg tccgtacagg cgatatgctc atccttattt tgcgtatcct tgcttctgtt ggtgattcgt aagtgtatat atattgtgtg ctagctactt gtattccgtt SEQ ID NO: gttatgggcc tctcacagtt tttcatgtat cccatattat aaacagtgac ggggatgcta ggaaatatgc ccttttccct ttgattgtgg ctgtatgaat. atattccatt atccttggac tcttcggaaa ctctcatcct catgcatata gacatttgag cctagcagct tgtgttaagt gttttagtgt taatataatg taatgataca atctattttt ttatttatgt ttctaatttg tcatgttcgg ggtagctcct cgtgtaagtt gattttaaaa tcattaatta tcaggtgtgc caccctggtg aaatgatatt tatgaaactg ggatattttt cttttaactc ccagttctat tggaacttgg catgccactt agctggatgc ggtcccagac agaagttaca gtcctgtatt gaggattata atgaatatat aaggactgat ttcctaacta atgaattctg taacgttttg tttcattgtt gaccaatggc tatccgtcaa agattattat catgtatttt ttttgtgtcc gaactctatg tagctttcag ttcaagaaag gttgcaggct ttcaaaaaag tgcactgttt tgctagttgc cattgatgaa attttaattt tgtatatatt attgccttgt gctaccattc 1 (continue) tgttgatgga tggatccctt tttctcttca ggttatattt ,tttctggcat atataggtta ggatcactag tgtacacttt cagt gcacac tttacttgaa gccttccagt ctaaatgaca taatttctat cataaaaatt ttttcttcag ataatcgaga gaagcctctt gatgaagc:ag atgtagttgc ctacacagca tgtcaggacg atg .ttgctgg aaaaggcaaa atgcacctgt gagcctatcg gtgtagttgg gcgcttatat ttatttatga ctgaatgagt ttacaacttt attttattcc tcttctcttg actctctccg tttcatatta ttgaccaagt ttatagaaaa tttttcaggc Tctcatgtaa atacacagct cttgtgtatt cacctctcct ccgtgttaat ctcatatgtt, gtcagcatga ctggaatctt actgcagaag ttgtttcttt gttatgttct gatgtgtatt tctactgcag gccctggctg ctggctgtac taacatgtta acttgttaat tgttgtggca tgtccatgct ccaggacttg tttggagctt taaacatagt gactggatta. tagacaaggt-acagctattc cctctcaata catggtttat gtatatattt cagctgctcc tgttctcttt ctactaactcttttactttt tagaattgtg cctgtttctt acctttttat tggaaaaagt cctatagtgg gctatgatta actaattctg tagaccccca aaagcccttt gagcaggatg caagaaaatt cctcaaggat tagcaatact attttgagcA gatcttgtac atgccatgcc aactgagtaa atgtgatttc atttttcctt gtgaaaagca caagtactat caattcacct tgAttagtca tttatatttc actaattttg tctggttatg gactctgttt ggtccttctt cttgtacagc cagatttgca gtgcaacatc tcataatcta ttgttgtact atgatgccat tactattgca gagtaatata tgcattgcct atgtctctat gatgtttatt ttgatcttgd actaaaattg cttcttgctc ctgaccatga gatggttgca tgggccaaaa tgggcccgtt gttagtgaag aaatgccact gatcaatggt tgtcaatttt gtgtaatata tagtatgtgg tacagggtct aaggccacct ttccgtaggt tttgttttga ccccatggca ttcaaagcac cagaatgttt gtcagtatga gaagattaag tgactggtgg ggttagaccc taatttctag ctcgtatttc ttattattat tttggatga ctgtcaaacc gtaaActact ttcatggcat ggaaatctga catgggacat atctttcctg atgctttgag ctctcttcca gttgatcaca cccaagcatg acatgatttt tgcaaacata taagtcgttt atttagcaac gcaattttga caaagg aata. gcagctcatt ttcacttaat ggaaaacaat tatcgacaat gaactatcct agctgtacta gtcataccca gcaagcaatg gctgatgtgt ggttctgaag .ctcctgtaat gttttctgtt tatggttaag tctattatca atcaagacac ggttcgtttt tgtttgatga aagtgqggga ttggtgtctt aactactitt ttcttaaaat tggttgAtcg -agaaaagagt tttcttttcg tcttagtt'ct gtaacttgat aatctttaat gtAttcactc tgttgagtgg gcgtcttatt tgattttttt cagag cgaat attatttaat actttattat aaattaattg aagttttgca atattaaggt gacaggtacc aggtccttcc gttctgaatg tcctact tt g ttatcttgtc ctttaatgaa agatggtttg caatttgtat aaggtaataa tgccaaatgc cttttcctaa tattctccag aaacctactt atcaattttc atacttttgt atgcgagcat gctggctaga ggtatgtggc aacatgaatc tactttgcag atggaaaact ccttggtatt .gtactaaata tttgaactat ttattgtctt gacttttttc atctaaaata tgtgccctct tatatatata tgttctgtcc ttagtatata tgataacgga tacaaatttg ctcctgatgg aaaccatctg tgctagttgc taatttgaaa gtaaagAggt .ccggtgctcc ca tgtatacc aggttgcatt gtttgtttcc attctcaatg tttgagcatc ttcttgacag t gttgat gt t ctttgtaaag gggcttgttg gccactgatt gtgctattga gcttgcatga tgtaaaatat tggcaaggaa ggaaaactgt attggcaatt tacattttat aatttggatc actctctttg cttcatgtaa tctgatcaac aaaattgtat atatagattg tgccggcatg gcttattgct gaAaaaatc gtcagatcca acatgtaaac aagccttatt aattagttgg agcaaattcg ataatttaaa at cacttag c, ttg gaaat ct ctaccgccaa tctactacac atgctagatt 1020 aggattgctt 1080 aacaagatta 1140 actgaaagta 1200 ta gcaaagca 1260 ccgtaggttt 1320 tttatgatca 1380 ctagagacgc 1440 cagttatcca 1500 ctgatgtatg 1560 atcttgcaga 1620 ttaaatgcta 1680 tcacattttt 1740 gtactagtaa 1800 gaaagttgaa 1860 acaaattgta 1920 ctagtcaaaa 1980 tcaaagtcat 2Q40 cctttcttct 2100 tatatatata 2160 cggttcaaat 2220 gaagatgcca 2280 attgattgca 2340 attctgagaa 2400 caacatggaa 2460 aattggcttc 2520 aatgacattt 2580 tctctctcta 2640 tggtcttcct 2700 tttgtcatca 2760 ccatcaatgg .2820 tactgggagt 2880 aaatttctgt 2940 ttgtcctttt 3000 attctagtag 3060 cctgtttcac 312.0 gaaaaaggta 3180 c-acttaactg 3240 cagaaatact 3300 agtatttctt* 3360 ttaaaaagat 3420 aaatattgtt 3480 gttatgcaac 3540 ggcagttagg 3600 gttctttatt 3660 ctagctgatt 3720 ggttgaaatt, 3,780 ttccattaga 3840 gttgcttttg 3900 gcattgaata 3960 tcctgagttc, 4020 ttatgcacag 4080 tagcacttaa 4140 aagcaacttt 4200 attaatgata 4260 gttaaagaat 4320 cttgaagagg 4380 tttttctaaa 4440 gctggattgt 4500 tgtatgctct 4560 tgttaaaatg 4620 ccccaataaa 4680 catgagcttt 4740 tacacatcct 4800 aagccaaggt 4860 ggttgtatat 4920 Z SEQ ID NO: _n ataggtaccc gagaaaggtt tggagggaag gccattgaat ttcaatcttg ttttgatagt IND attaactgag kn gcatcaagct rn tccctggact rn agttgctctt caaaagagag tctgggtgaa gcggctttgg accatttcga tttcaggggc gccgtgggga 1 (continuE acatatcatt tctatattga, aagtttttgg tggccaacga cagcatatgt tatggtctgg gtatatccaa gatattcaag ccggtcaatg cacttccaca ggaaatactt ctgctcgcaa acgcgagctc tcgtctgatg at t gacaa ct tggtacaaat atgaagtaga acccacaatc tccagtgctc tactcagtga atatactctg ctggtgctgt gtgaaggggg aatctcattt ttaatatagt tcatctgatc aacgaaaatt ccctgcttCt ggagaagggt. atctcgacct acctaagcgt ccccttccaa aataatcttg .attactgatg tgtgtgaaag gttttttttt tggcatatga gcTttccggt ttggcattgt attacttgca ttgtttgcta catgacttta tcctactgca gccaagctcc gggtagcaca gacatcatgc caagcaagtg gctgtaa tatgtttttg tcgatacatc aatttagcac taatacagtt acttattctg gaccgcgagc ttgattcata ttctatgcat gtagtatgct tatttgaccc ggagatcgat atggggcggg caacaatctc ctttggtatt acggagtacg tcagcatctg aatgcaaatt tgaagaagaa cattgtcctg ctactactac gatgccagag tgacatggtt ctccagttct actccaatta gccggaatta aacaagcgca actttaaaac ttcattcact c ct ccgat ga 4980 5040 5100 5160 5220 5280 5340 5400 5460 5520 5580 5640 5700 5760 5820 5857 [00371 SEQ ID NO: 2 is the protein-encoding nucleotide sequence of Thai Horn Mali Os2AP, SEQ ID NO: atggccacgg ctcggccgcc gcgggcacgg aacccgggcc atcgctgaca gggaagcctc tttgcagatc gaaaacttta tggaactatc acagctgtac tgtaaagagg gccggtgctc gaaactggta aaaaagtcct ctttggttgc taaaaaaatc gtcagatcca gaagattaag ggttagaccc cgatacatca atttagcact tgctgtgctt aattatctgg gcgcagcggc gcaagtgacg gtaa 2 cgatcccgca gcctccccgt cggaggacgt gcgactgggc aaataatcga ttgatgaagc ttgcagaatc aatgctatct ctctcctgat taaaaccatc ttggtcttcc ctttgtcatc tatatttcag atagtggtgt ttttggacca gctaaagaat cttgaagagg caatttgtat aagcatctgg atgcaaattt gaagaagaag tccggtgacc gtgaactgct tttggacgcg gagtacgcct gcggcagctc cgtcaacccc ggacgcggcg gcccgcgccg gaggaaatct agcatgggac cttggacaaa tcggaaagag ggcaacatgg tgaattggct ttcaggtgtg acaccctggt ctgctcctat ttgatgatgt atggccagat ttcaagaaag gttgcaggct ctaccg ccaa agaaaggttt ggagggaaga ccattgaatt gcgagcgatg cgcaaccctg agctcggaga ccgatgagcc ttCgtCgCCg gccaccgagt gtggcggcgg ggcgccgtcc gagctggcta atggacgatg. aggcaaaatg cctatcggtg aaggtagctc tccgtgactt ctaaacatag gtagacaagg ggttaagcct tgatgttgaa ttgcagtgca gatggttgca tgggcccgtt. aagccaaggt c ta tat t ga a agtttttggt ggccaacgat. ccagagatta cttctgccaa agggggcatt gtggggatgg gcgagtggcg cccccatcgg cgcgggaggc gggccaagta gactagagac ttgctggatg cacctgtctc tagttgggtt ctgccctggc gtttggagct tgactggatt ttgcatttac gtttCaCtgg aaagctgttg acatcgcgtc tgggccaaaa gttagtgaag gctaccattc cccacaatca ccagtgctct actcattatg actgaggaga gctccatggg gacaactacc tacaaatccc Cgcccccgcg cg agat cccg gcttaaaaag cattcgcgca gcttgattgt ctttgagtac tcttccaatg gatcacacc tgctggctgt tgctgatgtg aggttctgaa tgggagttat aacttggtgg agtggactct ttattcttca atattaaggt gacagtatga tgactggtgg ttactgatgt gtgtgaaaga gtctggctgg tcgatgccgg gcgggaacaa taagcgtcaa cttccaagct 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1504 Z [0038] SEQ ID NO: 3 is the amino acid sequence of the Thai Hom Mali Os2AP protein. C1 SEQ ID NO: 3 Met Ala Thr Ala Ile Pro Gin Arg Gin Leu Phe Val Ala Gly Glu Trp 1 5 10 ND Arg Ala Pro Ala Leu Gly Arg Arg Leu Pro Val Val Asn Pro Ala Thr mn 20 25 cn Glu Ser Pro Ile Gly Glu Ile Pro Ala Gly Thr Ala Glu Asp val Asp m 35 40 Ala Ala Val Ala Ala Ala Arg Glu Ala Leu Lys Lys Asn Pro Gly Arg (N 50 55 Asp Trp Ala Pro Ala Pro Gly Ala val Arg Ala Lys Tyr Ile Arg Ala 70 75 SIle Ala Asp Lys Ile Ile Glu Arg Lys Ser Glu Leu Ala Arg Leu Glu 1 85 90 Thr Leu Asp Cys Gly Lys Pro Leu Asp Glu Ala Ala Trp Asp Met Asp 100 105 110 Asp Val Ala Gly Cys Phe Glu Tyr Phe Ala Asp Leu Ala Glu Ser Leu 115 120 125 Asp Lys Arg Gin Asn Ala Pro val Ser Leu Pro Met Glu Asn Phe Lys 130 135 140 Cys Tyr Leu Arg Lys Glu Pro Ile Gly val val Gly Leu Ile Thr Pro 145 150 155 160 Trp Asn Tyr Pro Leu Leu Met Ala Thr Trp Lys val Ala Pro Ala Leu 165 170 175 Ala Ala Gly Cys Thr Ala Val Leu Lys Pro Ser Glu Leu Ala Ser Val 180 185 190 Thr Cys Leu Glu Leu Ala Asp Val Cys Lys Glu val Gly Leu Pro Ser 195 200 205 Gly val Leu Asn Ile val Thr Gly Leu Gly Ser Glu Ala Gly Ala Pro 210 215 220 Leu Ser Ser His Pro Gly val Asp Lys val Ala Phe Thr Gly Ser Tyr 225 230 235 240 Glu Thr Gly Ile Tyr Phe Ser Cys Ser Tyr Gly 245 250 100391 SEQ ID NO: 4 is the genomic nucleotide sequence of the Os2AP gene from a nonaromatic rice strain, Nipponbare. SEQ ID NO: atggccacgg cttggccgcc ttcaccctct ctgctgtttt gacgcggcgg cgcgcgccgg tgactacccc gggaggagct tagttttaag tagcttcggt ctgttctgga tggaataagt ttcaagctct ggctcacggc tgtgcttcgt ttttgtgcct ttgttatggg tttctcacag tttttcatgt tacccatatt taaaacagtg caggggatgc ttggaaatat c acctt tt cc gcttgattgt cactgtatga tgatattcca gaatccttgg tatcttcgga ttctctcatc aacatgcata aagacatttg tacctagcag aatgtgttaa atgttttagt cttaatataa tacacacctc ttttctcata atgtctggaa tttgttgttt tcgggatgtg tcctgccctg agtttaacat aaaatgttgt attaccagga gtgctaaaca ggtgtagaca tattcctctc actggtaaaa gtggatattt ttcttttaac agccagttc~t actggaactt tacatgccac tgagctggat ctggtcccag cgatcccgca gcctccccgt ccaccctctg ttgcgtgtcg tggcggCc gcgccgtccg cacccccccc ccg cgtg gct tttgctgctt ataag cc ccc ctaggaattg tggaagcatg tgttaattga catgggtgtg tagttggcag tgacgatggg cctgttgatg attttctggc atggatcact accagtgcac tagccttcca gctaatttct ctttttcttc gggaagcctc atatotagtt tttgtcagga acAaaaggca aagagcctat ctgcgcttat tactgaatga agattttatt ctactctctc gtttgaccaa gttttttcag tgatacacag tcctccgtgt tgttgtcagc tcttactgca ctttgttatg tatttctact gctgctggct gttaacttgt ggcatgtcca cttgtttgga tagtgactgg aggtacagct aatacatgg t agattatggc tttgttctct tcctttactt atcctgtttc ggtggaaaaa ttgctatgat gctagacccc acgagcagga gcggcagctc cgtcaacccc cttctgcctc gtgcgcaggc gcgggaggcg ggccaagtac ccccccccaa ctccagtagg aaattcctca catatgatgt gccataggtt gctgattgtg tgggttcaag cacatttgat gtcc Itagtca actggattac gatggatccc caggttatat atatataggt agtgtacact actttacttg gtctaaatga atcataaaaa agataatcga ttgatgaagc gcctacacag cgatgttgct aaatgcacct cggtgtagtt atttatttat gtttacaact cctcttct Ct cgtttcatat gtttatagaa gctctcttt ctct tg t gta taatgcagct atgattcact gaagggaaaa ttcttatcga gcaggaacta gtacagctgt taatgtcata tgCtgcaagC gcttgctgat attaggttct attcct cctg ttatgttttc ttcagctgct ttctactaac tttagaattg ttaccttttt gtcctatagt -taactaattc caaaagccct tgcaagaaaa ttcgtcgccg gccaccgagt tgattagcct gagatcccgg ctgaagagga ctccgcgcaa cgcgACCtgc tttttgagcc agggtt tagt, gcgtgcgtcg gatcgacagt tatctggatg cggagagggt tggtgcgcaa ctaaatcact tagccttttg ttt aattt ct ttctcgtatt tattattatt tttttggatg aactgtcaaa cagtaaacta ttttcatgg c gajgaaatct agcatgggac caatctttcc ggatgctttg gtctctcttc gggttgatca gacccaagca ttacatgatt tgtgcaaaca tataagtcgt aaatttagca AagcAatttt ttcaaaggaa catttgttct taatttagta caattgataa caattacaaa tcctCtCCtg actaAaacca cccatgctag 'aatg taattt gtgtgtaaag gaagccggtg taatcatgta tgttaggttg cctatggtta .tacctattat .tgat cAagac atggttcgtc ,ggtgtttgat tgaagtg cgg ttttggtgtc ttaactactt gcgagtggcg cccccatcgg ttttgttgtt cgggcacggc accggggccg tcgcggcca gtgcgtgtgt ccaaatcgat ttgcaaccaa gcatcggaag tcgagtattc ctgtttttgt gcgcaacaac caacaagtgt. attggattgg gttgcctttg agtgccaaat tccttttcct attattctcc aaaaacctac ccatcaattt ctatacttt t atatgcgagc gagctggcta atggtatgtg tgaacatgaa agtactttgc caatggaaa~a caccttggta tggtactaaa tttttgaact tattattgtc ttgacttttt. acat ctaaaa gatgtgccct aatatatata gtCCCggttc tatagaagat cggaattgat. tttgattctg atggcaacat tctgaattgg ttgcaatgac gaaatctctc' aggttggtct ctcctttgtc taccccatca catttactgg aggtttgttt caattctcaa actttgagca ttttcttgac gatgttgatg gactttgtaa ttgggcttgt ttgccactga cgcccccgcg taccctcctc gttgttgttg ggaggacgtg cgactgggcg ggtagggtgg tccgtacagg cgatatgctc atccttattt tgcgtatcct tgcttctgtt ggtgattcgt aagtgtatat atattgtttg tactagttac tggtattccg gcatgctaga aaaggattgc agaacaagat ttactgaaag tctagcaaag gtccgtaggt attttatgat gactagagac gccagttatc tcctgatgta agatcttgca ctttaaatgc tttcacattt tagtactagt atgaaagttg tcacaaattg tcctagtcaa tatcaaagtc ctcctttctt tatataatgaaaat'at ctat gccattattt tgcattctaaagaatcatgt ggaaggtagc cttccgtgt.a atttgatttt tctatcatt'a tccttcaggt atcacaccct. atggaaatga gagttatgaa ccaaatttct tgttgtcctt t cat-tctagt agcctgtttc ttgaaaaagg agcacttaAc tgcagaaata ttagtatttc 120 180 240 300 360 420 480 540 600 '660 720 780 840 .900 960 1080 1140 1200 .1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 ,2460 2-520 2580 2640. .2700 2760 2820 .2880 2940 3000 3060 3120 3180 3240 3300 3360 SEQ ID NO: ttagaagtta atgtcctgta ttgaggatta acatgaatat ggaaggactg ttttcctaac ttatgaattc tttaacgttt gatttcattg tggaccaatg tatatccgtc t cagattatt agcatgtatt aattttgtgt ttgaactcta tatagctttc atttcaagaa gggttgcagg aattcaaaaa gttgcactgt cttgctagtt tgcattgatg aaattttaat tttgtatata ctattgcctt gtgctaccat atataggtac tggagaaagg tttggaggga aagccattga tgttcaatct acttttgata agattaactg ttgcatcaag cttccctgga taagttgctc tgcaaaagaa tatctgggtg cagcggcttt acaccatttc cttttcaggg gagccgtggg 4 (continue cacctcaagg ttattttgag taatgccatg atatgtgatt atgtgaaaag tacaattcac tgtttatatt tgtctggtta ttggtccttc gccagatttg aat cataatc atatgatgcc ttgagtaata ccatgtctct tgttgatctt agctt cttg c aggatggttg cttgggcccg agaaatg cca tttgtcaatt gctagtatgt aaaaggccac tttttgtttt ttttcaaagc gtgtcagtat tctgactggt ccacatatca tttctatatt agaagttttt attggccaac tg cag cat at gttatggtct aggtatatcc ctgatattca ctccggtcaa ttcacttcca agggaaatac aactgctcgc ggacgcgagc gatcgtctga gcattgacaa gatggtacaa attagcaata cagatcttgt ccaactgagt tcatttttcc cacaagtAct cttgattagt tcactaattt tggactctgt ttcttgtaca cagtgcaaca tattgttgta attactattg tatgcattgc atgatgttta gaactaaaat tcctgaccat catggg ccaa ttgttagtga ctgatcaatg ttgtgtaata ggtacagggt ct'ttccgtag gaccccatgg accagaatgt gagaagatta ggggttagac ttatgaagta gaacccAcaa ggtccagtgc gatactcagt gtatatactc ggctggtgct. aagtgaaggg agaatctcat tgttaatAta catcatctga ttaacgaaaa aaccctgctt tcggagaagg tgatctcgac ctacctaagc atccccttcc ctttcttaaa actggttgat aaagaaaaga tttttctttt attcttagtt cagtaacttg tgaatcttta ttg tattcac gctgttga§gt tcgcgtctta cttgattttt cacag agcga ctattattta ttactttatt tgaaattaat gaaagttttg aaatattaag aggacaggta gtaggtccttt agttctgaa cttcctactt gtttatcttg cactttaatg ttagatggtt agcaatttgt ccaaggtaat g aaataa-tct tcattactga tctgtgtgaa gagttttttt tgtggcatat gtgctttccg ggttggcatt ttattacttg gtttgtttgc tccatgactt tttcctactg ctgccaagct gtgggtagca ctgacatcat gtcaagcaag aagctgtaa atgtgctatt cggcttgcat gttgtaaaat cgtggcaagg ctggaaaact atattggcaa attacatttt tca'atttgga ggactctctt ttcttcatgt tttctgatca ataaa a ttg t atatatagat attgccggca tggcttattg cagaaaaaaa g tgtcagatc ccacatgtaa ccaagcctta tgaattagtc. tgagcaaatt, tcataattta aaatcactta tgttggaaat at-ctaccgcc aatctactac tgtatgtttt tgtcgataca agaatttagc tttaatacag gaacttattc gtgaccgcga gtttgattca cattctatgc tagtagtatg tatatttgac caggagatcg ccatggggcg cacaacaatc, gcctttggta tgacggagta gattaaaaag gaaaatattg atgttatgca aaggcagtta gtgttcttta ttctagctga. atggttgaaa tcttccatta tggttgcttt aagcattgaa actcctgagt atttatgcac tgtagcactt tgaagcaact ctattaatga tcgctaaaga cacttgaaga actttttcta ttgctggatt 9gtgtatgct cgtg ttaaaa aaccccaata gccatgagct cttacacAtc aaaagccaag acglgttgtat tgtcagcatc tcaatgcaaa actgaagaag ttcattgtcc tgctactact gcgatgccag tatgacatgg atct 'ccagtt ctactccaat cccttttttt atgccggaat ggaacaagcg tcactttaaa ttttcattca cgcctccgat 3420 3480 3540 3600 3660 3720 3780 3840 3900 3960 4020 4080 4140 4200 4260 4320 4380 4440 4500 4560 4620 -4680 4740 4800 4860 4920 4980 5040 5100 5160 5220 5280 5340 5400 5460 5520 5580 5640 5700 5760 5820 5859 100401 SEQ ID NO: 5 is the protein-encoding nucleotide sequence of Nipponbare Os2AP. SEQ ID NO: atggccacgg ctcggccgcc gcgggcacgg aaccggggcc atcgcggcca gggaagcctc tttgcagatc gaaaacttta tggaactatc acagctgtac tgtaaagagg gccggtgctc gaaactggta cttggtggaa tggactctct attcttcata attaaggtgt cagtatgaga actggtgggg actgatgtcg gtgaaagaat ctggctggtg gatgccggaa gggaacaagc agcgtcaagc tccaagctgt cgatcccgca gcctccccgt cggaggacgt gcgactgggc agataatcga ttgatgaagc ttgcagaatc aatgctatct ctctcctgat taaaaccatc ttggtcttcc ctttgtcatc aaaagattat aaagtcctat ttggttgctt aaaaaatcgc c agat ccact agattaagca ttagacccaa atacatcaat ttagcactga ctgtgctttc ttatctgggt gcagcggctt aagtgacgga aa gcggcagctc cgtcaacccc ggacgcggcg gcgcgcgccg gaggaaatct agcatgggac cttggacaaa tcggaaagag ggcaacatgg tgaattggct ttcaggtgtg acaccctggt ggcttcagct agtggtgttt ttggaccaat taaagaattt tgaagagggt atttgtatct gcatctggag gcaaatttgg agaagaagcc cggtgaccgc gaactg ctcg tggacgcgag gtacgcctcc ttcgtcgccg gccaccgagt gtggcggcgg ggcgccgtcc gagctggcta atggacgAtg aggcaaaatg cctatcggtg aaggtagctc tccgtgajctt ctaaacatag gtagacaagg gctcctatgg gatgatgttg ggccagattt caagaaagga tgcaggcttg accgccaaaa aaaggtttct agggaagaag attgaattgg gagcgatgcc Caaccctgct ctcggagaag gatgagccgt gcgagtggcg cccccatcgg cgcgggaggc gggccaagta gactagagac ttgctggatg cacctgtct c tagttgggtt ctgccctggc gtttggagct tgactggatt ttgcatttac, ttaagcctgt atgttgaaaa gcagtgcaac tggttgcatg ggcccgttgt gccaaggtgc atattgaacc tttttggtcc ccaacgatac agagattaac tctgccaagc ggggcattga ggggatggta cgcccccgcg cgagatcccg gctgaagagg cctccgcgca gcttgattgt ctttgagtac tcttccaatg gatcacacct tgctggctgt tgctgatgtg aggttctgaa tgggagttat ttcactggaa agctgttgag atcgcgtctt ggccaaaaat tagtgaagga taccattctg cacaatcatt agtgctctgt tcatttatggt 'tgaggagatc tccatggggc caactaccta caaatcccct 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1512 [0041] SEQ ID NO: 6 is the amino acid sequence of Nipponbare Os2AP Protein. Z SEQ ID NO: 6 met Ala Thr Ala le Pro Gin Arg Gin Leu Phe /al Ala Gly GlU Trp 1 5 10 Arg Ala Pro Ala Leu Gly Arg Arg Leu Pro Val Val Asn Pro Ala Thr 25 Glu Ser Pro Ile Gly Glu Ile Pro Ala Gly Thr Ala GlU ASP Val ASP 40 IND Ala Ala Val Ala Ala Ala Arg Glu Ala LeU Lys Arg Asn Arg Gly Arg 50 55 ASP Trp Ala Arg Ala Pro Gly Ala Val Arg Ala Lys Tyr Leu Arg Ala 70 75 Ile Ala Ala Lys le Ile Giu Arg Lys Ser Glu Leu Ala Arg Leu Glu 90 C1Thr Leu ASP Cys Gly Lys Pro LeU ASP GlU Ala Ala Trp ASP Met ASP 100 105 110 ASP Val Ala Gly Cys Phe Glu Tyr Phe Ala ASP Leu Ala Giu Ser Leu 115 120 125 ASP Lys Arg Gin Asfl Ala Pro Val Ser Leu Pro Met GiU Asn Phe Lys 130 135 140 Cys Tyr LeU Arg Lys Giw Pro Ile Gly Val Val Gly Leu Ile Thr Pro 145 150 155 160 Trp Asfl Tyr Pro Leu Leu Met Ala Thr Trp Lys Val Ala Pro Ala Leu 165 170 175 Ala Ala Giy Cys Thr Ala Val LeU LYS Pro Ser GlU LeU Ala Ser Val 180 185 190 Thr Cys Leu Giu LeU Ala ASP Val Cys Lys Glu Val Gly Leu Pro Ser 195 200 205 Gly Vai Leu Asn Ile Val Thr Gly Leu Gly Ser Giu Ala Gly Ala Pro 210 215 220 Leu Ser ser His Pro Gly Val ASP LYS Val Ala Phe Thr Gly ser Tyr 225 230 235 240 GiU Thr Gly LYS LYS le Met Ala Ser Ala Ala Pro met Val Lys Pro 245 250 255 Val ser LeU Glu LeU Gly Gly LYSsSer Pro Ilie Val Val Phe ASP ASP 260 265 270 V'al ASP Val Giu Lys Ala 'al GlU Trp Thr Leu Phe Gly Cys Phe Trp 275 280 285 Thr Asfl Gly Gin Ile Cys Ser Ala Thr Sen Arg Leu Ile LeU His Lys 290 295 300 Lys lie Ala LYS Giu Phe Gin Giu Ang Met Val Ala Trp Ala Lys Asn 305 310 315 320 Ilie LYS Val Ser ASP Pro Leu Giu Giu Gly Cys Arg Leu Gly Pro Val 325 330 335 Val ser Giu Gly Gin Tyr GiU Lys le Lys Gin Phe Val Sen Thr Ala 340 345 350 Lys Sen Gin Gly Ala Thr Ile Leu Thr Gly Gly Val Arg Pro Lys His 355 360 365 LeU GlU Lys Gly Phe Tyr Ilie Giu Pro Thr lie Ilie Thr ASP Val ASP 370 375 380 Thr Sen Met Gin Ile Trp Arg Giu' GlU Val Phe Gly Pro Val Leu Cys 385 390 395 400 Val Lys GlU Phe Sen Thr Giu Glu. Glu Ala le GiU LeU Ala Asfl ASP 405 410 415 Thr His Tyr Gly Leu Ala Gly Ala Val Leu Sen Gly ASP Arg GlU Arg 420 425 430 Cys Gin Ang LeU Thr Giu GiTu le ASP Ala Gly le Ile Trp Val Asn 435 440 445 Cys Sen Gin Pro Cys Phe Cys Gin Ala Pro Trp Gly Gly Asn Lys Ang 450 455 460 Sen Gly Phe Gly Arg Giu Leu Gly Giu Gly Gly Ile Asp Asn Tyr Leu 465 470. 475 480 Sen Val Lys Gin Val Thr Giu Tyr Ala Sen ASP Giu Pro Trp Gly Trp 485 490 495 Tyr Lys Sen Pro Ser LYS Leu 500 O [0042] SEQ ID NO: 7 through 88 are primers which can be used to amplify portions of the Os2AP nucleotide sequence, GFP, or actin. (TABLE 1). S[0043] SEQ ID NO: 89 through 95 are sequences of fragments of 2AP gene orthologs. SEQ ID NO: 89 tgcatttact gggagttatg aaactggtat atatttcagc tgctcctatg gttaag 56 O SEQ ID NO: 90 tgcatttact gggagttatg aaactggtat atatttcagc tgctcctatg gttaag 56 SSEQ ID NO: 91 tgcatttact gggagttatg aaactggtat atatttcagc tgctcctatg gttaag 56 SEQ ID NO: 92 tgcatttact gggagttatg aaactggtaa aaagattatg gcttcagctg ctcctatggt taag 64 0 SEQ ID NO: 93 tgcatttact gggagttatg aaactggtat atatttcagc tgctcctatg gttaag 56 SEQ ID NO: 94 tgcatttact gggagttatg aaactggtaa aaagattatg gcttcagctg ctcctatggt taag 64 SEQ ID NO: 95 tgcatttact gggagttatg aaactggtaa aaagattatg gcttcagctg ctcctatggt taag 64 [0044] SEQ ID NO: 96 is the decapeptide which is highly conserved among general aldehyde dehydrogenases. Val-Thr-Leu-Glu-Leu-Gly-Gly-Lys-Ser-Pro DETAILED DESCRIPTION OF THE INVENTION [0045] The invention relates to 2-acetyl-l-pyrroline (2AP) genes. Nucleic acid sequences from 2AP genes are used to enhance the levels of 2-acetyl-l-pyrroline, a major aromatic compound found in rice, wheat, maize, oat, pandan leaf, aromatic coconut, and some bacteria and fungi. [0046] Before explaining at least one embodiment of the invention in detail, it is to be understood that the invention is not limited in its application to the details of construction and the arrangement of the components set forth in the following description. The invention is capable of other embodiments or of being practiced or carried out in various ways. Also, it is to be understood that the phraseology and terminology employed herein is for the purpose of description and should not be regarded as limiting. [0047] Throughout this disclosure, various publications, patents and published patent specifications are referenced. The disclosures of these publications, patents and published patent O specifications are hereby incorporated by reference into the present disclosure to more fully describe the state of the art to which this invention pertains. The practice of the present Sinvention will employ, unless otherwise indicated, conventional techniques of plant breeding, CN immunology, molecular biology, microbiology, cell biology and recombinant DNA, which are within the skill of the art. See, Sambrook, Fritsch and Maniatis, MOLECULAR tn CLONING: A LABORATORY MANUAL, 2nd edition (1989); CURRENT PROTOCOLS IN S MOLECULAR BIOLOGY M. Ausubel, et al. eds., (1987); Plant Breeding: Principles and CI Prospects (Plant Breeding, Vol 1) M.D. Hayward, N.O. Bosemark, I. Romagosa; Chapman 0 Hall, (1993.); Coligan, Dunn, Ploegh, Speicher and Wingfeld, eds. (1995) CURRENT (N PROTOCOLS IN PROTEIN SCIENCE (John Wiley Sons, Inc.); the series METHODS IN ENZYMOLOGY (Academic Press, Inc.): PCR 2: A PRACTICAL APPROACH (M.J. MacPherson, B.D. Hames and G.R. Taylor eds. (1995), Harlow and Lane, eds. (1988) ANTIBODIES, A LABORATORY MANUAL, and ANIMAL CELL CULTURE R.I. Freshney, ed. (1987). (0048] Unless otherwise noted, technical terms are used according to conventional usage. Definitions of common terms in molecular biology may be found in Lewin, Genes V, published by Oxford University Press, 1994 (SBN 0-19-854287-9); Kendrew et al. The Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., 1994 (SBN 0-632- 02182-9); and Robert A. Meyers Molecular Biology and Biotechnology, a Comprehensive Desk Reference, published by VCH Publishers, Inc., 1995 (ISBN 1-56081-569-8); Ausubel et al. (1987) Current Protocols in Molecular Biology, Green Publishing; Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York. Definitions of common terms in plant biology may be found in Esau, Plant Anatomy, published by John Wiley Sons (1977) (ISBN 0-471-24520-8); and Solomon et al., Biology, published by Saunders College Publishing (1993). Definitions [0049] In order to facilitate review of the various embodiments of the invention, the following 0 definitions are provided: [0050] 2-acetyl-l-pyrroline (2AP) polynucleotide sequence: The genes according to the )n subject invention include not only the full-length sequences disclosed herein but also fragments M of these sequences, which retain the characteristic activity of the sequences specifically CI exemplified herein. [0051] The rice 2AP gene sequence is disclosed herein. It is apparent to a person of skill in this art that 2AP polynucleotide sequences from another plant can be readily identified and obtained through several means using the rice 2AP gene sequence. The specific genes, or portions thereof, may be obtained from a culture depository, or constructed synthetically, for example, by use of a gene machine. [0052] Variations of these genes may be readily constructed using standard techniques for making point mutations. Also, fragments of these genes can be made using commercially available exonucleases or endonucleases according to standard procedures. For example, enzymes such as Bal31 or site-directed mutagenesis can be used to systematically cut off nucleotides from the ends of these genes. Also, genes which code for active fragments may be obtained using a variety of other restriction enzymes. Various enzymes may be used to directly obtain active fragments of these 2AP polynucleotide sequences. [0053] Equivalent 2AP polynucleotide sequences and/or genes encoding these equivalent 2AP polynucleotide sequences can also be isolated from strains and/or DNA libraries using the teachings provided herein. For example, antibodies to the 2AP proteins disclosed herein can be used to identify and isolate other 2AP proteins from a mixture of proteins. Specifically, antibodies may be raised to the portions of the 2AP proteins which are most constant and most distinct from other proteins. These antibodies can then be used to specifically identify equivalent 2AP proteins with the characteristic activity by immunoprecipitation, enzyme linked immunoassay (ELISA), or Western blotting. 1[0054] A further method for identifying the genes of the subject invention is through the use of oligonucleotide probes. These probes are nucleotide sequences having a detectable label. As is Swell known in the art, if the probe molecule and nucleic acid sample hybridize by forming a (N strong bond between the two molecules, it can be reasonably assumed that the probe and sample are essentially identical. The probe's detectable label provides a means for determining in a Sknown manner whether hybridization has occurred. Such a probe analysis provides a rapid rn method for identifying genes of the subject invention. Exemplary probes are described in Table 1. 0 [0055] The nucleotide segments which are used as probes according to the invention can be synthesized by use of DNA synthesizers using standard procedures. In the use of the nucleotide segments as probes, the particular probe is labeled with any suitable label known to those skilled in the art, including radioactive and non-radioactive labels. Typical radioactive labels include 32 P, 1251, 35S, or the like. A probe labeled with a radioactive isotope can be constructed from a nucleotide sequence complementary to the DNA sample by a conventional nick translation reaction, using a DNase and DNA polymerase. The probe and sample can then be combined in a hybridization buffer solution and held at an appropriate temperature until annealing occurs. Thereafter, the membrane is washed free of extraneous materials, leaving the sample and bound probe molecules typically detected and quantified by autoradiography and/or liquid scintillation counting. [0056] Non-radioactive labels include, for example, ligands such as biotin or thyroxine, as well as enzymes such as hydrolases or peroxidases, or the various chemiluminescers such as luciferin, or fluorescent compounds like fluorescein and its derivatives. The probe may also be labeled at both ends with different types of labels for ease of separation, as, for example, by using an isotopic label at the end mentioned above and a biotin label at the other end. [0057] Duplex formation and stability depend on substantial complementarity between the two strands of a hybrid, and, as noted above, a certain degree of mismatch can be tolerated. Therefore, the probes of the subject invention include mutations (both single and multiple), deletions, insertions of the described sequences, and combinations thereof, wherein said mutations, insertions and deletions permit formation of stable hybrids with the target O polynucleotide of interest. Mutations, insertions, and deletions can be produced in a given Spolynucleotide sequence in many ways, by methods currently known to an ordinarily skilled Sartisan, and perhaps by other methods which may become known in the future. [0058] The potential variations in the probes listed is due, in part, to the redundancy of the genetic code. Because of the redundancy of the genetic code, more than one coding rn* nucleotide triplet (codon) can be used for most of the amino acids used to make proteins. 0 Therefore different nucleotide sequences can code for a particular amino acid. Thus, the amino t¢C acid sequences of the 2AP proteins and peptides can be prepared by equivalent nucleotide sequences encoding the same amino acid sequence of the protein or peptide. Accordingly, the subject invention includes such equivalent nucleotide sequences. Also, inverse or complement sequences are an aspect of the subject invention and can be readily used by a person skilled in this art. In addition it has been shown that proteins of identified structure and function may be constructed by changing the amino acid sequence if such changes do not alter the protein secondary structure (Kaiser and Kezdy, 1984). Thus, the subject invention includes mutants of the amino acid sequence depicted herein which do not alter the protein secondary structure, or if the structure is altered, the biological activity is substantially retained. Further, the invention also includes mutants of organisms hosting all or part of a 2AP polynucleotide sequence encoding a gene of the invention. Such mutants can be made by techniques well known to persons skilled in the art. For example, UV irradiation can be used to prepare mutants of host organisms, tDNA insertion mutagenesis can be used, or TILLING (Targeted Induced Local Lesion of Genome) can be used. Likewise, such mutants may include asporogenous host cells which also can be prepared by procedures well known in the art. [00591 2AP polynucleotide sequences, including those from rice, may be obtained using the PCR primers of the invention. These primers are shown in TABLE 1, and correspond to the sequences depicted in SEQ ID NO: 7 through SEQ ID NO: 79. Combinations of these primers may be used to amplify different regions of the Os2AP gene. PCR conditions which work for this amplification are as follows: 10 jtL reaction mixture with 10 ng of template DNA, 0.1 mM dNTP, 0.5 (M of each primer, 0.5 unit of Taq polymerase, 2.0 mM MgCI2, and 1 x Thermophilic Polymerase Buffer (Promega). This mixture should undergo 30 cycles of PCR with the following times and temperatures: 94 'C for 30 seconds (denaturation), 60 'C for 30 seconds (annealing), and 72 'C for 2 min (extension). TABLE 1: List of primers SEQ ID Primer name Sequence NO: 7 OS2AP-8L GCCATGCCAACTGAGTAAAG 8 OS2AP-8R CAATTTTATTCGCTCTGTGC 9 OS2AP-9L TGCAACATCGCGTCTTATTC OS2AP-9R GCAACTAGCAAGAGCATACACC I11 OS2AP-12L ACCTGACATCATGCCTTTGG 12 OS2AP-12R CCGGTCATCAGCTAACTTCC 13 OS2AP-1I3R CCCTTCGTCATAAAATATACTAGCAA 14 OS2AP-1I4L TCCTCCAACATGCTCTTTCG OS2AP-14R CAGAGAAGTTTACGCCGTTG 16 OS2AP- 15L TTTTTAAATAAGATGAACGGTCAAA 17 OS2AP-16L CTCTCCACCCTCTGCTTCTG 18 OS2AP-16R CTCTCCGCTTGAACCCATC 19 OS2AP-1 7L GCATGGCTGATTGTGTATCTG OS2AP- 17R TTCCAAACCTACGGACAAAAG 21 OS2AP- 18L TTCCTCTTCTCTTGTGCAAAC 22 OS2AP-1I8R CACGGAAGCCAATTCAGATG 23 OS2AP- 19L CTATCCTCTCCTGATGGCAAC 24 OS2AP- 19R TGGCTACTAGAATGATGCTCAAAG OS2AP20L CCTTTTGTGTCGCTTTTGAG 26 OS2AP20R AAAATAGCCTTCACTCGTTGC 27 OS2AP2 IL CCATCGATTTCGAGGGTAAC 28 OS2AP2 IR CGCATCCGATAATATGTTG 29 OS2AP22L GTAATTAGGAGTACGACTCTCGTC 052AP22R GCTTATAGCCTACTGTATCCTCCTC 31 OS2AP23L AATTGGTTAACCCAGCAAGC 32 OS2AP23R ACATTGTGAAACGGAGGAAG 33 OS2AP24L GCTATAAGCCAGCTGCAAAC 34 OS2AP24R GCAGTTGGTACGGACTTCG OS2AP25L CCTAAATATTTGACGCCGTTG 36 OS2AP25R TGAAGAGGAGGGTACCGATG 37 OS2AP26L CACCACTCCACACCTGACAC 38 OS2AP26R GTACGGAACACACGCACAAG 39 OS2AP27L TGTTGTTGTTGTTGCTGCTG OS2AP27R GCCGTGAGCCATATACACTTG 41 023088C02 IL AGCTCCAGCTCCTCCTCGAT 42 023088C02 2L TATCTCTCACCGACCCCAAA 43 023 088C02 2R TGTTGCCATCAGGAGAGGA 44 023088C02 3R CTCTTGATGAAGCAGCATGG 023088C02 3R CCCAGTAAATGCAACCTTGTC 46 023088C02 4R GGCAACATGGAAGGTAGCTC 47 023088C02 4R CCATGCAACCATCCTTTCTT 48 023088C02 5R TTATGGCTTCAGCTGCTCCT 49 023088C02 5R CAATGGCTTCTTCTTCAGTGC 023088C02 6R GCCCGTTGTTAGTGAAGGAC 51 023088C02 6R GTACCATCCCCACGGCTCAT 52 023088C02 7L CGAGCGATGCCAGAGATTA 53 023088C02 7R AGCACATGGCAAATCAAACA 54 OS2AP-exon 7.1 -delF TGCTCCTTTGTCATCACACC OS2AP-exon 7. 1-delR TTTCCACCAAGTTCCAGTGA 56 OS2AP-inIL TTCGCTGCAGAACAGATGAC 57 OS2AP inIR CTGATGGTTACGCGACAATTT 58 OS2AP-inTA2L ATTTGAACCGGGACAGAACA 59 OS2AP inTA2R TTTTGATGTGCCCTCTCCTT OS2AP rnAAT3L TGGGTAATCTTGTTCTGGAG 61 OS2AP inAAT3R AGTGCCAAATGCATGCTAGA 62 OS2AP-inG4L TGGGGCTCAAAAACCTACTG 63 OS2AP-inG4R GTCCGGGCCAAGTACCTC 64 OS2AP-5-UTR-EXI-5 F ATCTCTCACCGACCCCAAAT OS2AP-5-UTR-EX1-5 R CCATTGGAAGAGAGACAGGTG 66 OS2AP-ATG1-600 F TGTTGTTGTTGTTGCTGCTG 67 OS2AP-ATGI1-600 R TGGGGCTCAAAAACCTACTG 68 OS2AP-EX5-12 F GGTTGGTCTTCCTTCAGGTG 69 OS2AP-EX5-12 R GGTCCAAAAGCAACCAAAGA Aromarker BigL ACTGGTAAAAAGATTATGGC 71 Aromarker BigR CAAGCCGATCAACCAGTACA 72 Aromarker SmaliL CCATGCTGCAAGCAATGTA 73 Aromarker SmalIR AACCATAGGAGCAGCTGAAATA 74 OS2AP8_OUT F ACCCTGGTGTAGACAAGGTA OS2AP8_INF GGGAGTTATGAAACTGGTATAT 76 OS2AP8_I NR ATAGGAGCAGCTGAAGCCAT 77 OS2AP8_OUTR GTCCCGCACTTCAGAATTAG 78 OS2AP8_ex2_F CTCTGCTTCTGCCTCTGATT 79 OS2AP-exon9.1I-delRN CTGGCTACTAGAATGATGCTC Exon6to9NcoIF AATTCCATGGGGTTGGTCTTCCTTCAGGTG 81 Exon6to9SpeIR AATTACTAGTTTCCACCAAGTTCCAGTGA A 82 Exon6to8NhelR AATTCCATGGGGTTGGTCTTCCTTCAGGTG 83 GFPU CTTGTTGAATTAGATGGTGATGTT 84 GFPL GTTGTGGGAGTTGTAGTTGTATTC Os2APCH4U TAGCTTCACATCCCCATGTG 86 Os2APCH4L GCACCTTCACATCTTGCTGT 87 ActinU ACATCGCCCTGGACTATGAC 88 ActinL TGCTGAGAGATGCCAAGATG -100601 Oryza sativa 2-acetyl-l-pyrroline (Os2AP) homologs: Sequences that show similarity to those described in this application can be identified by computer-based methods, using public domain sequence alignment algorithms and sequence similarity search tools to search sequence C1 databases (public domain databases include Genbank, EMBL, Swiss-Prot, PIR and others). [0061] Similarity searches retrieve and align sequences for comparison with a target sequence to Crr be analyzed a query sequence). The optimal alignment between local regions of the Scompared sequences is known as a local alignment. Sequence comparison algorithms use n) scoring matrices to assign an overall score to each of the alignments. C [00621 Polynucleotide and polypeptide sequences may be aligned, and percentage of identical residues in a specified region may be determined against other polynucleotide and polypeptide sequences, using computer algorithms that are publicly available. The percentage identity score is dependent on the length of the overlap region of the sequences being compared. [00631 The similarity between two nucleic acid sequences or two amino acid sequences may be expressed in terms of sequence identity (or, for proteins, also in terms of sequence similarity). Sequence identity is frequently measured in terms of percentage identity; the higher the percentage, the more similar the two sequences are. As described herein, homologs and variants of the 2AP protein-encoding nucleic acid molecules may be used in the present invention. Homologs and variants of these nucleic acid molecules will possess a relatively high degree of sequence identity when aligned using standard methods. Such homologs and variants will hybridize under high stringency conditions to one another. [0064] Methods of alignment of sequences for comparison are well known in the art. Various programs and alignment algorithms are described in: Smith and Waterman (1981); Needleman and Wunsch (1970); Pearson and Lipman (1988); Higgins and Sharp (1988); Higgins and Sharp (1989); Corpet et al. (1988); Huang et al. (1992); and Pearson et al. (1994). Altschul et al. (1994) presents a detailed consideration of sequence alignment methods and homology calculations. [00651 The NCBI Basic Local Alignment Search Tool (BLAST) (Altschul et al., 1990) is available from several sources, including the National Center for Biotechnology Information O (NCBI, Bethesda, MD) and on the Internet, for use in connection with the sequence analysis programs blastp, blastn, blastx, tblastn and tblastx. It can be accessed at the NCBI Website. A Sdescription of how to determine sequence identity using this program is available at the NCBI C website. [00661 Homologs of the disclosed protein sequences are typically characterized by possession of rr, at least 40% sequence identity counted over the full length alignment with the amino acid sequence of the disclosed sequence using the NCBI Blast 2.0, gapped blastp set to default 'n parameters. The adjustable parameters are preferably set with the following values: overlap span 1, overlap fraction 0.125, word threshold 11. The HSP S and HSP S2 parameters are dynamic values and are established by the program itself dependiig upon the composition of the particular sequence and composition of the particular database against which the sequence of interest is being searched; however, the values may be adjusted to increase sensitivity. Proteins with even greater similarity to the reference sequences will show increasing percentage identities when assessed by this method, such as at least about 50%, at least about 60%, at least about at least about 75%, at least about 80%, at least about 85%, at least about 90% or at least about sequence identity. 100671 Homologs of the disclosed nucleic acid sequences are typically characterized by possession of at least 40% sequence identity counted over the full length alignment with the amino acid sequence of the disclosed sequence using the NCBI Blast 2.0, gapped blastn set to default parameters. In addition, such sequences hybridize to homologous sequences under high stringency conditions. A preferred method utilizes the BLASTN module of WU-BLAST-2 (Altschul et al., 1996); set to the default parameters, with overlap span and overlap fraction set to 1 and 0.125, respectively. Nucleic acid sequences with even greater similarity to the reference sequences will show increasing percentage identities when assessed by this method, such as at least about 50%, at least about 60%, at least about 70%, at least about 75%, at least about at least about 85%, at least about 90% or at least about 95% sequence identity. [00681 The alignment may include the introduction of gaps in the sequences to be aligned. In addition, for sequences which contain either more or fewer amino acids than the protein depicted in SEQ ID NO: 3 or SEQ ID NO: 6, it is understood that in one embodiment, the percentage of O sequence identity will be determined based on the number of identical amino acids in relation to the total number of amino acids. Thus, for example, sequence identity of sequences shorter than Sthat shown in the figures as discussed below, will be determined using the number of amino N acids in the longer sequence, in one embodiment. In percent identity calculations relative weight is not assigned to various manifestations of sequence variation, such as, insertions, deletions, t substitutions, etc. [0069] In one embodiment, only identities are scored positively and all forms of sequence tl variation including gaps are assigned a value of which obviates the need for a weighted scale or parameters as described herein for sequence similarity calculations. Percent sequence identity can be calculated, for example, by dividing the number of matching identical residues by the total number of residues of the "shorter" sequence in the aligned region and multiplying by 100. The "longer" sequence is the one having the most actual residues in the aligned region. [0070] Proteins can be classified according to their sequence relatedness to other proteins in the same genome (paralogs) or a different genome (orthologs). Ortholog genes are genes that evolved by speciation from a common ancestral gene. These genes normally retain the same function as they evolve. Paralog genes are genes that are duplicated within a genome. These genes may acquire new specificities or modified functions which may be related to the original one. Phylogenetic analysis methods are well-known to those with ordinary skill in bioinformatics. [0071] As will be appreciated by those skilled in the art, the sequences of the present invention may contain sequencing errors. That is, there may be incorrect amino acid sequences, nucleotides, frameshifts, unknown nucleotides, or other types of sequencing errors in any of the sequences; however, the correct sequences will fall within the homology and stringency definitions herein for nucleic acids, and the protein homology described for proteins or polypeptides. [0072] 2-acetyl-l-pyrroline (2AP) polypeptide: As used herein, the term "2AP polypeptide" means a gene product having substantially the amino acid sequence of a 2AP ortholog. A 2AP polypeptide is characterized, in part, in that a decrease in its expression, reduction in its mRNA levels, or reduction in protein amount or activity results in an increase in the levels of the O compound 2-acetyl-l-pyrroline in the plant. A 2AP polypeptide also is characterized, in part, by having an amino acid sequence with at least about 30%, 40%, 50%, 60%, 70%, 80%, 90% or amino acid identity with the amino acid sequence depicted in SEQ ID NO: 3 or SEQ ID N NO: 6. sC [0073] Substantially identical: By "substantially identical" is meant a polypeptide or nucleic rcn acid exhibiting at least 30%, preferably 50%, more preferably 80%, and most preferably 90%, or 0 even 95% homology to a reference amino acid sequence (for example, the amino acid sequence V) depicted in SEQ ID NO: 3 or SEQ ID NO: 6) or nucleic acid sequence (for example, the nucleic 0 acid sequence depicted in SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 4 or SEQ ID NO: For polypeptides, the length of comparison sequences will generally be at least 16 amino acids, preferably at least 20 amino acids, more preferably at least 25 amino acids, and most preferably amino acids or greater. For nucleic acids, the length of comparison sequences will generally be at least 50 nucleotides, preferably at least 60 nucleotides, more preferably at least nucleotides, and most preferably 110 nucleotides or greater. [0074] Sequence identity is typically measured using sequence analysis software (for example, Sequence Analysis Software Package of the Genetics Computer Group, University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, Wis. 53705, BLAST, or PILEUP/PRETTYBOX programs). For example, such software when set to standard parameters matches identical or similar sequences by assigning degrees of homology to various substitutions, deletions, and/or other modifications. Conservative substitutions typically include substitutions within the following groups: glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid, asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. [0075] Elevated level: Elevated level, as used herein, means an increase in the average level of the compound 2-acetyl-l-pyrroline in a non-naturally occurring plant when compared to the average level of the compound 2-acetyl-l-pyrroline in the corresponding naturally occurring plant. Given that the level of the compound 2-acetyl-l-pyrroline in a plant will vary from plant to plant depending upon a number of variables, one of skill in the art would understand that in comparing the average level of the compound 2-acetyl-l-pyrroline in a non-naturally occurring plant and the corresponding naturally occurring plant, a reasonably sized sample population of each type of plant grown under similarly controlled conditions should be compared. The level of Sthe compound 2-acetyl-1-pyrroline is preferably measured in several plants from each population C1 and averaged to determine whether the non-naturally occurring plant contained an elevated level. The elevated level in the non-naturally occurring plant of the present invention is, on average, at ttn least about 20% greater, 40% greater, 60% greater, 80% greater, 100% greater, 150% greater, rn 200% greater, 250% greater, 300% greater, 400% greater, or 500% greater than in the C1 corresponding naturally occurring plant. 0 [0076] Promoter: A recognition site on a DNA sequence or group of DNA sequences that provide an expression control element for a structural gene and to which RNA polymerase specifically binds and initiates RNA synthesis (transcription) of that gene. [0077] Examples of plant expression constructs using these promoters are found in Fraley et al., U.S. Pat. No. 5,352,605. In most tissues of transgenic plants, the CaMV 35S promoter is a strong promoter (see, Odell et al., Nature 313:810, 1985). The CaMV promoter is also highly active in monocots (see, Dekeyser et al., Plant Cell 2:591, 1990; Terada and Shimamoto, Mol. Gen. Genet. 220:389, 1990). Moreover, activity of this promoter can be further increased between 2-10 fold) by duplication of the CaMV 35S promoter (see Kay et al., Science 236:1299, 1987; Ow et al., Proc. Natl. Acad. Sci., U.S.A. 84:4870, 1987; and Fang et al., Plant Cell 1:141, 1989, and McPherson and Kay, U.S. Pat. No. 5,378,142). [0078] Other useful plant promoters include, without limitation, the nopaline synthase (NOS) promoter (An et al., Plant Physiol. 88:547, 1988 and Rodgers and Fraley, U.S. Pat. No. 5,034,322), the octopine synthase promoter (Fromm et al., Plant Cell 1:977, 1989), figwort mosaic virus (FMV) promoter (Rogers, U.S. Pat. No. 5,378,619), and the rice actin promoter (Wu and McElroy, W091/09948). [0079] Exemplary monocot promoters include, without limitation, commelina yellow mottle virus promoter, sugar cane badna virus promoter, rice tungro bacilliform virus promoter, maize streak virus element, and wheat dwarf virus promoter. O [0080] Construct: Unless otherwise stated, the term "construct" refers to a recombinant genetic molecule comprising one or more isolated polynucleotide sequences of the invention. [0081] Genetic constructs used for transgene expression in a host organism include a gene promoter sequence operably linked to an open reading frame and optionally a gene termination O, sequence 3' downstream of the open reading frame. The open reading frame may be orientated in CM either a sense or anti-sense direction, depending upon the intended use of the gene sequence. 0 The construct may also include selectable marker gene(s) and other regulatory elements for gene l expression. C [0082] Vector: A DNA molecule capable of replication in a host cell and/or to which another DNA segment can be operatively linked so as to bring about replication of the attached segment. Plasmids, phagemids, cosmids, phage, virus, YACs, and BACs are all exemplary vectors. [0083] The term "vector" refers to a nucleic acid molecule which is used to introduce a polynucleotide sequence into a host cell, thereby producing a transformed host cell. A "vector" may include genetic material in addition to the above-described genetic construct, one or more nucleic acid sequences that permit it to replicate in one or more host cells, such as origin(s) of replication, selectable marker genes and other genetic elements known in the art sequences for integrating the genetic material into the genome of the host cell, and so on). [0084] Transformed: A transformed cell is a cell into which has been introduced a nucleic acid molecule by molecular biology techniques. As used herein, the term transformation encompasses all techniques by which a nucleic acid molecule might be introduced into such a cell, plant or animal cell, including transfection with viral vectors, transformation by Agrobacterium, with plasmid vectors, and introduction of naked DNA by electroporation, lipofection, and particle gun acceleration and includes transient as well as stable transformants. [00851 Methods for DNA transformation of plant cells include Agrobacterium-mediated plant transformation, protoplast transformation, gene transfer into pollen, injection into reproductive organs, injection into immature embryos and particle bombardment. Each of these methods has distinct advantages and disadvantages. Thus, one particular method of introducing genes into a S particular plant strain may not necessarily be the most effective for another plant strain, but it is well known which methods are useful for a particular plant strain. O [0086] There are many methods for introducing transforming DNA segments into cells, but not all are suitable for delivering DNA to plant cells. Suitable methods are believed to include O virtually any method by which DNA can be introduced into a cell, such as by Agrobacterium CMi infection, direct delivery of DNA such as, for example, by PEG-mediated transformation of Cc€ 0 protoplasts (Omirulleh et al., 1993), by desiccation/inhibition-mediated DNA uptake, by tl) electroporation, by agitation with silicon carbide fibers, by acceleration of DNA coated particles, 0 etc. In certain embodiments, acceleration methods are preferred and include, for example, microprojectile bombardment and the like. [0087] Technology for introduction of DNA into cells is well-known to those of skill in the art. Four general methods for delivering a gene into cells have been described: chemical methods (Graham and van der Eb, 1973; Zatloukal et al., 1992); physical methods such as microinjection (Capecchi, 1980), electroporation (Wong and Neumann, 1982; Fromm et al., 1985) and the gene gun (Johnston and Tang, 1994; Fynan et al., 1993); viral vectors (Clapp, 1993; Lu et al., 1993; Eglitis and Anderson, 1988a; 1988b); and receptor-mediated mechanisms (Curiel et al., 1991; 1992; Wagner et al., 1992). [0088] Electroporation: The application of brief, high-voltage electric pulses to a variety of animal and plant cells leads to the formation of nanometer-sized pores in the plasma membrane. DNA is taken directly into the cell cytoplasm either through these pores or as a consequence of the redistribution of membrane components that accompanies closure of the pores. Electroporation can be extremely efficient and can be used both for transient expression of cloned genes and for establishment of cell lines that carry integrated copies of the gene of interest. Electroporation, in contrast to calcium phosphate-mediated transfection and protoplast fusion, frequently gives rise to cell lines that carry one, or at most a few, integrated copies of the foreign DNA. [0089] The introduction of DNA by means of electroporation is well-known to those of skill in the art. In this method, certain cell wall-degrading enzymes, such as pectin-degrading enzymes, are employed to render the target recipient cells more susceptible to transformation by Selectroporation than untreated cells. Alternatively, recipient cells are made more susceptible to transformation by mechanical wounding. To effect transformation by electroporation one may Semploy either friable tissues such as a suspension culture of cells or embryogenic callus, or C alternatively one may transform immature embryos or other organized tissues directly. One would partially degrade the cell walls of the chosen cells by exposing them to pectin-degrading t enzymes (pectolyases) or mechanically wounding in a controlled manner. Such cells would then rn* be receptive to DNA transfer by electroporation, which may be carried out at this stage, and C transformed cells can then identified by a suitable selection or screening protocol dependent on the nature of the newly incorporated DNA. [0090] Microprojetile Bombardment: A further advantageous method for delivering transforming DNA segments to plant cells is microprojectile bombardment. In this method, particles may be coated with nucleic acids and delivered into cells by a propelling force. Exemplary particles include those comprised of tungsten, gold, platinum, and the like. [0091] An advantage of microprojectile bombardment, in addition to it being an effective means of reproducibly obtaining stably transforming monocots, is that neither the isolation of protoplasts (Cristou et al., 1988) nor the susceptibility to Agrobacterium infection is required. An illustrative embodiment of a method for delivering DNA into maize cells by acceleration is a Biolistics Particle Delivery System, which can be used to propel particles coated with DNA or cells through a screen, such as a stainless steel or Nytex screen, onto a filter surface covered with corn cells cultured in suspension. The screen disperses the particles so that they are not delivered to the recipient cells in large aggregates. It is believed that a screen intervening between the projectile apparatus and the cells to be bombarded reduces the size of projectiles aggregate and may contribute to a higher frequency of transformation by reducing damage inflicted on the recipient cells by projectiles that are too large. [0092] For the bombardment, cells in suspension are preferably concentrated on filters or solid culture medium. Alternatively, immature embryos or other target cells may be arranged on solid culture medium. The cells to be bombarded are positioned at an appropriate distance below the macroprojectile stopping plate. If desired, one or more screens are also positioned between the acceleration device and the cells to be bombarded. Through the use of techniques set forth herein one may obtain up to 1000 or more foci of cells transiently expressing a marker gene. The number of cells in a focus which express the exogenous gene product 48 hours postbombardment often range from 1 to 10 and average 1 to 3. [0093] In bombardment transformation, one may optimize the prebombardment culturing ,O conditions and the bombardment parameters to yield the maximum numbers of stable Mr, transformants. Both the physical and biological parameters for bombardment are important in this technology. Physical factors are those that involve manipulating the DNA/microprojectile V) precipitate or those that affect the flight and velocity of either the macro- or microprojectiles. O Biological factors include all steps involved in manipulation of cells before and immediately after bombardment, the osmotic adjustment of target cells to help alleviate the trauma associated with bombardment, and also the nature of the transforming DNA, such as linearized DNA or intact supercoiled plasmids. It is believed that pre-bombardment manipulations are especially important for successful transformation of immature embryos. [0094] Accordingly, it is contemplated that one may wish to adjust some of the bombardment parameters in small scale studies to fully optimize the conditions. One may particularly wish to adjust physical parameters such as gap distance, flight distance, tissue distance, and helium pressure. One may also minimize the trauma reduction factors (TRFs) by modifying conditions which influence the physiological state of the recipient cells and which may therefore influence transformation and integration efficiencies. For example, the osmotic state, tissue hydration and the subculture stage or cell cycle of the recipient cells may be adjusted for optimum transformation. The execution of other routine adjustments will be known to those of skill in the art in light of the present disclosure. [0095] Agrobacterium-Mediated Transfer: Agrobacterium-mediated transfer is a widely applicable system for introducing genes into plant cells because the DNA can be introduced into whole plant tissues, thereby bypassing the need for regeneration of an intact plant from a protoplast. The use of Agrobacterium-mediated plant integrating vectors to introduce DNA into plant cells is well known in the art. See, for example, the methods described (Fraley et al., 1985; Rogers et al., 1987). Further, the integration of the Ti-DNA is a relatively precise process resulting in few rearrangements. The region of DNA to be transferred is defined by the border O sequences, and intervening DNA is usually inserted into the plant genome as described (Spielmann et al., 1986; Jorgensen et al., 1987). [0096] Moder Agrobacterium transformation vectors are capable of replication in E. coli as well as Agrobacterium, allowing for convenient manipulations as described (Klee et al., 1985). O Moreover, recent technological advances in vectors for Agrobacterium-mediated gene transfer C have improved the arrangement of genes and restriction sites in the vectors to facilitate 0 construction of vectors capable of expressing various polypeptide coding genes. The vectors It described (Rogers et al., 1987), have convenient multi-linker regions flanked by a promoter and O a polyadenylation site for direct expression of inserted polypeptide coding genes and are suitable for present purposes. In addition, Agrobacterium containing both armed and disarmed Ti genes can be used for the transformations. In those plant strains where Agrobacterium-mediated transformation is efficient, it is the method of choice because of the facile and defined nature of the gene transfer. [0097] Agrobacterium-mediated transformation of leaf disks and other tissues such as cotyledons and hypocotyls appears to be limited to plants that Agrobacterium naturally infects. Agrobacterium-mediated transformation is most efficient in dicotyledonous plants. Few monocots appear to be natural hosts for Agrobacterium, although transgenic plants have been produced in asparagus using Agrobacterium vectors as described (Bytebier et al., 1987). Therefore, commercially important cereal grains such as rice, corn, and wheat must usually be transformed using alternative methods. [0098] A transgenic plant formed using Agrobacterium transformation methods typically contains a single gene on one chromosome. Such transgenic plants can be referred to as being heterozygous for the added gene. However, inasmuch as use of the word "heterozygous" usually implies the presence of a complementary gene at the same locus of the second chromosome of a pair of chromosomes, and there is no such gene in a plant containing one added gene as here, it is believed that a more accurate name for such a plant is an independent segregant, because the added, exogenous gene segregates independently during mitosis and meiosis. [0099] More preferred is a transgenic plant that is homozygous for the added structural gene; a transgenic plant that contains two added genes, one gene at the same locus on each S chromosome of a chromosome pair. A homozygous transgenic plant can be obtained by sexually mating (selfing) an independent segregant transgenic plant that contains a single added gene, Sgerminating some of the seed produced and analyzing the resulting plants produced for elevated CN 2-acetyl-l -pyrroline levels relative to a control (native, non-transgenic) or an independent segregant transgenic plant. tIN M~f [0100] Application of these systems to different plant strains depends upon the ability to 0 regenerate that particular plant strain from protoplasts. Illustrative methods for the regeneration 'C of cereals from protoplasts are described (Fujimura et al., 1985; Toriyama et al., 1986; Yamada 0 et al., 1986; Abdullah et al., 1986). [0101] To transform plant strains that cannot be successfully regenerated from protoplasts, other ways to introduce DNA into intact cells or tissues can be utilized. For example, regeneration of cereals from immature embryos or explants can be effected as described (Vasil, 1988). In addition, "particle gun" or high-velocity microprojectile technology can be utilized (Vasil, 1992). [0102] Using that latter technology, DNA is carried through the cell wall and into the cytoplasm on the surface of small metal particles as described (Klein et al., 1987; Klein et al., 1988; McCabe et al., 1988). The metal particles penetrate through several layers of cells and thus allow the transformation of cells within tissue explants. [0103] Isolated: An "isolated" biological component (such as a nucleic acid or protein or organelle) has been substantially separated or purified away from other biological components in the cell or the organism in which the component naturally occurs, other chromosomal and extra-chromosomal DNA and RNA, proteins and organelles. Nucleic acids and proteins that have been "isolated" include nucleic acids and proteins purified by standard purification methods. The term embraces nucleic acids including chemically synthesized nucleic acids and also embraces proteins prepared by recombinant expression in vitro or in a host cell and recombinant nucleic acids as defined below. As an example, a gene in a large genomic DNA fragment such as a contig is not sufficiently purified away from other biological components to be considered isolated due to the relatively large amount of extra DNA found in the average contig. As outlined below "recombinant nucleic acids" and "recombinant proteins" also are "isolated" as described above. O [0104] Recombinant: By "recombinant nucleic acid" herein is meant a nucleic acid that has a sequence that is not naturally occurring or has a sequence that is made by an artificial Scombination of two otherwise separated segments of sequence. This artificial combination is C often accomplished by chemical synthesis or, more commonly, by the artificial manipulation of nucleic acids, by genetic engineering techniques, such as by the manipulation of at least one tn nucleic acid by a restriction enzyme, ligase, recombinase, and/or a polymerase. Once introduced into a host cell, a recombinant nucleic acid is replicated by the host cell; however, the recombinant nucleic acid once replicated in the cell remains a recombinant nucleic acid for purposes of this invention. By "recombinant protein" herein is meant a protein produced by a i method employing a recombinant nucleic acid. As outlined above "recombinant nucleic acids" and "recombinant proteins" also are "isolated" as described above. A gene in a large fragment such as a contig would not be a "recombinant nucleic acid" given that such artificial combination does not relate to the gene. However, if sequences around or within a gene in a contig have been manipulated for purposes relating to that gene not merely because the gene is near the end of the contig), then such a gene in a contig would constitute a "recombinant nucleic acid" due to the relative proximity of the recombinant portion of the nucleic acid to the gene in question. [0105] Non-naturally occurring plant: As used herein, the term "non-naturally occurring," when used in reference to a plant, means a plant that has been genetically modified by human intervention to alter the levels of the compound 2-acetyl-1-pyrroline in the plant. The naturally occurring plant that has been genetically modified is referred to as the "control" plant. Within the context of the invention, the control plant may be a transgenic plant. For example, the control plant may be an herbicide resistant transgenic plant. In this example, a non-naturally occuring plant would be herbicide resistant and would have elevated levels of the compound 2acetyl- I -pyrroline compared to the control herbicide resistant plant. A transgenic plant of the invention, for example, is a non-naturally occurring plant that contains an exogenous nucleic acid molecule encoding a 2AP gene or fragment thereof and, therefore, has been genetically modified by human intervention. In addition, a plant that contains a mutation in, for example, a 2AP gene regulatory element or coding sequence as a result of calculated exposure to a mutagenic agent, such as a chemical mutagen, or an "insertional mutagen," such as a transposon or T-DNA, also is considered a non-naturally occurring plant, since it has been genetically modified by human intervention. In contrast, a plant containing only spontaneous or naturally O occurring mutations is not a "non-naturally occurring plant" as defined herein. One skilled in the art understands that, while a non-naturally occurring plant typically has a nucleotide sequence Sthat is altered as compared to a naturally occurring plant, a non-naturally occurring plant also can CI be genetically modified by human intervention without altering its nucleotide sequence, for example, by modifying its methylation pattern. |