Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_2087 |
Symbol | dhs |
ID | 7386892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 1711720 |
End bp | 1712997 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643651301 |
Product | 2-dehydro-3-deoxyphosphoheptonate aldolase |
Protein accession | YP_002549496 |
Protein GI | 222148539 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGGCGACCT ATCCGCCGCT GGTCTTTGCT GGTGAAGCGC GCCGGTTGAA AAAGGCGCTT GCCAATGTGG CTGATGGCAA TGGCTTCCTG CTTCAGGGCG GCGATTGTGC CGAAAGCTTT GCCGAACACG GCGCCGACAC GATCCGCGAC TTCTTCCGCG CCTTCCTGCA GATGGCCGTT GTCCTGACCT TTGGCGCCCA GCTTCCGGTC GTCAAGGTCG GCCGCATCGC TGGCCAGTTC GCCAAGCCGC GCTCGTCGGA TTTCGAGCGT CAGGGCGATG TCGAGTTGCC GAGCTACCGT GGCGATATCA TCAATGGCAT CGATTTCACC GAAGAGTCTC GCGTTCCCGA TCCGCATCGT CAGTTGATGG CCTATCGCCA GTCAGCCGCG ACGCTGAACC TGCTGCGCGC TTTCGCCATG GGTGGCTATG CCAATCTCGA AAACGTTCAT CAATGGATGC TGGGCTTCGT CAAGGACAGC CCGCAGGCAG AGCGTTACCG CAAGCTTGCC GACCGGATTT CCGAGACCAT GGATTTCATG AAGGCGGTCG GCATCACGGC GGAAACCAAT GCCAGCCTGC GCGAAACCGA TTTCTTCACC AGCCATGAAG CGCTGCTTCT TGGCTATGAA GAGGCGCTGA CCCGCGTCGA CTCGACATCT GGCGATCATT ACGCCACATC AGGCCACATG ATCTGGATTG GCGACCGTAC CCGTCAGGCC GATCATGCCC ATATCGAATA TTGCCGCGGA ATCAAAAACC CGCTGGGTCT CAAATGCGGC CCGTCGCTTC AGGCTGACGA TCTTCTCAAC CTGATCGACA TTCTCAATCC GCAAAATGAA GCGGGTCGTC TGACGCTGAT CTGCCGCTTC GGCCACGACA AGGTTGCTGA CCATCTGCCG CGCCTGATCC GCGCGGTGGA GCGGGAAGGG CGCAAGGTCG TATGGTCCTG CGATCCGATG CATGGCAACA CCATCACGCT CAACCACTAC AAGACCCGGC CCTTTGACCG GATCCTGTCG GAAGTGGAAA GCTTCTTCCA GATCCACCGG GCTGAAGGCT CGCATCCAGG CGGCATCCAT ATCGAGATGA CCGGCAACGA CGTGACCGAA TGCACCGGTG GCGCACGCGC CGTTTCCGCT GAAGATTTGC AGGATCGCTA CCATACCCAT TGCGACCCGC GTCTCAATGC GGACCAGGCG CTGGAACTGG CCTTCCTTCT GGCCGAGCGC ATGAAGGGCG GACGCGACGA GAAGCGGCTG AGAACAGTCG GGGCCTGA
|
Protein sequence | MATYPPLVFA GEARRLKKAL ANVADGNGFL LQGGDCAESF AEHGADTIRD FFRAFLQMAV VLTFGAQLPV VKVGRIAGQF AKPRSSDFER QGDVELPSYR GDIINGIDFT EESRVPDPHR QLMAYRQSAA TLNLLRAFAM GGYANLENVH QWMLGFVKDS PQAERYRKLA DRISETMDFM KAVGITAETN ASLRETDFFT SHEALLLGYE EALTRVDSTS GDHYATSGHM IWIGDRTRQA DHAHIEYCRG IKNPLGLKCG PSLQADDLLN LIDILNPQNE AGRLTLICRF GHDKVADHLP RLIRAVEREG RKVVWSCDPM HGNTITLNHY KTRPFDRILS EVESFFQIHR AEGSHPGGIH IEMTGNDVTE CTGGARAVSA EDLQDRYHTH CDPRLNADQA LELAFLLAER MKGGRDEKRL RTVGA
|
| |