Gene Avi_2087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_2087 
Symboldhs 
ID7386892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp1711720 
End bp1712997 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content60% 
IMG OID643651301 
Product2-dehydro-3-deoxyphosphoheptonate aldolase 
Protein accessionYP_002549496 
Protein GI222148539 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGGCGACCT ATCCGCCGCT GGTCTTTGCT GGTGAAGCGC GCCGGTTGAA AAAGGCGCTT 
GCCAATGTGG CTGATGGCAA TGGCTTCCTG CTTCAGGGCG GCGATTGTGC CGAAAGCTTT
GCCGAACACG GCGCCGACAC GATCCGCGAC TTCTTCCGCG CCTTCCTGCA GATGGCCGTT
GTCCTGACCT TTGGCGCCCA GCTTCCGGTC GTCAAGGTCG GCCGCATCGC TGGCCAGTTC
GCCAAGCCGC GCTCGTCGGA TTTCGAGCGT CAGGGCGATG TCGAGTTGCC GAGCTACCGT
GGCGATATCA TCAATGGCAT CGATTTCACC GAAGAGTCTC GCGTTCCCGA TCCGCATCGT
CAGTTGATGG CCTATCGCCA GTCAGCCGCG ACGCTGAACC TGCTGCGCGC TTTCGCCATG
GGTGGCTATG CCAATCTCGA AAACGTTCAT CAATGGATGC TGGGCTTCGT CAAGGACAGC
CCGCAGGCAG AGCGTTACCG CAAGCTTGCC GACCGGATTT CCGAGACCAT GGATTTCATG
AAGGCGGTCG GCATCACGGC GGAAACCAAT GCCAGCCTGC GCGAAACCGA TTTCTTCACC
AGCCATGAAG CGCTGCTTCT TGGCTATGAA GAGGCGCTGA CCCGCGTCGA CTCGACATCT
GGCGATCATT ACGCCACATC AGGCCACATG ATCTGGATTG GCGACCGTAC CCGTCAGGCC
GATCATGCCC ATATCGAATA TTGCCGCGGA ATCAAAAACC CGCTGGGTCT CAAATGCGGC
CCGTCGCTTC AGGCTGACGA TCTTCTCAAC CTGATCGACA TTCTCAATCC GCAAAATGAA
GCGGGTCGTC TGACGCTGAT CTGCCGCTTC GGCCACGACA AGGTTGCTGA CCATCTGCCG
CGCCTGATCC GCGCGGTGGA GCGGGAAGGG CGCAAGGTCG TATGGTCCTG CGATCCGATG
CATGGCAACA CCATCACGCT CAACCACTAC AAGACCCGGC CCTTTGACCG GATCCTGTCG
GAAGTGGAAA GCTTCTTCCA GATCCACCGG GCTGAAGGCT CGCATCCAGG CGGCATCCAT
ATCGAGATGA CCGGCAACGA CGTGACCGAA TGCACCGGTG GCGCACGCGC CGTTTCCGCT
GAAGATTTGC AGGATCGCTA CCATACCCAT TGCGACCCGC GTCTCAATGC GGACCAGGCG
CTGGAACTGG CCTTCCTTCT GGCCGAGCGC ATGAAGGGCG GACGCGACGA GAAGCGGCTG
AGAACAGTCG GGGCCTGA
 
Protein sequence
MATYPPLVFA GEARRLKKAL ANVADGNGFL LQGGDCAESF AEHGADTIRD FFRAFLQMAV 
VLTFGAQLPV VKVGRIAGQF AKPRSSDFER QGDVELPSYR GDIINGIDFT EESRVPDPHR
QLMAYRQSAA TLNLLRAFAM GGYANLENVH QWMLGFVKDS PQAERYRKLA DRISETMDFM
KAVGITAETN ASLRETDFFT SHEALLLGYE EALTRVDSTS GDHYATSGHM IWIGDRTRQA
DHAHIEYCRG IKNPLGLKCG PSLQADDLLN LIDILNPQNE AGRLTLICRF GHDKVADHLP
RLIRAVEREG RKVVWSCDPM HGNTITLNHY KTRPFDRILS EVESFFQIHR AEGSHPGGIH
IEMTGNDVTE CTGGARAVSA EDLQDRYHTH CDPRLNADQA LELAFLLAER MKGGRDEKRL
RTVGA