Gene EcHS_A0808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0808 
SymbolaroG 
ID5592996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp812234 
End bp813286 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content52% 
IMG OID640919980 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001457547 
Protein GI157160229 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value0.147419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTATC AGAACGACGA TTTACGCATC AAAGAAATCA AAGAGTTACT TCCTCCTGTC 
GCATTGCTGG AAAAATTCCC CGCTACTGAA AATGCCGCGA ATACGGTTGC CCATGCCCGA
AAAGCGATCC ATAAGATCCT GAAAGGTAAT GATGATCGCC TGTTGGTTGT GATTGGCCCA
TGCTCAATTC ATGATCCTGT CGCGGCAAAA GAGTATGCCA CTCGCTTGCT GGCGCTGCGT
GAAGAGCTGA AAGATGAGCT GGAAATCGTA ATGCGCGTCT ATTTTGAAAA GCCGCGTACC
ACGGTGGGCT GGAAAGGGCT GATTAACGAT CCGCATATGG ATAATAGCTT CCAGATCAAC
GACGGTCTGC GTATAGCCCG TAAATTGCTG CTTGATATTA ACGACAGCGG TCTGCCAGCG
GCAGGTGAGT TTCTCGATAT GATCACCCCA CAATATCTCG CTGACCTGAT GAGCTGGGGC
GCAATTGGCG CACGTACCAC CGAATCGCAG GTGCACCGCG AACTGGCATC AGGGCTTTCT
TGTCCGGTCG GCTTCAAAAA TGGCACCGAC GGTACGATTA AAGTGGCTAT CGATGCCATT
AATGCCGCCG GTGCGCCGCA CTGCTTCCTG TCCGTAACGA AATGGGGGCA TTCGGCGATT
GTGAATACCA GCGGTAACGG CGATTGCCAT ATCATTCTGC GCGGCGGTAA AGAGCCTAAC
TACAGCGCGA AGCACGTTGC TGAAGTGAAA GAAGGGCTGA ACAAAGCAGG CCTGCCAGCA
CAGGTGATGA TCGATTTCAG CCATGCTAAC TCGTCCAAAC AATTCAAAAA GCAGATGGAT
GTTTGTGCTG ACGTTTGCCA GCAGATTGCC GGTGGCGAAA AGGCCATTAT TGGCGTGATG
GTGGAAAGCC ATCTGGTGGA AGGCAATCAG AGCCTCGAGA GCGGGGAGCC GCTGGCCTAC
GGTAAGAGCA TCACCGATGC CTGCATCGGC TGGGAAGATA CCGATGCTCT GTTACGTCAA
CTGGCGAATG CAGTAAAAGC GCGTCGCGGG TAA
 
Protein sequence
MNYQNDDLRI KEIKELLPPV ALLEKFPATE NAANTVAHAR KAIHKILKGN DDRLLVVIGP 
CSIHDPVAAK EYATRLLALR EELKDELEIV MRVYFEKPRT TVGWKGLIND PHMDNSFQIN
DGLRIARKLL LDINDSGLPA AGEFLDMITP QYLADLMSWG AIGARTTESQ VHRELASGLS
CPVGFKNGTD GTIKVAIDAI NAAGAPHCFL SVTKWGHSAI VNTSGNGDCH IILRGGKEPN
YSAKHVAEVK EGLNKAGLPA QVMIDFSHAN SSKQFKKQMD VCADVCQQIA GGEKAIIGVM
VESHLVEGNQ SLESGEPLAY GKSITDACIG WEDTDALLRQ LANAVKARRG