Gene ECH74115_0856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0856 
SymbolaroG 
ID6966646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp872858 
End bp873910 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content52% 
IMG OID643384881 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002269381 
Protein GI209400570 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0352016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.842103 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTATC AGAACGACGA TTTACGCATC AAAGAAATCA AAGAGTTACT TCCTCCTGTC 
GCATTGCTGG AAAAATTCCC CGCTACTGAA AATGCCGCGA ATACGGTTGC CCATGCCCGA
AAAGCGATCC ATAAGATCCT GAAAGGTAAT GATGATCGCC TGTTGGTGGT GATTGGCCCG
TGCTCAATTC ATGATCCTGT CGCGGCTAAA GAGTATGCCA CTCGCTTGCT GGCGCTGCGT
GAAGAGCTGA AAGATGAGCT GGAAATCGTA ATGCGCGTCT ATTTTGAAAA GCCGCGTACT
ACGGTGGGCT GGAAAGGGCT GATTAACGAT CCGCATATGG ATAACAGCTT CCAGATCAAC
GACGGTCTGC GTATAGCCCG TAAATTGCTG CTTGATATTA ACGACAGCGG TCTGCCAGCG
GCGGGTGAAT TCCTCGATAT GATCACTCCT CAGTATCTCG CTGACCTGAT GAGCTGGGGC
GCAATTGGTG CACGTACCAC GGAATCGCAG GTGCACCGCG AACTGGCGTC TGGTCTTTCT
TGTCCGGTAG GTTTTAAAAA TGGCACAGAC GGTACGATTA AAGTGGCTAT CGATGCCATT
AATGCCGCCG GTGCGCCGCA CTGCTTCCTG TCCGTAACTA AATGGGGGCA TTCGGCGATT
GTGAATACCA GCGGTAACGG CGATTGCCAT ATCATTCTGC GCGGCGGTAA AGAGCCTAAC
TACAGCGCGA AGCACGTTGC TGAAGTGAAA GAAGGGCTGA ACAAAGCAGG TCTGCCAGCT
CAGGTGATGA TCGATTTCAG CCATGCTAAT TCGTCCAAAC AATTCAAAAA GCAGATGGAT
GTTTGTGCTG ACGTTTGCCA GCAGATTGCC GGTGGCGAAA AGGCCATTAT CGGTGTGATG
GTGGAAAGTC ATCTGGTGGA AGGCAATCAG AGCCTGGAGA GCGGGGAGCC GCTGGCCTAT
GGTAAGAGCA TCACCGATGC CTGCATTGGC TGGGAAGATA CCGATGCTCT GTTACGTCAA
CTGGCGAACG CAGTGAAAGC GCGTCGCGGG TAA
 
Protein sequence
MNYQNDDLRI KEIKELLPPV ALLEKFPATE NAANTVAHAR KAIHKILKGN DDRLLVVIGP 
CSIHDPVAAK EYATRLLALR EELKDELEIV MRVYFEKPRT TVGWKGLIND PHMDNSFQIN
DGLRIARKLL LDINDSGLPA AGEFLDMITP QYLADLMSWG AIGARTTESQ VHRELASGLS
CPVGFKNGTD GTIKVAIDAI NAAGAPHCFL SVTKWGHSAI VNTSGNGDCH IILRGGKEPN
YSAKHVAEVK EGLNKAGLPA QVMIDFSHAN SSKQFKKQMD VCADVCQQIA GGEKAIIGVM
VESHLVEGNQ SLESGEPLAY GKSITDACIG WEDTDALLRQ LANAVKARRG