Gene ECH74115_5224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5224 
SymbolwecE 
ID6971312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4871648 
End bp4872778 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content55% 
IMG OID643388889 
ProductTDP-4-oxo-6-deoxy-D-glucose transaminase 
Protein accessionYP_002273309 
Protein GI209400869 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0399] Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis 
TIGRFAM ID[TIGR02379] TDP-4-keto-6-deoxy-D-glucose transaminase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.744835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCCAT TTAACGCACC GCCGGTGGTG GGAACCGAAC TCGACTATAT GCAGTCGGCA 
ATGGGTAGCG GCAAACTGTG TGGCGATGGC GGTTTTACCC GTCGCTGCCA GCAGTGGCTG
GAGCAACGTT TTGGCAGCGC CAAAGTGTTA CTGACGCCGT CCTGCACCGC TTCGCTGGAG
ATGGCGGCGC TGCTGCTCGA TATCCAGCCT GGCGATGAAG TGATCATGCC GAGCTACACC
TTTGTCTCCA CCGCCAATGC CTTTGTGCTG CGTGGCGCAA AAATCGTTTT TGTGGATGTT
CGCCCGGACA CCATGAACAT CGACGAAACG TTGATTGAAG CGGCGATCAC CGACAAAACG
CGCGTTATCG TGCCGGTGCA TTACGCGGGC GTGGCCTGCG AAATGGACAC CATTATGGCG
TTGGCGAAAA AGCATAATCT GTTTGTGGTG GAAGATGCTG CTCAGGGCGT GATGTCCACT
TACAAAGGGC GTGCACTGGG AACCATTGGT CATATTGGCT GCTTTAGCTT CCATGAAACC
AAAAACTACA CGGCGGGCGG TGAAGGCGGC GCGACGCTGA TTAACGATAA AGCGTTGATC
GAACGAGCCG AGATCATCCG TGAAAAAGGC ACAAACCGCA GCCAGTTCTT CCGTGGTCAG
GTCGATAAAT ATACCTGGCG CGATATCGGC TCCAGCTATT TGATGTCCGA TCTGCAAGCT
GCGTACCTGT GGGCGCAACT GGAAGCAGCG GATCGTATCA ACCAGCAACG TCTGGCGCTG
TGGCAAAACT ACTACGATGC GTTAGCACCT CTGGCGAAAG CCGGGCGTAT CGAGCTGCCG
TCGATTCCCG ATGGCTGCTT GCAGAACGCG CATATGTTTT ACATTAAACT GCGGGATATT
GGTGACCGGA GCGCGTTGAT TAACTTTCTG AAAGAAGCGG AAATCATGGC GGTGTTCCAT
TACATTCCGC TGCACGGTTG CCCTGCGGGG GAACGCTTTG GTGAGTTCCA CGGTGAAGAT
CGCTACACCA CCAAAGAGAG CGAGCGCCTG CTGCGCCTGC CGCTGTTCTA CAACCTGTCG
CCCGTCAATC AGCGTACGGT AATTGCGACT TTGTTGAACT ACTTCTCCTG A
 
Protein sequence
MIPFNAPPVV GTELDYMQSA MGSGKLCGDG GFTRRCQQWL EQRFGSAKVL LTPSCTASLE 
MAALLLDIQP GDEVIMPSYT FVSTANAFVL RGAKIVFVDV RPDTMNIDET LIEAAITDKT
RVIVPVHYAG VACEMDTIMA LAKKHNLFVV EDAAQGVMST YKGRALGTIG HIGCFSFHET
KNYTAGGEGG ATLINDKALI ERAEIIREKG TNRSQFFRGQ VDKYTWRDIG SSYLMSDLQA
AYLWAQLEAA DRINQQRLAL WQNYYDALAP LAKAGRIELP SIPDGCLQNA HMFYIKLRDI
GDRSALINFL KEAEIMAVFH YIPLHGCPAG ERFGEFHGED RYTTKESERL LRLPLFYNLS
PVNQRTVIAT LLNYFS