Gene ECH74115_2333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2333 
SymbolmalX 
ID6968923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2205279 
End bp2206871 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content53% 
IMG OID643386207 
Productbifunctional PTS system maltose and glucose-specific transporter subunits IICB 
Protein accessionYP_002270691 
Protein GI209400964 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component
[TIGR02004] PTS system, maltose and glucose-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGA AAACAGCACC GAAAGTCACG CTGTGGGAGT TCTTCCAGCA GTTAGGCAAA 
ACCTTCATGT TACCCGTGGC ATTATTGTCG TTCTGCGGCA TTATGCTCGG CATTGGTAGT
TCCCTTAGTA GCCACGATGT CATCACCCTG CTCCCGGTCC TGGGCAACCC CGTGTTGCAG
GCTATCTTTA CCTGGATGAG TAAGATTGGC TCGTTTGCTT TTAGTTTCCT GCCAGTCATG
TTCTGTATCG CCATCCCGCT GGGTCTGGCA CGAGAAAACA AAGGCGTAGC GGCATTCGCT
GGCTTCGTCG GTTATGCGGT AATGAACCTC GCGGTAAACT TCTGGTTGAC CAATAAAGGC
ATTCTGCCAA CCACGGATGC CGCCGTTCTG AAAGCCAATA ACATCCAGAG CATTCTTGGG
ATCCAGTCGA TCGACACCGG GATCCTCGGT GCGGTGATCG CCGGTATTAT CGTCTGGATG
CTGCATGAGC GATTCCATAA TATCCGCCTG CCGGATGCGC TGGCATTCTT CGGCGGTACG
CGCTTCGTAC CAATCATCTC CTCGCTGGTG ATGGGCCTTG TCGGCCTGGT GATTCCATTA
GTCTGGCCGA TTTTCGCCAT GGGTATTAGC GGCTTAGGCC ATATGATCAA TAGCGCGGGT
GATTTCGGAC CGATGCTGTT TGGTACCGGT GAACGTCTGC TGTTGCCGTT TGGTCTGCAT
CACATTCTGG TGGCATTAAT TCGCTTTACC GACGCAGGTG GCACGCAGGA AGTCTGCGGT
CAAACCGTCA GCGGCGCATT GACCATCTTC CAGGCGCAAT TGAGTTGCCC GACCACTCAC
GGTTTTTCTG AAAGCGCCAC GCGTTTCCTT TCGCAAGGCA AAATGCCTGC GTTTCTCGGC
GGTCTGCCAG GTGCAGCGTT AGCGATGTAT CACTGCGCGC GCCCGGAAAA TCGCCATAAA
ATTAAAGGGC TGCTGATTTC TGGCCTGATC GCCTGTGTCG TTGGCGGCAC GACCGAACCG
CTGGAATTCC TGTTCCTGTT CGTAGCGCCA GTTCTGTATG TCATCCACGC GCTGTTAACC
GGCCTCGGCT TCACCGTCAT GTCTGTGCTC GGCGTCACCA TCGGTAATAC CGACGGCAAT
ATCATCGACT TCGTGGTGTT CGGTATTTTG CATGGTCTGT CAACCAAGTG GTACATGGTG
CCAGTGGTGG TGGCAATCTG GTTTGTCGTT TACTACGTCA TCTTCCGTTT CGCTATCACC
CGCTTCAATC TGAAAACCCC GGGGCGCGAT AGCGAAGTGG CCAGTTCAAT CGAAAAAGCC
GTTGCCGGTG CGCCGGGTAA ATCAGGTTAC AACGTTCCGG CAATCCTCGA AGCCTTAGGC
GGTGCCGACA ATATTGTTAG CCTCGATAAC TGCATTACCC GTCTGCGTTT GTCTGTGAAA
GATATGTCGC TTGTTAATGT GCAGGCACTG AAGGACAATC GGGCAATTGG CGTAGTACAA
CTTAATCAAC ATAACCTGCA GGTTGTTATC GGGCCACAAG TTCAGTCAGT AAAAGATGAA
ATGGCCGGTC TGATGCATAC TGTCCAGGCA TAA
 
Protein sequence
MTAKTAPKVT LWEFFQQLGK TFMLPVALLS FCGIMLGIGS SLSSHDVITL LPVLGNPVLQ 
AIFTWMSKIG SFAFSFLPVM FCIAIPLGLA RENKGVAAFA GFVGYAVMNL AVNFWLTNKG
ILPTTDAAVL KANNIQSILG IQSIDTGILG AVIAGIIVWM LHERFHNIRL PDALAFFGGT
RFVPIISSLV MGLVGLVIPL VWPIFAMGIS GLGHMINSAG DFGPMLFGTG ERLLLPFGLH
HILVALIRFT DAGGTQEVCG QTVSGALTIF QAQLSCPTTH GFSESATRFL SQGKMPAFLG
GLPGAALAMY HCARPENRHK IKGLLISGLI ACVVGGTTEP LEFLFLFVAP VLYVIHALLT
GLGFTVMSVL GVTIGNTDGN IIDFVVFGIL HGLSTKWYMV PVVVAIWFVV YYVIFRFAIT
RFNLKTPGRD SEVASSIEKA VAGAPGKSGY NVPAILEALG GADNIVSLDN CITRLRLSVK
DMSLVNVQAL KDNRAIGVVQ LNQHNLQVVI GPQVQSVKDE MAGLMHTVQA